Build Agents with Real-time Context
Most agents reason over stale snapshots. Kinetica resolves vector, SQL, graph, and spatial in one GPU query plan — on live streams, across billions of rows.
Three ways teams build agent retrieval today. Two stitch separate systems together — one runs it all in a single query plan.
Fast OLAP. No vector, no graph, no spatial.
Built for sub-second analytics on structured tables. Excellent at filtering and aggregating, but the agent's retrieval surface — semantic search over documents, graph traversal, geospatial joins — lives in other systems. The agent has to choreograph all of them.
Converged is easy to claim. GPU is why it's fast — and fresh.
A single SELECT can run ANN vector lookup, filter on tabular predicates, traverse a graph, and apply a geofence — all on the same scan, all on GPU.
Embeddings are computed and indexed as data arrives. There is no nightly re-embed batch and no drift between operational facts and what the agent retrieves.
MCP for tool-using agents, NL2SQL for LLMs writing their own queries, LangChain for orchestration, Postgres wire for everything else.
Independent benchmarks, same hardware, published. The engine that removes the network hops also wins on raw speed.
Structured retrieval
Can the engine run the full enterprise SQL suite an agent's structured queries resemble? JOIN-heavy, aggregation-heavy, at scale.
Semantic retrieval
How fast can new embeddings be ingested and made queryable — so the agent's recall stays fresh inside the turn?
In the loop
When a turn needs vector + filter + join + traverse together, how many systems and network hops does it cross?
An agent makes six to twelve blocking retrievals per turn, each gating the next. It's a new kind of user — with a latency budget.
As Janakiram MSV observed in Forbes after OpenAI's Rockset acquisition: production AI didn't need another vector database — it needed real-time retrieval over operational data. Pure vector stores are a feature, not a product.
Parallel KV lookups at 100k+ reads/sec against the same tables — no separate Redis.
CAGRA & HNSW on GPU. 5× faster ingest on VectorDBBench; embeddings stay fresh.
All 99 TPC-DS queries where ClickHouse runs a partial suite — vector + filter + join in one statement.
Native graph over the same tables as your SQL. Solve, match, traverse in one query.
Vectorized ASOF and WINDOW operators on GPU. Continuous views as ticks arrive.
Native ST_* operators and in-database tile rendering, fused with vector and SQL.
Every modality boundary is a network hop, an auth surface, and a consistency window. Errors compound: a stale read at step 3 of a 12-step agent loop is confidently wrong by step 12. Strong consistency isn't a nice-to-have for agents — it's a correctness requirement.
All six retrieval modes run as column types in the same engine. The agent issues one SQL statement; Kinetica fans out across vector, structured, graph, time-series, and spatial inside a single query plan. One auth surface. One transactional view. No drift.
However your agent talks, Kinetica answers on the same engine and the same tables — the framework never constrains the database.
Kinetica is an MCP server.
Any MCP-capable agent — Claude, Cursor, Copilot, Codex — discovers Kinetica's tables, schemas, and tools and queries them through the standard protocol. No glue code, no custom adapter.
Hybrid retrieval (vector + filter + join + traverse) returns as a single tool result.
# point any MCP client at the server connect mcp://kinetica.your-domain.com # the agent now sees tables + tools tools: query_sql, vector_search, graph_solve, st_filter
Embedding and inference next to the data.
Kinetica invokes NIM-hosted embedding and LLM models directly from SQL. Embeddings are generated as data arrives — no separate pipeline, no extra hop, no model server round-trip per row. The retrieval path and the generation path run on the same GPU fabric.
Your coding agent already knows how to use Kinetica.
Two skills. One install command. Eleven agent platforms.
kinetica-execute teaches agents SQL analytics, geospatial, graph, time-series, security, and admin — with a live dual-runtime CLI for running queries directly. kinetica-code teaches the Python SDK and embedded SQL for application developers. Both install in one command and activate based on what the agent is being asked to do.
$npx skills add kineticadb/agent-skills
github.com/kineticadb/agent-skills
One engine.
Every retrieval mode.
Frequently asked questions
How is this different from a vector database like Pinecone or Milvus?
Do I need to maintain a separate embedding pipeline?
How does an agent connect to Kinetica?
What is hybrid retrieval, exactly?
Will my coding agent know how to write Kinetica SQL?
kinetica-execute teaches agents SQL
analytics, geospatial, graph, time-series, security, and admin (with a dual-runtime CLI for
running queries directly). kinetica-code teaches the Python SDK and embedded SQL. One
install command — npx skills add kineticadb/agent-skills — activates them across
Claude Code, Cursor, OpenAI Codex, Windsurf, Gemini CLI, GitHub Copilot, Roo Code, Cline, Aider,
Continue, Amazon Q, and any agent that reads SKILL.md.