Keyword search that ranks like the best
Multi-field full-text search with best-in-class relevance and a fast path that skips any document that cannot reach the top results.
Agent Search· Torque
Torque is a search engine rebuilt from scratch in Rust that serves every query from memory, so most lookups finish in well under a millisecond. It speaks the same API as Typesense, so existing apps switch with a config change, and underneath it adds true semantic search, hybrid ranking, and a built-in interface agents can query directly.
Why Torque
Every search stack makes you pick a side. Bolt vectors onto a keyword engine and semantic search feels like an afterthought. Run a vector database and exact phrases, filters, and facets fall apart. And almost all of them lean on disk, adding latency to every single query.
Torque keeps the entire index in memory and does keyword and meaning-based search in one engine, while staying a drop-in replacement for the tools teams already run.
Capabilities
Multi-field full-text search with best-in-class relevance and a fast path that skips any document that cannot reach the top results.
Meaning-based vector search blended with keyword results, so a single query gets both exact-match precision and conceptual recall.
Fuzzy typo correction, prefix autocomplete, synonyms, and per-language stopwords, tuned across seventeen languages.
Rich filtering by number, geo-radius, nested field, or array, faceted counts with statistics, and sort by relevance, distance, or recency.
Upsert and delete in sub-milliseconds, or swap an entire freshly-built index atomically without ever blocking a query.
A native Model Context Protocol endpoint exposes search as a self-describing tool, so an agent queries and manages the engine with no glue code.
Pin or hide specific results, run up to fifty searches in one parallel request, and save reusable query presets.
Reference fields let a search pull in related documents from another collection and filter across the link.
Built different
Not a fork. The index is stored as zero-copy bitmaps mapped straight from disk and served entirely from RAM, so it loads instantly and answers fast.
Uses RaBitQ binary quantization from recent database research for vectors up to thirty-two times smaller, with better accuracy than the usual approach and no retraining as data drifts.
Pointer-free layouts run on CPU today and accelerate on NVIDIA GPUs, with automatic fallback when no GPU is present.
The agent tool interface is part of the engine itself, a feature of the database, not a sidecar you bolt on.
Speaks the Typesense API, so existing clients and tools connect unchanged while you gain semantic search underneath.
By the numbers
Measured on real data, CPU only, in our own benchmarks.
Agent-first
Point an agent at Torque and it immediately has a self-describing search tool: discover collections, read schemas, run keyword or semantic queries with filters and facets, and read or write documents, with zero integration code. Sub-millisecond latency means an agent can fire many searches inside a single reasoning loop without stalling.
Run it yourself
Torque is also released as a standalone product by Truespar, our research arm. Run it on your own infrastructure, the getting-started guide takes it from there.
See HQ running in your own Slack or Teams, on the operating system we built for agents.