传送门
https://github.com/infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast vector and full-text search
Roadmap 2024 | Twitter | Discord | YouTube |
Infinity is a cutting-edge AI-native database that provides a wide range of search capabilities for rich data types such as vectors, full-text, and structured data. It provides robust support for various LLM applications, including search, recommenders, question-answering, conversational AI, copilot, content generation, and many more RAG (Retrieval-augmented Generation) applications.
🌟 Key Features
Infinity comes with high performance, flexibility, ease-of-use, and many features designed to address the challenges facing the next-generation AI applications:
⚡️ Incredibly fast
- Achieves 0.1 milliseconds query latency on million-scale vector datasets.
- Up to 10K QPS on million-scale vector datasets.
See the Benchmark report for more information.
🔮 Fused search
Supports a fused search of multiple embeddings and full text, in addition to filtering.
🍔 Rich data types
Supports a wide range of data types including strings, numerics, vectors, and more.
🎁 Ease-of-use
- Intuitive Python API. See the Python API
- A single-binary architecture with no dependencies, making deployment a breeze.
