Lists (6)
Sort Name ascending (A-Z)
Stars
📚 Awesome papers and technical blogs on vector DB (database), semantic-based vector search or approximate nearest neighbor search (ANN Search, ANNS). Vector search is the key component of large-sca…
Scalable and Efficient Serverless Deployment for Large AI Models.
Tips for Writing a Research Paper using LaTeX
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Serving LLMs on heterogeneous decentralized clusters.
Distributed vector search for AI-native applications
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (https://stark.stanford.edu/)
A curated list of awesome works related to high dimensional structure/vector search & database
Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL
GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory.
Code for "Baleen: ML Admission & Prefetching for Flash Caches" (FAST 2024).
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Puck is a high-performance ANN search engine
Large Language Model (LLM) Systems Paper List
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
Build userspace NVMe drivers and storage applications with CUDA support
A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such algorithms.