Search Results for author: Xuanlin Jiang

Found 1 papers, 0 papers with code

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

no code implementations • 18 Apr 2024 • Chao Jin, Zili Zhang, Xuanlin Jiang, Fangyue Liu, Xin Liu, Xuanzhe Liu, Xin Jin

We implement RAGCache and evaluate it on vLLM, a state-of-the-art LLM inference system and Faiss, a state-of-the-art vector database.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.