Comprehensive LLM inference Tools in One Place | Creati.ai

Sponsored by Qoder - Qoder is an agentic coding platform for real software, Free to use the best model in preview.

Qoder - Qoder is an agentic coding platform for real software, Free to use the best model in preview.



LLM inference

rag-services
rag-services is an open-source microservices framework enabling scalable retrieval-augmented generation pipelines with vector storage, LLM inference, and orchestration.

0


0
Visit AI
What is rag-services?
rag-services is an extensible platform that breaks down RAG pipelines into discrete microservices. It offers a document store service, a vector index service, an embedder service, multiple LLM inference services, and an orchestrator service to coordinate workflows. Each component exposes REST APIs, allowing you to mix and match databases and model providers. With Docker and Docker Compose support, you can deploy locally or in Kubernetes clusters. The framework enables scalable, fault-tolerant RAG solutions for chatbots, knowledge bases, and automated document Q&A.
rag-services Core Features

Document storage service

Vector indexing and search

Embedding generation

Multiple LLM inference endpoints

Workflow orchestration API



Featured