Docs/RAG with Weaviate
Vector Search

RAG with Weaviate

Build retrieval-augmented generation pipelines using Weaviate's vector database. Store embeddings, run hybrid search, and feed grounded context into your LLM — all with Meridian's managed infrastructure.

Overview

Weaviate is an open-source vector database that powers semantic search, hybrid (keyword + vector) retrieval, and generative feedback loops. Meridian provisions a dedicated Weaviate cluster inside your VPC, pre-configured with TLS, backup snapshots, and a RESTful GraphQL API.

Embedding dims
384–4096
Index type
HNSW + flat
Consistency
Quorum / One

Quickstart

Once your cluster is provisioned, create a collection and start indexing.

Python SDK
import weaviate

client = weaviate.Client(
    url="https://your-cluster.meridian.dev",
    auth_client_secret=weaviate.AuthApiKey("sk-..."),
)

client.schema.create_class({
    "class": "Document",
    "vectorizer": "none",
    "properties": [
        {"name": "text", "dataType": ["text"]},
        {"name": "source", "dataType": ["string"]},
    ],
})

Hybrid Search

Combine BM25 keyword scoring with cosine-similarity vector search. Weaviate fuses results using a configurable alpha parameter — set alpha=0.5 for equal weighting.

GraphQL
{
  Get {
    Document(
      hybrid: {
        query: "transformer attention mechanism"
        alpha: 0.5
      }
      limit: 5
    ) {
      text
      source
      _additional { score }
    }
  }
}

End-to-End RAG Pipeline

1

Embed

Generate vectors with your model of choice — OpenAI Ada, Cohere Embed, or a self-hosted sentence-transformer.

2

Index

Batch-insert documents into Weaviate with metadata. Meridian handles sharding and replication.

3

Retrieve

Query with hybrid search. Weaviate returns the top-k chunks ranked by relevance.

4

Generate

Feed retrieved context into your LLM prompt. Grounded answers, no hallucination.

Limits & Pricing

TierVectorsQPSPrice
Starter1M10$49/mo
Pro10M100$299/mo
EnterpriseUnlimited1,000+Custom

Ready to ship RAG?

Provision a Weaviate cluster in under 90 seconds. No YAML, no Kubernetes — just your schema and your embeddings.

Launch Cluster