RAG with Weaviate

Build retrieval-augmented generation pipelines using Weaviate's vector database. Store embeddings, run hybrid search, and feed grounded context into your LLM — all with Meridian's managed infrastructure.

Overview

Weaviate is an open-source vector database that powers semantic search, hybrid (keyword + vector) retrieval, and generative feedback loops. Meridian provisions a dedicated Weaviate cluster inside your VPC, pre-configured with TLS, backup snapshots, and a RESTful GraphQL API.

Embedding dims

384–4096

Index type

HNSW + flat

Consistency

Quorum / One

Quickstart

Once your cluster is provisioned, create a collection and start indexing.

Python SDK

import weaviate

client = weaviate.Client(
    url="https://your-cluster.meridian.dev",
    auth_client_secret=weaviate.AuthApiKey("sk-..."),
)

client.schema.create_class({
    "class": "Document",
    "vectorizer": "none",
    "properties": [
        {"name": "text", "dataType": ["text"]},
        {"name": "source", "dataType": ["string"]},
    ],
})

Hybrid Search

Combine BM25 keyword scoring with cosine-similarity vector search. Weaviate fuses results using a configurable alpha parameter — set alpha=0.5 for equal weighting.

GraphQL

{
  Get {
    Document(
      hybrid: {
        query: "transformer attention mechanism"
        alpha: 0.5
      }
      limit: 5
    ) {
      text
      source
      _additional { score }
    }
  }
}

End-to-End RAG Pipeline

Embed

Generate vectors with your model of choice — OpenAI Ada, Cohere Embed, or a self-hosted sentence-transformer.

Index

Batch-insert documents into Weaviate with metadata. Meridian handles sharding and replication.

Retrieve

Query with hybrid search. Weaviate returns the top-k chunks ranked by relevance.

Generate

Feed retrieved context into your LLM prompt. Grounded answers, no hallucination.

Limits & Pricing

Tier	Vectors	QPS	Price
Starter	1M	10	$49/mo
Pro	10M	100	$299/mo
Enterprise	Unlimited	1,000+	Custom

Ready to ship RAG?

Provision a Weaviate cluster in under 90 seconds. No YAML, no Kubernetes — just your schema and your embeddings.

Launch Cluster