Built-in Vector Database for .NET, Embeddings Storage at Any Scale, LM-Kit

Unified vector storage for every stage.

LM-Kit provides a unified embedding storage architecture that scales from quick prototypes to production deployments. At its core is the DataSource abstraction, which manages embeddings, metadata, and retrieval through a consistent API regardless of where your vectors are stored.

Start with in-memory storage for rapid iteration, graduate to the built-in file-based vector database for local applications, or connect to Qdrant or PostgreSQL (pgvector) for distributed workloads. The same code works across all backends with zero modifications.

Think of it as SQLite for vectors: a self-contained, file-based engine that handles millions of embeddings without external infrastructure, while remaining fully compatible with cloud-scale solutions when you need them.

DataSource hierarchy

DataSource: container

↳ Sections + metadata

↳ TextPartitions · embeddings

↳ ImagePartitions · embeddings

Storage patterns

Six backends, one unified API.

Choose the storage that fits your application's lifecycle. Switch between them seamlessly without rewriting code.

In-memory

In-memory store

DataSource.CreateInMemoryDataSource()

Embeddings computed and stored in RAM with optional serialization to disk via Serialize() method. Zero setup required. Ideal for fast prototyping, testing, and live classification tasks.

Persistence: Temporary
Scale: Low
Infrastructure: None
Instant feedback during development
Serialize() and Deserialize() for reusability
Perfect for semantic search prototypes

Recommended

Built-in vector database

DataSource.CreateFileDataSource()

Self-contained, file-based engine optimized for embedding workloads. Think of it as SQLite for dense vectors. No server, no configuration, just a file path.

Persistence: File-based
Scale: Millions
Infrastructure: None
Durable, portable, easy to share
Low-latency I/O with disk-based operations
Perfect for desktop and offline apps

New

`FileSystemVectorStore`

new FileSystemVectorStore(path)

IVectorStore implementation that persists collections as individual files on disk. Each collection stored as a separate .ds file with in-memory caching for performance.

Persistence: Directory
Scale: Medium-High
Infrastructure: None
Multiple collections in one directory
Implements IVectorStore interface
Automatic caching for opened sources

Production

Qdrant vector store

QdrantEmbeddingStore + DataSource.LoadFromStore()

High-performance, open-source vector database with HNSW indexing. Ideal for production workloads requiring distributed access and advanced filtering.

Persistence: Durable
Scale: High
Infrastructure: Qdrant
HNSW indexing for sub-second search
Automatic sharding and replication
Deploy locally or in the cloud

Production

PostgreSQL pgvector store

PgVectorEmbeddingStore + DataSource.LoadFromStore()

Store embeddings directly in PostgreSQL using the pgvector extension. Keep vectors next to your relational data, with cosine similarity search and automatic extension, schema, and index provisioning.

Persistence: Durable
Scale: High
Infrastructure: PostgreSQL
Vectors alongside your existing tables
Reuse your existing PostgreSQL operations
Deploy locally or in the cloud

Custom

Custom via `IVectorStore`

Implement IVectorStore interface

Full control over vector storage logic. Integrate with proprietary databases, internal APIs, or hybrid storage systems using the standardized contract.

Persistence: Custom
Scale: Varies
Infrastructure: Your own
Seamless proprietary backend integration
Custom indexing and retrieval logic
Future-proof, vendor-agnostic architecture

Core capabilities

Enterprise-ready vector management.

Everything you need to build production-grade embedding storage and retrieval.

Hierarchy

Hierarchical data organization

Organize embeddings into sections and partitions with optional metadata at each level. Manage multi-modal inputs within a single collection.

Metadata

Rich metadata support

Attach metadata to sections and partitions for filtering, tagging, and advanced retrieval scenarios across any vector backend.

Portability

Serialization & portability

Serialize DataSource instances to disk and reload anywhere. Enable checkpointing, debugging, and deployment without external services.

Updates

Incremental updates

Efficient insertions, deletions, and metadata edits without rebuilding the entire dataset. Works with both built-in and external stores.

Search

Similarity search

SearchSimilar returns ranked results by vector similarity. Configure top-K, minimum scores, and metadata filters for precise retrieval.

Privacy

Privacy by design

Local-only and on-prem options keep data secure and compliant. No external dependencies required for complete vector management.

Code examples

Get started in minutes.

Same API, different backends. Switch storage strategies without rewriting your application logic.

InMemoryExample.cs

using LMKit.Model;
using LMKit.Data;
using LMKit.Retrieval;

// Load embedding model
var embedModel = LM.LoadFromModelID("embeddinggemma-300m");

// Create in-memory DataSource
var dataSource = DataSource.CreateInMemoryDataSource("my-collection", embedModel);

// Use RagEngine to import content
var ragEngine = new RagEngine(embedModel);
ragEngine.AddDataSource(dataSource);

// Import text with automatic chunking
ragEngine.ImportText(
    "Your document content here...",
    new TextChunking() { MaxChunkSize = 500 },
    "my-collection",
    "document-section");

// Optional: Serialize to disk for later reuse
dataSource.Serialize("./cache/my-collection.bin");

// Later: Deserialize from disk
var restored = DataSource.Deserialize("./cache/my-collection.bin", embedModel);

FileDataSourceExample.cs

using LMKit.Model;
using LMKit.Data;
using LMKit.Retrieval;

// Load embedding model
var embedModel = LM.LoadFromModelID("embeddinggemma-300m");

const string DATA_SOURCE_PATH = "Ebooks.dat";
const string COLLECTION_NAME = "Ebooks";

DataSource dataSource;

if (File.Exists(DATA_SOURCE_PATH))
{
    // Load existing file-based DataSource
    dataSource = DataSource.LoadFromFile(DATA_SOURCE_PATH, readOnly: false);
}
else
{
    // Create new file-based DataSource
    dataSource = DataSource.CreateFileDataSource(DATA_SOURCE_PATH, COLLECTION_NAME, embedModel);
}

// Use RagEngine to import and query
var ragEngine = new RagEngine(embedModel);
ragEngine.AddDataSource(dataSource);

// Check if section already exists
if (!dataSource.HasSection("Romeo and Juliet"))
{
    string content = File.ReadAllText("romeo_and_juliet.txt");
    ragEngine.ImportText(content,
        new TextChunking() { MaxChunkSize = 500 },
        COLLECTION_NAME, "Romeo and Juliet");
}

QdrantExample.cs

using LMKit.Model;
using LMKit.Data;
using LMKit.Data.Storage.Qdrant;
using LMKit.Retrieval;

// Load embedding model
var embedModel = LM.LoadFromModelID("embeddinggemma-300m");

// Connect to Qdrant (docker run -p 6333:6333 -p 6334:6334 qdrant/qdrant)
var store = new QdrantEmbeddingStore(new Uri("http://localhost:6334"));
const string COLLECTION = "Ebooks";

DataSource dataSource;

if (await store.CollectionExistsAsync(COLLECTION))
{
    // Load existing collection from Qdrant
    dataSource = DataSource.LoadFromStore(store, COLLECTION);
}
else
{
    // Create new collection in Qdrant
    dataSource = await DataSource.CreateVectorStoreDataSourceAsync(store, COLLECTION, embedModel);
}

// Use RagEngine with Qdrant-backed DataSource
var ragEngine = new RagEngine(embedModel, vectorStore: store);
ragEngine.AddDataSource(dataSource);

// Import content (automatically stored in Qdrant)
string content = await new HttpClient().GetStringAsync(
    "https://gutenberg.org/cache/epub/1513/pg1513.txt");
ragEngine.ImportText(content,
    new TextChunking() { MaxChunkSize = 500 },
    COLLECTION, "Romeo and Juliet");

FileSystemVectorStoreExample.cs

using LMKit.Model;
using LMKit.Data;
using LMKit.Data.Storage;
using LMKit.Retrieval;

// Load embedding model
var embedModel = LM.LoadFromModelID("embeddinggemma-300m");

// Create FileSystemVectorStore (NEW in 2026.1.1)
// Each collection stored as separate .ds file in directory
var fsStore = new FileSystemVectorStore("./vector-collections");
const string COLLECTION = "Ebooks";

DataSource dataSource;

if (await fsStore.CollectionExistsAsync(COLLECTION))
{
    // Load existing collection (auto-cached in memory)
    dataSource = DataSource.LoadFromStore(fsStore, COLLECTION);
}
else
{
    // Create new collection
    dataSource = await DataSource.CreateVectorStoreDataSourceAsync(fsStore, COLLECTION, embedModel);
}

// FileSystemVectorStore implements IVectorStore interface
// Works with RagEngine just like Qdrant
var ragEngine = new RagEngine(embedModel);
ragEngine.AddDataSource(dataSource);

// Directory structure: ./vector-collections/Ebooks.ds
Console.WriteLine($"Store path: {fsStore.DirectoryPath}");

PgVectorExample.cs

using LMKit.Model;
using LMKit.Data;
using LMKit.Data.Storage.PgVector;
using LMKit.Retrieval;

// Load embedding model
var embedModel = LM.LoadFromModelID("embeddinggemma-300m");

// Connect to PostgreSQL + pgvector
// docker run -e POSTGRES_PASSWORD=postgres -p 5433:5432 -d pgvector/pgvector:pg17
var connectionString = "Host=localhost;Port=5433;Database=vectors;Username=postgres;Password=postgres";
const string COLLECTION = "Ebooks";

// Create the database if needed, then open the store
await PgVectorEmbeddingStore.EnsureDatabaseExistsAsync(connectionString);
var store = new PgVectorEmbeddingStore(connectionString);

DataSource dataSource;

if (await store.CollectionExistsAsync(COLLECTION))
{
    // Load existing collection from PostgreSQL
    dataSource = DataSource.LoadFromStore(store, COLLECTION);
}
else
{
    // Create new collection (table + vector index) in PostgreSQL
    dataSource = await DataSource.CreateVectorStoreDataSourceAsync(store, COLLECTION, embedModel);
}

// Use RagEngine with the pgvector-backed DataSource
var ragEngine = new RagEngine(embedModel, vectorStore: store);
ragEngine.AddDataSource(dataSource);

// Import content (automatically stored in PostgreSQL via pgvector)
string content = await new HttpClient().GetStringAsync(
    "https://gutenberg.org/cache/epub/1513/pg1513.txt");
ragEngine.ImportText(content,
    new TextChunking() { MaxChunkSize = 500 },
    COLLECTION, "Romeo and Juliet");

Use cases

Built for real-world applications.

From desktop tools to enterprise RAG systems, LM-Kit's vector storage adapts to your needs.

Search

Semantic search engines

Build intelligent search that understands meaning, not just keywords. Index documents, products, or knowledge bases for natural language queries.

Chatbot

RAG-powered chatbots

Ground LLM responses with relevant context from your corpus. Use RagEngine with FindMatchingPartitions() and QueryPartitions() for accurate answers.

Memory

Agent memory systems

Give AI agents persistent memory with AgentMemory class. Store facts via SaveInformationAsync() and recall them automatically in conversations.

Documents

Document intelligence

Index and retrieve from large document collections with DocumentRag. Support legal discovery, research assistants, and enterprise knowledge management.

Recommend

Recommendation systems

Find similar items, content, or users based on embedding similarity. Power product recommendations, content discovery, and personalization.

Offline

Offline desktop applications

Ship portable AI modules with embedded vectors using FileSystemVectorStore. Support air-gapped environments and compliance-sensitive scenarios.

API reference

Key classes & interfaces.

Core components for building vector storage solutions.

`DataSource`

Central container for embedding storage. Manages sections, partitions, metadata. Create with CreateFileDataSource(), CreateInMemoryDataSource(), or LoadFromFile().

View documentation

`RagEngine`

Orchestrates RAG workflows. Import text with automatic chunking via ImportText(). Query with FindMatchingPartitions() and QueryPartitions().

View documentation

`IVectorStore`

Interface for custom vector storage backends. Implement for proprietary databases. Methods include CollectionExistsAsync(), CreateCollectionAsync().

View documentation

`FileSystemVectorStore`

File system-based IVectorStore implementation. Persists collections as .ds files in a directory with automatic caching.

View documentation

`QdrantEmbeddingStore`

Qdrant connector implementing IVectorStore. Bridges LM-Kit.NET with Qdrant's high-performance vector database via gRPC.

View documentation

`PgVectorEmbeddingStore`

PostgreSQL connector implementing IVectorStore on top of the pgvector extension. Stores embeddings as a vector column with cosine similarity search, and provisions the extension, schema, and indexes automatically.

View documentation

`PartitionSimilarity`

Result from similarity search. Contains SectionIdentifier, Similarity score, Metadata, and partition content for retrieval workflows.

View documentation

`AgentMemory`

Semantic memory for AI agents. SaveInformationAsync() stores facts with embeddings. Integrates with MultiTurnConversation via Memory property.

View documentation

`TextChunking`

Configures text splitting for embeddings. Set MaxChunkSize to control partition size. Used with RagEngine.ImportText() for automatic chunking.

View documentation

Why LM-Kit

Why choose LM-Kit for vector storage?

The right storage strategy is critical to performance, scalability, and developer productivity.

01

Swap backends instantly

Same code works across all storage types. Just change the backend configuration.

02

Privacy by design

Local-only and on-prem solutions keep data secure and compliant.

03

Performant & scalable

From desktop experiments to high-scale RAG systems with millions of vectors.

04

Developer-friendly

Clean APIs, comprehensive documentation, and consistent patterns across all backends.

Demos & docs

Build it. Read it. Try it.

Working console demos on GitHub, step-by-step how-to guides on the docs site, and the API reference for the classes used on this page.

Demo

Ready to simplify your vector storage?

From in-memory experiments to durable local databases and scalable remote setups, LM-Kit makes switching storage backends effortless.

Download free API documentation

Built-in vector database for .NET applications.

In-memory store

Built-in vector DB

FileSystemVectorStore

Qdrant integration

PostgreSQL pgvector

In-memory store

Built-in vector database

FileSystemVectorStore

Qdrant vector store

PostgreSQL pgvector store

Custom via IVectorStore

Hierarchical data organization

Rich metadata support

Serialization & portability

Incremental updates

Similarity search

Privacy by design

Semantic search engines

RAG-powered chatbots

Agent memory systems

Document intelligence

Recommendation systems

Offline desktop applications

DataSource

RagEngine

IVectorStore

FileSystemVectorStore

QdrantEmbeddingStore

PgVectorEmbeddingStore

PartitionSimilarity

AgentMemory

TextChunking

Swap backends instantly

Privacy by design

Performant & scalable

Developer-friendly

Single-turn RAG with built-in vector store

Single-turn RAG walkthrough

Single-turn RAG with Qdrant

Qdrant integration walkthrough

Single-turn RAG with pgvector

pgvector integration walkthrough

Build a semantic search engine

IVectorStore

`FileSystemVectorStore`

`FileSystemVectorStore`

Custom via `IVectorStore`

`DataSource`

`RagEngine`

`IVectorStore`

`FileSystemVectorStore`

`QdrantEmbeddingStore`

`PgVectorEmbeddingStore`

`PartitionSimilarity`

`AgentMemory`

`TextChunking`