LM-Kit.NET SDK

LM-Kit Server (REST API) private beta

Local AI Agent Platform for Builders

Your AI. Your Data. On Your Device.

100% Local

GPU Accelerated

Cross-Platform

Program.cs

using LMKit;
var model = new LLM("mistral-7b");
var chat = new MultiTurnConversation(model);
// Load your documents
chat.ImportDocument("report-2024.pdf");
chat.ImportDocument("financials.pdf");
// Query your data privately
var answer = chat.Submit("Summarize Q4 results");

Output

LM-Kit.NET
Complete Local AI SDK

Product Overview API Reference

LM-Kit.NET delivers the complete local AI stack for builders: high-performance inference, multi-agent orchestration, document intelligence, and batteries-included tooling. Run open models with zero cloud dependency and keep full control over data, latency, and cost.

Available on

Windows macOS Linux

Local Inference

Run open models locally with hardware acceleration on CPU, CUDA, Vulkan, or Metal.

Agent Orchestration

4 orchestrators, 5 planning strategies. Full MCP and Agent Skills support.

Document Intelligence

PDF chat, structured extraction, VLM-powered OCR, and layout analysis.

Batteries-Included Tools

56+ built-in tools across data, text, security, IO, and network categories.

From Prompts to Production Agents

The best AI agent is the one that ships.

LM-Kit goes beyond single-model inference. Compose multi-agent workflows using 4 orchestration patterns, extend capabilities with MCP and Agent Skills, and process documents with VLM-powered intelligence. Each component is optimized for its task, delivering results that monolithic LLMs cannot match.

Ship with built-in resilience, full observability, and zero cloud dependency. Predictable costs, complete data control, minimal footprint.🌳

Instant Free Trial - No Limits, No Signup

Multi-Agent Orchestration

Pipeline, parallel, router, and supervisor patterns. Agents that delegate, plan, and collaborate.

Document Intelligence

PDF chat, structured extraction, VLM-powered OCR. Understand documents, not just text.

Complete Data Sovereignty

100% local execution. Air-gapped ready. Your data never leaves your infrastructure.

Predictable Costs

No per-token billing. No rate limits. Fixed infrastructure, unlimited inference.

Production-Ready

Retry, circuit breaker, timeout, rate limiting. OpenTelemetry tracing built in.

Who is LM-Kit for?

Builders who want AI they can control, deploy anywhere, and run without cloud dependencies.

Builders

Build AI Agents Your Way

.NET SDK or REST API - your choice
4 orchestrators, 5 planning strategies
Full MCP and Agent Skills protocol support
RAG with built-in vector DB and reranking

Document Intelligence

Extract Meaning, Not Just Text

VLM-powered OCR with layout analysis
Structured extraction from any document
PDF, DOCX, XLSX, PPTX, images
Chat with your documents, locally

Compliance

Achieve True Data Sovereignty

100% local inference, zero data leakage
Air-gapped and offline-first ready
Built for GDPR, HIPAA, strict compliance
Full audit trail with OpenTelemetry

Platform Teams

Escape Per-Token Pricing

Fixed costs, unlimited inference
Retry, circuit breaker, timeout policies
No rate limits, works fully offline
Ship faster with no vendor lock-in

Best fit: When local execution matters more than "call an API and forget."

AI Agents Should Run Where the App Runs

Embedded AI, Not External Services

Cloud APIs add latency, complexity, and failure points. With LM-Kit, AI runs inside your application as a native .NET library. No HTTP calls. No separate services. No infrastructure to manage.

Your app deploys to desktop, mobile, server, or edge. Your AI goes with it. Same codebase, same process, same deployment. Build with familiar tools and ship faster.

Run models like Llama, Mistral, Phi, and Gemma with automatic hardware acceleration. Process sensitive data locally with complete privacy.

Product Overview API Reference

Explore runnable demos

Run real, local AI demos directly in .NET. No cloud calls. No external services.

AI Agents

Content Pipeline Data Analyst Doc Processing Agent Email Triage Filter Pipeline MCP Integration MCP Stdio Multi-Agent Review Chat with Memory News Monitoring Persistent Memory Research Assistant Skill-Based Assistant Meeting Assistant Smart Task Router Tool Calling Web Search

Chat & Conversation

Single-Turn Chat Multi-Turn Chat Persistent Session History Guidance Yes/No Assistant Custom Sampling Chat with Tools Chat with MCP Vision Chat Chat Playground (MAUI)

Coding & Development

Code Analysis Code Writing

Intelligent Document Processing

Chat with PDF Doc Splitting Document Summarizer Doc to Markdown Image to Markdown Text Summarizer VLM OCR VLM OCR Coords

Classification & Analysis

Batch Doc Class Custom Classif Doc Classification Emotion Keyword Extract Lang Detection Sarcasm Sentiment

Data Extraction

Batch PII Contract Extractor Invoice Extraction NER PII Extraction Receipt Scanner Resume Parser Structured Extraction Web to JSON

Embeddings, RAG & Integrations

Conversational RAG Help Desk KB Image Similarity Retrieval Tuning Single-Turn RAG RAG + Qdrant Function Calling Extensions.AI Semantic Kernel

Speech & Audio

Speech to Text Meeting Assistant Transcription App (MAUI)

Text Generation & Prompts

Prompt Templates Text Corrector Text Rewriter Text Translator

Model Operations & Observability

Fine-Tuning Quantization Telemetry

The complete platform for building local AI agents

Core AI Platform

Local inference for leading open-weight models

CPU, CUDA, Vulkan, Metal acceleration·Pre-configured model catalog with text, vision, embeddings, speech·Dynamic LoRA hot-swap·Windows, macOS, Linux, ARM64

Explore

Agent Orchestration

Orchestrators, planning strategies, growing built-in tool catalog

Pipeline, parallel, router, supervisor patterns·ReAct, Chain-of-Thought, Tree-of-Thought planning·Agent-to-agent delegation with real-time streaming·Tool permission policies and approval workflows

Explore

MCP & Agent Skills

Full MCP protocol, Agent Skills, stdio and HTTP transports

Sampling, elicitation, roots, progress tracking, resource subscriptions·Agent Skills from Cursor, GitHub, VS Code·Progressive disclosure for context efficiency·Human-in-the-loop controls

Explore

Conversational AI

Chatbots, Smart Memories, function calling

Multi-turn dialogue with context persistence·Tool calling and function invocation·RAG-backed agent memory·Dynamic Sampling for reliable structured output

Explore

RAG & Knowledge

Semantic retrieval with reranking

Built-in vector DB or Qdrant for scale·Text, markdown, and HTML-aware chunking·DocumentRag and PdfChat for document Q&A·Multimodal RAG with image embeddings

Explore

Document Intelligence

From PDF to chat, one pipeline

Intelligent Document Splitting with VLM-powered boundary detection·Built-in OCR, VLM extraction, and external OCR connectors·Confidence scores, entity validation, and schema discovery·PDF, DOCX, XLSX, PPTX, images

Explore

Vision & Multimodal

Image understanding with Vision Language Models

Visual text extraction with VLMs·Image embeddings for multimodal search·Background removal and segmentation·Multimodal classification and extraction

Explore

Text & NLP

Comprehensive NLP, locally

Named entity and PII extraction·Sentiment, emotion, sarcasm detection·Translation and summarization·JSON grammar for constrained generation

Explore

Speech & Language

Whisper-powered transcription

Speech-to-text with hallucination suppression·Voice Activity Detection·Dictation formatting with spoken commands·Real-time streaming, multi-language

Explore

Local Inference

Resilience, observability, optimization

Retry, circuit breaker, timeout, rate limiting·OpenTelemetry GenAI semantic conventions·Model quantization and LoRA hot-swap·Microsoft.Extensions.AI and Semantic Kernel bridges

Explore

Explore All Features

Deploy Your Way, Anywhere You Need

🚀

LM-Kit runs entirely on your infrastructure with no external dependencies.

From edge devices to enterprise servers, deploy AI workloads where your data lives, with full control over security, compliance, and costs.

Edge and embedded devices for real-time inference

Air-gapped environments with zero network requirements

On-premise servers for sensitive workloads and compliance

Private cloud on your VMs, no third-party API calls

In-process SDK for desktop apps and local workflows

Offline-first workflows that never depend on connectivity

Why Teams Are Moving AI Local

🎯

Shipping an agent should not mean shipping your data to someone else's servers.

Cloud AI APIs come with hidden costs: per-token billing, data exposure, and vendor dependency. LM-Kit gives you the same capabilities with none of the trade-offs.

No API costs that scale with usage. Predictable infrastructure, not per-token billing.

No data leaving your perimeter. Compliance-ready by design for HIPAA, GDPR, and air-gapped environments.

No vendor lock-in. Swap models without rewriting your code.

No latency surprises. Consistent performance, no rate limits, works offline.

No black-box dependencies. Full observability with OpenTelemetry instrumentation.

Native SDK. Not bindings. A real SDK designed for builders who ship.

Beyond GenAI: A Complete AI Stack

⚡

LLMs hallucinate and miss structure. Real-world AI needs more than text generation.
LM-Kit combines 5 AI paradigms so each layer compensates for the others.

Generative AI

Fluency & creativity

Symbolic AI

Logic & reasoning

Fuzzy Logic

Nuance & uncertainty

Expert Systems

Domain accuracy

Doc Understanding

Structure extraction

Built by a team with deep expertise in Intelligent Document Processing and Information Management.
We know what it takes to ship AI that works in production.

20+ years in &

Weekly releases shipping fast

Fortune 500 proven

Native .NET with CUDA/Vulkan/Metal

Instant Free Trial - No Limits, No Signup

Trusted by Builders Like You

Collaborating With Industry Leaders

We partner with forward-thinking companies who share our commitment to innovation in AI. From technology providers to strategic collaborators, our partners play a key role in expanding what's possible with LM-Kit. Together, we're shaping the future of AI integration across industries.

LM-Kit.NET Reviews

📝 Blog

Latest Insights from Our Blog

Explore our recent articles covering AI innovations, trends, and insights.

See All Articles

Local AI Agent Platform for Builders

LM-Kit.NET
Complete Local AI SDK

Local Inference

Agent Orchestration

Document Intelligence

Batteries-Included Tools

From Prompts to Production Agents

Multi-Agent Orchestration

Document Intelligence

Complete Data Sovereignty

Predictable Costs

Production-Ready

Who is LM-Kit for?

Build AI Agents Your Way

Extract Meaning, Not Just Text

Achieve True Data Sovereignty

Escape Per-Token Pricing

AI Agents Should Run Where the App Runs

Explore runnable demos

The complete platform for building local AI agents

Core AI Platform

Agent Orchestration

MCP & Agent Skills

Conversational AI

RAG & Knowledge

Document Intelligence

Vision & Multimodal

Text & NLP

Speech & Language

Local Inference

Deploy Your Way, Anywhere You Need

Why Teams Are Moving AI Local

Beyond GenAI: A Complete AI Stack

Trusted by Builders Like You

Collaborating With Industry Leaders

Latest Insights from Our Blog

🧰 Agent Skills Explained: Turn Any Agent Into an On-Demand Specialist with SKILL.md

🛰️LM-Kit.NET Goes Agentic: MCP, Tools, Orchestration, All On-Device

🧰 Meet LM-Kit Tool Calling for Local Agents

Ready to Build Local AI Agents?

Local AI Agent Platform for Builders

LM-Kit.NETComplete Local AI SDK

Local Inference

Agent Orchestration

Document Intelligence

Batteries-Included Tools

From Prompts to Production Agents

Multi-Agent Orchestration

Document Intelligence

Complete Data Sovereignty

Predictable Costs

Production-Ready

Who is LM-Kit for?

Build AI Agents Your Way

Extract Meaning, Not Just Text

Achieve True Data Sovereignty

Escape Per-Token Pricing

AI Agents Should Run Where the App Runs

Explore runnable demos

The complete platform for building local AI agents

Core AI Platform

Agent Orchestration

MCP & Agent Skills

Conversational AI

RAG & Knowledge

Document Intelligence

Vision & Multimodal

Text & NLP

Speech & Language

Local Inference

Deploy Your Way, Anywhere You Need

Why Teams Are Moving AI Local

Beyond GenAI: A Complete AI Stack

Trusted by Builders Like You

Collaborating With Industry Leaders

Latest Insights from Our Blog

🧰 Agent Skills Explained: Turn Any Agent Into an On-Demand Specialist with SKILL.md

🛰️LM-Kit.NET Goes Agentic: MCP, Tools, Orchestration, All On-Device

🧰 Meet LM-Kit Tool Calling for Local Agents

Ready to Build Local AI Agents?

LM-Kit.NET
Complete Local AI SDK