Drop-in local provider for Semantic Kernel and Microsoft.Extensions.AI
Official bridge packages let LM-Kit.NET act as the local model provider for any existing Semantic Kernel or Microsoft.Extensions.AI application.
Install via NuGet, pick a model, and build your first AI-powered .NET application. Full feature access, no time limits, no registration required, zero cloud dependency.
Create a new project, install the NuGet package, and follow the getting-started guide to build your first AI application.
01 · Project
Start from a console app or add LM-Kit.NET to any existing .NET solution.
dotnet new console -n MyAIApp
02 · Install
Single dependency, in-process. Includes optimised CPU inference out of the box.
dotnet add package LM-Kit.NET
03 · Build
Initialise the runtime, load a model, generate a response. The getting-started guide walks every step.
Your first AI agentLM-Kit.NET ships as a single NuGet package with optimised CPU inference (AVX/AVX2) included. Add an optional backend package for GPU acceleration when you need it.
.NET CLI
From any terminal, in any project directory. Equivalent in the Package Manager Console: Install-Package LM-Kit.NET
CUDA 12 · Win
dotnet add package LM-Kit.NET.Backend.Cuda12.Windows
CUDA 12 · Linux
dotnet add package LM-Kit.NET.Backend.Cuda12.Linux
Vulkan & Metal
Both backends are bundled in the main package. No extra install required.
The catalog is continuously updated with releases from Google, Meta, Alibaba, Mistral, Microsoft and IBM. Models auto-download on first use.
Chat & code
Gemma, Qwen, Llama, Phi, Mistral, Granite, Falcon, SmolLM. Sub-1B to 30B+.
Reasoning & agents
QwQ, DeepSeek R1, GPT-OSS, GLM Flash, Nemotron, Magistral.
Vision & multimodal
Qwen-VL, MiniCPM, Ministral, Pixtral, Devstral, plus dedicated OCR models.
OCR
PaddleOCR, LightOnOCR. Structured text from images and scanned documents.
Embeddings
Qwen Embed, EmbeddingGemma, BGE, Nomic. Text and image embeddings, plus rerankers.
Speech
A growing local speech-to-text catalog. Multiple model sizes for different speed and accuracy trade-offs.
A single SDK covering the full surface of AI features. Everything runs 100% on-device, zero cloud dependency. Explore all use cases →
Agents
Multi-agent workflows with persistent memory, reasoning, function calling, MCP integration. Pipeline, parallel, supervisor, router patterns.
AgentsDocuments
Chat with PDFs, extract structured data, convert documents, perform OCR on images.
Document intelligenceRAG
Built-in vector database, semantic search, intelligent splitting, citation tracking.
RAG & knowledgeVision & speech
Image understanding with vision-language models. High-accuracy on-device speech-to-text.
Speech to textText Analysis
Sentiment, classification, translation, summarisation, named-entity recognition, PII extraction.
Text AnalysisText generation
Generation, rewriting, grammar correction, JSON-schema-constrained structured output.
Text generationChatbots
Multi-turn agents with persistent memory, streaming responses, agent skills, human escalation.
ChatbotsMCP & tools
Connect agents to external services via Model Context Protocol. 56 built-in tools for file I/O, HTTP, web search.
MCP & toolsOfficial bridge packages let LM-Kit.NET act as the local model provider for any existing Semantic Kernel or Microsoft.Extensions.AI application.
Targets .NET Standard 2.0 for maximum compatibility. Develop and deploy on the platform of your choice.
No. LM-Kit.NET includes optimised CPU inference (AVX/AVX2) out of the box, fine for smaller models and lower-throughput scenarios. For production workloads or larger models, GPU acceleration via CUDA, Vulkan or Metal significantly improves performance. See the GPU setup guide.
Yes. The Community Edition is free for developers, startups and open-source projects with no time limit, no feature lock and no registration required. For enterprise deployments, commercial licences are available on the pricing page.
A continuously-updated catalog of open-weight models from Google, Meta, Alibaba, Mistral, Microsoft and IBM. Chat, reasoning, vision, OCR, embedding and speech models. Models auto-download on first use. Browse the full catalog.
Yes. Once a model is downloaded, all inference runs 100% on-device with zero cloud dependency. Pre-download models and deploy in fully air-gapped environments. Ideal for regulated industries, healthcare, defence, and any scenario requiring complete data sovereignty.
Local inference eliminates per-token API costs, avoids rate limits, and keeps all data on your infrastructure. See the cost and performance comparison.
Yes. Official bridge packages exist for both Semantic Kernel and Microsoft.Extensions.AI.
The Community Edition gives developers, startups and open-source projects full, unrestricted access to LM-Kit.NET at no cost. No time limits, no feature locks.