Free download

Get LM-Kit.NET up and running in minutes.

Install via NuGet, pick a model, and build your first AI-powered .NET application. Full feature access, no time limits, no registration required, zero cloud dependency.

Free trial: no signup Cross-platform: Windows · macOS · Linux Acceleration: CPU · CUDA · Vulkan · Metal
Quick start

From zero to AI in three steps.

Create a new project, install the NuGet package, and follow the getting-started guide to build your first AI application.

01 · Project

Create a .NET project

Start from a console app or add LM-Kit.NET to any existing .NET solution.

dotnet new console -n MyAIApp

02 · Install

Install the NuGet package

Single dependency, in-process. Includes optimised CPU inference out of the box.

dotnet add package LM-Kit.NET

03 · Build

Build your first AI app

Initialise the runtime, load a model, generate a response. The getting-started guide walks every step.

Your first AI agent
Full getting-started guide Sample walkthroughs Browse on GitHub
Install

One command. NuGet does the rest.

LM-Kit.NET ships as a single NuGet package with optimised CPU inference (AVX/AVX2) included. Add an optional backend package for GPU acceleration when you need it.

CUDA 12 · Win

NVIDIA GPU on Windows

dotnet add package LM-Kit.NET.Backend.Cuda12.Windows

CUDA 12 · Linux

NVIDIA GPU on Linux

dotnet add package LM-Kit.NET.Backend.Cuda12.Linux

Vulkan & Metal

Cross-vendor GPU & Apple Silicon

Both backends are bundled in the main package. No extra install required.

Models

Choose the right model for your use case.

The catalog is continuously updated with releases from Google, Meta, Alibaba, Mistral, Microsoft and IBM. Models auto-download on first use.

Chat & code

General conversation and coding

Gemma, Qwen, Llama, Phi, Mistral, Granite, Falcon, SmolLM. Sub-1B to 30B+.

Reasoning & agents

Multi-step reasoning, tool use

QwQ, DeepSeek R1, GPT-OSS, GLM Flash, Nemotron, Magistral.

Vision & multimodal

VLMs and OCR

Qwen-VL, MiniCPM, Ministral, Pixtral, Devstral, plus dedicated OCR models.

OCR

Document text extraction

PaddleOCR, LightOnOCR. Structured text from images and scanned documents.

Embeddings

Vector search and reranking

Qwen Embed, EmbeddingGemma, BGE, Nomic. Text and image embeddings, plus rerankers.

Speech

High-accuracy audio transcription

A growing local speech-to-text catalog. Multiple model sizes for different speed and accuracy trade-offs.

Browse the full catalog Choosing the right model
Capabilities

What you can build.

A single SDK covering the full surface of AI features. Everything runs 100% on-device, zero cloud dependency. Explore all use cases →

Microsoft .NET AI ecosystem

Drop-in local provider for Semantic Kernel and Microsoft.Extensions.AI

Official bridge packages let LM-Kit.NET act as the local model provider for any existing Semantic Kernel or Microsoft.Extensions.AI application.

Semantic Kernel bridge Extensions.AI bridge
Compatibility

Works everywhere you build .NET.

Targets .NET Standard 2.0 for maximum compatibility. Develop and deploy on the platform of your choice.

Operating systems
Windows x64 · macOS (Intel & Apple Silicon) · Linux x64 / ARM64
GPU acceleration
CUDA 12 / 13 · Vulkan · Metal · CPU AVX/AVX2 fallback
.NET frameworks
.NET Standard 2.0 · .NET 8 / 9 / 10
FAQ

Frequently asked questions.

Do I need a GPU to use LM-Kit.NET?

No. LM-Kit.NET includes optimised CPU inference (AVX/AVX2) out of the box, fine for smaller models and lower-throughput scenarios. For production workloads or larger models, GPU acceleration via CUDA, Vulkan or Metal significantly improves performance. See the GPU setup guide.

Is the Community Edition really free?

Yes. The Community Edition is free for developers, startups and open-source projects with no time limit, no feature lock and no registration required. For enterprise deployments, commercial licences are available on the pricing page.

What models are supported?

A continuously-updated catalog of open-weight models from Google, Meta, Alibaba, Mistral, Microsoft and IBM. Chat, reasoning, vision, OCR, embedding and speech models. Models auto-download on first use. Browse the full catalog.

Does it work offline or in air-gapped environments?

Yes. Once a model is downloaded, all inference runs 100% on-device with zero cloud dependency. Pre-download models and deploy in fully air-gapped environments. Ideal for regulated industries, healthcare, defence, and any scenario requiring complete data sovereignty.

How does local inference compare to cloud APIs?

Local inference eliminates per-token API costs, avoids rate limits, and keeps all data on your infrastructure. See the cost and performance comparison.

Can I integrate with Microsoft Semantic Kernel or Extensions.AI?

Yes. Official bridge packages exist for both Semantic Kernel and Microsoft.Extensions.AI.

Ready to deploy

Get the free Community license.

The Community Edition gives developers, startups and open-source projects full, unrestricted access to LM-Kit.NET at no cost. No time limits, no feature locks.

Get free license View pricing Talk to us