Solutions · Document Intelligence

The complete local document intelligence platform.

Turn any document into structured, searchable, actionable data. Chat with PDFs, extract fields from invoices, split multi-document scans, and build RAG pipelines. All powered by on-device AI with adaptive processing that combines OCR, Vision Language Models, and layout analysis. Zero cloud dependency.

100% on-device Adaptive processing .NET SDK HIPAA / GDPR ready
Pipeline
  1. Ingest any document
    PDF, DOCX, XLSX, PPTX, HTML, Markdown, images
  2. Adaptive analysis
    Auto-selects OCR, VLM, or text extraction per page
  3. Structured output
    Chat answers, extracted fields, document segments, RAG results
0
Cloud calls
7+
Formats
100%
Traceable
3
AI engines
Process & convert

The complete document toolkit.

Industry-grade OCR, universal Markdown conversion, format-to-format converters, full PDF manipulation, email archive parsing. The infrastructure beneath every document workflow, exposed as first-class .NET APIs and as agent tools.

Adaptive processing

Three AI engines, one API.

Every page in every document is different. A digital PDF has clean text layers. A scanned invoice needs OCR. A complex form with tables and columns needs visual understanding. LM-Kit.NET's adaptive engine analyzes each page individually and selects the optimal extraction strategy automatically.

This content-aware approach means you never have to classify documents upfront or write format-specific code. One API call handles a 500-page batch containing digital contracts, scanned receipts, and image-heavy reports.

Built by IDP pioneers: This isn't a wrapper around generic RAG. It's purpose-built document intelligence from a team with 20+ years of experience processing billions of documents in production worldwide.

Recommended · Auto

PageProcessingMode.Auto

Analyzes each page and automatically selects the best strategy. Uses VLM for image-heavy pages, direct text extraction for digital content, OCR for scanned text. Zero configuration required.

Mode

TextExtraction

Extracts text directly from PDF structure with OCR fallback for scanned pages. Fastest processing, lowest resource usage. Ideal for clean digital documents.

Mode

DocumentUnderstanding

Vision Language Models analyze pages visually to understand layout, structure, tables, and relationships. Outputs structured markdown. Best for complex layouts, forms, and multi-column content.

Quickstart

Up and running in minutes.

LM-Kit.NET is a single NuGet package. No microservices, no Docker, no API keys. Load a model, point at a document, and start extracting intelligence.

  1. Install the NuGet package and load your preferred AI models (chat, embedding, vision)
  2. Create a PdfChat instance and feed it any document: PDF, Word, Excel, images, HTML
  3. Ask questions in natural language and get grounded answers with source attribution

The same models power all four pillars. Switch from document Q&A to data extraction to document splitting by changing one class.

Platform capabilities

Enterprise-grade document processing.

Built for production workloads that demand accuracy, traceability, and compliance.

Capability

Layout analysis engine

Deep document structure understanding: columns, paragraphs, lines, text regions, reading order. Purpose-built algorithms for real-world document layouts.

Capability

Source attribution

Every answer and extracted value is traced to its source document, page number, and passage. Full audit trail for compliance and verification.

Capability

Intelligent caching

Processed document embeddings are cached via IVectorStore. Subsequent loads are instant. Supports filesystem, Qdrant, and custom backends.

Capability

Vision Language Models

VlmOcr uses multimodal AI to transcribe pages as structured markdown. Understands tables, forms, multi-column layouts, and handwritten notes visually.

Capability

100% on-device

All processing runs on your infrastructure. Documents never leave. Air-gapped deployments, HIPAA, GDPR, and SOC 2 compliance ready out of the box.

Capability

Neuro-symbolic validation

Dynamic Sampling combined with symbolic validation layers eliminates LLM hallucinations. Confidence scores on every extraction for production-grade reliability.

File format support

Process any document format.

Native support for the most common document types in enterprise workflows.

.pdf

PDF documents

.docx

Word documents

.xlsx

Excel spreadsheets

.pptx

PowerPoint slides

.html

HTML pages

.md

Markdown files

.png .jpg

Images & scans

.txt

Plain text

Use cases

Built for real-world document workflows.

From mailroom automation to compliance audits, LM-Kit.NET handles the document intelligence that matters.

Use case

Invoice & receipt processing

Extract vendor, amounts, line items, tax, and payment terms from any invoice format. Schema-driven extraction with zero hallucinations.

Use case

Contract analysis

Query legal agreements for clauses, obligations, termination conditions, and payment terms. Multi-document comparison with full source attribution.

Use case

Compliance & audit

Verify regulatory compliance across document collections. Traceable source references create audit trails for HIPAA, GDPR, and SOC 2.

Use case

Mailroom automation

Split multi-document scans into individual files, classify each automatically, and route to the correct workflow. No templates needed.

Use case

Knowledge base & research

Build searchable knowledge bases from technical manuals, research papers, and specifications. Semantic search across thousands of documents.

Use case

Customer support automation

Ingest product documentation and answer customer questions automatically. Grounded responses ensure accuracy with zero fabrication.

Ready-to-run demos

See document intelligence in action.

Every capability ships with a complete, runnable console application. Download, build, and explore. Full source code on GitHub.

More document intelligence demos

LM-Kit.NET pillars

Seven pillars, one foundation.

The seven pillars of LM-Kit.NET, plus the local runtime they share. Highlighted card is where you are now.

The foundation

Every capability above runs on this runtime.

Foundation

Local Inference

The runtime all seven pillars sit on. The LM-Kit.NET NuGet ships the complete inference system: open-weight LLMs, vision-language models, embeddings, on-device speech-to-text, OCR and classifiers, accelerated on CPU, AVX2, CUDA 12/13, Vulkan or Metal. One package, zero cloud calls, predictable latency, full data and technology sovereignty.

Explore the foundation
Install the SDK

Ready to build document intelligence?

The most advanced local document processing platform for .NET. From chat to extraction to splitting. 100% on your infrastructure.

Download free View pricing