Logo

Energy Efficient

Document intelligence for AI engineering workflows.

Extract structured, machine-readable content from any document and feed it directly into AI agents, pipelines, and applications.

1

One API

95

File formats

Why Kreuzberg?

Built for AI engineering workflows

Speed That Unblocks Your Team

Speed That
Unblocks Your Team

Process documents in milliseconds instead of seconds! Your RAG pipeline moves at the speed of API calls, not extraction bottlenecks. Index millions of documents without waiting weeks for processing to complete.

Batch-Processing at Scale

Batch-Processing at
Scale

Effectively process large number of documents in bulk. Kreuzberg is built for batch processing, and our cloud infrastructure is designed to scale.

Embeddings

Embeddings

Ultra-fast embeddings via Rust-native fastembed-rs. 4 presets out of the box, extensible to any model. No separate embedding pipeline needed.

Chunking and Metadata

Chunking and
Metadata

Semantic chunking across code, markdown, and plain text. Token reduction, keyword extraction, and rich metadata - structured output ready for any AI pipeline.

Code Intelligence

Code
Intelligence

Extract functions, classes, imports, and symbols from code files across 306 programming languages. Structured output, ready for semantic chunking and RAG pipelines.

LLM-Powered Intelligence

LLM-Powered
Intelligence

Go beyond extraction. Use vision language models as an OCR backend, extract structured JSON from documents using a schema, and generate embeddings - all via 146 LLM providers, including local models with zero API key configuration.

How it works

Three steps. One API

01

Send the file

Upload via API, SDK, CLI, or Docker. Supports PDFs, images, scanned docs, DOCX, PPTX, XLSX, HTML, and 90+ more formats.

02

We process it

Layout detection, OCR when needed, table extraction, optional VLM, and schema validation - all in a single call.

03

Pipe it anywhere

JSON response with full document structure. Webhook delivery for async workflows. Plug directly into your embeddings pipeline or RAG framework.

Pricing

Pay only for what you extract - no seats, no minimums

Cloud · Pay-as-you-go

Production-ready extraction, managed by us.

$0.008/page

Check

First 10,000 pages free

Check

95 file formats, 306 code formats

Check

Images and scanned PDFs supported

Check

OCR, layout detection, table extraction

Check

No monthly minimum

Get started instantly, no card required

Try it For Free!

High volume

100K+ pages a month? Let's talk pricing.

Check

Everything from the Pay as you go plan

Check

Discounted per-page rate on the cloud

Self-hosted

your data never leaves your environment

Check

Same capabilities as Cloud

Check

Run on your own infrastructure

Check

No data leaves your environment

Check

Built for regulated industries

Frequently Asked Questions

How fast is 'fast'?
Kreuzberg is built on a high-performance Rust core, so most documents are processed almost instantly- in milliseconds instead of seconds. For bulk jobs that's thousands of pages per hour on a single API key.
What file types do you support?
PDFs (native and scanned), images (JPG, PNG), Microsoft Office (DOCX, PPTX, XLSX), web content, and plain text. We detect document type automatically and optimize extraction for each format.
Do you handle scanned documents?
Yes. Built-in OCR recognizes text in images and scanned PDFs. No additional configuration needed—just send the file and get structured output back.
What happens to my documents?
Documents are processed in memory and deleted immediately after extraction. No storage, no indexing. We don't train on your data or use it for model improvement.
I already use your open-source library with good results. Why should I try Kreuzberg cloud?
The open-source engine is fully usable and powerful on its own. Kreuzberg Cloud removes the operational complexity, so you can run it in production without worrying about managing infrastructure.

Start Building Today

Join thousands of developers already building document intelligence pipelines using Kreuzberg - in their language of choice!

We value your privacy

Kreuzberg uses cookies to improve your experience, personalize content, and analyze traffic. You can manage your preferences at any time.