# Frequently Asked Questions

## How fast is "fast"?

Kreuzberg is built on a high-performance Rust core, so most documents are processed almost instantly — in milliseconds instead of seconds. For bulk jobs that's thousands of pages per hour on a single API key.

## What file types do you support?

PDFs (native and scanned), images (JPG, PNG, TIFF, and more), Microsoft Office (DOCX, PPTX, XLSX), web content (HTML, XML), plain text, and code in 305 programming languages. We detect document type automatically and optimize extraction for each format. Full list: 92 document formats. See [capabilities](https://kreuzberg.dev/llms/capabilities.md).

## Do you handle scanned documents?

Yes. Built-in Tesseract OCR recognises text in images and scanned PDFs with no additional configuration — just send the file and get structured output back. The cloud API is fixed to Tesseract; the self-hosted library can also use vision-language-model OCR backends for complex layouts.

## What happens to my documents?

Documents are processed in memory and deleted immediately after extraction. No storage, no indexing. We don't train on your data or use it for model improvement.

## What license does Kreuzberg use?

The Kreuzberg open-source library is licensed under the Elastic License v2 (ELv2) from v4.8.0 onward. You can use it freely for personal projects, internal tools, and commercial applications. The one restriction: you cannot offer Kreuzberg as a managed service to third parties. Versions v4.7.x and below remain MIT-licensed.

Kreuzberg Cloud is a separate commercial product with its own terms. If you need a different licensing arrangement, contact <contact@kreuzberg.dev>.

## I already use the open-source library. Why try Kreuzberg Cloud?

The open-source engine is fully usable and powerful on its own. Kreuzberg Cloud removes the operational complexity, so you can run it in production without worrying about managing infrastructure — autoscaling, OCR backends, webhook delivery, and quota management are handled for you.

## See also

- [Pricing](https://kreuzberg.dev/llms/pricing.md)
- [Capabilities](https://kreuzberg.dev/llms/capabilities.md)
- [Extraction API](https://kreuzberg.dev/llms/api.md)
