Skip to content

Textum

Textum Review interface preview
Textum Review interface preview

Textum is a desktop application for handwritten and printed text recognition. It contains a sophisticated review interface and handles all steps of the workflow: from input files, to text recognition, manual review and exporting to various formats. Textum works fully offline and doesn't send any data to external services. The application is provided free of charge.

🛠️ The Source Code can be found here: codeberg.org/textum/app.

🖼️ If you want to see more of the interface, visit the Gallery.

Features

  • Easy installation. Just download, extract and start working
  • Process single images, PDFs, or folders of images
  • Printed text recognition with Kosmos-2.5
  • Handwritten text recognition with TrOCR (fine-tuned on holocaust diaries)
  • Models are bundled with the application, no separate downloads needed
  • Intuitive side-by-side review interface
  • Automated highlighting of likely OCR errors, via dictionaries (currently German, English, Dutch)
  • Export to plain text, DOCX, or PDF. ALTO or PAGE XML are currently not yet supported
  • Works fully offline. No data is sent anywhere.

▶️ Next Steps

System Requirements

  • CUDA-capable GPU (recommended, but not required)
  • 8GB RAM should work, but 16GB is better
  • ~20GB disk space
  • Windows 10/11 or Linux. MacOS is not tested

How It Works

  1. Preprocess - Image enhancement and normalization
  2. Segment - YOLO-based line and region detection (handwritten only)
  3. Recognize - Kosmos-2.5 (printed) or TrOCR (handwritten)
  4. Postprocess - Spell checking and corrections
  5. Review - Manual verification and editing
  6. Export - Save as text, DOCX, or PDF