Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
-
Updated
May 22, 2024 - PHP
Kitodo.Presentation is a feature-rich framework for building a METS- or IIIF-based digital library. It is part of the Kitodo Digital Library Suite.
Read and extract text and other content from PDFs in C# (port of PDFBox)
OCR engine for all the languages
ALTO XML coordinates highlighting application for validating the coordinates values
TIFF Image - Converted into OCR XML using Tesseract
ALTO XML schema - latest and all former versions
A pipeline to transfer ground truth from Transkribus to eScriptorium.
Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis
Convert ALTO XML to plain text + minimal metadata
Python tools for performing various operations on ALTO XML files
Document Layout Analysis resources repos for development with PdfPig.
Text Overlay plugin for Mirador 3
Image Retrieval in Digital Libraries - A Multicollection Experimentation of Machine Learning techniques
Helper functions and web app for METS/ALTO archive viewing.
OCR engine for all the languages
Data Mining Historical Newspaper Metadata (METS/ALTO formats)
Add a description, image, and links to the alto-xml topic page so that developers can more easily learn about it.
To associate your repository with the alto-xml topic, visit your repo's landing page and select "manage topics."