- Dresden
Stars
Python tools for performing various operations on ALTO XML files
Ein Inventar aller Akten aus historischen Archiven, die sich mit der Geschichte des deutschsprachigen Buchhandels im 19. und 20. Jahrhundert befassen.
A documentation for FAIR GPT, a virtual RDM consultant
Layout analysis to find layout elements in documents (similar to P2PaLA)
Host repository for The Turing Way: a how to guide for reproducible data science
Algorithm for Open Data Detection in Publications (ODDPub)
forced alignment of lists of string by fuzzy string matching
Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Horizon 2020 project NewsEye. For more information about the p…
SitePackage and Configuration of Sachsen.Digital website
Scraping MDPI website to get the number of special issues for each of 74 journals with an IF
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Prototype for the presentation of a good-practice collection of Open Educational Ressources
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Highlighting various OCR formats directly in Solr
Models that were trained for the Origami BBZ project.
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
OCR-D wrapper for arbitrary coords-preserving image operations
Utility scripts for using the Web of Science Links Article Match Retrieval Service (AMR) service.
Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"