Skip to content
View wrznr's full-sized avatar
💭
Happy VIMing
💭
Happy VIMing

Organizations

@slub

Block or report wrznr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python tools for performing various operations on ALTO XML files

Python 45 16 Updated Feb 12, 2025
Python 1 1 Updated Jan 29, 2025

Ein Inventar aller Akten aus historischen Archiven, die sich mit der Geschichte des deutschsprachigen Buchhandels im 19. und 20. Jahrhundert befassen.

Jupyter Notebook 2 Updated Dec 9, 2021

A documentation for FAIR GPT, a virtual RDM consultant

14 1 Updated Oct 10, 2024

Layout analysis to find layout elements in documents (similar to P2PaLA)

Python 18 6 Updated Feb 20, 2025

Convert AWS Textract JSON to PRImA PAGE XML

Python 6 3 Updated Feb 3, 2025
Jupyter Notebook 2 Updated Jun 6, 2024

Host repository for The Turing Way: a how to guide for reproducible data science

TeX 1,978 670 Updated Feb 24, 2025

Algorithm for Open Data Detection in Publications (ODDPub)

R 36 9 Updated Feb 17, 2025

Docker integration of Kitodo.Production and OCR-D

XSLT 9 6 Updated Mar 12, 2024

unAPI Client (K10plus)

Python 2 Updated Sep 13, 2023

forced alignment of lists of string by fuzzy string matching

Python 10 1 Updated Sep 30, 2024

Modules used for separating articles in (historical) newspapers and similar documents. This repository is part of the European Union's Horizon 2020 project NewsEye. For more information about the p…

Python 20 Updated Sep 2, 2022

Leipzig music font

Python 5 4 Updated Sep 27, 2024

SitePackage and Configuration of Sachsen.Digital website

JavaScript 2 5 Updated Sep 6, 2024

Python Twitter API

Python 3,235 720 Updated Feb 10, 2025

Scraping MDPI website to get the number of special issues for each of 74 journals with an IF

HTML 42 10 Updated Mar 22, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,793 2,596 Updated Feb 6, 2025

Read-only unofficial mirror of Pynini

C++ 17 4 Updated May 7, 2019

Prototype for the presentation of a good-practice collection of Open Educational Ressources

CSS 4 5 Updated Feb 29, 2024

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 14,083 2,112 Updated Feb 21, 2025

Highlighting various OCR formats directly in Solr

HTML 84 13 Updated Feb 21, 2025

Models that were trained for the Origami BBZ project.

5 Updated Jan 21, 2021

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

JavaScript 187 24 Updated Feb 5, 2025

OCR-D wrapper for arbitrary coords-preserving image operations

Python 4 1 Updated Feb 15, 2025

Utility scripts for using the Web of Science Links Article Match Retrieval Service (AMR) service.

Python 17 3 Updated Feb 28, 2023

Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"

Makefile 12 1 Updated Dec 17, 2021

A tiny shell-script based testing framework

Shell 7 3 Updated Dec 7, 2018

Augment line images for improving OCR datasets

Python 9 1 Updated Oct 4, 2023
Next
Showing results