Skip to content
Change the repository type filter

All

    Repositories list

    • Zotero Plugin for OCR
      JavaScript
      GNU Affero General Public License v3.0
      40589100Updated Feb 8, 2025Feb 8, 2025
    • Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
      TypeScript
      MIT License
      1.9k000Updated Feb 6, 2025Feb 6, 2025
    • Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
      JavaScript
      MIT License
      23185351Updated Feb 5, 2025Feb 5, 2025
    • malibu

      Public
      Mannheim library utilities
      PHP
      GNU General Public License v3.0
      1426171Updated Feb 4, 2025Feb 4, 2025
    • ape

      Public
      ALMA Print Extension
      PHP
      Other
      1950Updated Feb 4, 2025Feb 4, 2025
    • Kitodo.Presentation Community Edition
      JavaScript
      GNU General Public License v3.0
      44200Updated Feb 3, 2025Feb 3, 2025
    • Ground truth for digitized publications of UB Tübingen
      Python
      Creative Commons Zero v1.0 Universal
      1810Updated Jan 30, 2025Jan 30, 2025
    • paraly

      Public
      Jupyter Notebook
      Other
      0000Updated Jan 29, 2025Jan 29, 2025
    • DCC

      Public
      DCC: Digitalization, OCR, and structuring of the books "The Descendants of the Colonial Clergy"
      Python
      Other
      0000Updated Jan 29, 2025Jan 29, 2025
    • A node.js , gulp based development enviornment for Primo's new UI customizations (css,images,html and javascript)
      JavaScript
      BSD 3-Clause "New" or "Revised" License
      86000Updated Jan 28, 2025Jan 28, 2025
    • Test Repository for Course "Scientific Writing and Bibliographic Research" @ Uni Mannheim
      Shell
      MIT License
      672810Updated Jan 20, 2025Jan 20, 2025
    • MBI-KG

      Public
      MBI-KG: A knowledge graph of structured and linked economic research data extracted from the book "Die Maschinen-Industrie im Deutschen Reich" written by Herbert Patschan in 1937
      Python
      Other
      1300Updated Jan 15, 2025Jan 15, 2025
    • pero-ocr

      Public
      Python
      BSD 3-Clause "New" or "Revised" License
      22000Updated Jan 12, 2025Jan 12, 2025
    • dach-gt

      Public
      Ground truth and full text for selected prints of German libraries
      Shell
      Creative Commons Zero v1.0 Universal
      2200Updated Jan 11, 2025Jan 11, 2025
    • User-friendly WebUI for LLMs (Formerly Ollama WebUI)
      JavaScript
      MIT License
      8.3k000Updated Jan 10, 2025Jan 10, 2025
    • ojs

      Public
      Open Journal Systems
      PHP
      918000Updated Jan 10, 2025Jan 10, 2025
    • Omeka

      Public
      A flexible web publishing platform for the display of library, museum and scholarly collections, archives and exhibitions.
      PHP
      GNU General Public License v3.0
      198000Updated Jan 10, 2025Jan 10, 2025
    • kraken

      Public
      OCR engine for all the languages
      Python
      Apache License 2.0
      141300Updated Jan 10, 2025Jan 10, 2025
    • Python
      Other
      62220Updated Dec 23, 2024Dec 23, 2024
    • Maintained by @Erikmitk.
      JavaScript
      Other
      2003Updated Dec 23, 2024Dec 23, 2024
    • party

      Public
      Page-wise text recognition with lower-supervision line data models
      Python
      Apache License 2.0
      4000Updated Dec 21, 2024Dec 21, 2024
    • Kitodo.Production Community Edition
      Java
      GNU General Public License v3.0
      63120Updated Dec 16, 2024Dec 16, 2024
    • metadata

      Public
      Repository for metadata related topics
      GNU Affero General Public License v3.0
      1000Updated Dec 7, 2024Dec 7, 2024
    • blatt

      Public
      NLP-helper for OCR-ed pages in PAGE XML format
      Python
      MIT License
      1900Updated Dec 6, 2024Dec 6, 2024
    • Software and data related to "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger"
      Shell
      Apache License 2.0
      0440Updated Dec 5, 2024Dec 5, 2024
    • tesseract

      Public
      Tesseract Open Source OCR Engine (main repository)
      C++
      Apache License 2.0
      9.7k3.3k91Updated Nov 22, 2024Nov 22, 2024
    • This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation was created in the context of the OCR-BW project.
      Creative Commons Zero v1.0 Universal
      2720Updated Nov 19, 2024Nov 19, 2024
    • Tools to process books in a cloud based pipeline system
      Go
      GNU General Public License v3.0
      4000Updated Nov 6, 2024Nov 6, 2024
    • ldll

      Public
      Find DLL dependencies required for a set of Windows binaries
      Python
      MIT License
      0000Updated Nov 5, 2024Nov 5, 2024
    • PalMA

      Public
      PalMA Team Monitor
      PHP
      Other
      16028253Updated Nov 5, 2024Nov 5, 2024