🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
Jun 3, 2024 - Python
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
FileTrove indexes files and creates metadata from them.
Michael Schiltz personal page
IFIscripts is an open-source digital preservation tool which facilitates collection management workflows within the IFI and further afield. It is freely available from the GitHub repository and subject to modification depending on the progressive needs of collections and based upon policies and preservation standards.
Archivematica Automated User Acceptance Tests (AMAUAT)
Tools to extract metadata and TIFFs from Alchemy database CD-ROMs
Archivematica API client module
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
Home of the official apt/deb package for Ubuntu/Debian-based systems.
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Pyscripted Demystify enabling analysis of DROID and Siegfried reports from the browser
Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivist…
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
What is the checksum of a directory?
Normalize file format identification results (DROID, Siegfried) into a single SQLite DB
List of open workflows and resources for A/V archiving
Add a description, image, and links to the digipres topic page so that developers can more easily learn about it.
To associate your repository with the digipres topic, visit your repo's landing page and select "manage topics."