Digitizing an independent newspaper 1987-1992.
WHAMO: World Herald Attitude Monitoring Operation.
In 1989 the newpaper rebranded to "Nebraska Observer."
-
Used extract.ps1 (Windows PowerShell) to extract Windows file metadata out of the original TIF files.
-
Used sort.pl to rough out a list of which TIFs should be which PDFs.
-
Tons of manual fixing of errors in sorted.txt.
-
pdf_and_ocr.pl merges multiple TIFs into a single TIF, converts that TIF to PDF, and then OCRs that PDF.
Browse the archive. Upload documentation.
export DATE=1989-02-01; ./ia upload NebraskaObserver-$DATE NebraskaObserver-$DATE.pdf \
--metadata="date:$DATE" \
--metadata="title:Nebraska Observer $DATE" \
--metadata="creator:Frances Mendenhall" \
--metadata="description:Nebraska Observer. A citizen's organization providing an alternative voice." \
--metadata="subject:newspaper; Omaha; Nebraska; Nebraska Observer" \
--metadata="language:English" \
--metadata="mediatype:texts"
export DATE=1987-09-01; ./ia upload WHAMO-$DATE WHAMO-$DATE.pdf \
--metadata="date:$DATE" \
--metadata="title:WHAMO $DATE" \
--metadata="creator:Frances Mendenhall" \
--metadata="description:WHAMO. A citizen's organization providing an alternative voice." \
--metadata="subject:newspaper; Omaha; Nebraska; WHAMO" \
--metadata="language:English" \
--metadata="mediatype:texts"