Releases: quest-bih/oddpub
Releases · quest-bih/oddpub
Volcano Island
Regular expressions were moved to a yaml file and updated to be more conservative when detecting data re-use.
It is no longer sufficient for a citation or accesion number, or link to be in the same sentence as "was_available".
PDF to text conversion was improved to remove left margin text and deal with image pages containing isolated math symbols.
Taal
Major updates:
open_data_search
receives a new parameterscreen_das
, which regulates the way data and code availability statements are screened.pdf_convert
receives a new parameter,add_section_tags
. With this thetags which denote the potential start of a section are now optional in the txt output.
Minor updates:
- improved parsing of jama papers.
- changed all parameters to snake_case.
Luzon
New Features:
- PDF parsing to recognize multi-column layouts, inserts (figures, tables, appendices) and tagging of potential section starts.
- Tagging in combination with common section titles is used to extract the Data and Code Availability Statements.
- Open data categories flagged include re-use, upon request, supplement, unknown/misspecified source. These categories do not satisfy our criteria for open data, but are of interest to meta-researchers. Some of these are still experimental and will be validated in the future.
Validation of the new release is planned for 2025.
ODDPub publication release (updated)
Updated ODDPub release version used for the publication.
ODDPub publication release
ODDPub release version used for the publication.
ODDPub publication release (pre-final)
ODDPub release version used for the publication.