A repo with 17.000+ open access articles text extracts, with subject tag 11 (Health Sciences) and an Altmetric Attention Score (AAS) ≥100. This repository contains the uncleaned pdf text extractions of these articles. The text extractions were performed using PyMuPDF. The included articles were established by means of Altmetric Explorer on 06-06-19. The dataset is still missing 300+ articles, and requires secondary validation. However it is a representative sample of the 'most popular' open access medical literature availible.
Please contact me if you want to use the dataset in peer reviewed literature.