Skip to content

Source code for research into the automation of literature reviews using Python and NLTK. This case study searches abstracts for research into toxic cosmetic ingredients.

Notifications You must be signed in to change notification settings

rsonger/CosIng-Toxicity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CosIng-Toxicity

This project contains source code for research into the automation of literature reviews using Python and NLTK. The CosIng-Toxicity case study in particular uses data about cosmetic ingredients to search for research into their toxicity.

This project was carried out in collaboration with the Kanazawa University Practical Pharmacology Laboratory. The goal was to verify an automated literature review process using natural language processing (NLP).

The data for this project comes from the following resources:

A presentation slideshow of this research is available on Slideshare at the link below.

A Natural Language Processing Approach to Reviewing Research Abstracts from Robert Songer

Research literature reviews have largely moved online and researchers must search through large quantities of digital documents to find research related to their academic pursuits. With recent developments in Natural Language Processing (NLP), computers can perform most of the searching and reduce the amount of time it takes researchers to find the papers they need. In this report, we introduce three basic NLP techniques (tokenization, frequency distributions, and in-sentence collocations) for searching the written texts of research abstracts downloaded from an online database. Real examples written in the Python programming language are provided along with a discussion of their efficacy in a project at Kanazawa University where an online research database was searched for research related to the adverse effects of hundreds of pharmaceutical compounds.

About

Source code for research into the automation of literature reviews using Python and NLTK. This case study searches abstracts for research into toxic cosmetic ingredients.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages