cq-notebooks

Notebooks for answering competency questions

Our goal here is to create a catalog of working examples that demonstrate how to access, transform, integrate and visualize the diverse data sources we intend to use for projects like Translator.

We are currently using Jupyter notebooks as our means of documenting, prototyping, and sharing code. As some of these experiments mature into working prototype pipelines, we intend to extract this functionality from the notebooks and migrate it into a production pipeline.

Workplan

Orange team queries are initially collected in the spreadsheet here, with tabs for different collections of notebooks (e.g. demonstrator-driven queries, general benchmarking queries). From this staging area, select queries are implemented in Jupyter or Zeppelin notebooks. A detailed overview of spreadsheet contents and the workflow for CQ development can be found in the documents here and here.

Notebooks under active development each have an associated directory in this repo that includes the notebook itself, a descriptive README, and any associated code or data. For each notebook, a Github ticket is also created and tagged with a notebook-status label to track its status, ownership, and outcomes. These tickets enable a dashboard-like overview of progress notebooks to be generated here.

Running Locally

One-time Setup

You will need Python (e.g., Python 3.5.2). If you do not have pip installed, you can install it with following command:

sudo easy_install pip

Once you have pip, run the following commands for first time setup

virtualenv env
source env/bin/activate
pip install jupyter ipython pandas requests

Running

After the initial setup, you only need to execute the commands below to bring up the notebooks

source env/bin/activate
jupyter notebook

APIs

API development guidelines

TODO: we should aim to drive this list from Smart API registry

Live

Pharos https://pharos.nih.gov/ (Purple) - drug info
Ginas http://ginas.ncats.nih.gov (Purple) - substances
BioLink https://api.monarchinitiative.org/api/ (Orange)
BioThings API for genes: MyGene.info (Orange)
BioThings API for variants: MyVariant.info (Orange)
BioThings API for drugs/Compounds: http://c.biothings.io (Orange)
BioThings API for taxonomy: http://t.biothings.io (Orange)
Wikidata SPARQL (Orange)
DGIdb API for drug-gene interactions: http://dgidb.genome.wustl.edu/api
Jaspar SPARQL endpoint: https://tfbsmotif.ncats.io/blazegraph/#query
Clinical profiles API: HAPI FHIR (Orange)
- Demo use

Hackathon

Disease prediction (Grey)

Translator TIDBITS Workflows

This is where the TIDBITS Workflows can be stored and edited.

In particular, you can use this git repository to track issues related to a given workflow.

One-time Setup

Upon git cloneing the project, you need to configure the mvp-modules-library git submodule:

$ git submodule init

Every time you git pull an update of the system, you may wish to also:

$ git submodule update

Name		Name	Last commit message	Last commit date
Latest commit History 1,226 Commits
BigGIM		BigGIM
Contributor_Docs		Contributor_Docs
FA_gene_sets		FA_gene_sets
Green_CQs		Green_CQs
OrangePurpleRedGreenGray		OrangePurpleRedGreenGray
Orange_Demonstrator_1_CQs		Orange_Demonstrator_1_CQs
Orange_Demonstrator_2_CQs		Orange_Demonstrator_2_CQs
Orange_Demonstrator_3_CQs		Orange_Demonstrator_3_CQs
Orange_QB1_Benchmark_CQs		Orange_QB1_Benchmark_CQs
Orange_QB2_Other_CQs		Orange_QB2_Other_CQs
PTSD-Stretch		PTSD-Stretch
Purple_BBTest1		Purple_BBTest1
WorkFlow9		WorkFlow9
Workflow2		Workflow2
Workflow3		Workflow3
Workflow4		Workflow4
Workflow5		Workflow5
Workflow7		Workflow7
Workflow8		Workflow8
blue		blue
docs/images		docs/images
greengamma		greengamma
hackathon_may_2018		hackathon_may_2018
indigo		indigo
infrared		infrared
mvp-module-library @ ac13f5c		mvp-module-library @ ac13f5c
ndex		ndex
pharos		pharos
swagger-utils		swagger-utils
tidbits		tidbits
wikidata		wikidata
xray		xray
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
API_dev_guidelines.md		API_dev_guidelines.md
BlueQ1		BlueQ1
CHEBIBioLinkGINAS.ipynb		CHEBIBioLinkGINAS.ipynb
README.md		README.md
green-requirements.txt		green-requirements.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cq-notebooks

Workplan

Running Locally

One-time Setup

Running

APIs

Live

Next

Hackathon

Translator TIDBITS Workflows

One-time Setup

About

Releases

Packages

Contributors 42

Languages

ncats/translator-workflows

Folders and files

Latest commit

History

Repository files navigation

cq-notebooks

Workplan

Running Locally

One-time Setup

Running

APIs

Live

Next

Hackathon

Translator TIDBITS Workflows

One-time Setup

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 42

Languages

Packages