Skip to content

Commit 52ccadb

Browse files
author
mdiberna
committed
initial commit
1 parent 19851db commit 52ccadb

16 files changed

+1071
-1
lines changed

.gitignore

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
# egg-info
2+
src/swissisoform.egg-info
3+
4+
# pycache
5+
src/swissisoform/__pycache__
6+
7+
# data
8+
data/genome_data/hg38.fa
9+
data/genome_data/hg38.ncbiRefSeq.gtf
10+
data/ribosome_profiling

LICENSE

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
MIT License
2+
3+
Copyright (c) 2019, Massachusetts Institute of Technology All rights reserved.
4+
5+
Permission is hereby granted, free of charge, to any person obtaining a copy
6+
of this software and associated documentation files (the "Software"), to deal
7+
in the Software without restriction, including without limitation the rights
8+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9+
copies of the Software, and to permit persons to whom the Software is
10+
furnished to do so, subject to the following conditions:
11+
12+
The above copyright notice and this permission notice shall be included in all
13+
copies or substantial portions of the Software.
14+
15+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21+
SOFTWARE.

README.md

Lines changed: 38 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,38 @@
1-
# swissisoform
1+
# SwissIsoform
2+
3+
Tool for analyzing and visualizing alternative protein isoforms from ribosome profiling data.
4+
5+
## Installation
6+
7+
```bash
8+
# Clone repository
9+
git clone https://github.com/cheeseman-lab/swissisoform.git
10+
cd swissisoform
11+
12+
# Create conda environment
13+
conda env create --file=environment.yml
14+
15+
# Activate environment
16+
conda activate swissisoform
17+
```
18+
19+
## Data Requirements
20+
21+
1. Reference genome (FASTA)
22+
2. Gene annotations (GTF)
23+
24+
```bash
25+
# Move into genome_data folder
26+
cd data/genome_data
27+
28+
# Download genome data
29+
sh download_genome.sh
30+
```
31+
32+
3. Alternative start sites (BED)
33+
34+
35+
36+
## Usage
37+
38+
See `notebooks/visualize_isoforms.ipynb` for example usage.

data/genome_data/download_genome.sh

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
#!/bin/bash
2+
UCSC_BASE="https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips"
3+
4+
wget ${UCSC_BASE}/hg38.fa.gz
5+
wget ${UCSC_BASE}/genes/hg38.ncbiRefSeq.gtf.gz
6+
7+
gunzip *.gz

environment.yml

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
name: swissisoform
2+
channels:
3+
- conda-forge
4+
- bioconda
5+
- anaconda
6+
dependencies:
7+
- python=3.10
8+
- numpy
9+
- pandas
10+
- biopython
11+
- pip
12+
- matplotlib
13+
- pip:
14+
- -e .

notebooks/naxe_5utr_NM_144772.3_2.png

71.1 KB
Loading

notebooks/naxe_features.csv

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
chromosome,source,feature_type,start,end,score,strand,frame,gene_id,transcript_id,,exon_number,exon_id,gene_name
2+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,exon,150333,150543,.,+,.,NAXE,NM_144772.3,,1,NM_144772.3.1,NAXE
3+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,5UTR,150333,150361,.,+,.,NAXE,NM_144772.3,,1,NM_144772.3.1,NAXE
4+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,CDS,150362,150543,.,+,0,NAXE,NM_144772.3,,1,NM_144772.3.1,NAXE
5+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,exon,150658,150766,.,+,.,NAXE,NM_144772.3,,2,NM_144772.3.2,NAXE
6+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,CDS,150658,150766,.,+,1,NAXE,NM_144772.3,,2,NM_144772.3.2,NAXE
7+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,exon,150922,151032,.,+,.,NAXE,NM_144772.3,,3,NM_144772.3.3,NAXE
8+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,CDS,150922,151032,.,+,0,NAXE,NM_144772.3,,3,NM_144772.3.3,NAXE
9+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,exon,151114,151227,.,+,.,NAXE,NM_144772.3,,4,NM_144772.3.4,NAXE
10+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,CDS,151114,151227,.,+,0,NAXE,NM_144772.3,,4,NM_144772.3.4,NAXE
11+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,exon,151965,152112,.,+,.,NAXE,NM_144772.3,,5,NM_144772.3.5,NAXE
12+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,CDS,151965,152112,.,+,0,NAXE,NM_144772.3,,5,NM_144772.3.5,NAXE
13+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,exon,152439,152856,.,+,.,NAXE,NM_144772.3,,6,NM_144772.3.6,NAXE
14+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,CDS,152439,152638,.,+,2,NAXE,NM_144772.3,,6,NM_144772.3.6,NAXE
15+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,3UTR,152642,152856,.,+,.,NAXE,NM_144772.3,,6,NM_144772.3.6,NAXE
16+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,start_codon,150362,150364,.,+,0,NAXE,NM_144772.3,,1,NM_144772.3.1,NAXE
17+
chr1_MU273335v1_fix,ncbiRefSeq.2022-10-28,stop_codon,152639,152641,.,+,0,NAXE,NM_144772.3,,6,NM_144772.3.6,NAXE
69.7 KB
Loading

notebooks/test_line.gtf

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
chr1_MU273335v1_fix ncbiRefSeq.2022-10-28 transcript 150333 152856 . + . gene_id "NAXE"; transcript_id "NM_144772.3"; gene_name "NAXE";

0 commit comments

Comments
 (0)