Skip to content

Commit

Permalink
Updated README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
jiadong324 committed Jan 21, 2022
1 parent 39fbcea commit 9ecbe8c
Show file tree
Hide file tree
Showing 5 changed files with 20,950 additions and 3 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ This repository provides support for SVision downstream CSV filter and analysis.

Please install pandas, numpy and [intervaltree](https://pypi.org/project/intervaltree/).

The call set for the paper is under ./supports.
The call set for the paper is ./supports/HG00733.svision.s5.graph.vcf.

#### Prepare config file

Expand All @@ -30,7 +30,7 @@ The config file requires:


```
python FilterMain.py -v svision.vcf -g graph_exact_match.txt -w ./workdir -i 0,3
python FilterMain.py supports/HG00733.svision.s5.graph.vcf -g ./supports/HG00733.graph_exactly_match.txt -w ./output_dir -i 0,3
```

This will generate three files:
Expand All @@ -39,5 +39,5 @@ This will generate three files:

*prefix*.Raw-CSVs.tsv: SVision CSVs filtered by graph structures.

*prefix*.HQ-CSVs.tsv: CSVs additionally filtered by simple repeats.
*prefix*.HQ-CSVs.tsv: CSVs additionally filtered by tandem repeats.

80 changes: 80 additions & 0 deletions supports/HG00733.HQ-CSVs.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,80 @@
chr1 43593624 43594257 522 10 48.03 SINE
chr1 70446312 70446751 676_1 4 66.29 SINE
chr1 81194666 81195870 748 12 57.14 LINE
chr1 175231433 175232798 1167 12 33.19 LTR
chr1 187495696 187497594 1219 15 21.6 LTR
chr1 205209504 205209664 1311 18 0.0 None
chr1 246817940 246819350 1677 20 21.56 SINE
chr2 16225124 16225236 2006 22 0.0 None
chr2 45061833 45079498 2140 23 21.25 LINE
chr2 61473618 61476320 2214 12 11.36 SINE
chr2 65600546 65601136 2242 6 100.0 Retroposon
chr2 116216766 116224724 2555 14 33.02 LINE
chr2 124294179 124295658 2625 12 17.04 SINE
chr2 195114663 195119917 2991 26 13.99 LTR
chr3 41320665 41321935 3691 27 36.61 LTR
chr3 80013716 80016128 3931 23 6.51 LTR
chr3 95746924 95752141 4017 28 19.78 LINE
chr3 128657347 128657553 4187 29 63.11 SINE
chr3 146667402 146677317 4277 28 28.67 LINE
chr3 162807955 162829868 4351 15 14.14 LINE
chr4 69220529 69274512 5571 12 6.36 LTR
chr4 92646210 92649022 5702 40 15.36 LINE
chr4 145693889 145694397 5955 18 100.0 Retroposon
chr4 145694076 145694384 5956 10 100.0 Retroposon
chr4 162657531 162658465 6068 12 45.5 LINE
chr5 28932544 28934950 6699 23 6.32 LINE
chr5 51925593 51927635 6954 40 74.14 LTR
chr5 83060043 83060167 7124 16 0.0 None
chr5 85639621 85662534 7142 23 21.56 LINE
chr5 148173476 148175199 7448 27 16.42 SINE
chr5 155906888 155907080 7472 47 100.0 DNA
chr6 89213914 89214231 8293 10 84.23 LINE
chr6 94031356 94031699 8312 10 99.42 LINE
chr6 167220854 167286115 8809 23 6.63 LINE
chr7 32684001 32712828 9365 23 20.44 LINE
chr7 102081266 102084757 9851 27 18.99 LINE
chr8 47980291 47981120 10763 35 38.48 SINE
chr8 84952507 84962077 10943 22 26.22 LINE
chr9 29591760 29592481 11568 15 30.65 DNA
chr9 62806327 62822799 11681 27 13.96 LINE
chr9 74283222 74283473 11739 51 64.94 LINE
chr9 86539643 86541026 11808 15 41.87 LINE
chr9 105054361 105055065 11946 12 43.18 LTR
chr9 110262294 110274913 11980 28 6.74 LINE
chr10 57497186 57498223 13092 15 21.89 LINE
chr10 98735100 98735257 13271 22 41.4 LTR
chr10 125502145 125508663 13443 14 30.1 DNA
chr11 59277816 59282714 14188 49 56.78 LTR
chr11 94233146 94238782 14472_1 4 95.49 LINE
chr11 99819283 99820576 14510 54 27.15 LTR
chr11 126632640 126632771 14670 29 0.0 None
chr11 126632686 126632772 14671 55 0.0 None
chr12 25106528 25107673 15025 47 20.26 DNA
chr12 39466255 39466532 15134 15 46.21 DNA
chr12 65424994 65425174 15289 29 100.0 LINE
chr12 71315487 71316538 15316 12 8.47 SINE
chr12 77990099 77994485 15349 12 11.79 LTR
chr12 80452280 80464114 15364 23 18.91 LINE
chr12 124438675 124439617 15625 6 100.0 Low_complexity
chr12 132954337 132954479 15902 2 98.59 Low_complexity
chr13 79819615 79843210 16357 23 15.27 LINE
chr13 88458707 88460622 16430 14 18.59 LINE
chr13 113176446 113176543 16712 2 64.95 SINE
chr14 25142261 25144518 16885 27 13.42 SINE
chr14 47854765 47856277 17005 12 11.71 SINE
chr14 65375820 65376415 17087 15 50.42 SINE
chr14 65791559 65791687 17091 5 0.0 None
chr15 91438350 91446225 17865 28 19.59 LINE
chr15 95148445 95149529 17907 23 28.32 LTR
chr16 3632630 3634495 18122 4 16.51 SINE
chr16 48871382 48872323 18585 15 31.77 LINE
chr16 69727929 69728992 18686 12 27.28 SINE
chr16 81764812 81765622 18752 4 36.67 SINE
chr17 34854678 34855853 19449 12 9.28 DNA
chr19 16256389 16256892 20839 2 55.27 SINE
chr20 44677206 44683572 21975 23 5.23 LINE
chr20 61329281 61329476 22123 2 0.0 None
chr21 45055057 45055144 22747_1 8 0.0 None
chr22 18136755 18146780 22983 28 8.71 LTR
chr22 20075212 20075748 23001 6 50.93 SINE
122 changes: 122 additions & 0 deletions supports/HG00733.graph_exactly_match.txt

Large diffs are not rendered by default.

262 changes: 262 additions & 0 deletions supports/HG00733.svision.s5.graph.Raw-CSVs.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,262 @@
chrom start end id graphid
chr1 875829 876429 5_1 1
chr1 1382229 1382905 46_1 2
chr1 4144330 4144902 156_1 4
chr1 14492637 14492882 274 5
chr1 18049516 18049997 311 2
chr1 23392264 23392801 361_1 6
chr1 30655194 30655767 436 2
chr1 31431744 31432233 447 8
chr1 41544599 41545712 512_1 4
chr1 43593624 43594257 522 10
chr1 55002739 55003091 592_1 4
chr1 70446312 70446751 676_1 4
chr1 81194666 81195870 748 12
chr1 154713957 154714648 1053 2
chr1 167056411 167057034 1125 6
chr1 175231433 175232798 1167 12
chr1 175650712 175651218 1171_1 13
chr1 186857071 186857179 1215 14
chr1 187495696 187497594 1219 15
chr1 194669488 194669624 1260 16
chr1 202204190 202205958 1290 4
chr1 205209504 205209664 1311 18
chr1 206923746 206924783 1325 4
chr1 243979526 243979761 1603_1 2
chr1 246817940 246819350 1677 20
chr2 909659 910332 1747_1 2
chr2 1861595 1861997 1800_1 21
chr2 4108493 4109072 1870 4
chr2 16225124 16225236 2006 22
chr2 45061833 45079498 2140 23
chr2 61473618 61476320 2214 12
chr2 65600546 65601136 2242 6
chr2 71617736 71618760 2264 4
chr2 102350215 102353227 2478 6
chr2 116216766 116224724 2555 14
chr2 124294179 124295658 2625 12
chr2 195114663 195119917 2991 26
chr2 233176292 233176815 3243 4
chr2 238908668 238909242 3322_2 4
chr2 241589975 241591269 3423 4
chr3 38084270 38084611 3667_1 2
chr3 41320665 41321935 3691 27
chr3 72337362 72337700 3864 2
chr3 80013716 80016128 3931 23
chr3 95746924 95752141 4017 28
chr3 128657347 128657553 4187 29
chr3 146667402 146677317 4277 28
chr3 162807955 162829868 4351 15
chr3 195941352 196000896 4579_1 30
chr4 566615 566968 4639 31
chr4 1045973 1046897 4671_1 6
chr4 1761392 1761577 4726_1 2
chr4 3814791 3815871 4777 6
chr4 40294166 40295804 5067 13
chr4 69220529 69274512 5571 12
chr4 92646210 92649022 5702 40
chr4 145693889 145694397 5955 18
chr4 145694076 145694384 5956 10
chr4 146285801 146287244 5959_1 4
chr4 150250849 150251650 5980 6
chr4 162657531 162658465 6068 12
chr5 344375 345564 6405 2
chr5 1539454 1540235 6462 6
chr5 3276084 3276473 6505 8
chr5 3323120 3324424 6507 41
chr5 3659930 3660273 6520_1 8
chr5 7371374 7372887 6550 13
chr5 28932544 28934950 6699 23
chr5 35708515 35708708 6735_1 2
chr5 51925593 51927635 6954 40
chr5 83060043 83060167 7124 16
chr5 85639621 85662534 7142 23
chr5 148173476 148175199 7448 27
chr5 155906888 155907080 7472 47
chr5 178585329 178585742 7610 8
chr5 181045776 181046363 7665 8
chr6 371034 372448 7676_2 6
chr6 2504651 2505153 7740_1 6
chr6 34071333 34072869 7945 4
chr6 35169470 35169826 7955_1 2
chr6 44044437 44045331 8025 6
chr6 89213914 89214231 8293 10
chr6 94031356 94031699 8312 10
chr6 149983637 149984559 8591 21
chr6 157147192 157147952 8653_1 4
chr6 157238317 157238525 8655 2
chr6 157270152 157270698 8656 2
chr6 160218388 160219387 8711 48
chr6 163329259 163329660 8758 2
chr6 167220854 167286115 8809 23
chr6 167651412 167652230 8843 4
chr6 167990008 167990237 8852_1 8
chr6 168659409 168660053 8899 1
chr6 170113592 170114254 8959 17
chr6 170140965 170141359 8966_1 4
chr7 494561 494801 9059 8
chr7 588425 589084 9062 2
chr7 8152568 8153003 9222_1 6
chr7 32684001 32712828 9365 23
chr7 39040051 39040207 9410_1 2
chr7 45610584 45611184 9446 6
chr7 85460318 85462084 9737 23
chr7 98810994 98811636 9816 41
chr7 100957465 100959069 9827 2
chr7 102081266 102084757 9851 27
chr7 153505688 153506577 10153 2
chr7 155792014 155792516 10216_1 2
chr7 158048548 158049239 10297 4
chr7 158915036 158917198 10363 4
chr7 159202671 159203771 10399_1 6
chr8 47980291 47981120 10763 35
chr8 84952507 84962077 10943 22
chr8 100426436 100426906 11021_1 2
chr8 130897680 130898405 11169 49
chr8 138205959 138206980 11219 4
chr8 141873771 141874474 11281 1
chr8 142012566 142013221 11291 4
chr8 143217982 143218404 11344 8
chr8 144020780 144023635 11367 50
chr8 144022028 144023618 11369 28
chr8 144125598 144126078 11371 8
chr8 144946514 144947412 11406 4
chr9 4713900 4714239 11453_1 2
chr9 29591760 29592481 11568 15
chr9 62806327 62822799 11681 27
chr9 74283222 74283473 11739 51
chr9 86539643 86541026 11808 15
chr9 89253936 89254765 11846 4
chr9 105054361 105055065 11946 12
chr9 110262294 110274913 11980 28
chr9 135096025 135096364 12230 6
chr9 137502684 137502739 12341 2
chr9 137669315 137669580 12350 2
chr10 717730 718502 12411_1 2
chr10 2392245 2392681 12503 8
chr10 3314409 3315277 12540_1 4
chr10 7526452 7526998 12607 4
chr10 11283800 11284554 12656_1 6
chr10 52833982 52835352 13059_1 40
chr10 57497186 57498223 13092 15
chr10 98735100 98735257 13271 22
chr10 123254343 123255576 13422 6
chr10 125502145 125508663 13443 14
chr10 127797394 127797973 13473 2
chr10 130790055 130791201 13519 4
chr10 131176223 131177195 13527 4
chr10 132364325 132364843 13577_1 6
chr10 132821799 132824744 13619 6
chr10 133010699 133011809 13647_1 6
chr10 133065761 133066778 13659 6
chr11 3096908 3097168 13822_1 2
chr11 36507099 36507635 14031 46
chr11 59277816 59282714 14188 49
chr11 92705288 92706101 14455_1 6
chr11 94233146 94238782 14472_1 4
chr11 99819283 99820576 14510 54
chr11 102730742 102731363 14524 2
chr11 126632640 126632771 14670 29
chr11 126632686 126632772 14671 55
chr11 128930488 128931006 14690_1 6
chr11 133082371 133082616 14722_1 2
chr11 133354198 133354361 14724 2
chr12 25106528 25107673 15025 47
chr12 39466255 39466532 15134 15
chr12 65424994 65425174 15289 29
chr12 71315487 71316538 15316 12
chr12 77990099 77994485 15349 12
chr12 80452280 80464114 15364 23
chr12 121255185 121255398 15590 14
chr12 124438675 124439617 15625 6
chr12 127696569 127697336 15667 4
chr12 128473093 128473755 15678 4
chr12 128708092 128708919 15683_1 4
chr12 130646038 130646478 15734 2
chr12 132954337 132954479 15902 2
chr13 35957933 35958563 16071 6
chr13 79819615 79843210 16357 23
chr13 85730003 85730414 16404 2
chr13 88458707 88460622 16430 14
chr13 108278557 108278735 16557_1 2
chr13 111567871 111568511 16636_1 4
chr13 113176446 113176543 16712 2
chr13 114155594 114156404 16795 6
chr14 25142261 25144518 16885 27
chr14 47854765 47856277 17005 12
chr14 65375820 65376415 17087 15
chr14 65791559 65791687 17091 5
chr14 100526670 100526977 17270 6
chr14 104214879 104215701 17331_1 4
chr14 105477654 105477838 17393 2
chr15 40050449 40051019 17556 6
chr15 91438350 91446225 17865 28
chr15 94051380 94052271 17897_1 6
chr15 95148445 95149529 17907 23
chr16 159885 160874 17997_1 4
chr16 818705 818950 18039_1 2
chr16 1185577 1186198 18078 2
chr16 3632630 3634495 18122 4
chr16 17071455 17071854 18217_1 6
chr16 48871382 48872323 18585 15
chr16 65321679 65322776 18668 17
chr16 69727929 69728992 18686 12
chr16 81764812 81765622 18752 4
chr16 83950270 83951069 18768_1 6
chr16 86464077 86464639 18812 4
chr16 89008042 89009232 18902_1 4
chr17 841373 841724 19024_1 2
chr17 6193888 6194529 19106 31
chr17 34854678 34855853 19449 12
chr17 41632907 41633812 19470 4
chr17 65499935 65500379 19572 4
chr17 81249165 81249564 19768 2
chr17 81425437 81426511 19773 4
chr17 82215620 82215738 19798_1 2
chr18 4510684 4511036 19905 2
chr18 13261934 13262972 19972 6
chr18 49067090 49067531 20152_1 6
chr18 59473638 59473905 20222_1 2
chr18 74204683 74204899 20309_1 31
chr18 74616745 74617186 20317_1 6
chr18 76966974 76967693 20349_1 4
chr18 79145203 79146652 20435_1 4
chr18 79200703 79200789 20437 2
chr19 350750 351963 20534 4
chr19 2713035 2714475 20637_1 4
chr19 6069985 6070267 20708_1 2
chr19 7152976 7153497 20736_1 6
chr19 16256389 16256892 20839 2
chr19 18441990 18442615 20859_1 4
chr19 33250056 33250964 21024_1 4
chr20 1877008 1877240 21375 31
chr20 19923970 19924690 21493 2
chr20 20337231 20337679 21498_1 2
chr20 20827812 20828165 21506 6
chr20 44404558 44405135 21973_2 4
chr20 44677206 44683572 21975 23
chr20 46788834 46789480 21987_1 2
chr20 61329281 61329476 22123 2
chr20 61331558 61331724 22124 2
chr20 62349655 62349852 22164 2
chr20 62509010 62510016 22181_1 4
chr20 63167415 63167596 22211 8
chr20 63948544 63948731 22238_1 8
chr20 64088566 64089018 22250_1 2
chr21 30570849 30571312 22556_1 4
chr21 39974421 39975159 22632_2 2
chr21 45055057 45055144 22747_1 8
chr21 45797255 45798798 22807 6
chr21 46054813 46055126 22845_1 2
chr22 18136755 18146780 22983 28
chr22 19202365 19202732 22991_1 6
chr22 20075212 20075748 23001 6
chr22 29119985 29120648 23056 2
chr22 35463370 35463836 23100 4
chr22 43980046 43980198 23205 2
chr22 48620585 48621284 23309 6
chr22 48757129 48757451 23324_1 2
chr22 49155766 49156162 23344_1 4
chr22 50276181 50276433 23405_1 2
chr22 50643623 50644313 23421_1 1
chr22 50683748 50683878 23422 2
Loading

0 comments on commit 9ecbe8c

Please sign in to comment.