-
Notifications
You must be signed in to change notification settings - Fork 2
Description
When uploading a Small Mutation using the command line ipr command, I get an 'ERROR gene finding failures for' message for records that contain all required fields plus hgvsProtein, hgvsCds and hgvsGenomic optional fields. If I remove the hgvsGenomic field only, I no longer get the gene finding failure error when uploading the report. I have reduced my dataset to a single record that can be used to reproduce the error. I also confirmed that the gene symbol is in the GraphKB database I am using.
I have a second case that does exhibits the same gene finding failure, but the removal of the hgvsGenomic line was not enough to have the gene be successfully found. Only the removal of all hgvs lines caused the gene to be found.
To Reproduce
I am running this from the pori_python repository at commit SHA: 3a7392b and have created a venv based on the instructions in the repository.
Steps to reproduce the behaviour:
- Run command 'ipr -c ~/testJsonImportingSmall.txt --ipr_url 'http://ipr/api' --password iprdemo --username iprdemo --graphkb_password secret --graphkb_username graphkb_importer --graphkb_url 'http://graphkb/api''
- with input 'testJsonImportingSmall.txt' as attached.
- See error:
Deprecated function 'create_report' called - use ipr_report instead
05-07-25 09:54:58 ipr INFO connecting to graphkb: http://graphkb/api
05-07-25 09:54:58 ipr INFO annotating 1 small mutations
05-07-25 09:54:59 ipr ERROR gene finding failures for ['TAS1R3']
05-07-25 09:54:59 ipr ERROR 1 gene finding failures for positional variants
05-07-25 09:54:59 ipr ERROR skipped 1 positional variants due to errors
05-07-25 09:54:59 ipr INFO matched 1 positional variants to 0 graphkb annotations
05-07-25 09:54:59 ipr INFO annotating 0 structural variants
05-07-25 09:54:59 ipr INFO matched 0 positional variants to 0 graphkb annotations
05-07-25 09:54:59 ipr INFO annotating 0 copy variants
05-07-25 09:54:59 ipr INFO Starting annotation of 0 copy category_variants
05-07-25 09:54:59 ipr INFO matched 0 copy category variants to 0 graphkb annotations
05-07-25 09:54:59 ipr INFO annotating 0 expression variants
05-07-25 09:54:59 ipr INFO Starting annotation of 0 expression category_variants
05-07-25 09:54:59 ipr INFO matched 0 expression variants to 0 graphkb annotations
05-07-25 09:54:59 ipr INFO fetching gene annotations
05-07-25 09:54:59 graphkb INFO fetching variant related genes list
05-07-25 09:54:59 graphkb INFO fetching oncogenes list
05-07-25 09:54:59 graphkb INFO fetching tumour supressors list
05-07-25 09:54:59 graphkb INFO fetching cancerGeneListMatch list
05-07-25 09:54:59 graphkb INFO fetching therapeutic associated genes lists
05-07-25 09:55:00 graphkb INFO Setting gene_info flags on 1 genes
05-07-25 09:55:00 ipr INFO generating analyst comments
05-07-25 09:55:01 ipr INFO Uploading to IPR http://ipr/api
Request payload size: 606 bytes
05-07-25 09:55:01 ipr INFO {'message': 'Report upload was successful', 'ident': 'd064abad-691d-4f79-814f-3a0990f6991f'}
05-07-25 09:55:01 ipr INFO made 109 requests to graphkb
05-07-25 09:55:01 ipr INFO average load 37 req/s
Expected behaviour
By removing the hgvsGenomic line, the gene finding failure is not longer present. I would not expect these optional input lines to have an effect on the gene finding. See attached testJsonImportingSmallNohgvsGenomic.txt and this is the log message:
ipr -c ~/testJsonImportingSmallNohgvsGenomic.txt --ipr_url 'http://ipr/api' --password iprdemo --username iprdemo --graphkb_password secret --graphkb_username graphkb_importer --graphkb_url 'http://graphkb/api'
Deprecated function 'create_report' called - use ipr_report instead
05-07-25 09:55:08 ipr INFO connecting to graphkb: http://graphkb/api
05-07-25 09:55:09 ipr INFO annotating 1 small mutations
05-07-25 09:55:09 ipr INFO matched 1 positional variants to 0 graphkb annotations
05-07-25 09:55:09 ipr INFO annotating 0 structural variants
05-07-25 09:55:09 ipr INFO matched 0 positional variants to 0 graphkb annotations
05-07-25 09:55:09 ipr INFO annotating 0 copy variants
05-07-25 09:55:09 ipr INFO Starting annotation of 0 copy category_variants
05-07-25 09:55:09 ipr INFO matched 0 copy category variants to 0 graphkb annotations
05-07-25 09:55:09 ipr INFO annotating 0 expression variants
05-07-25 09:55:09 ipr INFO Starting annotation of 0 expression category_variants
05-07-25 09:55:09 ipr INFO matched 0 expression variants to 0 graphkb annotations
05-07-25 09:55:09 ipr INFO fetching gene annotations
05-07-25 09:55:09 graphkb INFO fetching variant related genes list
05-07-25 09:55:09 graphkb INFO fetching oncogenes list
05-07-25 09:55:09 graphkb INFO fetching tumour supressors list
05-07-25 09:55:09 graphkb INFO fetching cancerGeneListMatch list
05-07-25 09:55:09 graphkb INFO fetching therapeutic associated genes lists
05-07-25 09:55:10 graphkb INFO Setting gene_info flags on 1 genes
05-07-25 09:55:10 ipr INFO generating analyst comments
05-07-25 09:55:12 ipr INFO Uploading to IPR http://ipr/api
Request payload size: 586 bytes
05-07-25 09:55:12 ipr INFO {'message': 'Report upload was successful', 'ident': 'aef54213-56c3-44c7-a4dc-85196ce8cc85'}
05-07-25 09:55:12 ipr INFO made 103 requests to graphkb
05-07-25 09:55:12 ipr INFO average load 33 req/s
Versions (please complete the following information):
-
OS: Ubuntu 22.04
-
Python: 3.12.4
-
Versions
-
- pori_python v0.1.2
-
- pori_ipr_api release/v8.0.1
-
- graphkb_api container: bcgsc/pori-graphkb-api:latest
-
GraphKB and IPR URL are a local kubernetes cluster at UofC.
Additional context
Similar behaviour exhibited for even smaller case. See testJsonImportingCdsSmall.txt and testJsonImportingCdsSmallNoHGVSLines.txt. I had to remove all hvgs lines from the input for gene finding to succeed.
testJsonImportingSmall.txt
testJsonImportingSmallNohgvsGenomic.txt
testJsonImportingCdsSmallNoHGVSLines.txt
testJsonImportingCdsSmall.txt