- added vignette
- removeLazyData
- DESCRIPTION updaates
- replacing url_exists with RCurl::url.exists
- added NEWS.md
- bug fix in fetch_kegg
- removed
min_count
andfdr_limit
arguments fromfunctional_enrichment
; filtering can be done afterwards - added a small Shiny app as an example of
fenr
- updates to vignette
- small fixes
- link to a separate GitHub Shiny app added
- added support for WikiPathways
- improved robustness
- more tests
fetch_reactome
provides two ways of retrieving data, via one downloadable file or via APIs
- Replacing ontologyIndex::get_ontology with a simpler parser
- Replacing KEGGREST with own simple API parsers
- Applying BiocStyle to the vignette
- improving test_functional_enrichment
- Style changes for BiocCheck
- Adding more tests
- Fixing a bug in
parse_kegg_genes
- Significant speed-up of enrichment by using Rfast::Hash in place of R lists
- KEGG improvements, recognizing flat file genes with no gene synonym
- Additional tests for Reactome
- Minor improvements and fixes
- Added functions
get_term_features
andget_feature_terms
to access data safely - HACK: BioPlanet server's SSL certificate expired, so need insecure download.
- Ditched large and clunky
Rfast
and using native R environments as fast hashes (see https://riptutorial.com/r/example/18339/environments-as-hash-maps) - A few tweaks and improvements
- Fixed a bug where there are some features at a term that are not present in the universe (all features). This could happen when the universe was particularly small. Potentially a serious bug.
- Added a fix to work correctly with integer feature IDs.
- New units tests added
- Pre-release Bioconductor version.
- BioPlanet database vanished from internet and there is no sign of it coming back. Removing all BioPlanet-related code and replacing BioPlanet with GO in the vignette and examples (this, alas, makes it longer to check).
- OK, it is back, but I keep GO examples and vignettes.
- Minor improvements to documentation.
- BioPlanet's tripod.nih.gov SSL certificate seems to be fixed, so reversing to the original read_csv code.
- Continuing issues with access to BioPlanet.
fetch_bp
example is now markeddonotrun
and testingfetch_bp
is removed to ensure smooth build and check even when BioPlanet server is down.
- Taking
BiocCheck
new warnings into account: adding@return
to data roxygens.
- Major overhaul following comments from Bioconductor's reviewer.
- The wording in the vignette was adjusted to more clearly convey the purpose of the package to users
- Rewritten the description in DESCRIPTION file to clearly convey the purpose of the package to users
- Removed BioPlanet for good, as their webpage is continuously down and the maintainer is not responding
- Removed KEGG from interactive example to speed up vignette building (GO and Reactome are sufficient for a simple example)
- Replaced yeast with simpler organisms in Wiki and KEGG tests to speed up testing
- Replaced yeast with simpler organisms in Wiki and KEGG examples to speed up checking
- Minor changes to prepare for Bioconductor release
- Reverting temporarily to readr version 1 to circumvent a vroom 1.6.4 bug
- First update after Bioconductor release
- Implemented changes to prevent the package from build/check fail, if one of the remote servers is not responding
- Moved from
httr
tohttr2
- Tests and examples now generate warnings in case of server failure
- Added tests for behaviour in case of a non-responsive server
- Extended test coverage to 100%, except for the interactive example
- Bug fixes, examples need on_error = "warn"
- Reinstated Bioplanet access, this time with graceful fail when the website is down.
- Minor code changes.
- Bug fix: if feature id - term id mapping is not unique (which can happen), features are duplicated in counting; fixed by
dplyr::distinct()
on mapping - Correction in vignette: using yeast genome for
topGO
, instead of human. - Improving test coverage
- Changed the Ensembl mapping file downloaded from Reactome to "Physical entity" mapping, as it contains gene symbols, in addition to the Ensembl IDs.
- Changed the name of GAF column
DB Object Synonym
fromgene_synonym
togene_id
for consistency with other methods. - Corrected Reactome test as it failed with multiple gene symbols per gene id.
- Replaced biomaRt with a single RESTful XML call; as biomaRt is used only once to obtain GO terms, this replacement reduced dependency footprint of the package
- Improved error handling with unresponsive servers - timeouts are now handled gracefully
- Further improving error handling, making sure
assert_url_path()
handles timeouts properly - Introduced on_error = "ignore" for test purposes
- Changed the way
assert_url_path()
handles some remote files - it turns out every time it was called, the entire file was unnecessary downloaded, leading to duplication. Now we only assert top directories. Should speed things up! - Increased default timeout to 30 s.
- Due to recurring issues with build and check on Bioconductor's machines, I have removed all database downloads from the vignette. Any glitch in the GO server, or simply an internet problem would cause the vignette build to crash. The GO-term information is now attached as data and loaded in the vignette.
- Made sure the package passes BUILD and CHECK with no internet connection.
- Correction in vignette: using yeast genome for
topGO
, instead of human (somehow it was not applied in 1.0.5).
- Go term namespace added to the information extracted by
fetch_go
.
- Attempted to fix a bizarre error message on Bioconductor's test machines with older version of MacOS. Windows and Linux are not affected; my laptop running Sequoia 5.2 does not show show errors. I suspect a memory leak in older systems. The error
vector memory limit of 64.0 Gb reached, see mem.maxVSize()
happened in the function parse_kegg_genes(), a flat-file parser for KEGG. It occurred around the call tidyr::separate(), which I replaced with an alternative approach. Will see if the error is fixed.