Skip to content

v0.6.0

Compare
Choose a tag to compare
@bluegenes bluegenes released this 06 Apr 21:38
· 15 commits to main since this release
255299a

v0.6.0 modifies the gbsketch strategy to reduce the number of calls to the NCBI REST API (necessary due to policy changes, but a better strategy regardless). Rather than downloading all files via the API, we download a single file via the API that contains direct fetch links to all requested accessions. We then download accessions in parallel -- currently limited to 30 simultaneously. It also enables streaming processing for both gbsketch and urlsketch, reducing memory requirements. Finally, it improves batch restart for --keep-fasta by avoiding re-downloading and overwriting of completed downloads.

Note: gbsketch now relies on gzip processing (internal crc32 checks) rather than md5sum checks to ensure we have complete downloads. urlsketch will still check md5sums if they are provided.

What's Changed

Full Changelog: v0.5.0...v0.6.0