SA-1B-Downloader

Simple script using GPT-4 to parallelize download and extract files for SA-1B Dataset.

Requirements

Python >= 3.6
requests >= 2.0

Install with:

pip install requests

Usage

The download.py script uses the sa1b_links.txt file by default as the input file for downloading and extracting images:

python download.py --processes 4 --input_file sa1b_links.txt --raw_dir raw --images_dir images --masks_dir annotations

The download.py script takes the following command-line arguments:

--processes: The number of processes to use for downloading and extracting files (default: 4)
--input_file: The path to the input file containing file names and URLs (default: 'sa1b.txt')
--raw_dir: The directory to store downloaded files (default: 'raw')
--images_dir: The directory to store extracted jpg files (default: 'images')
--masks_dir: The directory to store extracted json files (default: 'annotations')
--skip_existing: Skip extraction if the file has already been extracted (default: False)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LICENSE		LICENSE
README.md		README.md
download.py		download.py
sa1b_links.txt		sa1b_links.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SA-1B-Downloader

Requirements

Usage

About

Releases

Packages

Contributors 3

Languages

License

KKallidromitis/SA-1B-Downloader

Folders and files

Latest commit

History

Repository files navigation

SA-1B-Downloader

Requirements

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages