datadings.sets.ImageNet21k_write module
- usage: ImageNet21k_write.py [-h] [-o PATH] [-y] [-s] [-t 0-2]
[–compress [quality 0-100]] [–subsampling {444,422,420,440,411,Gray}] INDIR
Create ImageNet21k winter release data set files.
This tool will look for the following files in the input directory:
winter21_whole.tar.gz
Note
Registration is required to download this dataset. Please visit the website to download it. If you experience issues downloading you may consider using bittorrent: https://academictorrents.com/details/8ec0d8df0fbb507594557bce993920442f4f6477
Important
For performance reasons samples are read in same order as they are stored in the source tar files. It is recommended to use the datadings-shuffle command to create a shuffled copy.
Important
Samples have the following keys:
"key"
"image"
"label"
"label_tree"
Positional arguments
INDIR Directory that contains dataset source files.
- options:
- -h, --help
show this help message and exit
- -o PATH, --outdir PATH
Output directory. Defaults to indir.
- -y, --no-confirm
Don’t require user interaction.
- -s, --skip-verification
Skip verification of source files.
- -t 0-2, –threads 0-2
Number of threads for conversion. 0 uses all available CPUs (default 1).
- –compress [quality 0-100]
Use JPEG compression with optional quality. Default quality is 85. Big images are resized to roughly fit 500x375.
- –subsampling {444,422,420,440,411,Gray}
Color subsampling factor used with compress option. 444 is forced for small images to preserve details.