Two datasets will be used for the competition:
- DEV dataset which is available here for development and experimenting.
- TEST dataset which will be kept secret until after the competition and will be used for final scoring.
Both datasets are composed from publicly available images which were posted online under CC-BY license.
DEV dataset is defined by dev_dataset.csv
which contains URLs of the images along with bounding boxes
and classification labels.
dev_dataset.csv
is a table in
CSV
format with the following columns:
- ImageId - id of the image.
- URL - URL of the image.
- x1, y1, x2, y2 - bounding box of the area of interest in the image. Bounding box is relative, which means that all coordinates are between 0 and 1.
- TrueLabel - true label of the image.
- TargetClass - label for targeted adversarial attack.
- OriginalLandingURL - original landing page where this image was found.
- License - licence under which image was distributed by author.
- Author - author of the image.
- AuthorProfileURL - URL of the author's profile.
Dataset is labelled with ImageNet labels. Specific values of labels are compatible with pre-trained Inception models, which are available as a part of TF-Slim. In particular pre-trained Inception v3 and InceptionResnet v2 could be used to classify dataset with high accuracy.
dev_dataset.csv
contains only URLs of the images.
Actual images have to be downloaded before being used for experiments.
download_images.py
is a Python program which downloads images for all
records in dev_dataset.csv
. Usage:
# Replace CSV_FILE with path to dev_dataset.csv
CSV_FILE=dev_dataset.csv
# Replace OUTPUT_DIR with path to directory where all images should be stored
OUTPUT_DIR=images
# Download images
python download_images.py --input_file=${CSV_FILE} --output_dir=${OUTPUT_DIR}
All downloaded images will be cropped according to the bounding boxes in
dev_dataset.csv
and resized to 299x299 pixels.
Each image will be saved in PNG format with filename IMAGE_ID.png
where IMAGE_ID
is the id of the image from dev_dataset.csv
.