SnakeCLEF 2021
Please use the following text to cite this item or export to a predefined format:
Picek, Lukáš; Bolon, Isabelle; Durso, Andrew M. and Castañeda, Rafael Ruiz de, 2021,
SnakeCLEF 2021, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/20.500.12800/1-4773.
Authors
Item identifier
Date issued
2021-01-01
Language(s)
Description
The dataset with 409,679 images belonging to 772 snake species from 188 countries and all continents (386,006 images with labels targeted for development and 23,673 images without labels for testing). In addition, we provide a simple train/val (90% / 10%) split to validate preliminary results while ensuring the same species distributions. Furthermore, we prepared a compact subset (70,208 images) for fast prototyping. The test set data consists of 23,673 images submitted to the iNaturalist platform within the "first four months of 2021.
All data were gathered from online biodiversity platforms (i.e., iNaturalist, HerpMapper) and further extended by data scraped from Flickr. The provided dataset has a heavy long-tailed class distribution, where the most frequent species (Thamnophis sirtalis) is represented by 22,163 images and the least frequent by just 10 (Achalinus formosanus).
Acknowledgement
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:LM2018101
Project name:LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy
SGS-2019-02
Project code:University of West Bohemia
Project name:Studentská Grantová Soutež
Universitaires de Genève
Project code:QS04-20
Project name:Fondation privée des Hôpitaux Universitaires de Genève
Collections
Files in this item
- Name
- ViT_PreProcessing-ops11-preprocessing-int-dynam_graph.onnx
- Size
- 1.13 GB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 838ae0f8ed19ffeee89aba5fa8b50956

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- training_data.tar.gz
- Size
- 59.69 GB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a39f0a73e6f35b4222e1aa1878ef20ca

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- min-train_metadata.csv
- Size
- 13.1 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 52ec43c8595a3835da6aa16364aff75e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- train_metadata.csv
- Size
- 73.24 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- f981dd4289cac2ea05f472b584668beb

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- BaseLine-EfficientNet-B0-224.ipynb
- Size
- 29.32 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 1fff5bc5ef4c896d6b0ab12cb2c9a37f

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- species_to_country_mapping.csv
- Size
- 1.48 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 7f45509480abef9df3f69e6685dc24ed

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

