Please use the following text to cite this item or export to a predefined format:
Picek, Lukáš; Bolon, Isabelle; Durso, Andrew M. and Castañeda, Rafael Ruiz de, 2021,
SnakeCLEF 2021, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/20.500.12800/1-4773.
| dc.contributor.author | Picek, Lukáš |
| dc.contributor.author | Bolon, Isabelle |
| dc.contributor.author | Durso, Andrew M. |
| dc.contributor.author | Castañeda, Rafael Ruiz de |
| dc.date.accessioned | 2022-06-21T12:11:15Z |
| dc.date.available | 2022-06-21T12:11:15Z |
| dc.date.issued | 2021-01-01 |
| dc.description | The dataset with 409,679 images belonging to 772 snake species from 188 countries and all continents (386,006 images with labels targeted for development and 23,673 images without labels for testing). In addition, we provide a simple train/val (90% / 10%) split to validate preliminary results while ensuring the same species distributions. Furthermore, we prepared a compact subset (70,208 images) for fast prototyping. The test set data consists of 23,673 images submitted to the iNaturalist platform within the "first four months of 2021. All data were gathered from online biodiversity platforms (i.e., iNaturalist, HerpMapper) and further extended by data scraped from Flickr. The provided dataset has a heavy long-tailed class distribution, where the most frequent species (Thamnophis sirtalis) is represented by 22,163 images and the least frequent by just 10 (Achalinus formosanus). |
| dc.identifier.uri | http://hdl.handle.net/20.500.12800/1-4773 |
| dc.language.iso | zxx |
| dc.publisher | CEUR Workshop Proceedings (CEUR-WS.org) |
| dc.relation.isreferencedby | https://dspace5.zcu.cz/bitstream/11025/47274/1/paper-125.pdf |
| dc.rights | BSD 3-Clause "New" or "Revised" license |
| dc.rights.label | PUB |
| dc.rights.uri | http://opensource.org/licenses/BSD-3-Clause |
| dc.subject | LifeCLEF |
| dc.subject | SnakeCLEF |
| dc.subject | global health |
| dc.subject | epidemiology |
| dc.subject | snake bite |
| dc.subject | snake |
| dc.subject | reptile |
| dc.subject | benchmark |
| dc.subject | biodiversity |
| dc.subject | machine learning |
| dc.subject | computer vision |
| dc.subject | Classification |
| dc.title | SnakeCLEF 2021 |
| dc.type | corpus |
| edm.type | IMAGE |
| local.branding | LINDAT / CLARIAH-CZ |
| local.contact.person | Lukáš Picek lukaspicek@gmail.com University of West Bohemia, Department of Cybernetics |
| local.dataProvider | iNaturalist, HerpMapper, Flickr |
| local.files.count | 6 |
| local.files.size | 65398371213 |
| local.has.files | yes |
| local.language.name | Nolinguistic content |
| local.sponsor | nationalFunds LM2018101 Ministerstvo školství, mládeže a tělovýchovy České republiky LINDAT/CLARIAH-CZ: Digitální výzkumná infrastruktura pro jazykové technologie, umění a humanitní vědy |
| local.sponsor | Other University of West Bohemia SGS-2019-02 Studentská Grantová Soutež |
| local.sponsor | nationalFunds QS04-20 Universitaires de Genève Fondation privée des Hôpitaux Universitaires de Genève |
Collections
Files in this item
- Name
- ViT_PreProcessing-ops11-preprocessing-int-dynam_graph.onnx
- Size
- 1.13 GB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 838ae0f8ed19ffeee89aba5fa8b50956

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- training_data.tar.gz
- Size
- 59.69 GB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a39f0a73e6f35b4222e1aa1878ef20ca

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- min-train_metadata.csv
- Size
- 13.1 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 52ec43c8595a3835da6aa16364aff75e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- train_metadata.csv
- Size
- 73.24 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- f981dd4289cac2ea05f472b584668beb

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- BaseLine-EfficientNet-B0-224.ipynb
- Size
- 29.32 KB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 1fff5bc5ef4c896d6b0ab12cb2c9a37f

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- species_to_country_mapping.csv
- Size
- 1.48 MB
- Format
- application/octet-stream
- Description
- Unknown
- MD5
- 7f45509480abef9df3f69e6685dc24ed

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

