Extended CLEF eHealth 2013-2015 IR Test Collection
Please use the following text to cite this item or export to a predefined format:
Pecina, Pavel and Saleh, Shadi, 2019,
Extended CLEF eHealth 2013-2015 IR Test Collection, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-2925.
Authors
Item identifier
Project URL
Date issued
2019-04-14
Size
7000000000 bytes
Description
This package contains an extended version of the test collection used in the CLEF eHealth Information Retrieval tasks in 2013--2015. Compared to the original version, it provides complete query translations into Czech, French, German, Hungarian, Polish, Spanish and Swedish and additional relevance assessment.
Acknowledgement
Grant Agency of Czech Republic
Project code:GAP103/12/G084
Project name:Center for large-scale multi-modal data interpretation
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- README
- Size
- 13.84 KB
- Format
- text/plain
- Description
- Text
- MD5
- 272d35f086025ab11d55ab97447863db

Extended CLEF eHealth Test Collection for
Cross-lingual Information Retrieval in the Medical Domain
version 1.0 (April 2019)
1. Description
This package contains an extended version of the test collection used in the
CLEF eHealth Information Retrieval tasks in 2013--2015. Compared to the original
version, it provides complete query translations into Czech, French, German,
Hungarian, Polish, Spanish and Swedish and additional relevance assessment. This
dataset is described in [5] and available from the LINDAT/CLARIN repository
http://hdl.handle.net/11234/1-2925.
2. Preamble
2.1 Source
The data is adopted from the CLEF eHealth Information retrieval tasks
2013-2015 (https://sites.google.com/site/clefehealth/) organized under the CLEF
initiative (http://clef-initiative.eu).
2.2 License
The original data (document collection, queries, relevance assessments) is
available under the original license (see
http://catalog.elra.info/product_info.php?products_id=1218 and
https://github.com/CLEFeHealth)
The newly added data is made available under the terms of the Creative
Commons Attribution-Noncommercial (CC-BY-NC) license, version 4.0 international.
You may use them for academic research and all non-commercial purposes as long
as the authors (cf. Authors, below) are properly credited and sources
acknowledged (cf. the Acknowledgment section). See
http://creativecommons.org/licenses/by-nc/4.0/ for a full description and
explanation of the licensing terms.
3. Data
This package contains the original English document collection (also
available from http://catalog.elra.info/product_info.php?products_id=1218), the
original queries in English and relevance assessments (also available from
https://github.com/CLEFeHealth), human-translated queries into 7 languages, and
machine-translated queries from the 7 languages back to . . .- Name
- data.tgz
- Size
- 6.31 GB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 834aa7d9f1f37acfb8a4b85be7a2c894

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

