This is not the latest version of this item. The latest version can be found here.
Czech Named Entity Corpus 1.0
Please use the following text to cite this item or export to a predefined format:
Ševčíková, Magda; Žabokrtský, Zdeněk and Straková, Jana, 2007,
Czech Named Entity Corpus 1.0, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11858/00-097C-0000-0022-C73C-7.
Authors
Item identifier
Date issued
2007
Size
6000 sentences
Language(s)
Description
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large body of manually annotated named entities in Czech sentences, including a fine-grained classification.
Acknowledgement
Grantová agentura Akademie věd České republiky
Project code:1ET101120503
Project name:Integrace jazykových zdrojů za účelem extrakce informací z přirozených textů
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- Czech_Named_Entity_Corpus_1.0.zip
- Size
- 9.13 MB
- Format
- application/zip
- Description
- Zip
- MD5
- fe11eeb02e30a2709007ecb94770e3aa

- Czech_Named_Entity_Corpus_1.0
- README3 kB
- data
- tmt
- named_ent-92.tmt301 kB
- named_ent-6.tmt227 kB
- named_ent-43.tmt235 kB
- named_ent-81.tmt265 kB
- named_ent-59.tmt257 kB
- named_ent-97.tmt261 kB
- named_ent-32.tmt244 kB
- named_ent-70.tmt234 kB
- named_ent-110.tmt268 kB
- named_ent-48.tmt222 kB
- named_ent-21.tmt279 kB
- named_ent-86.tmt303 kB
- named_ent-37.tmt305 kB
- named_ent-10.tmt206 kB
- named_ent-75.tmt246 kB
- named_ent-115.tmt243 kB
- named_ent-26.tmt237 kB
- named_ent-64.tmt217 kB
- named_ent-104.tmt266 kB
- named_ent-15.tmt287 kB
- named_ent-53.tmt219 kB
- named_ent-91.tmt281 kB
- named_ent-69.tmt260 kB
- named_ent-5.tmt328 kB
- named_ent-42.tmt262 kB
- named_ent-109.tmt232 kB
- named_ent-80.tmt251 kB
- named_ent-58.tmt226 kB
- named_ent-96.tmt266 kB
- named_ent-31.tmt358 kB
- named_ent-47.tmt243 kB
- named_ent-85.tmt277 kB
- named_ent-20.tmt229 kB
- named_ent-36.tmt234 kB
- named_ent-74.tmt228 kB
- named_ent-114.tmt266 kB
- named_ent-25.tmt211 kB
- named_ent-63.tmt256 kB
- named_ent-103.tmt227 kB
- named_ent-79.tmt300 kB
- named_ent-14.tmt243 kB
- named_ent-52.tmt251 kB
- named_ent-90.tmt292 kB
- named_ent-4.tmt302 kB
- named_ent-68.tmt217 kB
- named_ent-41.tmt233 kB
- named_ent-108.tmt317 kB
- named_ent-19.tmt270 kB
- named_ent-57.tmt234 kB
- named_ent-30.tmt246 kB
- named_ent-95.tmt239 kB
- named_ent-9.tmt260 kB
- named_ent-46.tmt219 kB
- named_ent-84.tmt272 kB
- named_ent-35.tmt258 kB
- named_ent-73.tmt270 kB
- named_ent-113.tmt315 kB
- named_ent-24.tmt304 kB
- named_ent-89.tmt233 kB
- named_ent-62.tmt248 kB
- named_ent-102.tmt248 kB
- named_ent-13.tmt219 kB
- named_ent-78.tmt228 kB
- named_ent-51.tmt246 kB
- named_ent-118.tmt219 kB
- named_ent-29.tmt252 kB
- named_ent-67.tmt206 kB
- named_ent-3.tmt261 kB
- named_ent-40.tmt240 kB
- named_ent-107.tmt261 kB
- named_ent-18.tmt305 kB
- named_ent-56.tmt223 kB
- named_ent-94.tmt274 kB
- named_ent-8.tmt231 kB
- named_ent-45.tmt263 kB
- named_ent-83.tmt277 kB
- named_ent-34.tmt307 kB
- named_ent-99.tmt266 kB
- named_ent-72.tmt251 kB
- named_ent-112.tmt301 kB
- named_ent-88.tmt311 kB
- named_ent-23.tmt263 kB
- named_ent-61.tmt257 kB
- named_ent-101.tmt270 kB
- named_ent-39.tmt262 kB
- named_ent-77.tmt274 kB
- named_ent-12.tmt245 kB
- named_ent-50.tmt260 kB
- named_ent-117.tmt272 kB
- named_ent-28.tmt238 kB
- named_ent-66.tmt233 kB
- named_ent-2.tmt275 kB
- named_ent-106.tmt259 kB
- named_ent-17.tmt292 kB
- named_ent-55.tmt221 kB
- named_ent-93.tmt249 kB
- named_ent-7.tmt235 kB
- named_ent-44.tmt218 kB
- named_ent-82.tmt283 kB
- named_ent-33.tmt320 kB
- named_ent-98.tmt265 kB
- named_ent-71.tmt233 kB
- named_ent-111.tmt257 kB
- named_ent-49.tmt241 kB
- named_ent-22.tmt242 kB
- named_ent-87.tmt325 kB
- named_ent-60.tmt234 kB
- named_ent-100.tmt299 kB
- named_ent-38.tmt253 kB
- named_ent-76.tmt204 kB
- named_ent-11.tmt278 kB
- named_ent-116.tmt275 kB
- named_ent-27.tmt280 kB
- named_ent-1.tmt229 kB
- named_ent-65.tmt250 kB
- named_ent-105.tmt277 kB
- named_ent-16.tmt244 kB
- named_ent-54.tmt227 kB
- html
- named_ent.html1 MB
- plain
- named_ent_plain.txt1 MB
- xml_simple
- named_ent_xml_simple.txt1 MB
- tmt_split
- etest.tmt2 MB
- dtest.tmt2 MB
- train.tmt22 MB
- orig
- named_ent_orig.txt1 MB
- tmt
- tools
- statistics.pl504 B
- namedent_annotations_to_html.pl3 kB
- namedent_annotations_to_xml_simple.pl559 B
- compare_ne_outputs_v2.pl14 kB
- namedent_annotations_to_plain.pl313 B
- doc
- techrep-ne-2007.pdf600 kB
- doc.pdf61 kB
- statistics.txt1 kB
- doc.ps137 kB

