dc.contributor.author | Straková, Jana |
dc.date.accessioned | 2024-09-19T14:56:39Z |
dc.date.available | 2024-09-19T14:56:39Z |
dc.date.issued | 2024-08-30 |
dc.identifier.uri | http://hdl.handle.net/11234/1-5678 |
dc.description | This is a trained model for the supervised machine learning tool NameTag 3 (https://ufal.mff.cuni.cz/nametag/3/), trained jointly on several NE corpora: English CoNLL-2003, German CoNLL-2003, Dutch CoNLL-2002, Spanish CoNLL-2002, Ukrainian Lang-uk, and Czech CNEC 2.0, all harmonized to flat NEs with 4 labels PER, ORG, LOC, and MISC. NameTag 3 is an open-source tool for both flat and nested named entity recognition (NER). NameTag 3 identifies proper names in text and classifies them into a set of predefined categories, such as names of persons, locations, organizations, etc. The model documentation can be found at https://ufal.mff.cuni.cz/nametag/3/models#multilingual-conll. |
dc.language.iso | eng |
dc.language.iso | deu |
dc.language.iso | nld |
dc.language.iso | spa |
dc.language.iso | ukr |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.source.uri | https://ufal.mff.cuni.cz/nametag/3/ |
dc.subject | named entity recognition |
dc.subject | NER |
dc.subject | NameTag |
dc.subject | multilingual |
dc.title | NameTag 3 Multilingual CoNLL Model |
dc.type | languageDescription |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#ContentInfo.detailedType | mlmodel |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
demo.uri | https://lindat.mff.cuni.cz/services/nametag/ |
contact.person | Jana Straková strakova@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | Grantová agentura České republiky GX20-16819X LUSyD – Language Understanding: from Syntax to Discourse nationalFunds |
size.info | 1.7 gb |
files.size | 1811630949 |
files.count | 1 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- nametag3-multilingual-conll-240830.zip
- Velikost
- 1.69 GB
- Formát
- application/zip
- Popis
- Unknown
- MD5
- eba18d3d76633aa6420caf8dd8ae2166
- nametag3-multilingual-conll-240830
- LICENSE20 kB
- README.md2 kB
- options.json1 kB
- udpipe.tokenizer342 kB
- checkpoint.weights.h52 GB
- mappings.pickle330 B