NameTag

NameTag is an open-source tool for named entity recognition (NER). NameTag identifies proper names in text and classifies them into predefined categories, such as names of persons, locations, organizations, etc.

NameTag 3 achieves state-of-the-art performance on 21 test datasets in 15 languages: Cebuano, Chinese, Croatian, Czech, Danish, English, Norwegian Bokmål, Norwegian Nynorsk, Portuguese, Russian, Serbian, Slovak, Swedish, Tagalog, and Ukrainian. It also delivers competitive results on Arabic, Dutch, German, Maghrebi, and Spanish, as of February 2025.

NameTag is available as an online demo NameTag Online Demo and web service NameTag Web Service hosted by LINDAT/CLARIN.

The linguistic models are free for non-commercial use and distributed under CC BY-NC-SA license, although for some models the original data used to create the model may impose additional licensing conditions.

NameTag is versioned using Semantic Versioning.

Copyright 2020 by Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic.

Authors Jana Straková, Milan Straka
Homepage https://ufal.mff.cuni.cz/nametag
Development repository https://github.com/ufal/nametag3
Status version 3.1.0, 2.0, 1.1.0
OS Linux, Windows
License of the library MPL 2.0
License of the models CC BY-NC-SA 4.0
Contact straka@ufal.mff.cuni.cz