NameTag web service is available on
http(s)://lindat.mff.cuni.cz/services/nametag/api/
.
The web service is freely available. Respect the CC BY-NC-SA licence of the models – explicit written permission of the authors is required for any commercial exploitation of the system. If you use the service, you agree that data obtained by us during such use can be used for further improvements of the systems at UFAL. All comments and reactions are welcome.
The NameTag REST API can be accessed directly or via any other web programming tools that support standard HTTP request methods and JSON for output handling.
Service Request | Description | HTTP Method |
---|---|---|
models | return list of models and supported methods | GET/POST |
recognize | recognize named entities | GET/POST |
tokenize | tokenize supplied text | GET/POST |
Return the list of models available in the NameTag REST API, and for each model enumerate methods supported by this models. The default model (used when user supplies no model to a method call) is also returned. The default model (used when user supplies no model to a method call) is also returned – this is guaranteed to be the latest Czech model.
http://lindat.mff.cuni.cz/services/nametag/api/models |
{ "models": { "czech-140205-cnec2.0": [ ,"recognize" ,"tokenize" ] ,"czech-140205-cnec2.0-no_tokenizer": [ ,"recognize" ] } ,"default_model": "czech-140205-cnec2.0" }
Recognize named entities as described in the User's Manual. The output format is described later.
Parameter | Mandatory | Data type | Description |
---|---|---|---|
data | yes | string | Input text in UTF-8. |
model | no | string | Model to use; see model selection for model matching rules. |
input | no | string (untokenized /vertical /conllu ) | Input format to use: NameTag 1, NameTag 2; default is untokenized . |
output | no | string (xml /vertical /conll /conllu-ne ) | Output format to use: NameTag 1, NameTag 2; default is xml . |
http://lindat.mff.cuni.cz/services/nametag/api/recognize?data=Václav Havel byl prvním prezidentem České republiky. |
|
http://lindat.mff.cuni.cz/services/nametag/api/recognize?data=Václav Havel byl prvním prezidentem České republiky.&output=vertical |
Tokenize the supplied text as described in the User's Manual. The output format is described later.
Parameter | Mandatory | Data type | Description |
---|---|---|---|
data | yes | string | Input text in UTF-8. |
model | no | string | Model to use; see model selection for model matching rules. |
output | no | string (xml /vertical ) | Output format to use; default is xml . |
http://lindat.mff.cuni.cz/services/nametag/api/tokenize?data=Václav Havel byl prvním prezidentem České republiky. |
|
http://lindat.mff.cuni.cz/services/nametag/api/tokenize?data=Václav Havel byl prvním prezidentem České republiky.&output=vertical |
The response format of all methods is JSON. Except for the models method, the output JSON has the following structure:
{ "model": "Model used" ,"acknowledgements": ["URL with acknowledgements", ...] ,"result": "Output text" }
There are several possibilities how to select required model using
the model
option:
model
option is not specified, the default model
(returned by models method) is used – this is
guaranteed to be the latest Czech model.model
option can specify one of the models returned
by the models method.-YYMMDD
format can be left out when
supplying model
option – the latest avilable model will be
used.model
option may be only several first words of model
name. In this case, the latest most suitable model is used. Note that the last possibility allows using czech
or english
as models.
curl
. Several examples follow:
curl --data-urlencode 'data=Václav Havel byl prvním prezidentem České republiky.' http://lindat.mff.cuni.cz/services/nametag/api/recognize
curl -F 'data=@input_file' http://lindat.mff.cuni.cz/services/nametag/api/recognize
curl -F 'data=@input_file' -F 'output=vertical' http://lindat.mff.cuni.cz/services/nametag/api/recognize
curl -F 'data=@input_file' http://lindat.mff.cuni.cz/services/nametag/api/recognize | PYTHONIOENCODING=utf-8 python -c "import sys,json; sys.stdout.write(json.load(sys.stdin)['result'])"