Publisher: Masaryk University, NLP Centre - LINDAT/CLARIAH-CZ Catalog Search Results

21. SQAD

Creator:: Medveď, Marek and Horák, Aleš
Publisher:: Masaryk University, NLP Centre
Type:: text and corpus
Subject:: question answering, Simple Question Answering Database, and SQAD
Language:: Czech
Description:: The SQAD database consists of 3301 records obtained from Czech Wikipedia articles. The record structure is following: - the original sentence(s) from Wikipedia - a question that is directly answered in the text - the expected answer to the question as it appears in the original text - the URL of the Wikipedia web page from which the original text was extracted - name of the author of this SQAD record
Rights:: GNU General Public Licence, version 3, http://opensource.org/licenses/GPL-3.0, and PUB

22. sqad 3.0

Creator:: Medveď, Marek and Horák, Aleš
Publisher:: Masaryk University, NLP Centre
Type:: text and corpus
Subject:: Simple Question Answering Database, Czech, and question answering
Language:: Czech
Description:: Simple question answering database version 3 (SQAD v3) created from Czech Wikipedia. New version consits of 13477 records. Each record of SQAD consist of multiple files - question, answer extraction, answer selection, ulr, question metadata and in some cases answer context.
Rights:: GNU Library or "Lesser" General Public License 3.0 (LGPL-3.0), http://opensource.org/licenses/LGPL-3.0, and PUB

23. SQAD 3.2

Creator:: Medveď, Marek
Publisher:: Masaryk University, NLP Centre
Type:: text and corpus
Subject:: QA, Question Answering, SQAD, and Czech QA
Language:: Czech
Description:: Simple question answering database version 3.2 (SQAD v3.2) created from Czech Wikipedia. The new version consists of more than 16000 records. Each record of SQAD consists of multiple files - question, answer extraction, answer selection, URL, question metadata, and in some cases, answer context.
Rights:: Attribution-ShareAlike 3.0 Unported (CC BY-SA 3.0), http://creativecommons.org/licenses/by-sa/3.0/, and PUB

24. The Internet Language Reference Book

Creator:: Šmerk, Pavel, Pravdová, Markéta, Beneš, Martin, Černá, Anna, Hlaváčková, Dana, Chromý, Jan, Konečná, Hana, Kopecký, Jakub, Mžourková, Hana, Pala, Karel, Prokšová, Hana, Prošek, Martin, Smejkalová, Kamila, Svobodová, Ivana, and Uhlířová, Ludmila
Publisher:: Institute of Czech Language, Czech Academy of Sciences and Masaryk University, NLP Centre
Type:: toolService and service
Subject:: literature
Language:: Czech and English
Description:: The ILRB has been created by two cooperating teams - by the team of the Institute of Czech Language, Czech Academy of Sciences and the team of the NLP Centre at the Faculty of Informatics, Masaryk University (2004-2008). The tool consists of two sections: wordlist and reference (explanatory) one. Comments and remarks are welcome and should be send to the address poradna@ujc.cas.cz. 1. Wordlist section It contains more than 60 000 dictionary entries and is based on the glossary of the School Rules of Czech Orthography, the Dictionary of the Literary Czech and selected entries from the New Dictionary of Words of Foreign Origin and Dictionary of Neologisms. The entries typically include information that is asked about frequently by the users. Also inflectional forms of the particular words forms are offered in the form of tables thanks to the morphological analyzer ajka created at the Faculty of Informatics, MU. The dictionary part is linked to the explanatory one through the hypertext links. 2. Reference section It comprises the explanations about linguistic phenomena described in the Rules of Czech Orthography and contemporary Czech grammars, frequently and repeatedly asked by the users turning to the Linguistic Advisory Line in the Institute of Czech Language. In the offered explanations some typical spelling problems are dealt with including the appropriate recommendations. The ILRB is regularly updated and completed, new expressions are added and made more precise. and Academy of Sciences of the Czech Republic in project 1ET200610406 and Ministry of Education, Youth and Sports in projects LM2010013, LC536 and 2C06009.
Rights:: Not specified

25. Tigrinya Web Corpus

Creator:: Suchomel, Vít and Rychlý, Pavel
Publisher:: Masaryk University, NLP Centre
Type:: text and corpus
Subject:: text corpora, Ethiopian languages, web corpora, under-resourced languages, Tigrinya, and Tigrigna
Language:: Tigrinya
Description:: Tigrinya web corpus. Crawled by SpiderLing in January 2016. Encoded in UTF-8, cleaned, deduplicated.
Rights:: NLP Centre Web Corpus License, https://lindat.mff.cuni.cz/repository/xmlui/page/license-NLPC-WeC, and ACA

21. SQAD

22. sqad 3.0

23. SQAD 3.2

24. The Internet Language Reference Book

25. Tigrinya Web Corpus

Limit your search

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Creator

Show values starting with

Language

Publisher

Rights

Show values starting with

Subject

Show values starting with

Type

Date

Original context has metadata only

Harvested from