Skip to search
Skip to main content
Skip to first result
Search
Search Results
Publisher:
University of Tartu
Format:
application/octet-stream
Type:
corpus
Language:
Estonian
Description:
Recordings of different Estonian dialects, 900000 words, transcribed and partly (400000 words) morphologically annotated
Rights:
Not specified
Publisher:
University of Tartu
Format:
text/plain
Type:
lexicalConceptualResource
Language:
Estonian
Description:
10000 most frequent lemmas, 1000 most frequent word forms, based on 1 million words of journals and fiction
Rights:
Not specified
Publisher:
University of Tartu
Format:
application/tei+xml
Type:
corpus
Language:
Estonian
Description:
Collection of Estonian texts (divided into subcorpora); ca 175 million words; TEI
Rights:
Not specified
Publisher:
Laboratory of Phonetics and Speech Technology, Tallinn University of Technology
Type:
toolService
Rights:
Not specified
Type:
corpus
Language:
English and Estonian
Description:
written EU legislation; 5 mio words Est, 7.8 mio words Eng; Sentence-aligned
Rights:
Not specified
Publisher:
Filosoft
Type:
toolService
Language:
Estonian
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
written general; 600 000 words; local tagset; manually disambiguated
Rights:
Not specified
Publisher:
University of Tartu
Type:
corpus
Subject:
speech corpus
Language:
Estonian
Description:
Studio recordings of spontaneous Estonian segmented phonetically on word, sound, and other linguistic levels. Current size about 22 hours of speech, 155 000 words. Online search engine lets you search from word-level segments and returns matching 2 second sequences of sound and segmentation.
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
written general; 300 000 words; local tagset (POS, syntactic functions)
Rights:
Not specified
Type:
corpus
Language:
Estonian
Description:
200 sentences, TIGER-XML
Rights:
Not specified