Rights: Not specified - LINDAT/CLARIAH-CZ Catalog Search Results

Start Over Rights Not specified Date Unknown

101. Corpus Work Bench CWB (CQP)

Publisher:: Institut Universitari de Lingüística Aplicada, Universitat Pompeu Fabra
Type:: toolService
Description:: This SOAP service implements the IMS Open Corpus Workbench (CWB), a collection of open-source tools for managing and querying large text corpora (ranging from 10 million to 2 billion words) with linguistic annotations. Its central component is the flexible and efficient query processor CQP. The service makes it possible to index a new corpus and query it.
Rights:: Not specified

102. CorpusExplorer

Creator:: Rüdiger, Jan Oliver
Publisher:: Jan Oliver Rüdiger
Type:: tool and toolService
Subject:: Corpus Linguisitics, NLP, conll, tei, XML, nlp, Natural Language Processing, linguistics, Linguistics, Computational Linguistics, corpus processing, tagger, POS tagger, lemmatization, text cleaning, CommonCrawl, epub, JSON, Twitter, Pandoc, Wikipedia, digital data, DTA, DSpin, MySQL, ElasticSearch, TextGrid, text corpora, TigerXML, and WeblichtXML
Language:: German, English, French, Italian, Dutch, Spanish, Polish, Arabic, Chinese, and Portuguese
Description:: Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
Rights:: Not specified

103. Croatian Dependency Treebank

Publisher:: University of Zagreb, Faculty of Humanities and Social Sciences
Format:: application/octet-stream
Type:: corpus
Language:: Croatian
Description:: Manually tagged dependency treebank, analytical layer according to the PDT formalism adapted for Croatian
Rights:: Not specified

104. Croatian Lemmatization Server

Publisher:: University of Zagreb, Faculty of Humanities and Social Sciences
Type:: toolService
Language:: Croatian
Description:: On line service for lemmatization, full POS or MSD tagging of Croatian texts.
Rights:: Not specified

105. Croatian Morphological Lexicon

Publisher:: University of Zagreb, Faculty of Humanities and Social Sciences
Type:: lexicalConceptualResource
Language:: Croatian
Description:: 110,000+ lemmas; 3,900,000+ word-forms, MulText East lexica format
Rights:: Not specified

106. Croatian National Corpus

Publisher:: University of Zagreb, Faculty of Humanities and Social Sciences
Type:: corpus
Language:: Croatian
Description:: This is the reference corpus of standard Croatian. In its 3.0 version, which is accessible via noSketch Engine, it has 216.8 million tokens. In terms of annotation, the corpus is tokenised, lemmatised and tagged for MSDs (morphosyntactic descriptions).
Rights:: Not specified

107. CST's lemmatiser

Publisher:: Center for Sprogteknologi, University of Copenhagen
Type:: toolService
Language:: Danish, Dutch, English, German, Modern Greek (1453-), Icelandic, Norwegian, Russian, Slovenian, and Swedish
Description:: 1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of lemmatization rules based on full form-lemma list
Rights:: Not specified

108. CST's lemmatizer

Creator:: Jongejan, Bart
Publisher:: Københavns Universitet, Center for Sprogteknologi (CST)
Type:: toolService
Description:: 1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of lemmatization rules based on full form-lemma list
Rights:: Not specified

109. Cyril Belica : Kookkurrenzdatenbank CCDB

Publisher:: Institut für Deutsche Sprache
Type:: toolService
Language:: German
Description:: A co-occurrence database, developed by the Institut fuer Deutsche Sprache, for research in the field of collocation analysis in modern German. The database holds over 200,000 analysed words that can be browsed or searched and shown in context.
Rights:: Not specified

110. Czech Morphological Analyzer v1

Creator:: Hajič, Jan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: toolService and service
Subject:: morphological analysis and lemmatization
Language:: Czech
Description:: One of the very first steps in automatic processing of Czech text is morphological analysis and lemmatization.
Rights:: Not specified

« Previous
Next »
1
2
…
7
8
9
10
11
12
13
14
15
…
49
50

101. Corpus Work Bench CWB (CQP)

102. CorpusExplorer

103. Croatian Dependency Treebank

104. Croatian Lemmatization Server

105. Croatian Morphological Lexicon

106. Croatian National Corpus

107. CST's lemmatiser

108. CST's lemmatizer

109. Cyril Belica : Kookkurrenzdatenbank CCDB

110. Czech Morphological Analyzer v1

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Coverage

Show values starting with

Creator

Show values starting with

Format

Language

Show values starting with

Publisher

Show values starting with

Rights

Subject

Show values starting with

Type

Original context has metadata only

Harvested from