CorpusExplorer
- Autoři
- Rüdiger, Jan Oliver
- Identifikátor
- http://hdl.handle.net/11234/1-2634
- URL projektu
- http://corpusexplorer.de
- URL dema
- http://corpusexplorer.de
- Datum vydání
- 2018-03-14
- Typ
- toolService
- Jazyky
- Arabic , Chinese , Dutch , English , French , German , Italian , Polish , Portuguese , Spanish
- Popis
- Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks such as text acquisition, cleaning or tagging are completely automated. The simple interface supports the use in university teaching and leads users/students to fast and substantial results. The CorpusExplorer is open for many standards (XML, CSV, JSON, R, etc.) and also offers its own software development kit (SDK). Source code available at https://github.com/notesjor/corpusexplorer2.0
- Nakladatel
- Jan Oliver Rüdiger
- Klíčová slova
- Corpus Linguisitics NLP conll tei XML nlp Natural Language Processing linguistics Linguistics Computational Linguistics corpus processing tagger POS tagger lemmatization text cleaning CommonCrawl epub JSON Twitter Pandoc Wikipedia digital data DTA DSpin MySQL ElasticSearch TextGrid text corpora TigerXML WeblichtXML