Creator: Hajič, Jan / Harvested from: LINDAT/CLARIAH-CZ repository

94. VIADAT-REPO

Creator:: Košarko, Ondřej and Hajič, Jan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: infrastructure and toolService
Subject:: digital data and digital repository
Description:: VIADAT-REPO is a modification to lindat-dspace platform; it's a part of the VIADAT project and as such will be a part of a "virtual assistant" for processing, annotation, enrichment and accessing of audio and video recordings.
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

95. VIADAT-REPO+DEPOSIT

Creator:: Košarko, Ondřej and Hajič, Jan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: infrastructure and toolService
Subject:: digital data and digital repository
Description:: VIADAT-REPO is an additional module to the lindat-dspace platform which allows for depositing data records in the field of oral history, including its specific metadata workflow; it has been created within the VIADAT project and as such will be a part of a "virtual assistant" for processing, annotation, enrichment and accessing of audio and video recordings. This package contains VIADAT-DEPOSIT module; bundled with VIADAT-REPO to ease the integration.
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

96. VIADAT-SEARCH

Creator:: Böhm, Stanislav and Hajič, Jan
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: infrastructure and toolService
Subject:: oral history, speech, and search
Language:: Czech
Description:: VIADAT-SEARCH in connection with VIADAT-REPO enables searching transcripts of oral history recordings. Language analysis has been used to preprocess the recordings, which makes it possible to search the fulltext using multiple criteria, including names, different forms of the same word etc. Developed in cooperation with ÚSD AV ČR and NFA.
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

97. VIADAT-STAT

Creator:: Böhm, Stanislav, Hajič, Jan, Srdečný, Vojtěch, Toman, Josef, and Košarko, Ondřej
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: infrastructure and toolService
Subject:: oral history, speech, and search
Language:: Czech
Description:: A VIADAT module; the purpose of VIADAT-STAT is statistical analysis of recordings stored by the platform. Developed in cooperation with ÚSD AV ČR and NFA.
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

98. VIADAT-STAT (2019-12-31)

Creator:: Böhm, Stanislav, Hajič, Jan, Srdečný, Vojtěch, Toman, Josef, and Košarko, Ondřej
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: infrastructure and toolService
Subject:: oral history, speech, and search
Language:: Czech
Description:: A VIADAT module; the purpose of VIADAT-STAT is statistical analysis of recordings stored by the platform. Developed in cooperation with ÚSD AV ČR and NFA.
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

99. VIADAT-TEXT

Creator:: Böhm, Stanislav, Hajič, Jan, Srdečný, Vojtěch, Toman, Josef, and Košarko, Ondřej
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: infrastructure and toolService
Subject:: oral history, speech, and search
Language:: Czech
Description:: A VIADAT module; the purpose of VIADAT-TEXT is analysis of transcribed recordings. Developed in cooperation with ÚSD AV ČR and NFA.
Rights:: BSD 3-Clause "New" or "Revised" license, http://opensource.org/licenses/BSD-3-Clause, and PUB

100. WordSim353-cs: Evaluation Dataset for Lexical Similarity and Relatedness, based on WordSim353

Creator:: Cinková, Silvie, Straková, Jana, Hajič, Jakub, Hajič, Jan, Hajič, Jan, jr., Janoušková, Jolana, Straka, Milan, and Urešová, Miroslava
Publisher:: Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Type:: text, wordList, and lexicalConceptualResource
Subject:: lexical semantics, similarity, relatedness, evaluation, and distributional semantics
Language:: Czech and English
Description:: Czech translation of WordSim353. The Czech translation of English WordSim353 word pairs were obtained from four translators. All translation variants were scored according to the lexical similarity/relatedness annotation instructions for WordSim353 annotators, by 25 Czech annotators. The resulting data set consists of two annotation files: "WordSim353-cs.csv" and "WordSim-cs-Multi.csv". Both files are encoded in UTF-8, have a header, text is enclosed in double quotes, and columns are separated by commas. The rows are numbered. The WordSim-cs-Multi data set has rows numbered from 1 to 634, whereas the row indices in the WordSim353-cs data set reflect the corresponding row numbers in the WordSim-cs-Multi data set. The WordSim353-cs file contains a one-to-one mapping selection of 353 Czech equivalent pairs whose judgments have proven to be most similar to the judgments of their corresponding English originals (compared by the absolute value of the difference between the means over all annotators in each language counterpart). In one case ("psychology-cognition"), two Czech equivalent pairs had identical means as well as confidence intervals, so we randomly selected one. The "WordSim-cs-Multi.csv" file contains human judgments for all translation variants. In both data sets, we preserved all 25 individual scores. In the WordSim353-cs data set, we added a column with their Czech means as well as a column containing the original English means and 95% confidence intervals in separate columns for each mean (computed by the CI function in the Rmisc R package). The WordSim-cs-Multi data set contains only the Czech means and confidence intervals. For the most convenient lexical search, we provided separate columns with the respective Czech and English single words, entire word pairs, and eventually an English-Czech quadruple in both data sets. The data set also contains an xls table with the four translations and a preliminary selection of the best variants performed by an adjudicator.
Rights:: Creative Commons - Attribution 4.0 International (CC BY 4.0), http://creativecommons.org/licenses/by/4.0/, and PUB

91. VIADAT-ANNOTATE (2019-12-31)

92. VIADAT-GIS

93. VIADAT-GIS (2019-12-31)

94. VIADAT-REPO

95. VIADAT-REPO+DEPOSIT

96. VIADAT-SEARCH

97. VIADAT-STAT

98. VIADAT-STAT (2019-12-31)

99. VIADAT-TEXT

100. WordSim353-cs: Evaluation Dataset for Lexical Similarity and Relatedness, based on WordSim353

Limit your search

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Show values starting with

Search

Search Constraints

Search Results

Limit your search

Contributor

Show values starting with

Creator

Show values starting with

Language

Show values starting with

Publisher

Rights

Show values starting with

Subject

Show values starting with

Type

Show values starting with

Date

Original context has metadata only

Harvested from