Show simple item record

 
dc.contributor.author Waclawičová, Martina
dc.contributor.author Kopřivová, Marie
dc.contributor.author Křen, Michal
dc.contributor.author Válková, Lucie
dc.date.accessioned 2013-12-13T11:56:16Z
dc.date.available 2013-12-13T11:56:16Z
dc.date.issued 2008
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0023-119D-A
dc.description Balanced corpus of informal spoken Czech sized 1 MW. It contains transcriptions of 297 recordings made in 2002–2007 in the whole of Bohemia. All the recordings were made in informal situations to ensure prototypically spontaneous spoken language. This means private environment, physical presence of speakers who know each other, unscripted speech and topic not given in advance. The total number of speakers is 995, the corpus is balanced in their main sociolinguistic categories (gender, age group, education, region of childhood residence). The corpus is provided in a (semi-XML) vertical format used as an input to the Manatee query engine. The data thus exactly correspond to the corpus available via query interface to registered users of the CNC.
dc.description.sponsorship MSM0021620823 – Český národní korpus a korpusy dalších jazyků
dc.language.iso ces
dc.publisher Faculty of Arts, Institute of the Czech National Corpus, Charles University in Prague
dc.rights Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.source.uri https://wiki.korpus.cz/doku.php/cnk:oral2008
dc.subject informal spoken language
dc.subject balanced corpus
dc.title ORAL2008: Balanced corpus of informal spoken Czech
dc.type corpus
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Křen
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Michal
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, Faculty of Arts, Institute of the Czech National Corpus
metashare.ResourceInfo#DistributionInfo.availability restrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse academic-nonCommercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse attribution
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse shareAlike
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium downloadable
metashare.ResourceInfo#ValidationInfo.validated True
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType nationalFunds
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#TextInfo#SizeInfo.size 1000000
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit words
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email michal.kren@ff.cuni.cz
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky MSM 0021620823 Český národní korpus a korpusy dalších jazyků nationalFunds
size.info 1000000 words
files.size 2707529
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Distributed under Creative Commons Attribution Required Noncommercial Share Alike
Icon
Name
oral2008.gz
Size
2.58 MB
Format
application/x-gzip
Description
corpus data
MD5
4558c08d313f18a6a4c460acee3e1e4e
 Download file

Show simple item record