dc.contributor.author | Veselovská, Kateřina |
dc.contributor.author | Bojar, Ondřej |
dc.date.accessioned | 2013-12-02T22:10:38Z |
dc.date.available | 2013-12-02T22:10:38Z |
dc.date.issued | 2013-12-02 |
dc.identifier.uri | http://hdl.handle.net/11858/00-097C-0000-0022-FF60-B |
dc.description | Czech subjectivity lexicon, i.e. a list of subjectivity clues for sentiment analysis in Czech. The list contains 4626 evaluative items (1672 positive and 2954 negative) together with their part of speech tags, polarity orientation and source information. The core of the Czech subjectivity lexicon has been gained by automatic translation of a freely available English subjectivity lexicon downloaded from http://www.cs.pitt.edu/mpqa/subj_lexicon.html. For translating the data into Czech, we used parallel corpus CzEng 1.0 containing 15 million parallel sentences (233 million English and 206 million Czech tokens) from seven different types of sources automatically annotated at surface and deep layers of syntactic representation. Afterwards, the lexicon has been manually refined by an experienced annotator. |
dc.description.sponsorship | The work on this project has been supported by the GAUK 3537/2011 grant and by SVV project number 267 314. |
dc.language.iso | ces |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.rights | Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ |
dc.source.uri | http://ufal.mff.cuni.cz/seance |
dc.subject | subjectivity lexicon |
dc.subject | sentiment analysis |
dc.subject | opinion mining |
dc.subject | polarity clues |
dc.title | Czech SubLex 1.0 |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContactInfo#PersonInfo.surname | Veselovská |
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName | Kateřina |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName | Charles University in Prague, UFAL |
metashare.ResourceInfo#DistributionInfo.availability | restrictedUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | academic-nonCommercialUse |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | attribution |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse | shareAlike |
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium | downloadable |
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.projectName | GAUK 3537/2011 grant and SVV project number 267 314. |
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType | nationalFunds |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#TextInfo#SizeInfo.size | 207 |
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit | kb |
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email | veselovska@ufal.mff.cuni.cz |
metashare.ResourceInfo#ContentInfo.detailedType | wordList |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
sponsor | Grantová agentura Univerzity Karlovy v Praze GAUK 3537/2011 Detekce větné polarity v počítačovém korpusu nationalFunds |
sponsor | Univerzita Karlova v Praze (mimo GAUK) SVV 267 314 Teoretické základy informatiky a výpočetní lingvistiky nationalFunds |
size.info | 207 kb |
files.size | 381830 |
files.count | 3 |
Files in this item
Download all files in item (372.88 KB)This item is
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Publicly Available
and licensed under:Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
- Name
- sublex_1_0.csv
- Size
- 206.26 KB
- Format
- Unknown
- Description
- Czech SubLex 1.0
- MD5
- d2655e8b46d2a192f094d492c8d4ffdb
- Name
- README
- Size
- 745 bytes
- Format
- Unknown
- Description
- CSV README
- MD5
- ad9f39db98ec16c0bc6e3ad31040d657
- Name
- Czech subjectivity lexicon.pdf
- Size
- 165.89 KB
- Format
- Description
- Czech SubLex article
- MD5
- 946d3ea2a7bbcc0c5c2261c7286f2031