dc.contributor.author |
Barque, Lucie |
dc.contributor.author |
Candito, Marie |
dc.contributor.author |
Constant, Matthieu |
dc.contributor.author |
Cordeiro, Silvio Ricardo |
dc.contributor.author |
Crabbé, Benoît |
dc.contributor.author |
Fort, Karën |
dc.contributor.author |
Guillaume, Bruno |
dc.contributor.author |
Haas, Pauline |
dc.contributor.author |
Huyghe, Richard |
dc.contributor.author |
Perrier, Guy |
dc.contributor.author |
Ramisch, Carlos |
dc.contributor.author |
Ribeyre, Corentin |
dc.contributor.author |
Savary, Agata |
dc.contributor.author |
Seddah, Djamé |
dc.contributor.author |
Segonne, Vincent |
dc.contributor.author |
Tribout, Delphine |
dc.contributor.author |
Villemonte de la Clergerie, Eric |
dc.contributor.author |
Parmentier, Yannick |
dc.contributor.author |
Pasquer, Caroline |
dc.contributor.author |
Antoine, Jean-Yves |
dc.date.accessioned |
2021-01-21T10:36:00Z |
dc.date.available |
2021-01-21T10:36:00Z |
dc.date.issued |
2020-03 |
dc.identifier.uri |
http://hdl.handle.net/11234/1-3429 |
dc.description |
The Sequoia corpus is a set of 3,099 linguistically-annotated French sentences, originating from four sources (Europarl, European Agency Reports, French regional journal L'Est Républicain, and French wikipedia).
Several types of annotations were added over the years.
The current release comprises:
- parts-of-speech (SEQUOIA ANR-08-EMER-013 project)
- syntactic dependency trees
- deep syntactic dependency graphs (Deep sequoia project)
- multi-word expressions and named entities (PARSEME COST project and PARSEME-FR ANR-14-CERA-0001 project)
- coarse semantic tags for nouns (FrSemCor project)
See the deep sequoia page for a detailed description: https://deep-sequoia.inria.fr/ |
dc.language.iso |
fra |
dc.publisher |
ANR |
dc.rights |
Deep Sequoia Licence |
dc.rights.uri |
https://lindat.mff.cuni.cz/repository/xmlui/page/deep-sequoia-licence |
dc.source.uri |
https://deep-sequoia.inria.fr/ |
dc.subject |
morpho-syntactic annotations |
dc.subject |
treebank |
dc.subject |
dependency syntax |
dc.subject |
semantic tagging |
dc.subject |
multiword expressions |
dc.subject |
named entities |
dc.title |
Deep Sequoia corpus - PARSEME-FR corpus - FrSemCor |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
dc.rights.label |
PUB |
has.files |
yes |
branding |
LRT + Open Submissions |
contact.person |
Candito Marie marie.candito@gmail.com Université de Paris |
contact.person |
Seddah Djamé djame.seddah@gmail.com Paris-Sorbonne University |
contact.person |
Guillaume Bruno bruno.guillaume@loria.fr LORIA |
sponsor |
ANR (French National Research Agency) SEQUOIA ANR-08-EMER-01 SEQUOIA nationalFunds |
sponsor |
ANR (France) ANR-14-CERA-0001 PARSEME-FR nationalFunds |
sponsor |
LABEX Empirical Foundations of Linguistics ANR-10-LABX-0083 LABEX-EFL nationalFunds |
size.info |
3099 sentences |
files.size |
4577386 |
files.count |
1 |