SLäNDa 2.0
Please use the following text to cite this item or export to a predefined format:
Stymne, Sara and Östman, Carin, 2022,
SLäNDa 2.0, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11372/LRT-4739.
Authors
Item identifier
Date issued
2022-05-09
Size
255325 tokens,
3287 utterances
Language(s)
Description
SLäNDa, the Swedish literature corpus of narrative and dialogue, is a corpus made up of eight Swedish literary novels from the 19th and early 20th centuries, manually annotated mainly for different aspects of dialogue. The full annotation also contains other cited materials, like thoughts, signs and letters. The main motivation for including these categories as well, is to be able to identify the main narrative, which is all remaining unannotated text.
SLäNDa version 2.0 extends version 1.0 mainly by adding more data, but also by additional quality control, and a slight modification of the annotation scheme. In addition, the data is organized into test sets with different types of speech marking: quotation marks, dashes, and no marking.
Publisher
Acknowledgement
Vetenskapsrådet (Swedish Research Council)
Project code:2020-02617
Project name:Fictional prose and language change. The role of colloquialization in the history of Swedish 1830–1930
Vetenskapsrådet (Swedish Research Council)
Project code:2020-02617
Project name:Fictional prose and language change. The role of colloquialization in the history of Swedish 1830–1930}
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- slanda-2.0.zip
- Size
- 9.1 MB
- Format
- application/zip
- Description
- Zip
- MD5
- 51509d53feadb901513160876c2307b3

- slanda-2.0
- LREC2022
- test2-none.orig.conll233 kB
- test1-none.orig.conll50 kB
- test1-quote.stripped.conll87 kB
- test2-dash.orig.conll356 kB
- train-dev.stripped.conll143 kB
- test1-dash.orig.conll127 kB
- test2-none.stripped.conll232 kB
- train-train.orig.conll1018 kB
- test1-none.stripped.conll50 kB
- train-dev.orig.conll146 kB
- train-train.stripped.conll993 kB
- test2-dash.stripped.conll349 kB
- test1-quote.orig.conll90 kB
- test1-dash.stripped.conll126 kB
- README10 kB
- TSV
- per-author
- train.ber5.tsv96 kB
- train.lev8.tsv99 kB
- test2d.roo7.tsv341 kB
- test.sod11.tsv14 kB
- train.sod29.tsv54 kB
- test.ber11.tsv260 kB
- train.hei20.tsv122 kB
- train.ber29.tsv77 kB
- test.ben11.tsv320 kB
- train.ryd5.tsv209 kB
- train.ryd11.tsv240 kB
- train.ber17.tsv120 kB
- test2n.nor32.tsv509 kB
- train.sod5.tsv38 kB
- train.ben8.tsv238 kB
- test2d.fly20.tsv252 kB
- train.mal18.tsv272 kB
- train.ber8.tsv193 kB
- test2d.kru10.tsv110 kB
- train.san5.tsv541 kB
- train.hei11.tsv163 kB
- test2d.kru29.tsv82 kB
- test2n.alm5.tsv147 kB
- train.sod8.tsv36 kB
- test2n.nor11.tsv74 kB
- test.lev11.tsv81 kB
- train.lag12.tsv82 kB
- test2d.kru13.tsv110 kB
- train.boy15.tsv142 kB
- train.ben12.tsv472 kB
- test.str11.tsv174 kB
- train.ben3.tsv188 kB
- train.sod20.tsv104 kB
- train.ber20.tsv33 kB
- test2d.pal19.tsv252 kB
- test2n.ced2.tsv60 kB
- train.str13.tsv204 kB
- train.boy18.tsv202 kB
- test.san2.tsv257 kB
- train.ben15.tsv326 kB
- train.lev2.tsv130 kB
- train.ben6.tsv741 kB
- test2d.roo11.tsv453 kB
- train.sod23.tsv78 kB
- test2n.pal16.tsv150 kB
- train.ber23.tsv87 kB
- test2d.kru32.tsv71 kB
- train.boy14.tsv113 kB
- train.ben2.tsv175 kB
- test2n.ced5.tsv288 kB
- train.hei5.tsv135 kB
- train.lag6.tsv409 kB
- train.ber2.tsv125 kB
- train.ben9.tsv293 kB
- train.lev5.tsv150 kB
- test.boy11.tsv213 kB
- train.ryd20.tsv259 kB
- train.sod26.tsv39 kB
- test2n.pal19.tsv154 kB
- train.ber26.tsv130 kB
- test2d.elk11.tsv346 kB
- train.sod14.tsv15 kB
- train.lag2.tsv142 kB
- train.ber14.tsv175 kB
- train.ben14.tsv296 kB
- test2d.kru23.tsv85 kB
- train.sod2.tsv68 kB
- train.ben5.tsv430 kB
- test.lag11.tsv185 kB
- per-dataset
- test1-dash.tsv726 kB
- test2-none.tsv1 MB
- test1-none.tsv260 kB
- test2-dash.tsv2 MB
- test1-quote.tsv521 kB
- train-all.tsv8 MB
- per-author
- IOB
- train-all.iob2 MB
- test2-none.iob491 kB
- test1-dash.iob245 kB
- test2-dash.iob722 kB
- test1-none.iob84 kB
- test1-quote.iob177 kB
- LREC2022

