Show simple item record

 
dc.contributor.author Sellat, Hashem
dc.contributor.author Saleh, Shadi
dc.contributor.author Krubiński, Mateusz
dc.contributor.author Pospíšil, Adam
dc.contributor.author Zemánek, Petr
dc.contributor.author Pecina, Pavel
dc.date.accessioned 2023-03-14T09:01:49Z
dc.date.available 2023-03-14T09:01:49Z
dc.date.issued 2023-03-11
dc.identifier.uri http://hdl.handle.net/11234/1-5033
dc.description This is the first release of the UFAL Parallel Corpus of North Levantine, compiled by the Institute of Formal and Applied Linguistics (ÚFAL) at Charles University within the Welcome project (https://welcome-h2020.eu/). The corpus consists of 120,600 multiparallel sentences in English, French, German, Greek, Spanish, and Standard Arabic selected from the OpenSubtitles2018 corpus [1] and manually translated into the North Levantine Arabic language. The corpus was created for the purpose of training machine translation for North Levantine and the other languages.
dc.language.iso apc
dc.language.iso eng
dc.language.iso fra
dc.language.iso spa
dc.language.iso arb
dc.language.iso ell
dc.language.iso deu
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.relation info:eu-repo/grantAgreement/EC/H2020/870930
dc.rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.source.uri http://ufal.mff.cuni.cz/ufal-parallel-corpus-of-north-levantine
dc.subject multilingual
dc.subject machine translation
dc.subject parallel corpus
dc.subject north levantine
dc.subject corpus
dc.title UFAL Parallel Corpus of North Levantine 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
contact.person Hashem Sellat sellat@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
sponsor European Union EC/H2020/870930 WELCOME - Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals in the EU euFunds info:eu-repo/grantAgreement/EC/H2020/870930
size.info 844200 sentences
size.info 6227225 words
files.size 87074303
files.count 13


 Files in this item

 Download all files in item (83.04 MB)
Icon
Name
README.md
Size
4.13 KB
Format
Unknown
Description
Readme
MD5
ca5874218369b39ecbca59c355c12ade
 Download file
Icon
Name
ufal-nla-v1.apc
Size
6.57 MB
Format
Unknown
Description
North Levantine translations
MD5
11e84548700efc58d9ea6e33bff220be
 Download file
Icon
Name
ufal-nla-v1.arb
Size
7.25 MB
Format
Unknown
Description
Standard Arabic translations
MD5
c0b2c0606045b5b944f39b3826cbef19
 Download file
Icon
Name
ufal-nla-v1.deu
Size
5.45 MB
Format
Unknown
Description
German translations
MD5
6afc8dff4c10e8d2c1dbbab02abfe680
 Download file
Icon
Name
ufal-nla-v1.ell
Size
8.73 MB
Format
Unknown
Description
Greek translations
MD5
0ee0e3aad10fd53c9cadf0e9dfa5d35b
 Download file
Icon
Name
ufal-nla-v1.eng
Size
4.97 MB
Format
Unknown
Description
English translations
MD5
f30a569d2efe070afea6f7b9ca196368
 Download file
Icon
Name
ufal-nla-v1.fra
Size
5.13 MB
Format
Unknown
Description
French translations
MD5
22ea41474e6f7ddbb346c0fc4da68a52
 Download file
Icon
Name
ufal-nla-v1.spa
Size
5.09 MB
Format
Unknown
Description
Spanish translations
MD5
21be7ab5448e6cfdd4882c4e4fef593e
 Download file
Icon
Name
ufal-nla-v1.arb-eng.ids
Size
8.01 MB
Format
Unknown
Description
Standard Arabic-English OpenSubtitles2018 ids
MD5
c829727818dfec1fde15457674f8d745
 Download file
Icon
Name
ufal-nla-v1.deu-eng.ids
Size
7.97 MB
Format
Unknown
Description
German-English OpenSubtitles2018 ids
MD5
29a443b68efac21f9c52d2f18dacecf6
 Download file
Icon
Name
ufal-nla-v1.ell-eng.ids
Size
7.96 MB
Format
Unknown
Description
Greek-English OpenSubtitles2018 ids
MD5
c964ad29eceb23990f2c1a8535dea64a
 Download file
Icon
Name
ufal-nla-v1.eng-fra.ids
Size
7.95 MB
Format
Unknown
Description
English-French OpenSubtitles2018 ids
MD5
6e5439fd076a9dbb94097853821fcbd9
 Download file
Icon
Name
ufal-nla-v1.eng-spa.ids
Size
7.95 MB
Format
Unknown
Description
English-Spanish OpenSubtitles2018 ids
MD5
5e2006da9d4d3700f905102125fd7134
 Download file

Show simple item record