dc.contributor.author | Sellat, Hashem |
dc.contributor.author | Saleh, Shadi |
dc.contributor.author | Krubiński, Mateusz |
dc.contributor.author | Pospíšil, Adam |
dc.contributor.author | Zemánek, Petr |
dc.contributor.author | Pecina, Pavel |
dc.date.accessioned | 2023-03-14T09:01:49Z |
dc.date.available | 2023-03-14T09:01:49Z |
dc.date.issued | 2023-03-11 |
dc.identifier.uri | http://hdl.handle.net/11234/1-5033 |
dc.description | This is the first release of the UFAL Parallel Corpus of North Levantine, compiled by the Institute of Formal and Applied Linguistics (ÚFAL) at Charles University within the Welcome project (https://welcome-h2020.eu/). The corpus consists of 120,600 multiparallel sentences in English, French, German, Greek, Spanish, and Standard Arabic selected from the OpenSubtitles2018 corpus [1] and manually translated into the North Levantine Arabic language. The corpus was created for the purpose of training machine translation for North Levantine and the other languages. |
dc.language.iso | apc |
dc.language.iso | eng |
dc.language.iso | fra |
dc.language.iso | spa |
dc.language.iso | arb |
dc.language.iso | ell |
dc.language.iso | deu |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/870930 |
dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
dc.source.uri | http://ufal.mff.cuni.cz/ufal-parallel-corpus-of-north-levantine |
dc.subject | multilingual |
dc.subject | machine translation |
dc.subject | parallel corpus |
dc.subject | north levantine |
dc.subject | corpus |
dc.title | UFAL Parallel Corpus of North Levantine 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Hashem Sellat sellat@ufal.mff.cuni.cz Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
sponsor | European Union EC/H2020/870930 WELCOME - Multiple Intelligent Conversation Agent Services for Reception, Management and Integration of Third Country Nationals in the EU euFunds info:eu-repo/grantAgreement/EC/H2020/870930 |
size.info | 844200 sentences |
size.info | 6227225 words |
files.size | 87074303 |
files.count | 13 |
Soubory tohoto záznamu
Stáhnout všechny soubory záznamu (83.04 MB)Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
- Název
- README.md
- Velikost
- 4.13 KB
- Formát
- Neznámý
- Popis
- Readme
- MD5
- ca5874218369b39ecbca59c355c12ade
- Název
- ufal-nla-v1.apc
- Velikost
- 6.57 MB
- Formát
- Neznámý
- Popis
- North Levantine translations
- MD5
- 11e84548700efc58d9ea6e33bff220be
- Název
- ufal-nla-v1.arb
- Velikost
- 7.25 MB
- Formát
- Neznámý
- Popis
- Standard Arabic translations
- MD5
- c0b2c0606045b5b944f39b3826cbef19
- Název
- ufal-nla-v1.deu
- Velikost
- 5.45 MB
- Formát
- Neznámý
- Popis
- German translations
- MD5
- 6afc8dff4c10e8d2c1dbbab02abfe680
- Název
- ufal-nla-v1.ell
- Velikost
- 8.73 MB
- Formát
- Neznámý
- Popis
- Greek translations
- MD5
- 0ee0e3aad10fd53c9cadf0e9dfa5d35b
- Název
- ufal-nla-v1.eng
- Velikost
- 4.97 MB
- Formát
- Neznámý
- Popis
- English translations
- MD5
- f30a569d2efe070afea6f7b9ca196368
- Název
- ufal-nla-v1.fra
- Velikost
- 5.13 MB
- Formát
- Neznámý
- Popis
- French translations
- MD5
- 22ea41474e6f7ddbb346c0fc4da68a52
- Název
- ufal-nla-v1.spa
- Velikost
- 5.09 MB
- Formát
- Neznámý
- Popis
- Spanish translations
- MD5
- 21be7ab5448e6cfdd4882c4e4fef593e
- Název
- ufal-nla-v1.arb-eng.ids
- Velikost
- 8.01 MB
- Formát
- Neznámý
- Popis
- Standard Arabic-English OpenSubtitles2018 ids
- MD5
- c829727818dfec1fde15457674f8d745
- Název
- ufal-nla-v1.deu-eng.ids
- Velikost
- 7.97 MB
- Formát
- Neznámý
- Popis
- German-English OpenSubtitles2018 ids
- MD5
- 29a443b68efac21f9c52d2f18dacecf6
- Název
- ufal-nla-v1.ell-eng.ids
- Velikost
- 7.96 MB
- Formát
- Neznámý
- Popis
- Greek-English OpenSubtitles2018 ids
- MD5
- c964ad29eceb23990f2c1a8535dea64a
- Název
- ufal-nla-v1.eng-fra.ids
- Velikost
- 7.95 MB
- Formát
- Neznámý
- Popis
- English-French OpenSubtitles2018 ids
- MD5
- 6e5439fd076a9dbb94097853821fcbd9
- Název
- ufal-nla-v1.eng-spa.ids
- Velikost
- 7.95 MB
- Formát
- Neznámý
- Popis
- English-Spanish OpenSubtitles2018 ids
- MD5
- 5e2006da9d4d3700f905102125fd7134