This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

MCSQ Translation Models (en-de) (v1.0)

Please use the following text to cite this item or export to a predefined format:
Variš, Dušan, 2022, MCSQ Translation Models (en-de) (v1.0), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-4680.
Date issued
2022-03-15
Language(s)
Description
En-De translation models, exported via TensorFlow Serving, available in the Lindat translation service (https://lindat.mff.cuni.cz/services/translation/). The models were trained using the MCSQ social surveys dataset (available at https://repo.clarino.uib.no/xmlui/bitstream/handle/11509/142/mcsq_v3.zip). Their main use should be in-domain translation of social surveys. Models are compatible with Tensor2tensor version 1.6.6. For details about the model training (data, model hyper-parameters), please contact the archive maintainer. Evaluation on MCSQ test set (BLEU): en->de: 67.5 (train: genuine in-domain MCSQ data only) de->en: 75.0 (train: additional in-domain backtranslated MCSQ data) (Evaluated using multeval: https://github.com/jhclark/multeval)
Acknowledgement
 Files in this item
Name
mcsq.de-en.zip
Size
663.55 MB
Format
application/zip
Description
Zip
MD5
b11a7d80a17127637e6862bc80e0f748
Preview
  File Preview
Name
mcsq.en-de.zip
Size
671.32 MB
Format
application/zip
Description
Zip
MD5
f38e69751aadf0ea687b8d574b59223e
Preview
  File Preview