dc.contributor.author | Hardmeier, Christian |
dc.contributor.author | Tiedemann, Jörg |
dc.contributor.author | Nakov, Preslav |
dc.contributor.author | Stymne, Sara |
dc.contributor.author | Versley, Yannick |
dc.date.accessioned | 2016-01-31T19:41:09Z |
dc.date.available | 2016-01-31T19:41:09Z |
dc.date.issued | 2016-01-30 |
dc.identifier.uri | http://hdl.handle.net/11372/LRT-1611 |
dc.description | The data set includes training, development and test data from the shared tasks on pronoun-focused machine translation and cross-lingual pronoun prediction from the EMNLP 2015 workshop on Discourse in Machine Translation (DiscoMT2015). The release also contains the submissions to the pronoun-focused machine translation along with the manual annotations used for the official evaluation as well as gold-standard annotations of pronoun coreference for the shared task test set. |
dc.language.iso | eng |
dc.language.iso | fra |
dc.publisher | Uppsala University |
dc.rights | Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ |
dc.source.uri | https://www.idiap.ch/workshop/DiscoMT/shared-task |
dc.subject | machine translation |
dc.subject | coreference resolution |
dc.subject | anaphora resolution |
dc.subject | discourse |
dc.title | DiscoMT 2015 Shared Task on Pronoun Translation |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
dc.rights.label | PUB |
has.files | yes |
branding | LRT + Open Submissions |
demo.uri | https://www.idiap.ch/workshop/DiscoMT/shared-task |
contact.person | Jörg Tiedemann jorg.tiedemann@helsinki.fi University of Helsinki |
contact.person | Christian Hardmeier christian.hardmeier@lingfil.uu.se Uppsala University |
sponsor | Swedish Research Council 2012-916 Discourse-Oriented Machine Translation nationalFunds |
sponsor | European Association for Machine Translation (EAMT) xx EAMT Sponsorship of Activities Other |
files.size | 18924984561 |
files.count | 9 |
Soubory tohoto záznamu
Licenční kategorie:
Licence: Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
Publicly Available
Licence: Creative Commons - Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)
- Název
- README
- Velikost
- 989 bajtů
- Formát
- Neznámý
- Popis
- Overview of the release
- MD5
- 5be549b2465abb8dce823fcd2b54d70d
- Název
- Shared-Task.pdf
- Velikost
- 75.49 KB
- Formát
- Popis
- Description of the shared task (pdf)
- MD5
- fd742d3bd5df6a92bcd5bdff75501698
- Název
- Shared-Task.html
- Velikost
- 36.57 KB
- Formát
- HTML
- Popis
- Description of the shared task (html)
- MD5
- be04abe8ef63c154de8e2140ec61e5dd
- Název
- Shared-Task-Evaluation.tar.gz
- Velikost
- 3.09 MB
- Formát
- application/x-gzip
- Popis
- submissions to the pronoun-focused machine translation subtask
- MD5
- 5d30691ada7ab90bfe7e6460dbf498a3
- Shared-Task-Evaluation
- A3-108
- DiscoMT2015.test.low.fr.output.xml309 kB
- DiscoMT2015.test.low.en.xml263 kB
- alignment.txt6 kB
- README3 kB
- uu-hardmeier
- word-alignments245 kB
- target.xml0 B
- bilm-brown.ana0.03.033554432.xml801 kB
- source.xml263 kB
- DiscoMT2015-tiedemann
- DiscoMT2015.test.base.recased.xml721 kB
- DiscoMT2015.test.low.en.xml263 kB
- DiscoMT2015.test.tok.en.xml263 kB
- README490 B
- DiscoMT2015.test.DET+PRON-moses.alg243 kB
- DiscoMT2015.test.raw.en.xml256 kB
- DiscoMT2015.test.base.alg240 kB
- DiscoMT2015.test.DET+PRON-moses.recased.xml732 kB
- AnnotatorInstructions.pdf48 kB
- shared-task-results.pdf54 kB
- its2
- its2_english.xml281 kB
- its2_word_alignment.txt248 kB
- its2_french_trans.xml325 kB
- reference-translation
- DiscoMT2015.test.tok.fr244 kB
- DiscoMT2015.test.tok.en214 kB
- DiscoMT2015.test.tok.en.xml263 kB
- DiscoMT2015.test.tok.en-fr.alig243 kB
- DiscoMT2015.test.tok.fr.xml295 kB
- baseline-with-moses
- DiscoMT2015.test.base-with-moses.alg247 kB
- DiscoMT2015.test.base-with-moses.134217728.xml729 kB
- DiscoMT2015.test.base-with-moses.recased.xml729 kB
- pronoun-prediction-eval
- Geneva1.CONTRASTIVE.txt3 kB
- UoM_DiscoMT2015.test.data.mf.scaled.predict.2.txt3 kB
- whatelles.txt3 kB
- UoM_DiscoMT2015.test.data.mf.scaled.predict.1.txt3 kB
- tiedemann.DiscoMT2015.test.IWSLT14-trg2+3-d1p.liblinear.predicted.CONTRASTIVE.txt3 kB
- SharedTaskResults.pdf50 kB
- A3-108.txt3 kB
- uedin.dwetzel.allinone.txt3 kB
- Geneva2.PRIMARY.txt3 kB
- idiap.system1.PRIMARY.predictions.txt3 kB
- tiedemann.DiscoMT2015.test.IWSLT14-trg2+3-d1cp.liblinear.predicted.PRIMARY.txt3 kB
- baseline.np1_eval.txt3 kB
- uedin.dwetzel.postcombined.txt3 kB
- tuli.txt3 kB
- idiap.system2.CONTRASTIVE.predictions.txt3 kB
- annotations.tab196 kB
- IDIAP-SUBMISSION
- SECONDARY_SYSTEM
- DiscoMT2015.test.low.fr.secondary.xml285 kB
- DiscoMT2015.test.alignment.src-tgt.standard.converted240 kB
- DiscoMT2015.test.alignment.src-tgt.standard243 kB
- convert-alignment.awk184 B
- DiscoMT2015.test.low.en.xml263 kB
- README.alignments397 B
- README1 kB
- PRIMARY_SYSTEM
- DiscoMT2015.test.low.fr.primary.xml285 kB
- SECONDARY_SYSTEM
- shared-task-results.html25 kB
- auto-postEDIt
- DiscoMT.test.fr.xml293 kB
- alignment.finaltest.txt248 kB
- DiscoMT2015.test.low.en.xml263 kB
- A3-108
- Název
- Shared-Task.tar.gz
- Velikost
- 1.46 GB
- Formát
- application/x-gzip
- Popis
- training and test data
- MD5
- 7b5e0e0e79d1225b4e52da4404785077
- Shared-Task
- Test-Predict
- Train-Predict
- IWSLT14.en-fr.doc-ids.gz6 kB
- Europarl.classes111 B
- Makefile12 kB
- Europarl.data.gz313 MB
- Europarl.en-fr.doc-ids.gz12 MB
- IWSLT14.classes111 B
- NCv9.en-fr.doc-ids.gz258 kB
- NCv9.classes111 B
- IWSLT14.data.gz18 MB
- TEDdev.classes111 B
- NCv9.data.gz28 MB
- TEDdev.data.gz164 kB
- Tools
- Baseline-Predict
- README3 kB
- Test-MT
- Baseline-SMT
- Train-MT
- TEDdev.en-fr.doc-ids6 kB
- IWSLT14.TED.tst2010.en-fr.en.xml192 kB
- IWSLT14
- 1192.info533 B
- 1090.info640 B
- 1729.info695 B
- 1704.info492 B
- 1833.info540 B
- 206.info504 B
- 412.info403 B
- 335.info395 B
- 258.info518 B
- 104.info496 B
- 310.info388 B
- 233.info511 B
- 156.info379 B
- 362.info491 B
- 285.info485 B
- 1177.info568 B
- 1100.info683 B
- 131.info489 B
- 260.info415 B
- 183.info499 B
- 1152.info679 B
- 1075.info546 B
- 1050.info568 B
- 1281.info478 B
- 29.info481 B
- 1818.info642 B
- 1716.info936 B
- 1639.info580 B
- 31.info424 B
- 1768.info502 B
- 1614.info481 B
- 1229.info539 B
- 1537.info447 B
- 1820.info599 B
- 1743.info502 B
- 1204.info442 B
- 1512.info516 B
- 1127.info571 B
- 1435.info448 B
- 1795.info680 B
- 1487.info484 B
- 1410.info465 B
- 1641.info653 B
- 1256.info561 B
- 1564.info526 B
- 1770.info586 B
- 1693.info749 B
- 1462.info409 B
- 195.info472 B
- 170.info391 B
- 43.info466 B
- 426.info507 B
- 349.info453 B
- 118.info494 B
- 1549.info485 B
- 324.info484 B
- 247.info401 B
- 1678.info691 B
- 1601.info549 B
- 1216.info456 B
- 1524.info477 B
- 1139.info572 B
- 1447.info416 B
- 299.info416 B
- 222.info430 B
- 1730.info563 B
- 1653.info465 B
- 1345.info571 B
- 1576.info496 B
- 1499.info942 B
- 1114.info376 B
- 145.info430 B
- 1037.info446 B
- 351.info444 B
- 1243.info440 B
- 1551.info443 B
- 1166.info519 B
- 1089.info363 B
- 1397.info501 B
- 1012.info405 B
- 1320.info485 B
- 1141.info555 B
- 1064.info963 B
- 1372.info684 B
- 1295.info619 B
- 1270.info358 B
- 1193.info458 B
- 1091.info76 B
- 1807.info712 B
- 1859.info675 B
- 1834.info545 B
- 207.info368 B
- 413.info501 B
- 259.info944 B
- 105.info441 B
- 388.info416 B
- 234.info517 B
- 157.info425 B
- 363.info479 B
- 286.info477 B
- 1101.info519 B
- 261.info459 B
- 184.info427 B
- 1230.info490 B
- 1153.info502 B
- 1076.info593 B
- 1384.info394 B
- 390.info516 B
- 1051.info553 B
- 1282.info483 B
- 1180.info506 B
- 1717.info659 B
- 32.info430 B
- 1846.info686 B
- 1769.info718 B
- 1538.info587 B
- 1821.info576 B
- 1744.info471 B
- 1667.info574 B
- 1205.info471 B
- 1513.info683 B
- 1436.info556 B
- 1359.info482 B
- 1873.info696 B
- 1796.info615 B
- 1257.info489 B
- 1565.info458 B
- 1488.info604 B
- 1642.info737 B
- 1771.info437 B
- 1463.info462 B
- 273.info511 B
- 196.info439 B
- 171.info432 B
- 427.info495 B
- 119.info72 B
- 1627.info575 B
- 402.info497 B
- 325.info91 B
- 248.info433 B
- 1679.info514 B
- 1448.info468 B
- 1602.info473 B
- 1217.info467 B
- 377.info346 B
- 300.info479 B
- 223.info409 B
- 1885.info509 B
- 1346.info471 B
- 1269.info569 B
- 1577.info558 B
- 1500.info547 B
- 1115.info579 B
- 1423.info566 B
- 146.info387 B
- 1860.info611 B
- 1244.info522 B
- 1552.info539 B
- 1167.info618 B
- 1475.info652 B
- 1398.info528 B
- 1013.info529 B
- 1321.info564 B
- 4.info496 B
- 1142.info550 B
- 1450.info477 B
- 1065.info563 B
- 1373.info469 B
- 1296.info545 B
- 1194.info596 B
- 1040.info441 B
- 1271.info484 B
- 1092.info507 B
- 1808.info605 B
- 1706.info625 B
- 208.info434 B
- 414.info448 B
- 312.info469 B
- 235.info525 B
- 158.info87 B
- 287.info109 B
- 210.info439 B
- 1179.info709 B
- 1102.info558 B
- 185.info509 B
- 1000.info554 B
- 1231.info502 B
- 1154.info477 B
- 1077.info81 B
- 391.info456 B
- 1283.info497 B
- 1591.info623 B
- 1052.info84 B
- 1360.info541 B
- 1718.info392 B
- 33.info479 B
- 1847.info611 B
- 1308.info496 B
- 1616.info585 B
- 1822.info1 kB
- 1745.info534 B
- 1668.info653 B
- 1206.info389 B
- 1514.info642 B
- 1129.info533 B
- 1437.info405 B
- 1797.info530 B
- 1720.info632 B
- 1566.info491 B
- 1489.info939 B
- 1412.info541 B
- 1335.info472 B
- 1643.info626 B
- 1695.info561 B
- 1670.info743 B
- 274.info453 B
- 197.info426 B
- 172.info381 B
- 18.info511 B
- 428.info520 B
- 1628.info459 B
- 20.info458 B
- 403.info485 B
- 326.info482 B
- 249.info511 B
- 1757.info514 B
- 1449.info541 B
- 1603.info544 B
- 1218.info610 B
- 1526.info461 B
- 301.info458 B
- 224.info473 B
- 1886.info569 B
- 1732.info641 B
- 1655.info599 B
- 1039.info521 B
- 1501.info604 B
- 1116.info428 B
- 430.info421 B
- 353.info437 B
- 1861.info736 B
- 1784.info573 B
- 1399.info478 B
- 1014.info438 B
- 1322.info473 B
- 1630.info638 B
- 1168.info469 B
- 1476.info555 B
- 1682.info416 B
- 1297.info514 B
- 1220.info482 B
- 1143.info555 B
- 1451.info621 B
- 1066.info603 B
- 1374.info449 B
- 1195.info526 B
- 1041.info489 B
- 1272.info519 B
- 1580.info562 B
- 1093.info555 B
- 1170.info556 B
- 1809.info697 B
- 1707.info85 B
- 209.info417 B
- 313.info402 B
- 236.info498 B
- 159.info531 B
- 365.info491 B
- 288.info509 B
- 211.info534 B
- 1103.info495 B
- 340.info486 B
- 263.info387 B
- 1078.info608 B
- 1001.info455 B
- 1232.info492 B
- 1155.info498 B
- 392.info402 B
- 1284.info551 B
- 1130.info470 B
- 161.info431 B
- 1053.info569 B
- 1361.info977 B
- 1490.info614 B
- 1080.info788 B
- 1719.info603 B
- 34.info521 B
- 1848.info621 B
- 1309.info881 B
- 1823.info447 B
- 1746.info658 B
- 1669.info609 B
- 1207.info459 B
- 1515.info559 B
- 1438.info643 B
- 1798.info543 B
- 1721.info565 B
- 1336.info589 B
- 1644.info687 B
- 1567.info461 B
- 1850.info1 kB
- 1773.info106 B
- 1696.info590 B
- 1542.info723 B
- 1671.info657 B
- 198.info479 B
- 121.info535 B
- 250.info309 B
- 19.info431 B
- 429.info475 B
- 1629.info495 B
- 21.info550 B
- 404.info482 B
- 327.info440 B
- 1758.info877 B
- 1219.info457 B
- 1527.info502 B
- 1604.info456 B
- 379.info471 B
- 225.info441 B
- 1810.info613 B
- 1656.info528 B
- 1117.info495 B
- 1579.info611 B
- 431.info463 B
- 1862.info384 B
- 1785.info495 B
- 1015.info525 B
- 1323.info69 B
- 1631.info570 B
- 1246.info524 B
- 1554.info600 B
- 1760.info548 B
- 1683.info726 B
- 1298.info456 B
- 1221.info510 B
- 1144.info441 B
- 1452.info697 B
- 1067.info582 B
- 1375.info469 B
- 1581.info533 B
- 1196.info626 B
- 1042.info508 B
- 1350.info559 B
- 1171.info424 B
- 1094.info577 B
- 1708.info696 B
- 416.info409 B
- 339.info70 B
- 108.info397 B
- 237.info464 B
- 366.info452 B
- 212.info475 B
- 1104.info509 B
- 1258.info391 B
- 341.info538 B
- 264.info567 B
- 187.info409 B
- 110.info97 B
- 1079.info550 B
- 1002.info573 B
- 1310.info467 B
- 1233.info403 B
- 393.info403 B
- 1054.info589 B
- 1362.info467 B
- 1593.info564 B
- 1131.info587 B
- 162.info877 B
- 1260.info497 B
- 1491.info558 B
- 1081.info403 B
- 35.info416 B
- 1618.info486 B
- 10.info522 B
- 1824.info658 B
- 1747.info485 B
- 1208.info540 B
- 1516.info631 B
- 1876.info591 B
- 1799.info747 B
- 1722.info510 B
- 1337.info543 B
- 1645.info637 B
- 1568.info652 B
- 1851.info589 B
- 1774.info706 B
- 1697.info623 B
- 1620.info460 B
- 147.info390 B
- 276.info418 B
- 199.info70 B
- 122.info477 B
- 251.info387 B
- 174.info449 B
- 22.info466 B
- 405.info376 B
- 328.info478 B
- 1836.info615 B
- 1605.info506 B
- 1734.info603 B
- 1657.info531 B
- 1118.info570 B
- 1426.info448 B
- 1349.info496 B
- 1503.info619 B
- 432.info501 B
- 1786.info575 B
- 1401.info925 B
- 1016.info571 B
- 1632.info553 B
- 1247.info514 B
- 1555.info544 B
- 1478.info471 B
- 1684.info516 B
- 1068.info1 kB
- 1376.info426 B
- 1222.info530 B
- 1530.info577 B
- 1145.info490 B
- 1890.info584 B
- 1274.info510 B
- 1582.info581 B
- 1197.info600 B
- 1120.info459 B
- 1043.info475 B
- 1351.info574 B
- 1172.info555 B
- 1480.info522 B
- 1095.info450 B
- 109.info83 B
- 315.info470 B
- 213.info425 B
- 1259.info412 B
- 1105.info470 B
- 188.info72 B
- 1465.info625 B
- 1003.info608 B
- 1311.info494 B
- 1234.info532 B
- 394.info497 B
- 163.info481 B
- 1055.info584 B
- 1286.info488 B
- 1594.info579 B
- 1132.info587 B
- 1440.info411 B
- 292.info430 B
- 1261.info456 B
- 1184.info490 B
- 1492.info501 B
- 1030.info477 B
- 190.info476 B
- 1082.info570 B
- 1390.info377 B
- 36.info509 B
- 1619.info437 B
- 11.info534 B
- 1209.info507 B
- 1517.info591 B
- 1877.info607 B
- 1800.info875 B
- 1338.info521 B
- 1646.info518 B
- 1569.info557 B
- 1698.info542 B
- 1621.info508 B
- 1544.info462 B
- 148.info378 B
- 1750.info541 B
- 354.info422 B
- 200.info358 B
- 123.info477 B
- 252.info485 B
- 175.info467 B
- 381.info393 B
- 23.info487 B
- 406.info489 B
- 329.info514 B
- 1837.info619 B
- 1606.info587 B
- 1735.info509 B
- 1658.info644 B
- 1119.info545 B
- 433.info428 B
- 1787.info563 B
- 1710.info615 B
- 1479.info566 B
- 1402.info647 B
- 1017.info80 B
- 1633.info675 B
- 1248.info521 B
- 1556.info544 B
- 1762.info710 B
- 1069.info480 B
- 1377.info491 B
- 1223.info419 B
- 1531.info945 B
- 1146.info602 B
- 1454.info533 B
- 1891.info605 B
- 1660.info500 B
- 1583.info640 B
- 1198.info490 B
- 1121.info475 B
- 1044.info567 B
- 1352.info418 B
- 1250.info613 B
- 1173.info574 B
- 1481.info490 B
- 418.info461 B
- 316.info488 B
- 239.info490 B
- 214.info501 B
- 1106.info458 B
- 420.info492 B
- 343.info478 B
- 266.info463 B
- 189.info508 B
- 1158.info504 B
- 112.info416 B
- 1389.info761 B
- 1004.info557 B
- 1312.info454 B
- 1235.info488 B
- 395.info475 B
- 241.info482 B
- 1672.info620 B
- 1133.info625 B
- 1441.info383 B
- 164.info448 B
- 1056.info631 B
- 1364.info550 B
- 1287.info359 B
- 1595.info608 B
- 1210.info536 B
- 1031.info479 B
- 1262.info357 B
- 1570.info717 B
- 1185.info575 B
- 191.info470 B
- 1160.info645 B
- 1083.info479 B
- 1391.info483 B
- 37.info458 B
- 12.info411 B
- 1826.info585 B
- 1801.info724 B
- 1724.info637 B
- 1853.info574 B
- 1622.info675 B
- 1545.info602 B
- 149.info429 B
- 355.info468 B
- 278.info401 B
- 201.info457 B
- 330.info389 B
- 253.info436 B
- 176.info366 B
- 151.info955 B
- 280.info493 B
- 1070.info552 B
- 1709.info519 B
- 407.info353 B
- 1607.info581 B
- 1813.info586 B
- 1505.info474 B
- 434.info470 B
- 1788.info630 B
- 1711.info550 B
- 1403.info494 B
- 1018.info434 B
- 1326.info516 B
- 1249.info504 B
- 1557.info636 B
- 1763.info365 B
- 1147.info521 B
- 1455.info566 B
- 1378.info723 B
- 1301.info446 B
- 1224.info475 B
- 1661.info434 B
- 1045.info536 B
- 1353.info490 B
- 1276.info413 B
- 1584.info517 B
- 1199.info542 B
- 1122.info549 B
- 1790.info478 B
- 1251.info97 B
- 1482.info513 B
- 1380.info511 B
- 419.info491 B
- 215.info458 B
- 1107.info636 B
- 421.info503 B
- 344.info517 B
- 267.info387 B
- 1236.info427 B
- 1159.info455 B
- 1467.info527 B
- 113.info400 B
- 1005.info498 B
- 1313.info444 B
- 396.info500 B
- 242.info463 B
- 1134.info477 B
- 1442.info471 B
- 165.info85 B
- 1057.info732 B
- 1365.info456 B
- 1288.info472 B
- 1596.info513 B
- 1211.info403 B
- 371.info449 B
- 294.info508 B
- 140.info510 B
- 1032.info573 B
- 1571.info596 B
- 1186.info524 B
- 1494.info585 B
- 1161.info438 B
- 1084.info588 B
- 1290.info511 B
- 38.info490 B
- 13.info435 B
- 1519.info514 B
- 1879.info592 B
- 1802.info535 B
- 1725.info405 B
- 1648.info679 B
- 1700.info489 B
- 1623.info655 B
- 356.info356 B
- 279.info472 B
- 202.info463 B
- 125.info508 B
- 331.info711 B
- 254.info329 B
- 177.info501 B
- 152.info506 B
- 1096.info783 B
- 1071.info526 B
- 25.info541 B
- 408.info490 B
- 1839.info619 B
- 1608.info710 B
- 306.info503 B
- 1737.info503 B
- 1506.info623 B
- 1429.info488 B
- 1789.info917 B
- 1712.info605 B
- 1558.info536 B
- 1404.info462 B
- 1019.info491 B
- 1327.info429 B
- 1841.info553 B
- 1764.info578 B
- 1687.info552 B
- 1148.info603 B
- 1456.info432 B
- 1379.info386 B
- 1302.info509 B
- 1533.info465 B
- 1046.info529 B
- 1585.info664 B
- 1200.info443 B
- 1431.info469 B
- 1791.info624 B
- 1252.info428 B
- 1483.info656 B
- 1381.info477 B
- 318.info733 B
- 216.info467 B
- 1339.info497 B
- 1108.info603 B
- 139.info469 B
- 422.info449 B
- 345.info492 B
- 268.info486 B
- 1237.info552 B
- 1468.info474 B
- 114.info463 B
- 1006.info556 B
- 1314.info454 B
- 320.info501 B
- 243.info415 B
- 1674.info515 B
- 1212.info585 B
- 1135.info473 B
- 1058.info469 B
- 1366.info467 B
- 1289.info486 B
- 1597.info557 B
- 372.info455 B
- 1495.info705 B
- 1110.info563 B
- 141.info494 B
- 1033.info566 B
- 1264.info442 B
- 1572.info484 B
- 270.info97 B
- 1085.info939 B
- 1162.info89 B
- 1470.info553 B
- 1291.info529 B
- 1060.info504 B
- 39.info418 B
- 14.info448 B
- 1828.info599 B
- 1726.info439 B
- 1649.info560 B
- 1855.info642 B
- 1701.info631 B
- 1624.info490 B
- 228.info473 B
- 1753.info581 B
- 203.info344 B
- 126.info514 B
- 255.info464 B
- 178.info397 B
- 101.info480 B
- 230.info475 B
- 153.info481 B
- 282.info521 B
- 1020.info571 B
- 1072.info524 B
- 26.info393 B
- 409.info488 B
- 1609.info82 B
- 307.info521 B
- 1815.info614 B
- 1738.info541 B
- 1507.info590 B
- 1713.info547 B
- 1559.info533 B
- 1405.info428 B
- 1328.info507 B
- 1636.info691 B
- 1765.info671 B
- 1688.info649 B
- 1226.info621 B
- 1149.info579 B
- 1457.info531 B
- 1303.info597 B
- 1611.info576 B
- 1740.info88 B
- 1663.info451 B
- 1124.info534 B
- 1432.info586 B
- 1047.info1 kB
- 1355.info432 B
- 1586.info507 B
- 1201.info469 B
- 1792.info404 B
- 1330.info437 B
- 1253.info577 B
- 1561.info490 B
- 1176.info363 B
- 1484.info538 B
- 1690.info576 B
- 1382.info371 B
- 192.info483 B
- 319.info428 B
- 217.info439 B
- 1109.info598 B
- 40.info526 B
- 423.info500 B
- 346.info477 B
- 269.info84 B
- 1007.info598 B
- 1315.info376 B
- 1238.info437 B
- 1546.info624 B
- 1469.info332 B
- 115.info56 B
- 321.info418 B
- 1675.info861 B
- 167.info465 B
- 1598.info541 B
- 1213.info496 B
- 1521.info542 B
- 1136.info624 B
- 1444.info426 B
- 1059.info570 B
- 1367.info518 B
- 296.info88 B
- 1188.info658 B
- 1496.info523 B
- 1111.info459 B
- 142.info415 B
- 1034.info371 B
- 1342.info466 B
- 1265.info560 B
- 1573.info515 B
- 271.info482 B
- 1086.info551 B
- 1394.info676 B
- 1240.info498 B
- 1163.info637 B
- 1471.info331 B
- 1061.info423 B
- 1190.info484 B
- 1804.info707 B
- 1727.info669 B
- 1856.info71 B
- 1779.info710 B
- 1702.info707 B
- 1831.info532 B
- 435.info406 B
- 358.info451 B
- 204.info390 B
- 127.info518 B
- 410.info465 B
- 333.info385 B
- 385.info511 B
- 231.info762 B
- 000.info85 B
- 154.info345 B
- 360.info479 B
- 1098.info465 B
- 181.info492 B
- 1150.info363 B
- 1073.info483 B
- 27.info489 B
- 308.info477 B
- 1739.info530 B
- 1508.info499 B
- 1329.info443 B
- 1637.info645 B
- 1843.info492 B
- 1766.info617 B
- 1689.info594 B
- 1227.info617 B
- 1535.info362 B
- 1458.info396 B
- 1304.info456 B
- 1612.info514 B
- 1664.info611 B
- 1125.info578 B
- 1433.info432 B
- 1048.info456 B
- 1356.info570 B
- 1587.info651 B
- 1202.info589 B
- 1510.info545 B
- 1793.info710 B
- 1562.info584 B
- 1485.info550 B
- 1691.info465 B
- 1460.info577 B
- 193.info514 B
- 218.info477 B
- 41.info442 B
- 424.info431 B
- 347.info406 B
- 116.info395 B
- 1008.info503 B
- 1239.info761 B
- 1547.info852 B
- 399.info433 B
- 322.info481 B
- 245.info483 B
- 168.info506 B
- 1676.info988 B
- 1599.info565 B
- 1214.info637 B
- 1522.info579 B
- 1137.info631 B
- 1445.info472 B
- 1368.info460 B
- 374.info404 B
- 297.info433 B
- 220.info475 B
- 1651.info552 B
- 1266.info485 B
- 1574.info578 B
- 1189.info537 B
- 1112.info522 B
- 1420.info536 B
- 143.info446 B
- 1343.info599 B
- 272.info537 B
- 1780.info569 B
- 1164.info539 B
- 1472.info614 B
- 1087.info495 B
- 1395.info573 B
- 1010.info445 B
- 1241.info473 B
- 1.info556 B
- 1062.info526 B
- 1370.info445 B
- 1191.info579 B
- 16.info534 B
- 1805.info534 B
- 1728.info491 B
- 1857.info654 B
- 1703.info502 B
- 1626.info675 B
- 1832.info715 B
- 359.info494 B
- 128.info501 B
- 411.info453 B
- 334.info434 B
- 103.info943 B
- 386.info339 B
- 232.info488 B
- 155.info474 B
- 361.info525 B
- 130.info419 B
- 182.info477 B
- 1074.info539 B
- 1280.info514 B
- 28.info487 B
- 1817.info655 B
- 1509.info541 B
- 1715.info931 B
- 1638.info568 B
- 30.info472 B
- 1767.info475 B
- 1613.info500 B
- 1228.info481 B
- 1459.info492 B
- 1305.info504 B
- 1896.info504 B
- 1742.info549 B
- 1588.info540 B
- 1203.info498 B
- 1126.info380 B
- 1434.info543 B
- 1049.info503 B
- 1357.info402 B
- 1794.info617 B
- 1640.info551 B
- 1255.info374 B
- 1563.info568 B
- 1461.info635 B
- 194.info470 B
- 219.info450 B
- 42.info542 B
- 348.info440 B
- 117.info425 B
- 1009.info587 B
- 400.info435 B
- 323.info485 B
- 246.info553 B
- 1754.info457 B
- 1215.info554 B
- 1523.info594 B
- 1138.info480 B
- 375.info433 B
- 298.info85 B
- 221.info516 B
- 1883.info587 B
- 1652.info577 B
- 1267.info484 B
- 1575.info612 B
- 1498.info538 B
- 1113.info371 B
- 144.info450 B
- 1036.info467 B
- 1344.info425 B
- 350.info528 B
- 1165.info513 B
- 1473.info492 B
- 1088.info517 B
- 1396.info807 B
- 1011.info550 B
- 1140.info515 B
- 1371.info356 B
- 1294.info512 B
- IWSLT14.TED.tst2010.en-fr.fr.xml212 kB
- Makefile1 kB
- data
- Test-Predict
- ... too many files ...0 B
- Název
- ParCor-Annotations.tar.gz
- Velikost
- 495.02 KB
- Formát
- application/x-gzip
- Popis
- Pronoun Annotations for the DiscoMT 2015 Test Set
- MD5
- 36778068670a43338aaeeef5777ea046
- ParCor-Annotations
- DiscoMT2015.test.mmax
- 003_1894.mmax152 B
- Basedata
- 002_1825_words.xml87 kB
- 001_1819_words.xml94 kB
- words.dtd83 B
- 007_1953_words.xml143 kB
- 010_205_words.xml131 kB
- 008_1979_words.xml89 kB
- 011_2053_words.xml90 kB
- 003_1894_words.xml185 kB
- 004_1935_words.xml99 kB
- 000_1756_words.xml137 kB
- 005_1938_words.xml82 kB
- 006_1950_words.xml191 kB
- 009_2043_words.xml108 kB
- Schemes
- coref_scheme.xml4 kB
- checks_scheme.xml327 B
- sentence_scheme.xml585 B
- 008_1979.mmax152 B
- Styles
- just_text.xsl1 kB
- default_style.xsl1 kB
- with_handles.xsl1 kB
- muc_style.xsl5 kB
- generic_nongui_style.xsl690 B
- 010_205.mmax151 B
- 009_2043.mmax152 B
- common_paths.xml763 B
- 006_1950.mmax152 B
- enTEDChecks-mmax.py9 kB
- 011_2053.mmax152 B
- 000_1756.mmax152 B
- Customizations
- sentence_customization.xml72 B
- coref_customization.xml884 B
- markables
- 005_1938_coref_level.xml30 kB
- 000_1756_checks_level.xml130 B
- 003_1894_sentence_level.xml22 kB
- 004_1935_checks_level.xml130 B
- markables.dtd69 B
- 006_1950_sentence_level.xml23 kB
- 006_1950_coref_level.xml122 kB
- 003_1894_coref_level.xml153 kB
- 000_1756_coref_level.xml111 kB
- 005_1938_sentence_level.xml9 kB
- 007_1953_coref_level.xml125 kB
- 004_1935_sentence_level.xml13 kB
- 007_1953_sentence_level.xml23 kB
- 001_1819_sentence_level.xml14 kB
- 010_205_coref_level.xml121 kB
- 000_1756_sentence_level.xml17 kB
- 009_2043_coref_level.xml74 kB
- .000_1756_coref_level.xml.swp16 kB
- 002_1825_coref_level.xml47 kB
- 008_1979_coref_level.xml64 kB
- 008_1979_sentence_level.xml13 kB
- 011_2053_coref_level.xml51 kB
- 004_1935_coref_level.xml63 kB
- 011_2053_sentence_level.xml13 kB
- 010_205_sentence_level.xml27 kB
- 009_2043_sentence_level.xml15 kB
- 002_1825_sentence_level.xml11 kB
- 001_1819_coref_level.xml58 kB
- 004_1935.mmax152 B
- 005_1938.mmax152 B
- 001_1819.mmax152 B
- 007_1953.mmax152 B
- 002_1825.mmax152 B
- Documentation
- AnnotationGuidelines_v1.1.pdf178 kB
- ._AnnotationGuidelines_v1.1.pdf674 B
- README2 kB
- DiscoMT2015.test.mmax
- Název
- DiscoMT2015.SMT-baseline.tar.gz
- Velikost
- 4.8 GB
- Formát
- application/x-gzip
- Popis
- SMT baseline system
- MD5
- c60b18b674e1f84b6853924d4d0da9c6
- tools
- convert-nc9.pl813 B
- merge-word-alignments.pl~481 B
- clean-corpus-n.perl3 kB
- insert-placeholders_v1.pl5 kB
- europarl-markup.pl1 kB
- convert-iwslt.pl1 kB
- europarl-markup.pl~809 B
- convert-iwslt.pl~1 kB
- insert-placeholders.pl~7 kB
- insert-placeholders.pl7 kB
- convert-nc9.pl~131 B
- insert-placeholders_v3.pl6 kB
- tokenizer
- deescape-special-chars.perl466 B
- escape-special-chars.perl586 B
- replace-unicode-punctuation.perl609 B
- basic-protected-patterns178 B
- pre-tokenizer.perl801 B
- remove-non-printing-char.perl286 B
- normalize-punctuation.perl1 kB
- lowercase.perl120 B
- detokenizer.perl11 kB
- tokenizer.perl16 kB
- merge-word-alignments.pl516 B
- share
- nonbreaking_prefixes
- nonbreaking_prefix.nl1 kB
- nonbreaking_prefix.cs1 kB
- nonbreaking_prefix.lv1 kB
- nonbreaking_prefix.it1 kB
- nonbreaking_prefix.fr1008 B
- nonbreaking_prefix.is1 kB
- nonbreaking_prefix.ru1 kB
- nonbreaking_prefix.ta2 kB
- nonbreaking_prefix.ro104 B
- nonbreaking_prefix.fi1 kB
- nonbreaking_prefix.ca324 B
- nonbreaking_prefix.sv184 B
- nonbreaking_prefix.pt1 kB
- README.txt255 B
- nonbreaking_prefix.pl1 kB
- nonbreaking_prefix.sl356 B
- nonbreaking_prefix.sk2 kB
- nonbreaking_prefix.de1 kB
- nonbreaking_prefix.hu1 kB
- nonbreaking_prefix.es835 B
- nonbreaking_prefix.en1 kB
- nonbreaking_prefix.el16 kB
- nonbreaking_prefixes
- insert-placeholders_v2.pl5 kB
- workdir
- ted_ep_nc
- ted_ep_nc.ini1 kB
- model
- phrase-table-filtered.binphr.tgtvoc1 MB
- phrase-table-filtered.binphr.tgtdata.wa653 MB
- phrase-table-filtered.binphr.srctree.wa107 MB
- phrase-table-filtered.gz294 MB
- phrase-table-filtered.binphr.srcvoc1 MB
- phrase-table-filtered.binphr.idx573 kB
- phrase-table-filtered.d
- probing_hash.dat125 MB
- config18 B
- target_phrases1 MB
- binfile.dat333 MB
- source_vocabids2 MB
- Wall16 MB
- corpus.5.fr.trie.kenlm4 GB
- ted_ep_nc
- Makefile15 kB
- README1 kB
- Název
- DiscoMT2015.SMT-baseline-all.tar.gz
- Velikost
- 10.31 GB
- Formát
- application/x-gzip
- Popis
- SMT baseline system including intermediate files
- MD5
- e216c70373caf323326da1661a38afc5
- tools
- convert-nc9.pl813 B
- merge-word-alignments.pl~481 B
- clean-corpus-n.perl3 kB
- insert-placeholders_v1.pl5 kB
- europarl-markup.pl1 kB
- convert-iwslt.pl1 kB
- europarl-markup.pl~809 B
- convert-iwslt.pl~1 kB
- insert-placeholders.pl~7 kB
- insert-placeholders.pl7 kB
- convert-nc9.pl~131 B
- insert-placeholders_v3.pl6 kB
- tokenizer
- deescape-special-chars.perl466 B
- escape-special-chars.perl586 B
- replace-unicode-punctuation.perl609 B
- basic-protected-patterns178 B
- pre-tokenizer.perl801 B
- remove-non-printing-char.perl286 B
- normalize-punctuation.perl1 kB
- lowercase.perl120 B
- detokenizer.perl11 kB
- tokenizer.perl16 kB
- merge-word-alignments.pl516 B
- share
- nonbreaking_prefixes
- nonbreaking_prefix.nl1 kB
- nonbreaking_prefix.cs1 kB
- nonbreaking_prefix.lv1 kB
- nonbreaking_prefix.it1 kB
- nonbreaking_prefix.fr1008 B
- nonbreaking_prefix.is1 kB
- nonbreaking_prefix.ru1 kB
- nonbreaking_prefix.ta2 kB
- nonbreaking_prefix.ro104 B
- nonbreaking_prefix.fi1 kB
- nonbreaking_prefix.ca324 B
- nonbreaking_prefix.sv184 B
- nonbreaking_prefix.pt1 kB
- README.txt255 B
- nonbreaking_prefix.pl1 kB
- nonbreaking_prefix.sl356 B
- nonbreaking_prefix.sk2 kB
- nonbreaking_prefix.de1 kB
- nonbreaking_prefix.hu1 kB
- nonbreaking_prefix.es835 B
- nonbreaking_prefix.en1 kB
- nonbreaking_prefix.el16 kB
- nonbreaking_prefixes
- insert-placeholders_v2.pl5 kB
- workdir
- ted_ep_nc
- ted_ep_nc.IWSLT14.TED.tst2010.en.fr.log1 MB
- ted_ep_nc.IWSLT14.TED.tst2010.en.fr168 kB
- corpus.fr-en.align308 MB
- corpus.en-fr729 MB
- ted_ep_nc.IWSLT14.TED.tst2012.en.fr.eval88 B
- corpus.en-fr.lines17 MB
- corpus.en-fr.classification.data395 MB
- corpus.5.fr.trie.kenlm4 GB
- ted_ep_nc.IWSLT14.TED.tst2012.en.fr.log846 kB
- model
- phrase-table.gz3 GB
- phrase-table-filtered.binphr.tgtvoc1 MB
- phrase-table-filtered.d
- probing_hash.dat125 MB
- config18 B
- target_phrases1 MB
- binfile.dat333 MB
- source_vocabids2 MB
- Wall16 MB
- phrase-table-filtered.binphr.srctree.wa107 MB
- phrase-table-filtered.binphr.idx573 kB
- lex.e2f112 MB
- phrase-table-filtered.binphr.srcvoc1 MB
- aligned.grow-diag-final-and340 MB
- lex.f2e112 MB
- moses-binary.ini746 B
- moses.ini749 B
- phrase-table-filtered.binphr.tgtdata.wa653 MB
- phrase-table-filtered.gz294 MB
- moses-filtered-binary.ini755 B
- ted_ep_nc.IWSLT14.TED.tst2010.en.fr.eval88 B
- corpus.en-fr.classification.data.log12 kB
- corpus.fr4 GB
- tuned
- run4.weights.txt76 B
- run3.moses.ini1 kB
- run1.extract.out0 B
- run4.best200.out.gz8 MB
- run3.mert.log808 B
- run3.best200.out.gz8 MB
- run1.moses.ini1 kB
- run4.extract.out0 B
- run3.features.dat15 MB
- run2.init.opt109 B
- run2.best200.out.gz9 MB
- run1.out181 kB
- init.opt107 B
- run1.extract.err540 B
- run1.best200.out.gz7 MB
- run1.mert.out0 B
- run2.scores.dat7 MB
- run4.mert.log946 B
- run4.extract.err539 B
- run3.weights.txt76 B
- run4.scores.dat6 MB
- run3.init.opt107 B
- run4.out185 kB
- run2.features.dat16 MB
- run2.mert.out0 B
- weights.txt76 B
- run3.extract.out0 B
- run4.moses.ini1 kB
- mert.out0 B
- features.list171 B
- extract.out0 B
- run4.init.opt107 B
- run3.extract.err540 B
- run2.moses.ini1 kB
- run2.weights.txt76 B
- run3.mert.out0 B
- run3.out186 kB
- extractor.sh384 B
- run1.mert.log528 B
- run1.features.dat12 MB
- extract.err539 B
- run2.extract.out0 B
- run2.dense201 B
- run4.dense199 B
- run1.dense197 B
- run1.scores.dat5 MB
- run3.dense199 B
- moses.ini1 kB
- run3.scores.dat6 MB
- run4.mert.out0 B
- run2.mert.log664 B
- run2.extract.err541 B
- run4.features.dat15 MB
- run2.out185 kB
- mert.log946 B
- run1.weights.txt78 B
- run1.init.opt105 B
- finished_step.txt2 B
- corpus.en-fr.align331 MB
- ted_ep_nc.ini1 kB
- ted_ep_nc.IWSLT14.TED.tst2012.en.fr119 kB
- ted_ep_nc
- Makefile15 kB
- README1 kB
- Název
- DiscoMT2015.SMT-recaser.tar.gz
- Velikost
- 1.05 GB
- Formát
- application/x-gzip
- Popis
- SMT baseline system - recaser model
- MD5
- d65ff4955cef670b19341e5c7e961965
- recaser
- truecaser_model37 MB
- moses.ini736 B
- cased.kenlm1 GB
- phrase-table.gz21 MB