dc.contributor.author | Vodolán, Miroslav |
dc.contributor.author | Jurčíček, Filip |
dc.date.accessioned | 2016-04-04T08:02:54Z |
dc.date.available | 2016-04-04T08:02:54Z |
dc.date.issued | 2016 |
dc.identifier.uri | http://hdl.handle.net/11234/1-1670 |
dc.description | Dataset collected from natural dialogs which enables to test the ability of dialog systems to interactively learn new facts from user utterances throughout the dialog. The dataset, consisting of 1900 dialogs, allows simulation of an interactive gaining of denotations and questions explanations from users which can be used for the interactive learning. |
dc.language.iso | eng |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.rights | Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) |
dc.rights.uri | http://creativecommons.org/licenses/by-sa/4.0/ |
dc.subject | question dialogs |
dc.subject | interactive learning |
dc.title | Question Dialogs Dataset |
dc.type | lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.mediaType | text |
metashare.ResourceInfo#ContentInfo.detailedType | other |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
contact.person | Miroslav Vodolán miravod@centrum.cz vodolan@ufal.mff.cuni.cz |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LK11221 Vývoj metod pro návrh statistických mluvených dialogových systémů nationalFunds |
sponsor | Univerzita Karlova v Praze (mimo GAUK) SVV 260 224 Specifický vysokoškolský výzkum nationalFunds |
sponsor | Grantová agentura Univerzity Karlovy v Praze GAUK 1170516 Řízení dialogu v otevřených doménách s využitím znalostních grafů nationalFunds |
size.info | 1900 items |
size.info | 8533 turns |
files.size | 3639983 |
files.count | 7 |
Soubory tohoto záznamu
Stáhnout všechny soubory záznamu (3.47 MB)Licenční kategorie:
Licence: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
Licence: Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Název
- question_dialogs-train.json
- Velikost
- 1.71 MB
- Formát
- Neznámý
- Popis
- Training part of the dataset.
- MD5
- d9c811d3a4067337ba1d1952f375fa83
- Název
- question_dialogs-dev.json
- Velikost
- 515.51 KB
- Formát
- Neznámý
- Popis
- Development part of the dataset.
- MD5
- 0540cbed16c80de5a72957fad5d46e53
- Název
- question_dialogs-test.json
- Velikost
- 1.22 MB
- Formát
- Neznámý
- Popis
- Test part of the dataset.
- MD5
- 9356c5058bc890e49b580f0c69f4c1e7
- Název
- README.txt
- Velikost
- 11.33 KB
- Formát
- Textový soubor
- Popis
- Readme describing the dataset.
- MD5
- 99f1ec5f83c43eece65d5ae7e2367665
========================== QUESTION DIALOGS DATASET ========================== For more details see the paper: "Data Collection for Interactive Learning through the Dialog", 2016, Vodolán Miroslav, Filip Jurčíček, http://arxiv.org/abs/1603.09631 The dataset consists of standard data split into training, development and test files: 1) question_dialogs-train.json 2) question_dialogs-dev.json 3) question_dialogs-test.json Dataset files contain one dialog per line. The dialogs are stored in json format. Three python scripts are released with the dataset: a) interactive_learning_evaluator.py - Evaluates given model in interactive manner on dialogs simmulated from conversations in dataset. b) interactive_model_base.py - Base class which simplifies developement of interactive models by providing standard routines for communication with interactive_learning_evaluator.py simulator. c) simple_interactive_model.py - Simple imp . . .
- Název
- interactive_model_base.py
- Velikost
- 3.95 KB
- Formát
- Neznámý
- Popis
- Base class for interactive model development.
- MD5
- 2d6b66a79ed16ba7df43049af6996ea1
- Název
- simple_interactive_model.py
- Velikost
- 3 KB
- Formát
- Neznámý
- Popis
- Simple implementation of interactive model.
- MD5
- fcf070af58c18b8fb78bf2b7172540a8
- Název
- interactive_learning_evaluator.py
- Velikost
- 19.71 KB
- Formát
- Neznámý
- Popis
- Script for evaluation of interactive models.
- MD5
- 54dba2acbee67b4b785bf1892624c5a7