Show simple item record

 
dc.contributor.author Vodolán, Miroslav
dc.contributor.author Jurčíček, Filip
dc.date.accessioned 2016-04-04T08:02:54Z
dc.date.available 2016-04-04T08:02:54Z
dc.date.issued 2016
dc.identifier.uri http://hdl.handle.net/11234/1-1670
dc.description Dataset collected from natural dialogs which enables to test the ability of dialog systems to interactively learn new facts from user utterances throughout the dialog. The dataset, consisting of 1900 dialogs, allows simulation of an interactive gaining of denotations and questions explanations from users which can be used for the interactive learning.
dc.language.iso eng
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
dc.rights.uri http://creativecommons.org/licenses/by-sa/4.0/
dc.subject question dialogs
dc.subject interactive learning
dc.title Question Dialogs Dataset
dc.type lexicalConceptualResource
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#ContentInfo.detailedType other
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
contact.person Miroslav Vodolán miravod@centrum.cz vodolan@ufal.mff.cuni.cz
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky LK11221 Vývoj metod pro návrh statistických mluvených dialogových systémů nationalFunds
sponsor Univerzita Karlova v Praze (mimo GAUK) SVV 260 224 Specifický vysokoškolský výzkum nationalFunds
sponsor Grantová agentura Univerzity Karlovy v Praze GAUK 1170516 Řízení dialogu v otevřených doménách s využitím znalostních grafů nationalFunds
size.info 1900 items
size.info 8533 turns
files.size 3639983
files.count 7


 Files in this item

 Download all files in item (3.47 MB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Distributed under Creative Commons Attribution Required Share Alike
Icon
Name
question_dialogs-train.json
Size
1.71 MB
Format
Unknown
Description
Training part of the dataset.
MD5
d9c811d3a4067337ba1d1952f375fa83
 Download file
Icon
Name
question_dialogs-dev.json
Size
515.51 KB
Format
Unknown
Description
Development part of the dataset.
MD5
0540cbed16c80de5a72957fad5d46e53
 Download file
Icon
Name
question_dialogs-test.json
Size
1.22 MB
Format
Unknown
Description
Test part of the dataset.
MD5
9356c5058bc890e49b580f0c69f4c1e7
 Download file
Icon
Name
README.txt
Size
11.33 KB
Format
Text file
Description
Readme describing the dataset.
MD5
99f1ec5f83c43eece65d5ae7e2367665
 Download file  Preview
 File Preview  
==========================
 QUESTION DIALOGS DATASET
==========================
For more details see the paper:
    "Data Collection for Interactive Learning through the Dialog", 2016,
        Vodolán Miroslav, Filip Jurčíček,  
        http://arxiv.org/abs/1603.09631


The dataset consists of standard data split into training, development and test files:
    1) question_dialogs-train.json
    2) question_dialogs-dev.json
    3) question_dialogs-test.json

Dataset files contain one dialog per line. The dialogs are stored in json format. 

Three python scripts are released with the dataset:
    a) interactive_learning_evaluator.py - Evaluates given model in interactive manner on dialogs simmulated from conversations in dataset.
    b) interactive_model_base.py - Base class which simplifies developement of interactive models by providing standard routines for communication with interactive_learning_evaluator.py simulator.
    c) simple_interactive_model.py - Simple imp . . .
                                            
Icon
Name
interactive_model_base.py
Size
3.95 KB
Format
Unknown
Description
Base class for interactive model development.
MD5
2d6b66a79ed16ba7df43049af6996ea1
 Download file
Icon
Name
simple_interactive_model.py
Size
3 KB
Format
Unknown
Description
Simple implementation of interactive model.
MD5
fcf070af58c18b8fb78bf2b7172540a8
 Download file
Icon
Name
interactive_learning_evaluator.py
Size
19.71 KB
Format
Unknown
Description
Script for evaluation of interactive models.
MD5
54dba2acbee67b4b785bf1892624c5a7
 Download file

Show simple item record