This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation

Please use the following text to cite this item or export to a predefined format:
Jůzová, Markéta; Tihelka, Daniel and Matoušek, Jindřich, 2017, High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-2585.
Date issued
2017-12-12
Size
3500 sentences
Language(s)
Description
This text corpus contains a carefully optimized set of sentences that could be used in the process of preparing a speech corpus for the development of personalized text-to-speech system. It was designed primarily for the voice conservation procedure that must be performed in a relatively short period before a person loses his/her own voice, typically because of the total laryngectomy. Total laryngectomy is a radical treatment procedure which is often unavoidable to save life of patients who were diagnosed with severe laryngeal cancer. In spite of being very effective with respect to the primary treatment, it significantly handicaps the patients due to the permanent loss of their ability to use voice and produce speech. Luckily, the modern methods of computer text-to-speech (TTS) synthesis offer a possibility for "digital conservation" of patient's original voice for his/her future speech communication -- a procedure called voice banking or voice conservation. Moreover, the banking procedure can be undertaken by any person facing voice degradation or loss in farther future, or who is simply is willing to keep his/her voice-print.
Acknowledgement
 Files in this item
Name
sentences.xml
Size
1.81 MB
Format
text/xml
Description
XML
MD5
ec174a721b658f5522252dd6455930ea
Preview
  File Preview
Name
README.txt
Size
6.46 KB
Format
text/plain
Description
Text
MD5
fc41091bebd67f3830f4053c1696c6c5
Preview
  File Preview
    Title: High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation
    Authors: Markéta Jůzová, Daniel Tihelka, Jindřich Matoušek
    
    Note: This corpus constitutes the research outcome TH02010307-V2 of the project "Automatic voice banking and reconstruction for patients after total laryngectomy" (TH02010307).                           
    
    -----
    This text corpus contains a carefully optimized set of sentences that could be used in the process of preparing a speech corpus for the development 
    of personalized text-to-speech system. It was designed primarily for the voice conservation procedure that must be performed in a relatively short 
    period before a person loses his/her own voice, typically because of the total laryngectomy.
    
    Total laryngectomy is a radical treatment procedure which is often unavoidable to safe life of patients who were diagnosed with severe laryngeal cancer.
    In spite of being very effective with respect to the primary treatment, it significantly handicaps the patients due to the permanent loss of their
    ability to use voice and produce speech. Luckily, the modern methods of computer text-to-speech (TTS) synthesis offer a possibility for "digital
    conservation" of patient's original voice for his/her future speech communication -- a procedure called voice banking or voice conservation. Moreover,
    the banking procedure can be undertaken by any person facing voice degradation or loss in farther future, or who is simply is willing to keep his/her voice
    print.
    
    The key aspect is the design design of speech recording process, since the speakers are required to record speech data suitable enough for a personalized
    TTS system with a reasonable level of quality. Given that there can be very little time between the diagnosis and surgery in case of laryngectomy, and
    also the fact that a common speaker is absolutely non-trained in speech recording (sometimes even having lower computer-handling skills) makes the recording
    conditions and speech cor . . .