High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation
Please use the following text to cite this item or export to a predefined format:
Jůzová, Markéta; Tihelka, Daniel and Matoušek, Jindřich, 2017,
High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-2585.
Authors
Item identifier
Date issued
2017-12-12
Size
3500 sentences
Language(s)
Description
This text corpus contains a carefully optimized set of sentences that could be used in the process of preparing a speech corpus for the development of personalized text-to-speech system. It was designed primarily for the voice conservation procedure that must be performed in a relatively short period before a person loses his/her own voice, typically because of the total laryngectomy.
Total laryngectomy is a radical treatment procedure which is often unavoidable to save life of patients who were diagnosed with severe laryngeal cancer. In spite of being very effective with respect to the primary treatment, it significantly handicaps the patients due to the permanent loss of their ability to use voice and produce speech. Luckily, the modern methods of computer text-to-speech (TTS) synthesis offer a possibility for "digital conservation" of patient's original voice for his/her future speech communication -- a procedure called voice banking or voice conservation. Moreover, the banking procedure can be undertaken by any person facing voice degradation or loss in farther future, or who is simply is willing to keep his/her voice-print.
Acknowledgement
Technology Agency of the Czech Republic (TA CR)
Project code:TH02010307
Project name:Automatic Voice Banking and Reconstruction for Patients after Total Laryngectomy
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- sentences.xml
- Size
- 1.81 MB
- Format
- text/xml
- Description
- XML
- MD5
- ec174a721b658f5522252dd6455930ea

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- README.txt
- Size
- 6.46 KB
- Format
- text/plain
- Description
- Text
- MD5
- fc41091bebd67f3830f4053c1696c6c5

Title: High-Coverage Multi-Level Text Corpus for Non-Professional Voice Conservation Authors: Markéta Jůzová, Daniel Tihelka, Jindřich Matoušek Note: This corpus constitutes the research outcome TH02010307-V2 of the project "Automatic voice banking and reconstruction for patients after total laryngectomy" (TH02010307). ----- This text corpus contains a carefully optimized set of sentences that could be used in the process of preparing a speech corpus for the development of personalized text-to-speech system. It was designed primarily for the voice conservation procedure that must be performed in a relatively short period before a person loses his/her own voice, typically because of the total laryngectomy. Total laryngectomy is a radical treatment procedure which is often unavoidable to safe life of patients who were diagnosed with severe laryngeal cancer. In spite of being very effective with respect to the primary treatment, it significantly handicaps the patients due to the permanent loss of their ability to use voice and produce speech. Luckily, the modern methods of computer text-to-speech (TTS) synthesis offer a possibility for "digital conservation" of patient's original voice for his/her future speech communication -- a procedure called voice banking or voice conservation. Moreover, the banking procedure can be undertaken by any person facing voice degradation or loss in farther future, or who is simply is willing to keep his/her voice print. The key aspect is the design design of speech recording process, since the speakers are required to record speech data suitable enough for a personalized TTS system with a reasonable level of quality. Given that there can be very little time between the diagnosis and surgery in case of laryngectomy, and also the fact that a common speaker is absolutely non-trained in speech recording (sometimes even having lower computer-handling skills) makes the recording conditions and speech cor . . .

