13.8 Hours - Chinese Mandarin Synthesis Corpus-Female, Emotional
Chinese
female
Emotional
tts
Synthesis
Corpus
principal
body
carcass
principal
sum
compilation
substance
collection
mass
bulk
corpse
core
corpora
bodies
capital
stock
fixed
capital
frame
quantity
staple
anthology
fuselage
compilations
law
book
lawbook
oeuvre
principals
whole
amount
assemblage
capital
corps
entirety
form
bones
code
flesh
information
main
idea
matter
physique
skeleton
torso
trunk
volume
aggregate
basis
being
cadaver
center
centre
of
attention
codex
The 13.8 Hours - Chinese Mandarin Synthesis Corpus-Female, Emotional. It is recorded by Chinese native speaker, emotional text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 16bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
six emotions (happiness, anger, sadness, surprise, fear, disgust);
Speaker
female, 20-30 years old, soft and friendly voice;
Device
microphone;
Language
Mandarin;
Annotation
word and pinyin transcription, prosodic boundary annotation;
10 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service
Chinese Mandarin Synthesis Corpus-Female, Customer Service, It is recorded by Chinese native speakers, with lively and frindly voice. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSMandarinFemaleCustomer Service
50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service
50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSAverage ToneSynthesis CorpusCustomer Service
150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service
150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
MandarinCustomer ServiceSynthesis Corpus
20.1 Hours - Chinese Mandarin Synthesis Corpus-Male, Customer Service
20 Hours - Chinese Mandarin Synthesis Corpus-Male, Customer Service. It is recorded by Chinese native speakers, the voice of the full of magnetism. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSCustomer ServiceSynthesis Corpus
26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service
26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, It is recorded by Chinese native speakers, with lively and frindly voice. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSMandarinFemaleCustomer Service
6.78 Hours - Chinese Mandarin Speech Synthesis Corpus-Female Imitating Children
Female audio data of adults imitating children, 6599 sentences in total and 6.78 hours. It is recorded by Chinese native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSChineseChildren
19.46 Hours - American English Speech Synthesis Corpus-Female
Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSAmerican EnglishFemale
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.