[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio"},{"@type":"PropertyValue","name":"Recording content","value":"seven emotions (happiness, anger, sadness, surprise, fear, disgust)+sentences with filler word"},{"@type":"PropertyValue","name":"Speaker","value":"professional CharacterVoice; Role: An 18-year-old girl who works as an entertainment anchor and enjoys singing and dancing"},{"@type":"PropertyValue","name":"Device","value":"microphone"},{"@type":"PropertyValue","name":"Language","value":"Mandarin"},{"@type":"PropertyValue","name":"Annotation","value":"word and pinyin transcription, prosodic boundary annotation, phoneme boundary annotation"},{"@type":"PropertyValue","name":"The amount of data","value":"The amount of neutral data is not less than 1.6 hours; the amount of data with filler word is not less than 0.4 hours; and the remaining six types of emotional data is not less than 1.67 hours each"}]
{"id":1304,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"12 Hours - Chinese Mandarin Synthesis Corpus-Female, Entertainment anchor Style, Multi-emotional","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel"},{"title":"Recording environment","value":"professional recording studio"},{"title":"Recording content","value":"seven emotions (happiness, anger, sadness, surprise, fear, disgust)+sentences with filler word"},{"title":"Speaker","value":"professional CharacterVoice; Role: An 18-year-old girl who works as an entertainment anchor and enjoys singing and dancing"},{"title":"Device","value":"microphone"},{"title":"Language","value":"Mandarin"},{"title":"Annotation","value":"word and pinyin transcription, prosodic boundary annotation, phoneme boundary annotation"},{"title":"The amount of data","value":"The amount of neutral data is not less than 1.6 hours; the amount of data with filler word is not less than 0.4 hours; and the remaining six types of emotional data is not less than 1.67 hours each"}],"datatag":"Synthesis Corpus,TTS,Mandarin,Multi-emotional,Entertainment anchor","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/100003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=uMEV3TO532USz9MKnu1ZM3iK6YE%3D","/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/100003.wav","希望#1能够#1呼吸#1新鲜#1空气#3而不是#1被污染#1物质#1包裹着#4。xi1 wang4 neng2 gou4 hu1 xi1 xin1 xian1 kong1 qi4 er2 bu2 shi4 bei4 wu1 ran3 wu4 zhi4 bao1 guo3 zhe5"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2FgwKxB3N2I4AIU0FgM5Ko82fZtQ%3D","/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/000001.wav","请不要#1太过分#3,我是#1有#1边界的#4。qing3 bu2 yao4 tai4 guo4 fen4 wo3 shi4 you3 bian1 jie4 de5"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/500003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=cCOrRvHLWfeolbL%2BUQFVvKHKZkk%3D","/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/500003.wav","我#1找不到#1任何#1颜色#1和#1乐趣#4。wo6 zhao3 bu2 dao4 ren4 he2 yan2 se4 he2 le4 qu4"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/300006.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=GPokiHfh30zcglC2i72F2hqnahM%3D","/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/300006.wav","跟着#1我的#1节奏#2一起#1舞动吧#4!gen1 zhe5 wo3 de5 jie2 zou4 yi4 qi6 wu3 dong4 ba5"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/200022.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=XhXLYVcSHQNUPQhUYnnvhKef6Dk%3D","/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/200022.wav","仿佛有#1一只手#3正从#1我的#1后背#1伸出来#4。fang3 fu2 you3 yi4 zhi1 shou3 zheng4 cong2 wo3 de5 hou4 bei4 shen1 chu1 lai5"]],"officialSummary":"12 Hours - Chinese Mandarin Entertainment anchor Style Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker. six emotional text+modal particles, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["Chinese","Emotional","Multi-emotional","tts","Synthesis","Corpus"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
[{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/100003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=uMEV3TO532USz9MKnu1ZM3iK6YE%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2FgwKxB3N2I4AIU0FgM5Ko82fZtQ%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/500003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=cCOrRvHLWfeolbL%2BUQFVvKHKZkk%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/300006.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=GPokiHfh30zcglC2i72F2hqnahM%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230707001_demo1702029600667/APY230707001_demo/200022.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=XhXLYVcSHQNUPQhUYnnvhKef6Dk%3D"}]
12 Hours - Chinese Mandarin Synthesis Corpus-Female, Entertainment anchor Style, Multi-emotional
Chinese
Emotional
Multi-emotional
tts
Synthesis
Corpus
12 Hours - Chinese Mandarin Entertainment anchor Style Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker. six emotional text+modal particles, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel
Recording environment
professional recording studio
Recording content
seven emotions (happiness, anger, sadness, surprise, fear, disgust)+sentences with filler word
Speaker
professional CharacterVoice; Role: An 18-year-old girl who works as an entertainment anchor and enjoys singing and dancing
Annotation
word and pinyin transcription, prosodic boundary annotation, phoneme boundary annotation
The amount of data
The amount of neutral data is not less than 1.6 hours; the amount of data with filler word is not less than 0.4 hours; and the remaining six types of emotional data is not less than 1.67 hours each
Sample
Audio 希望#1能够#1呼吸#1新鲜#1空气#3而不是#1被污染#1物质#1包裹着#4。xi1 wang4 neng2 gou4 hu1 xi1 xin1 xian1 kong1 qi4 er2 bu2 shi4 bei4 wu1 ran3 wu4 zhi4 bao1 guo3 zhe5
Audio 请不要#1太过分#3,我是#1有#1边界的#4。qing3 bu2 yao4 tai4 guo4 fen4 wo3 shi4 you3 bian1 jie4 de5
Audio 我#1找不到#1任何#1颜色#1和#1乐趣#4。wo6 zhao3 bu2 dao4 ren4 he2 yan2 se4 he2 le4 qu4
Audio 跟着#1我的#1节奏#2一起#1舞动吧#4!gen1 zhe5 wo3 de5 jie2 zou4 yi4 qi6 wu3 dong4 ba5
Audio 仿佛有#1一只手#3正从#1我的#1后背#1伸出来#4。fang3 fu2 you3 yi4 zhi1 shou3 zheng4 cong2 wo3 de5 hou4 bei4 shen1 chu1 lai5
Recommended Dataset
Tell Us Your Special Needs
060e6dbc-c7ed-4648-b005-5b817a05703d