[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"general corpus;"},{"@type":"PropertyValue","name":"Speaker","value":"professional Character Voice, 20-30 years old, Shantou dialect in Chaoshan;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"chaozhou;"},{"@type":"PropertyValue","name":"Annotation","value":"word and phoneme transcription, prosodic boundary annotation;"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1410,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","value":"professional recording studio;"},{"title":"Recording content","value":"general corpus;"},{"title":"Speaker","value":"professional Character Voice, 20-30 years old, Shantou dialect in Chaoshan;"},{"title":"Device","value":"microphone;"},{"title":"Language","value":"chaozhou;"},{"title":"Annotation","value":"word and phoneme transcription, prosodic boundary annotation;"},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"Synthesis Corpus,Chaozhou,TTS,Chinese,Dialect","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000020.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=8glEWRKf1G7FTpq4AaasYrTBYug%3D","/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000020.wav","第一次#1销售#1例会#3,叫伊#1分享#3,伊#1分享了#1字个#4。doin6 ig4 ce3 siao1 ciu5 li7 huê6 gio3 i1 hung1 hiang2 i1 hung1 hiang2 liao3 ri7 gai7"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000044.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=u%2BL2mKje0h3UGEyX6IdokJZIp8o%3D","/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000044.wav","富士山#2哈是#1好睇个#3,明年#1准备#1再去#1一次#4。bu3 se6 san1 hah4 si6 ho7 toin2 gai7 mêng5 ni5 zung2 bi6 zai3 ke3 zêg8 ce3"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000015.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NviOvTnUfK8hBGdbLyHFo68bzSk%3D","/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000015.wav","指个#1医院#1虽然#1级别#1过低#3,但是#2有#1糖尿病#1专科#4。zi2 gai7 ui1 in7 sui1 riang5 kib4 biag8 guê3 di1 dang6 si6 u6 teng5 rio7 bên7 zuang1 kuê1"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000016.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=vQipnYQ39ft28sN8rXTnMbrD6tA%3D","/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000016.wav","我#1而是#1胶己#1百度#3,正知#2伊个#1原来#1而是#4。ua2 ru5 si6 ga1 gi7 bêh4 dou7 zian3 zai1 i1 gai7 nguang5 lai3 ru5 si6"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000011.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=WbS9ISjlgG%2BBhgyDax8XjGuCToI%3D","/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000011.wav","卡#1紧张了#3,习惯#1而好#3,字种#1心态#3多补考#1挂次#3而好了#4。ka2 ging2 ziang1 liao3 sib8 guang3 ru5 ho7 ri7 zêng2 sim1 tai3 do1 bou2 kao2 gua3 ce3 ru5 ho7 liao3"]],"officialSummary":"10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["Synthesis Corpus","TTS","Female","General","Chaozhou","Dialect"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
[{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000020.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=8glEWRKf1G7FTpq4AaasYrTBYug%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000044.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=u%2BL2mKje0h3UGEyX6IdokJZIp8o%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000015.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NviOvTnUfK8hBGdbLyHFo68bzSk%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000016.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=vQipnYQ39ft28sN8rXTnMbrD6tA%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240129001_demo1712743258731/APY240129001_demo/000011.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=WbS9ISjlgG%2BBhgyDax8XjGuCToI%3D"}]
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female Synthesis Corpus
TTS
Female
General
Chaozhou
Dialect
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis. This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
general corpus;
Speaker
professional Character Voice, 20-30 years old, Shantou dialect in Chaoshan;
Annotation
word and phoneme transcription, prosodic boundary annotation;
Application scenarios
speech synthesis.
Sample
Audio 第一次#1销售#1例会#3,叫伊#1分享#3,伊#1分享了#1字个#4。doin6 ig4 ce3 siao1 ciu5 li7 huê6 gio3 i1 hung1 hiang2 i1 hung1 hiang2 liao3 ri7 gai7
Audio 富士山#2哈是#1好睇个#3,明年#1准备#1再去#1一次#4。bu3 se6 san1 hah4 si6 ho7 toin2 gai7 mêng5 ni5 zung2 bi6 zai3 ke3 zêg8 ce3
Audio 指个#1医院#1虽然#1级别#1过低#3,但是#2有#1糖尿病#1专科#4。zi2 gai7 ui1 in7 sui1 riang5 kib4 biag8 guê3 di1 dang6 si6 u6 teng5 rio7 bên7 zuang1 kuê1
Audio 我#1而是#1胶己#1百度#3,正知#2伊个#1原来#1而是#4。ua2 ru5 si6 ga1 gi7 bêh4 dou7 zian3 zai1 i1 gai7 nguang5 lai3 ru5 si6
Audio 卡#1紧张了#3,习惯#1而好#3,字种#1心态#3多补考#1挂次#3而好了#4。ka2 ging2 ziang1 liao3 sib8 guang3 ru5 ho7 ri7 zêng2 sim1 tai3 do1 bou2 kao2 gua3 ce3 ru5 ho7 liao3
Recommended Dataset
35 Hours - Mandarin Chinese(China) transcribed Pinyin for Audiobooks Microphone speech dataset
Mandarin Chinese(China) transcribed Pinyin for Audiobooks Microphone speech dataset, collected from monologue based on given scripts, with balanced gender distribution. Transcribed with Chinese characters, Pinyin and other attributes. Our dataset was collected from extensive and diversify speakers(5 speakers in total, 3 males and 2 females), enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Phonetic Data Labeled by Phonetic Reading Text Pinyin
Details
Tell Us Your Special Needs
746f3122-8633-4c72-9d80-dac7aa9bec6a