[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"contains news and general corpus;"},{"@type":"PropertyValue","name":"Speaker","value":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;"},{"@type":"PropertyValue","name":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"Japanese"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1411,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"2 People - Japanese Average Tone Speech Synthesis Corpus","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","value":"professional recording studio;"},{"title":"Recording content","value":"contains news and general corpus;"},{"title":"Speaker","value":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;"},{"title":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"title":"Device","value":"microphone;"},{"title":"Language","value":"Japanese"},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"Japanese,Tts,Average Tone","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B9%2FlCe9cQv3dpyBbp3Xa7jlwIMw%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav","あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b5hCLTmjOhmm1LT%2BravL3gnDM8s%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav","何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ptvZWPSR06IvznsBDC7p34CJazs%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav","この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZcgDHbn7yGxy2713ps%2BB0GM0OY%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav","はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ndv2gvsnX2%2FfO9cWnLD1poVyTas%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav",""]],"officialSummary":"2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["TTS","Japanese","Average Tone"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
[{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B9%2FlCe9cQv3dpyBbp3Xa7jlwIMw%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b5hCLTmjOhmm1LT%2BravL3gnDM8s%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ptvZWPSR06IvznsBDC7p34CJazs%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZcgDHbn7yGxy2713ps%2BB0GM0OY%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ndv2gvsnX2%2FfO9cWnLD1poVyTas%3D"}]
2 People - Japanese Average Tone Speech Synthesis Corpus
TTS
Japanese
Average Tone
2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
contains news and general corpus;
Speaker
professional voice actor, one male and one female, aged 25-35, 10 hours per person;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Application scenarios
speech synthesis.
Sample
Audio あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)
Audio 何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)
Audio この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)
Audio はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)
Audio
Recommended Dataset
Tell Us Your Special Needs
d685ebd0-0442-43c7-92c6-64e7610b07f3