[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"contains news and general corpus;"},{"@type":"PropertyValue","name":"Speaker","value":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;"},{"@type":"PropertyValue","name":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"Japanese"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1411,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"20 Hours Japanese TTS Dataset – Native Japanese Voice Corpus","datazy":[{"title":"Format","desc":"Format","content":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","desc":"Recording environment","content":"professional recording studio;"},{"title":"Recording content","desc":"Recording content","content":"contains news and general corpus;"},{"title":"Speaker","desc":"Speaker","content":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;"},{"title":"Annotation","desc":"Annotation","content":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"title":"Device","desc":"Device","content":"microphone;"},{"title":"Language","desc":"Language","content":"Japanese"},{"title":"Application scenarios","desc":"Application scenarios","content":"speech synthesis."}],"datatag":"Japanese,Tts,Average Tone","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B9%2FlCe9cQv3dpyBbp3Xa7jlwIMw%3D","intro":"あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b5hCLTmjOhmm1LT%2BravL3gnDM8s%3D","intro":"何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ptvZWPSR06IvznsBDC7p34CJazs%3D","intro":"この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZcgDHbn7yGxy2713ps%2BB0GM0OY%3D","intro":"はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ndv2gvsnX2%2FfO9cWnLD1poVyTas%3D","intro":"","size":0,"progress":100,"type":"mp3"}],"officialSummary":"This dataset contains recordings from 2 native Japanese speakers with authentic accents, each person contribute 10 hours of audio. Contains news and colloquial style general corpus, the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of building Japanese text-to-speech systems, speech synthesis research, and AI voice applications.","dataexampl":null,"datakeyword":["Japanese speech dataset","Japanese TTS dataset","Japanese speech synthesis corpus","Japanese voice dataset for AI","native Japanese speech dataset","Japanese text-to-speech dataset","balanced phoneme Japanese corpus"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Voice Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp
[{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B9%2FlCe9cQv3dpyBbp3Xa7jlwIMw%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b5hCLTmjOhmm1LT%2BravL3gnDM8s%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ptvZWPSR06IvznsBDC7p34CJazs%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZcgDHbn7yGxy2713ps%2BB0GM0OY%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ndv2gvsnX2%2FfO9cWnLD1poVyTas%3D"}]
20 Hours Japanese TTS Dataset – Native Japanese Voice Corpus
Japanese speech dataset
Japanese TTS dataset
Japanese speech synthesis corpus
Japanese voice dataset for AI
native Japanese speech dataset
Japanese text-to-speech dataset
balanced phoneme Japanese corpus
This dataset contains recordings from 2 native Japanese speakers with authentic accents, each person contribute 10 hours of audio. Contains news and colloquial style general corpus, the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of building Japanese text-to-speech systems, speech synthesis research, and AI voice applications.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
![Specifications]()
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
contains news and general corpus;
Speaker
professional voice actor, one male and one female, aged 25-35, 10 hours per person;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Application scenarios
speech synthesis.
![Sample]()
Sample
Audio
あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)
Audio
何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)
Audio
この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)
Audio
はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)
Audio
![Recommended Datasets]()
Recommended Dataset
Tell Us Your Special Needs
6de8a5fb-eb38-4045-959b-f6b9df239d3c