[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Content category","value":"Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds"},{"@type":"PropertyValue","name":"Recording condition","value":"Quiet indoor environment, without echoes, background voices, obvious noises"},{"@type":"PropertyValue","name":"Recording device","value":"Android phone"},{"@type":"PropertyValue","name":"Speaker","value":"Total 300 contributors,40% males and 60% females. 83%contributors aged 18-37, 15% contributors aged 38-45, and 2% contributors aged 46-65"},{"@type":"PropertyValue","name":"Country","value":"China(CHN);"},{"@type":"PropertyValue","name":"Language","value":"Mandarin Chinese, English;"}]
{"id":1358,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"300 People - Mandarin Chinese and English Bilingual Spotaneous Monologue Smartphone speech dataset","datazy":[{"title":"Format","desc":"Format","content":"16kHz, 16 bit, wav, mono channel"},{"title":"Content category","desc":"Content category","content":"Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds"},{"title":"Recording condition","desc":"Recording condition","content":"Quiet indoor environment, without echoes, background voices, obvious noises"},{"title":"Recording device","desc":"Recording device","content":"Android phone"},{"title":"Speaker","desc":"Speaker","content":"Total 300 contributors,40% males and 60% females. 83%contributors aged 18-37, 15% contributors aged 38-45, and 2% contributors aged 46-65"},{"title":"Country","desc":"Country","content":"China(CHN);"},{"title":"Language","desc":"Language","content":"Mandarin Chinese, English;"}],"datatag":"Unscripted monologue,Natural Speech,Mandarin,English,Bilingual","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[],"officialSummary":"Mandarin Chinese and English Bilingual Spotaneous Monologue Smartphone speech dataset, collected from dialogues based on given topics, covering generic domain. Our dataset was collected from extensive and diversify speakers(300 people in total, ages 18 to 65), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["Unscripted monologue","Natural Speech","Mandarin","English","Bilingual"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
300 People - Mandarin Chinese and English Bilingual Spotaneous Monologue Smartphone speech dataset
Unscripted monologue
Natural Speech
Mandarin
English
Bilingual
Mandarin Chinese and English Bilingual Spotaneous Monologue Smartphone speech dataset, collected from dialogues based on given topics, covering generic domain. Our dataset was collected from extensive and diversify speakers(300 people in total, ages 18 to 65), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16 bit, wav, mono channel
Content category
Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds
Recording condition
Quiet indoor environment, without echoes, background voices, obvious noises
Recording device
Android phone
Speaker
Total 300 contributors,40% males and 60% females. 83%contributors aged 18-37, 15% contributors aged 38-45, and 2% contributors aged 46-65