[{"@type":"PropertyValue","name":"Format","value":"16k Hz, 16 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Content category","value":"Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds"},{"@type":"PropertyValue","name":"Recording condition","value":"Quiet indoor environment, without echoes, background voices, obvious noises"},{"@type":"PropertyValue","name":"Recording device","value":"Android phone, iPhone"},{"@type":"PropertyValue","name":"Speaker","value":"Total 302 contributors,45% male and 55% female. 291contributors aged 18-37, 10 contributors aged 38-45, and 1 contributor aged 46-65"},{"@type":"PropertyValue","name":"Country","value":"India(IND)"},{"@type":"PropertyValue","name":"Language","value":"Hindi,English"}]
{"id":1420,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"Hindi-English Bilingual Speech Dataset – 302 Speakers (Smartphone, Monologue)","datazy":[{"title":"Format","content":"16k Hz, 16 bit, wav, mono channel","desc":"Format"},{"title":"Content category","content":"Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds","desc":"Content category"},{"title":"Recording condition","content":"Quiet indoor environment, without echoes, background voices, obvious noises","desc":"Recording condition"},{"title":"Recording device","content":"Android phone, iPhone","desc":"Recording device"},{"title":"Speaker","content":"Total 302 contributors,45% male and 55% female. 291contributors aged 18-37, 10 contributors aged 38-45, and 1 contributor aged 46-65","desc":"Speaker"},{"title":"Country","content":"India(IND)","desc":"Country"},{"title":"Language","content":"Hindi,English","desc":"Language"}],"datatag":"Spontaneous monologue,Natural Speech,Hindi,English,Bilingual","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"G00004S1021.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20250721152216/G00004S1021.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=jRqYujueWG1GwZHXDURjsXC%2BrI4%3D","intro":"","size":487620,"progress":100,"type":"mp3"},{"name":"G00004S1001.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20250721152216/G00004S1001.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=VZdf%2BKxijtgeFXcMD3u%2FGVYAkf8%3D","intro":"","size":364000,"progress":100,"type":"mp3"},{"name":"G00019S1023.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20250721152216/G00019S1023.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=NTW7R5ytHLsZ6eLeaDHdlkyoIz8%3D","intro":"","size":439086,"progress":100,"type":"mp3"},{"name":"G00019S1006.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20250721152216/G00019S1006.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=JcR90mFM0FNryXKERgYAT8sxUlM%3D","intro":"","size":530508,"progress":100,"type":"mp3"}],"officialSummary":"This dataset contains spontaneous bilingual speech in Hindi and English, collected from dialogues based on given topics, covering generic domain. Our dataset was collected from extensive and diversify speakers(302 people in total, ages 18 to 46), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["Hindi English bilingual speech dataset","Hindi English speech dataset","Hindi English ASR dataset","Hindi English TTS dataset","Hindi English voice dataset","Hindi English audio dataset","Bilingual speech dataset Hindi English"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
This dataset contains spontaneous bilingual speech in Hindi and English, collected from dialogues based on given topics, covering generic domain. Our dataset was collected from extensive and diversify speakers(302 people in total, ages 18 to 46), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16k Hz, 16 bit, wav, mono channel
Content category
Individuals naturally speaking, with no specific content limitations. Each speaker records 20 audios in each language (40 recordings per person), each recording lasting about 10-20 seconds
Recording condition
Quiet indoor environment, without echoes, background voices, obvious noises
Recording device
Android phone, iPhone
Speaker
Total 302 contributors,45% male and 55% female. 291contributors aged 18-37, 10 contributors aged 38-45, and 1 contributor aged 46-65