[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Content category","value":"Smart car; smart home; voice assistant;"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise(indoor), without echo;"},{"@type":"PropertyValue","name":"Recording device","value":"Android smartphone; iPhone;"},{"@type":"PropertyValue","name":"Speaker","value":"155 Malays, 34% male and 66% female;"},{"@type":"PropertyValue","name":"Country","value":"Malaysia(MYS);"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"ms-MY;"},{"@type":"PropertyValue","name":"Language","value":"Malay;"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text, 4 special identifiers;"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%(noise symbols are excluded)"}]
{"id":172,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY161101044_G.png?Expires=2007353629&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=KdjDW%2BzNM/VXJ3C8PWFzpqt/9Dc%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"155 People - Malay(Malaysia) Scripted Monologue Smartphone speech dataset_Guiding","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"title":"Content category","value":"Smart car; smart home; voice assistant;"},{"title":"Recording condition","value":"Low background noise(indoor), without echo;"},{"title":"Recording device","value":"Android smartphone; iPhone;"},{"title":"Speaker","value":"155 Malays, 34% male and 66% female;"},{"title":"Country","value":"Malaysia(MYS);"},{"title":"Language(Region) Code","value":"ms-MY;"},{"title":"Language","value":"Malay;"},{"title":"Features of annotation","value":"Transcription text, 4 special identifiers;"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%(noise symbols are excluded)"}],"datatag":"Malay,Malaysia,Smartphone,Guiding,Scripted Monologue","technologydoc":null,"downurl":null,"datainfo":"The product is recorded by 155 Malays native speakers with authentic accents. Among them, 102 are women ( 66% of the total ) , and each recorded about 50 sentences. The recorded text includes driving scenarios, smart home and intelligent voice assistant. The data can be used for tasks such as speech recognition, machine translation, and voiceprint recognition.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["155 people","7 hours","50 sentences for each person"],"samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0206G0095Q0023.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=AbTpz0BcAFi3SilMO6EZA4ndpuM%3D","/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0206G0095Q0023.wav","Semak nombor telefon."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0004Q0015.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=tW%2FBQhfKWSlKbO9Bp24DGOI9btw%3D","/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0004Q0015.wav","Sila bacakan komik Kembara Kembar Nakal keluaran terkini."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0016Q0015.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=fWc4O7p9nx6Uj919SM6fOiYDJVo%3D","/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0016Q0015.wav","Menggunakan [/Baidu ditu/] untuk mengetahui [~]R [/and/] [~]R yang seterusnya."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0016Q0011.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=vvDkEUVUCPyGWHZ7R6ntNxG8068%3D","/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0016Q0011.wav","Berapa lama untuk sampai ke [~]R [/and/] [~]R seterusnya?"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0004Q0012.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=PEc8HkNmqFT4DkrgTrmGbfkTR5w%3D","/data/apps/damp/temp/ziptemp/APY161101044_G_demo1706954405325/APY161101044_G/T0205G0004Q0012.wav","Sila mainkan novel yang bertemakan cinta."]],"officialSummary":"Malay(Malaysia) Scripted Monologue Smartphone speech dataset_Guiding, collected from monologue based on given prompts, covering smart car, smart home, voice assistant domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(155 Malay people), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Malay data"," mobile phone collected voice data"," guide voice"," Malaysian voice"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
155 People - Malay(Malaysia) Scripted Monologue Smartphone speech dataset_Guiding
Malay data
mobile phone collected voice data
guide voice
Malaysian voice
Malay(Malaysia) Scripted Monologue Smartphone speech dataset_Guiding, collected from monologue based on given prompts, covering smart car, smart home, voice assistant domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(155 Malay people), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16bit, uncompressed wav, mono channel;
Content category
Smart car; smart home; voice assistant;
Recording condition
Low background noise(indoor), without echo;
Recording device
Android smartphone; iPhone;
Speaker
155 Malays, 34% male and 66% female;
Country
Malaysia(MYS);
Language(Region) Code
ms-MY;
Language
Malay;
Features of annotation
Transcription text, 4 special identifiers;
Accuracy Rate
Sentence Accuracy Rate (SAR) 95%(noise symbols are excluded)