[{"@type":"PropertyValue","name":"Format","value":"8kHz, 8bit, u-law/a-law wav, mono channel;"},{"@type":"PropertyValue","name":"Recording condition","value":"quiet indoor environment, without echo;"},{"@type":"PropertyValue","name":"Content category","value":"dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;"},{"@type":"PropertyValue","name":"Speaker","value":"1,234 Vietnam native speakers in total, with 53% male and 47% female;"},{"@type":"PropertyValue","name":"Features of annotation","value":"transcription text, timestamp, speaker ID and gender;"},{"@type":"PropertyValue","name":"Recording device","value":"Telephony recording system;"},{"@type":"PropertyValue","name":"Language","value":"Vietnamese;"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"vi-VN"},{"@type":"PropertyValue","name":"Country","value":"Vietnam(VNM)"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech recognition; voiceprint recognition;"},{"@type":"PropertyValue","name":"Accuracy rate","value":"Word Accuracy Rate (WAR) 98%."}]
{"id":1408,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"977 Hours - Vietnamese Spontaneous Dialogue Telephony speech dataset","datazy":[{"title":"Format","desc":"Format","content":"8kHz, 8bit, u-law/a-law wav, mono channel;"},{"title":"Recording condition","desc":"Recording condition","content":"quiet indoor environment, without echo;"},{"title":"Content category","desc":"Content category","content":"dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;"},{"title":"Speaker","desc":"Speaker","content":"1,234 Vietnam native speakers in total, with 53% male and 47% female;"},{"title":"Features of annotation","desc":"Features of annotation","content":"transcription text, timestamp, speaker ID and gender;"},{"title":"Recording device","desc":"Recording device","content":"Telephony recording system;"},{"title":"Language","desc":"Language","content":"Vietnamese;"},{"title":"Language(Region) Code","desc":"Language(Region) Code","content":"vi-VN"},{"title":"Country","desc":"Country","content":"Vietnam(VNM)"},{"title":"Application scenarios","desc":"Application scenarios","content":"speech recognition; voiceprint recognition;"},{"title":"Accuracy rate","desc":"Accuracy rate","content":"Word Accuracy Rate (WAR) 98%."}],"datatag":"Vietnamese,Spontaneous,Dialogue,Telephony","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_2.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_2.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=zP95dkIicYnqL2B6Pb4Zio6YB0I%3D","intro":"đến ngày sale lớn mười một tháng mười một đấy.","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_4.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_4.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=aL7mP447OGauRQLpTp0XR%2FQxbFk%3D","intro":"Cháu ờ đang chờ ngày mai có lương ấy có lương xong rồi mới bắt đầu vào săn được ấy.","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_3.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_3.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=e6otVfM3MuDUCC2hF4wF16kTeqA%3D","intro":"Thì đã chọn được nhiều đồ để cho vào vào giỏ hàng để mà săn sale chưa?","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_5.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_5.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=i3GJ8iQw8txI%2BSWcCn4SNF4FXLU%3D","intro":"Lương á cần gì đợi lương trời ơi săn sale lại săn toàn những cái hàng không đồng á lại có khi chẳng cần mất đồng lương nào luôn.","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_1.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230930006_demo1728640801276/0002_003_telephone_1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=XeHXRQYOQzTn1XuTszE%2BUSg8yaE%3D","intro":"Ờ Vân ơi, dạo này có","size":0,"progress":100,"type":"mp3"}],"officialSummary":"Vietnamese Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics. Transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(more than 1200 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["Conversational speech","Vietnamese asr data"," Vietnamese"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
Vietnamese Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics. Transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(more than 1200 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
8kHz, 8bit, u-law/a-law wav, mono channel;
Recording condition
quiet indoor environment, without echo;
Content category
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Speaker
1,234 Vietnam native speakers in total, with 53% male and 47% female;
Features of annotation
transcription text, timestamp, speaker ID and gender;
Recording device
Telephony recording system;
Language
Vietnamese;
Language(Region) Code
vi-VN
Country
Vietnam(VNM)
Application scenarios
speech recognition; voiceprint recognition;
Accuracy rate
Word Accuracy Rate (WAR) 98%.
Sample
Audio
đến ngày sale lớn mười một tháng mười một đấy.
Audio
Cháu ờ đang chờ ngày mai có lương ấy có lương xong rồi mới bắt đầu vào săn được ấy.
Audio
Thì đã chọn được nhiều đồ để cho vào vào giỏ hàng để mà săn sale chưa?
Audio
Lương á cần gì đợi lương trời ơi săn sale lại săn toàn những cái hàng không đồng á lại có khi chẳng cần mất đồng lương nào luôn.