{"id":1128,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY211031005.png?Expires=2007353699&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=H7KI6ubdlkG0op3KvwzS%2BcFsfaw%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"711 Hours - Vietnamese(Vietnam) Real-world Casual Conversation and Monologue speech dataset","datazy":[{"title":"Format","value":"16kHz, 16 bit, wav, mono channel"},{"title":"Content category","value":"including interview, self-meida,variety show, etc."},{"title":"Recording environment","value":"Low background noise"},{"title":"Country","value":"Vietnam(VNM)"},{"title":"Language(Region) Code","value":"vi-VN"},{"title":"Language","value":"Vietnamese"},{"title":"Features of annotation","value":"Transcription text, timestamp, speaker ID, gender"},{"title":"Accuracy","value":"Word Accuracy Rate (WAR) 98%"}],"datatag":"Vietnamese,Casual Conversation,Monologue,Asr","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_3.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Aorj05msKCKn7et6seOZZ52sA9E%3D","/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_3.wav","xin mời quý vị và các bạn cùng theo dõi cuộc trò chuyện về một chủ đề mà Thảo Vân nghĩ rằng, ờ, luôn luôn mới"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_5.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=4JsbstNTNk5kiKg6sCxTzKTAY%2F0%3D","/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_5.wav","Như ngày hôm nay, chúng tôi muốn mời quý vị, ờ không chỉ là chị em phụ nữ mà cả các anh, hãy cùng lắng nghe, cùng theo dõi với chúng tôi"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_4.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=6DcKu1gDLauRJ5tBV2oJjFSvECU%3D","/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_4.wav","luôn luôn không bao giờ cũ, mặc dù nó là một chủ đề chúng ta vẫn nói nhiều lần, đó là chủ đề làm đẹp."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_2.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=1L0kRad%2FNFc1kmsNyYc9yb706Gc%3D","/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_2.wav","Thưa quý vị, trong câu chuyện à đêm muộn ngày hôm nay"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ggSUHnbjEWd1rpf4cjF1TWv%2BLWI%3D","/data/apps/damp/temp/ziptemp/APY211031005_demo1703757600163/0006_1.wav","Xin kính chào quý vị khán giả."]],"officialSummary":"Vietnamese(Vietnam) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Vietnamese","Colloquial Video","text annotation"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Vietnamese(Vietnam) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16 bit, wav, mono channel
Content category
including interview, self-meida,variety show, etc.
Recording environment
Low background noise
Country
Vietnam(VNM)
Language(Region) Code
vi-VN
Language
Vietnamese
Features of annotation
Transcription text, timestamp, speaker ID, gender
Accuracy
Word Accuracy Rate (WAR) 98%
Sample
Audio
xin mời quý vị và các bạn cùng theo dõi cuộc trò chuyện về một chủ đề mà Thảo Vân nghĩ rằng, ờ, luôn luôn mới
Audio
Như ngày hôm nay, chúng tôi muốn mời quý vị, ờ không chỉ là chị em phụ nữ mà cả các anh, hãy cùng lắng nghe, cùng theo dõi với chúng tôi
Audio
luôn luôn không bao giờ cũ, mặc dù nó là một chủ đề chúng ta vẫn nói nhiều lần, đó là chủ đề làm đẹp.
Audio
Thưa quý vị, trong câu chuyện à đêm muộn ngày hôm nay