[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, mono channel, wav"},{"@type":"PropertyValue","name":"Data Content","value":"Spontaneous speech data collected from public channels with a colloquial style;"},{"@type":"PropertyValue","name":"Language","value":"Accent Mandarin"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text, timestamp, speaker ID, gender, noise"},{"@type":"PropertyValue","name":"Accuracy","value":"Sentence Accuracy Rate (SAR) 97%"}]
{"id":1295,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"548 Hours - Taiwanese Accent Mandarin(China) Real-world Casual Conversation and Monologue speech dataset","datazy":[{"title":"Format","value":"16kHz, 16bit, mono channel, wav"},{"title":"Data Content","value":"Spontaneous speech data collected from public channels with a colloquial style;"},{"title":"Language","value":"Accent Mandarin"},{"title":"Features of annotation","value":"Transcription text, timestamp, speaker ID, gender, noise"},{"title":"Accuracy","value":"Sentence Accuracy Rate (SAR) 97%"}],"datatag":"Accent mandarin; taiwanese,Colloquial Video","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_12.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZQsX5YDj4im2YWqUiIPWnxsy24%3D","/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_12.wav","然后这种shop back,在台湾常见的购物平台,例如你买日用品有虾皮,买外送有food panda。[N]"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_10.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=4KtIDyKTExiO6s3E9t0558RG7Tg%3D","/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_10.wav","更照顾台湾的孩子们,那我们在日常生活中,不需要多做什么事情,就可以得到来自他的money。[N]"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_11.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b0M%2BJ5vghgLmW4pz%2FW14gIB6jMw%3D","/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_11.wav","没有错,shop back,它是一个现金回馈平台。[N]"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_9.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ACmxDVk%2Bktroa%2FFgUWB1UyliYpM%3D","/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_9.wav","他糖果爸爸,又名shop back,是一位大爱的父亲,来自新加坡的他,不止帮助我孝敬我爸。[N]"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_14.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=uMeAKlA1whqemNexreuOemWqpbA%3D","/data/apps/damp/temp/ziptemp/APY230731003_demo1711360881819/000109_14.wav","经由平台完成订单之后,就可以得到可以被提领出来的现金回馈。[N]"]],"officialSummary":"Taiwanese Accent Mandarin(China) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Chinese spoken video voice data"," Chinese voice data"," Chinese spoken video data"," Chinese multimodal data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Taiwanese Accent Mandarin(China) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16bit, mono channel, wav
Data Content
Spontaneous speech data collected from public channels with a colloquial style;