[{"@type":"PropertyValue","name":"Format","value":"44.1kHz, 16bit, wav, dual channel."},{"@type":"PropertyValue","name":"Recording environment","value":"Mixed"},{"@type":"PropertyValue","name":"Recording content","value":"lectures on science and technology, training, publicity, etc."},{"@type":"PropertyValue","name":"Device","value":"AU Center Console Mixer"},{"@type":"PropertyValue","name":"Country","value":"China(CHN)"},{"@type":"PropertyValue","name":"Language","value":"Mandarin"},{"@type":"PropertyValue","name":"Features of annotation","value":"annotating for the transcription text, speaker identification and gender"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate(SAR) 97%"}]
{"id":1066,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY200229003.png?Expires=2007353680&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=lNR7zzSwafiD7FPaXwDbT9Yicy0%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"1,722 Hours - Mandarin(China) Near-field Conference speech dataset","datazy":[{"title":"Format","value":"44.1kHz, 16bit, wav, dual channel."},{"title":"Recording environment","value":"Mixed"},{"title":"Recording content","value":"lectures on science and technology, training, publicity, etc."},{"title":"Device","value":"AU Center Console Mixer"},{"title":"Country","value":"China(CHN)"},{"title":"Language","value":"Mandarin"},{"title":"Features of annotation","value":"annotating for the transcription text, speaker identification and gender"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate(SAR) 97%"}],"datatag":"Mandarin,Speech,Near field,Center Console","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/01.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=5iK%2BRv2bdt%2F0%2FEO4vDSi40%2FO19w%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/01.wav","我觉得呃我觉得我工作的一直非常的开心为什么开心呢因为其实有三点"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/05.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NHJo3x6EmLvBWCXChY8UIyQ0Onk%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/05.wav","但是我们每家我们都有我们自己的特点我们每家都能够呃做出自己的亮点我们都可以去互相的学习"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/04.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2BSfn2DtikBBVHVnxacyfXPsmmTI%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/04.wav","因为不管我们行业发展到什么样的情况我们的企业在什么样的阶段我们的学习培训工作在什么样的阶段"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/02.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2LwBd3QdtDvZgZ%2BeW8X%2BrxcePsU%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/02.wav","我第一个是什么呢就是我以为我觉得这样的一个行业"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/03.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=KEvdUIKlaAHvv6RmhJuC1fW1mNw%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/03.wav","是一个百花齐放嗯这个百家争鸣的行业"]],"officialSummary":"Mandarin(China) Near-field Conference speech dataset, collected the output by AU central console mixer in real speech scenes. It has a natural pronunciation without environmental noise almost, covers a variety of topics. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Mandarin speech dataset"," Mandarin Near-field Conference dataset"," Mandarin speech data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Mandarin(China) Near-field Conference speech dataset, collected the output by AU central console mixer in real speech scenes. It has a natural pronunciation without environmental noise almost, covers a variety of topics. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
44.1kHz, 16bit, wav, dual channel.
Recording environment
Mixed
Recording content
lectures on science and technology, training, publicity, etc.
Device
AU Center Console Mixer
Country
China(CHN)
Language
Mandarin
Features of annotation
annotating for the transcription text, speaker identification and gender