{"id":939,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY180915001.png?Expires=2007353648&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=afFK9TFUMEZ/8Bjx3v6Gh1POdWA%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"1,535 Hours - Mandarin Chinese and English(China) Mix Scripted Monologue Smartphone speech dataset","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"title":"Content category","value":"Generic domain, human-machine interaction;"},{"title":"Recording condition","value":"Low background noise (indoor), without echo;"},{"title":"Recording device","value":"Android smartphone, iPhone;"},{"title":"Speaker","value":"3,972 speakers in total, 43% male and 57% female. 68% speakers of all are in the age group of 12-25, 31% speakers of all in the age group of 26-45, 1% speakers of all are in the age group of 46-60;"},{"title":"Country","value":"China(CHN);"},{"title":"Language","value":"Mandarin Chinese, English;"},{"title":"Features of annotation","value":"Transcription text."},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 97%"}],"datatag":"Mix, Mandarin Chinese,English,China,Smartphone,Reading","technologydoc":null,"downurl":null,"datainfo":"2,733 native Chinese speakers, covering seven dialect districts. Recording text: mixed Chinese& English sentences, includes common& human-computer interaction scenarios, rich content, accurate transcription. This data can be applied to impove the Speech recognition system's recognition of mixed Chinese&English","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["3972 people","seven main dialect zones","sentences with Chinese and English"],"samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G05003S1010.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=anc0N%2F5CD6oOeVJkmf8tYMok7d8%3D","/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G05003S1010.wav","讲座最后一个分支是worm的形成"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G05003S2309.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=WMixr6BHMDVVMxaHaxDtkEcLPLU%3D","/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G05003S2309.wav","定位@NIKEA总部位置"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G00001S1002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=LGtvUbPjNIdOcj5lLOkS%2F%2FNvwT8%3D","/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G00001S1002.wav","[N]新年第一天来了个reject扎心啊"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G25216S2450.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=1mASEjK9gC%2F20WUrwyVjri2BFM4%3D","/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G25216S2450.wav","切换嘻哈风格的Code of Honor听。[N]"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G25216S1252.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=VbiEDigyF6d%2BuVF5fpk6sFDyMlI%3D","/data/apps/damp/temp/ziptemp/APY180915001_demo1712829638215/G25216S1252.wav","[S]话说airmail就要洋气得多[N]"]],"officialSummary":"Mandarin Chinese and English(China) Mix Scripted Monologue Smartphone speech dataset, collected from monologue based on given Chinese and English Mixed prompts, covering general and human-computer interaction domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(3972 Chinese native speakers), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Chinese and English mixed reading voice"," mixed reading voice data"," mobile phone collection of voice data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}

en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1,535 Hours - Mandarin Chinese and English(China) Mix Scripted Monologue Smartphone speech dataset

Chinese and English mixed reading voice

mixed reading voice data

mobile phone collection of voice data

Mandarin Chinese and English(China) Mix Scripted Monologue Smartphone speech dataset, collected from monologue based on given Chinese and English Mixed prompts, covering general and human-computer interaction domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(3972 Chinese native speakers), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Specifications

Format

16kHz, 16bit, uncompressed wav, mono channel;

Content category

Generic domain, human-machine interaction;

Recording condition

Low background noise (indoor), without echo;

Recording device

Android smartphone, iPhone;

Speaker

3,972 speakers in total, 43% male and 57% female. 68% speakers of all are in the age group of 12-25, 31% speakers of all in the age group of 26-45, 1% speakers of all are in the age group of 46-60;

Country

China(CHN);

Language

Mandarin Chinese, English;

Features of annotation

Transcription text.

Accuracy Rate

Sentence Accuracy Rate (SAR) 97%

Sample

Sample

Audio
讲座最后一个分支是worm的形成
Audio
定位@NIKEA总部位置
Audio
[N]新年第一天来了个reject扎心啊
Audio
切换嘻哈风格的Code of Honor听。[N]
Audio
[S]话说airmail就要洋气得多[N]

Recommended Datasets

Recommended Dataset

303 Hours - Mandarin Chinese and English(China) Mix Scripted Monologue Smartphone speech dataset

Mandarin Chinese and English(China) Mix Scripted Monologue Smartphone speech dataset, collected from monologue based on given Chinese and English Mixed prompts, covering general and human-computer interaction domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,113 speakers), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Chinese and English mixed reading voice mixed reading voice data mobile phone collection of voice data

Tell Us Your Special Needs

Full Name *

Contact Phone No. *

Company name *

Company Email *

Data Requirements *

By submitting, I agree to the Privacy Protection

Subscribe to our newsletter

Be the first to receive Nexdata latest product releases, data solutions and enterprise news.

Off-the-Shelf Datasets: All Category Datasets; Computer Vision Datasets; Speech Recognition Datasets; Speech Synthesis Datasets; OCR Datasets; Pronunciation Dictionary; NLU Datasets; LLM Datasets

Data Service: 3D Point Cloud Data; Street View Data; OCR Data; Behavior Recognition Data; Identity Recognition Data; Speech Recognition Data; Speech Synthesis Data; Multimodal Data

Industries: Autonomous Vehicles; AR/VR; Conversational AI; Smart Home; Retail; Intelligent Healthcare

Company: About Us; News; partners; Quality & Security; Event
Links: OPENMPD

Annotation Platform: Annotation Platform
Resources: Sponsored Datasets

Sharpen Your AI with Better Data

+1(626)594-5598

[email protected]

Copyright © 2023 NEXDATA TECHNOLOGY INC

Sitemap Terms and Conditions

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.

d49a319e-f2e9-4409-a234-a09894de7771

873998b4-b475-4e77-b53c-91cdfb1f668e