en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1,722 Hours - Mandarin(China) Near-field Conference speech dataset

Mandarin speech dataset
Mandarin Near-field Conference dataset
Mandarin speech data

Mandarin(China) Near-field Conference speech dataset, collected the output by AU central console mixer in real speech scenes. It has a natural pronunciation without environmental noise almost, covers a variety of topics. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
44.1kHz, 16bit, wav, dual channel.
Recording environment
Mixed
Recording content
lectures on science and technology, training, publicity, etc.
Device
AU Center Console Mixer
Country
China(CHN)
Language
Mandarin
Features of annotation
annotating for the transcription text, speaker identification and gender
Accuracy Rate
Sentence Accuracy Rate(SAR) 97%
Sample Sample
  • Audio

    我觉得呃我觉得我工作的一直非常的开心为什么开心呢因为其实有三点

  • Audio

    但是我们每家我们都有我们自己的特点我们每家都能够呃做出自己的亮点我们都可以去互相的学习

  • Audio

    因为不管我们行业发展到什么样的情况我们的企业在什么样的阶段我们的学习培训工作在什么样的阶段

  • Audio

    我第一个是什么呢就是我以为我觉得这样的一个行业

  • Audio

    是一个百花齐放嗯这个百家争鸣的行业

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

f666c658-ee13-4075-a185-91eb49e294fa

69723457-bca5-43d5-896d-ad5075486dfa