en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

Speech Recognition Datasets

Instantly enhance AI model performance with high quality off-the-shelf datasets.

Language

All
8
Arabic
2
Burmese
2
Chinese Dialects
23
English
47
French
7
German
7
Hindi
6
Indonesian
8
Italian
8
Japanese
6
Korean
11
Malay
5
Mandarin
32
Others
28
Portugese
4
Russian
5
Spanish
12
Thai
5
Vietnamese
4

Data Type

All
8
Dialogue
91
Read
129

96 Hours - Japanese(Japan) Children Real-world Casual Conversation and Monologue speech dataset

Japanese(Japan) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Spontaneous Speech text annotation Japanese

513 Hours – Japanese Conversational Speech Data by Telephone

The 513 Hours - Japanese Conversational Speech of natural conversations collected by telephony involved more than 800 native speakers, developed with the proper balance of gender ratio, Speakers would choose a few familiar topics out of the given list and start conversations to ensure dialogues' fluency and naturalness. The recording devices is telephony recording system. The audio format is 8kHz, 8bit, uncompressed WAV, and all the speech data was recorded in quiet indoor environments. All the speech audio was manually transcribed with text content, the start and end time of each effective sentence, and speaker identification. The accuracy rate of sentences is ≥ 95%.
Japanese natural conversation speech data Japanese natural conversation speech Japanese natural conversation data Japanese conversation speech data

633 Hours - Japanese(Japan) Spontaneous Dialogue Smartphone speech dataset

Japanese(Japan) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics. Transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(around 1000 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Japanese discuss data Japanese discuss dataset Japanese discuss collection Japanese small talk data Japanese small talk dataset

261 Hours - Japanese(Japan) Scripted Monologue Smartphone speech dataset

Japanese(Japan) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers( 1006 Japanese native speakers), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Japanese data Japanese audio data basic recognition Japanese reading audio

474 Hours - Japanese(Japan) Scripted Monologue Smartphone speech dataset

Japanese(Japan) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,245 speakers in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Japanese Scripted Monologue speech data

234.8 Hours - Japanese(Japan) Scripted Monologue Smartphone speech dataset

Japanese(Japan) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering 210,000 formal or informal expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(799 Japanese recorded in mixed condition, such as indoor, roadside, restaurant, etc.), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Japanese pronunciation mobile phone collecting voice data reading aloud voice

207 Hours - English(Japan) Scripted Monologue Smartphone speech dataset

English(Japan) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(464 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Accent English Japanese Japan English

593 Hours - English(China) Scripted Monologue Smartphone speech dataset

English(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering 100,000 common expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(3,691 Chinese, covering domestic dialect zones like Jiangsu, Shandong, Beijing, He'nan, and meets the specific accents of Chinese speaking English), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Chinese speak English English voice data and mobile phones collect voice data.china chinaman oriental taiwanese byzantine chink chinese woman formosan asian chinawoman japanese korean sinaean celestial chine chino chugoku non-chinese ovary pekin acupuncture airframe airframes all-china beijing patois vernacular language lingo speech idiom argot tongue accent jargon slang parlance cant patter brogue provincialism localism regional language locution terminology vocabulary local speech regionalism pidgin regionalisms colloquialism phraseology idioms mother tongue talk pronunciation local language creole langue lingua franca phrasing tongues idiolect lexicon vernacularism colloquial word brogues dialectal jive talk languages lingua localisms wording accents business language

loading

Tailor Your Data Now

Why off-the-shelf Datasets

  • Copyright

    Copyright

    Clear Coyright and Ready to Check
  • Security

    Security

    Properly Authorized Secure to Use
  • Professional

    Professional

    Designed and produced by AI data experts
  • Diversity

    Diversity

    Collected from a varity of real scenes
  • Cost Effective

    Cost Effective

    More Cost-Efficient Than Tailored Data
  • Efficiency

    Efficiency

    Ready-To-Go Deliver in Seconds
c8044156-6300-4d0f-ae2d-bd5546a80eea