en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

Speech Recognition Datasets

Instantly enhance AI model performance with high quality off-the-shelf datasets.

Language

All
12
Arabic
2
Burmese
2
Chinese Dialects
25
English
47
French
8
German
8
Hindi
6
Indonesian
8
Italian
8
Japanese
6
Korean
11
Malay
5
Mandarin
32
Others
29
Portugese
3
Russian
5
Spanish
12
Thai
5
Vietnamese
4

Data Type

All
12
Dialogue
95
Read
130

203 Hours - Korean(Korea) Medical Entities Real-world Casual Conversation and Monologue speech dataset

Korean(Korea) Medical Entities Real-world Casual Conversation and Monologue speech dataset, covering various medical professional terminologies, primarily focuses on medical consultation, medical education, medical academic conferences and lectures, etc., mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean Entity Spontaneous Dialogue Medical

215 Hours - Korean(Korea) Financial Entities Real-world Casual Conversation and Monologue speech dataset

Korean(Korea) Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean Entity Spontaneous Dialogue Financial

93 Hours - Korean(Korea) Children Real-world Casual Conversation and Monologue speech dataset

Korean(Korea) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Spontaneous Speech text annotation Korean

396 Hours - Korean(Korea) Real-world Casual Conversation and Monologue speech dataset

Korean(Korea) Real-world Casual Conversation and Monologue speech dataset, covers live, variety-show, speech domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Spontaneous Speech korean

136 Hours - Korean(Korea) Spontaneous Dialogue Telephony speech dataset

Korean(Korea) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(216 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Conversational telephone korean

393 Hours - Korean(Korea) Children Scripted Monologue Smartphone speech dataset

Korean(Korea) Children Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering essay stories, and numbers. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean children's voice data Korean children's voice Korean collected voice voice data

290 Hours - Korean(Korea) Spontaneous Dialogue Smartphone speech dataset

Korean(Korea) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(442 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
korean Conversational speech Korean asr data Korean asr dataset Korean asr collection Korean language data Korean language dataset Korean language collection Korean discuss asr data Korean discuss asr dataset Korean discuss asr collection Korean discuss language data Korean discuss language dataset Korean discuss language collection Korean small talk asr data Korean small talk asr dataset Korean small talk asr collection Korean small talk language data Korean small talk language dataset Korean small talk language collection Korean conversational asr data Korean conversational asr dataset Korean conversational asr collection Korean conversational language data Korean conversational language dataset Korean conversational language collection Korean chat asr data Korean chat asr dataset Korean chat asr collection Korean chat language data Korean chat language dataset Korean chat language collection Korean communication asr data Korean communication asr dataset Korean communication asr collection Korean communication language data Korean communication language dataset Korean communication language collection Korean speech asr data Korean speech asr dataset Korean speech asr collection Korean speech language data Korean speech language dataset Korean speech language collection Korean talk asr data Korean talk asr dataset Korean talk asr collection Korean talk language data Korean talk language dataset Korean talk language collection Korean conversation asr data Korean conversation asr dataset Korean conversation asr collection Korean conversation language data Korean conversation language dataset Korean conversation language collection Korea asr data Korea asr dataset Korea asr collection Korea language data Korea language dataset Korea language collection Korea discuss asr data Korea discuss asr dataset Korea discuss asr collection Korea discuss language data Korea discuss language dataset Korea discuss language collection Korea small talk asr data Korea small talk asr dataset Korea small talk asr collection Korea small talk language data Korea small talk language dataset Korea small talk language collection Korea conversational asr data Korea conversational asr dataset Korea conversational asr collection Korea conversational language data Korea conversational language dataset Korea conversational language collection Korea chat asr data Korea chat asr dataset Korea chat asr collection Korea chat language data Korea chat language dataset Korea chat language collection Korea communication asr data Korea communication asr dataset Korea communication asr collection Korea communication language data Korea communication language dataset Korea communication language collection Korea speech asr data Korea speech asr dataset Korea speech asr collection Korea speech language data Korea speech language dataset Korea speech language collection Korea talk asr data Korea talk asr dataset Korea talk asr collection Korea talk language data Korea talk language dataset Korea talk language collection Korea conversation asr data Korea conversation asr dataset Korea conversation asr collection Korea conversation language data Korea conversation language dataset Korea conversation language collection Seoul asr data Seoul asr dataset Seoul asr collection Seoul language data Seoul language dataset Seoul language collection Seoul discuss asr data Seoul discuss asr dataset Seoul discuss asr collection Seoul discuss language data Seoul discuss language dataset Seoul discuss language collection Seoul small talk asr data Seoul small talk asr dataset Seoul small talk asr collection Seoul small talk language data Seoul small talk language dataset Seoul small talk language collection Seoul conversational asr data Seoul conversational asr dataset Seoul conversational asr collection Seoul conversational language data Seoul conversational language dataset Seoul conversational language collection Seoul chat asr data Seoul chat asr dataset Seoul chat asr collection Seoul chat language data Seoul chat language dataset Seoul chat language collection Seoul communication asr data Seoul communication asr dataset Seoul communication asr collection Seoul communication language data Seoul communication language dataset Seoul communication language collection Seoul speech asr data Seoul speech asr dataset Seoul speech asr collection Seoul speech language data Seoul speech language dataset Seoul speech language collection Seoul talk asr data Seoul talk asr dataset Seoul talk asr collection Seoul talk language data Seoul talk language dataset Seoul talk language collection Seoul conversation asr data Seoul conversation asr dataset Seoul conversation asr collection Seoul conversation language data Seoul conversation language dataset Seoul conversation language collection

516 Hours - Korean(Korea) Scripted Monologue Smartphone speech dataset

Korean(Korea) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,077 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean audio data korea.korean interpret reading study understand learn show decipher register translate record scan take peruse construe review indicate look comprehend say interpreted recite said deliver scrutinize grasp perceive declaim play pore over read out view tell know examine interpreting learned make out readout hear lecture announce display translated browse consult eyeball check deciphering leaf through mark

357 Hours - Korean(Korea) Scripted Monologue Smartphone speech dataset

Korean(Korea) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(999 Korean), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean mobile phones collect voice data Korean identification data Korean collection Korean labeling

loading

Tailor Your Data Now

Why off-the-shelf Datasets

  • Copyright

    Copyright

    Clear Coyright and Ready to Check
  • Security

    Security

    Properly Authorized Secure to Use
  • Professional

    Professional

    Designed and produced by AI data experts
  • Diversity

    Diversity

    Collected from a varity of real scenes
  • Cost Effective

    Cost Effective

    More Cost-Efficient Than Tailored Data
  • Efficiency

    Efficiency

    Ready-To-Go Deliver in Seconds
2863d747-9c87-47e5-afef-59dd4d3f43a2