en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

High-Quality Training Datasets

Boost the performance of your AI models with our high-quality, ready-to-use training datasets.

Language

All

Data Type

All

9,000 Images of 180 People - Driver Gesture 21 Landmarks Annotation Data

9,000 Images of 180 People - Driver Gesture 21 Landmarks Annotation Data. This data diversity includes multiple age periods, multiple time periods, multiple gestures, multiple vehicle types, multiple time periods. For annotation, the vehicle type, gesture type, person nationality, gender, age and gesture 21 landmarks (each landmark includes the attribute of visible and invisible) were annotated. This data can be used for tasks such as driver gesture recognition, gesture landmarks detection and recognition.
DMS driver gesture gesture 21 landmarks static gesture dynamic gesture driver gesture recognition gesture landmarks detection gesture landmarks recognition

19 Hours Bus Scene Noise Data by Voice Recorder

Bus Scene Noise Data by Voice Recorder, including in-bus and bus platform scenes, recorded by Tascam DR-07x, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Noise Voice recorder Bus Platform

10 million - English Test Questions Text Parsing And Processing Data

10 Million - English Test Questions Text Parsing And Processing Data, Each question contains title, answer, parse, subject, grade, question type; The educational stages cover primary, middle, high school, and university; Subjects cover mathmatics, biology, accounting, etc.The data are questions text under the Anglo-American system, which can be used to enhance the subject knowledge of large models
English test questions text data LLM Large Language Model Large Model chatgpt data

5,500,000 Groups - Turkish-English Parallel Corpus Data

The 5,500,000 English Turkish Parallel Corpus Data is a bilingual text is stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.
Parallel Corpus Tukish Engling

190 Hours - French(France) Gaming Real-world Casual Conversation and Monologue speech dataset

French(France) Gaming Real-world Casual Conversation and Monologue speech dataset, covers spontaneous dialogue about popular and evergreen games, including player discussions on battle strategies, social interactions, esports news, etc., mirrors real-world interactions. Transcribed with text content, offenssive expressions, speaker's ID, gender, accent and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
French Spontaneous Dialogue Gaming

217 Hours - Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset

Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Spanish Entity Spontaneous Dialogue Financial

200 Hours - Portuguese(Brazil) Financial Entities Real-world Casual Conversation and Monologue speech dataset

Portuguese(Brazil) Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Brazilian Portuguese Entity Spontaneous Dialogue Financial

203 Hours - German(Germany) Financial Entities Real-world Casual Conversation and Monologue speech dataset

German(Germany) Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
German Entity Spontaneous Dialogue Financial

2 People - Korean Average Tone Speech Synthesis Corpus

2 People - Korean Average Tone Speech Synthesis Corpus. It is recorded by korean native , with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS Korean Average Tone

105 Hours - Italian(Italy) Gaming Real-world Casual Conversation and Monologue speech dataset

Italian(Italy) Gaming Real-world Casual Conversation and Monologue speech dataset, covers spontaneous dialogue about popular and evergreen games, including player discussions on battle strategies, social interactions, esports news, etc., mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, accent, offensive expression labeling and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Italy Spontaneous Dialogue Gaming Italian

300 Hours - English(India) Spontaneous Dialogue Smartphone speech dataset

English(India) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics. Transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(390 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English India Spontaneous Dialogue

206 Hours - English Financial Entities Real-world Casual Conversation and Monologue speech dataset

English Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English Entity Spontaneous Dialogue Financial
. . .
loading

loading

b8d01cd5-6792-439b-b058-c1355d28454b