en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

NLU Datasets

Instantly enhance AI model performance with high quality off-the-shelf datasets.

Type

All
34
Entity Identification
4
Dialogue Text
1
Intention Understanding
1
Others
2
Parallel Corpus
23

850,000 Groups-English-Japanese Parallel Corpus Data

The 850,000 English Japanese Parallel Corpus Data is a bilingual text is stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. average English sentence 23 words. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.
English - Japanese Parallel Corpus Data English -Japanese Parallel Corpus Parallel Corpus Data Alignment Corpus Data

380,000 Groups – Japanese-English Parallel Corpus Data

Japanese and English parallel corpus, 380,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.
Japanese and English parallel corpus data Japanese and English parallel corpus collection Alignment Corpus Parallel Corpus Data Alignment Corpus Data

9,830,000 Groups - Chinese-Japanese Parallel Corpus Data

9.83 Million Pairs of Sentences - Chinese-Japanese Parallel Corpus Data be stored in txt format. It covers multiple fields including general, IT, news, patent, and international engine. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.
Chinese-Japanese parallel corpus Chinese-Japanese alignment Parallel Corpus Data Alignment Corpus Data

10,000 Chinese News Events Annotation Data

10,000 Chinese news event annotated data. The contents are hot news in 2013. Each piece of news contains one or more events. Each event is annotated. The data is stored in xml and can be used for natural language understanding.
Chinese news corpus annotation corpus annotation news corpus corpus data

8,178 Chinese Social Comments Events Annotation Data

8,178 Chinese social comments annotated data. The contents are hot news in 2013. Each piece of news contains one or more events and is annotated with time, theme, cause, procedure and result. The data is stored in xml and can be used for natural language understanding.
Social comment event annotation data event annotation comment annotation data event annotation data

56,920 Car Fine Granularity Comments Annotation Data

It collectes comments from different car forums and fine-grained annotation is carried out on posts commented by users. Annotations include labels of manufacturer, brand, model, attribute, description value, tendency, etc. It can be used in fine-grained natural language understanding research, emotion analysis and some other fields.
Fine-grained car comment annotation data car comment data annotation text data collection nlu data

687,694 Open Domain Intention Annotation Data

Annotation of 687,694 sentences generated by users in the mobile phone scene, covering to-do scenes, location scenes, and schedule scenes. The data set can be used for natural language understanding tasks.
open domain data intent annotation data textual data annotation SMS text data nlu data Intention understanding data

28,237 Intent-type single sentence annotation data

Intent-like single-sentence annotated textual data, the data size is 28,237 sentences, artificially written, and annotated with intent classes, including slot and slot value information; the intent field includes music, weather, date, schedule, home equipment, etc.; it is applied to intent recognition research and related fields.
intent annotation data interactive intent annotation data intent recognition nlp intent recognition data NLU data

47,811 Sentences - Intention Annotation Data in Interactive Scenes

Intent-like single-sentence annotated textual data, the data size is 47811 sentences, annotated with intent classes, including slot and slot value information; the intent field includes music, weather, date, schedule, home equipment, etc.; it is applied to intent recognition research and related fields.
intent annotation data interactive intent annotation data intent recognition nlp intent recognition data NLU data

loading

Tailor Your Data Now

Why off-the-shelf Datasets

  • Copyright

    Copyright

    Clear Coyright and Ready to Check
  • Security

    Security

    Properly Authorized Secure to Use
  • Professional

    Professional

    Designed and produced by AI data experts
  • Diversity

    Diversity

    Collected from a varity of real scenes
  • Cost Effective

    Cost Effective

    More Cost-Efficient Than Tailored Data
  • Efficiency

    Efficiency

    Ready-To-Go Deliver in Seconds
9a479035-ff3d-40f0-9ac0-31aaaff41590