en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

4,720,000 Groups - Chinese-Uighur Parallel Corpus Data

Chinese and Uygur Parallel Corpus Data
Alignment Corpus
Parallel Corpus Data
Alignment Corpus Data

4,720,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Storage format
TXT
Data content
Chinese-Uighur Parallel Corpus Data
Data size
4.72 million pairs of Chinese-Uighur Parallel Corpus Data. The Chinese sentences contain 22 characters on average
Language
Chinese, Uighur
Application scenario
machine translation
Accuracy rate
90%
Sample Sample
  • 4,720,000 Groups - Chinese-Uighur Parallel Corpus Data
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

c3f4e9ef-7ecd-4f50-ad43-1921a7a9dd05

9fdcfb92-2e38-40fc-af73-7f42bf61af88