en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

English-Japanese Parallel Corpus – 850,000 Sentence Pairs for Machine Translation

English Japanese parallel corpus
English Japanese translation dataset
English Japanese bilingual corpus
English Japanese parallel dataset
English Japanese text dataset
English Japanese MT dataset

This dataset contains 850,000 English-Japanese parallel sentences stored in TXT format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. average English sentence 23 words. The data desensitization and quality checking had been done. It can be used as a fundamental dataset for machine translation, bilingual NLP tasks, and other text processing applications.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Storage format
TXT
Data content
English-Japanese Parallel Corpus Data
Data size
0.85 million pairs of English-Japanese Parallel Corpus Data. The English sentences contain 23 words on average.
Language
English, Japanese
Accuracy rate
90%
Application scenario
machine translation
Sample Sample
  • English-Japanese Parallel Corpus – 850,000 Sentence Pairs for Machine Translation
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

6c62962e-99af-4a50-b960-b90356bfd9d6

a2ad5ba8-0c27-4aeb-bb80-429919c77c3a