[{"@type":"PropertyValue","name":"Storage format","value":"TXT"},{"@type":"PropertyValue","name":"Data content","value":"English-Japanese Parallel Corpus Data"},{"@type":"PropertyValue","name":"Data size","value":"0.85 million pairs of English-Japanese Parallel Corpus Data. The English sentences contain 23 words on average."},{"@type":"PropertyValue","name":"Language","value":"English, Japanese"},{"@type":"PropertyValue","name":"Accuracy rate","value":"90%"},{"@type":"PropertyValue","name":"Application scenario","value":"machine translation"}]
{"id":1186,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY220720003.jpg?Expires=2007353710&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=9MRdeR6cjnYd%2BlPzjuY/aeWZrq4%3D","type1":"183","type1str":null,"type2":"185","type2str":null,"dataname":"English-Japanese Parallel Corpus – 850,000 Sentence Pairs for Machine Translation","datazy":[{"title":"Storage format","content":"TXT","desc":"Storage format"},{"title":"Data content","content":"English-Japanese Parallel Corpus Data","desc":"Data content"},{"title":"Data size","content":"0.85 million pairs of English-Japanese Parallel Corpus Data. The English sentences contain 23 words on average.","desc":"Data size"},{"title":"Language","content":"English, Japanese","desc":"Language"},{"title":"Accuracy rate","content":"90%","desc":"Accuracy rate"},{"title":"Application scenario","content":"machine translation","desc":"Application scenario"}],"datatag":"English,Japanese,English-Japan,Parallel corpus","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY220720003_demo1711015209296/APY220720003-demo/en_ja ????.png","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220720003_demo1711015209296/APY220720003-demo/en_ja%20%3F%3F%3F%3F.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=UCEqlC34Vc5x33Uwd%2FM9z2ObDkQ%3D","intro":"","size":0,"progress":100,"type":"jpg"}],"officialSummary":"This dataset contains 850,000 English-Japanese parallel sentences stored in TXT format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. average English sentence 23 words. The data desensitization and quality checking had been done. It can be used as a fundamental dataset for machine translation, bilingual NLP tasks, and other text processing applications.","dataexampl":null,"datakeyword":["English Japanese parallel corpus","English Japanese translation dataset","English Japanese bilingual corpus","English Japanese parallel dataset","English Japanese text dataset","English Japanese MT dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"nlu","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
English-Japanese Parallel Corpus – 850,000 Sentence Pairs for Machine Translation
English Japanese parallel corpus
English Japanese translation dataset
English Japanese bilingual corpus
English Japanese parallel dataset
English Japanese text dataset
English Japanese MT dataset
This dataset contains 850,000 English-Japanese parallel sentences stored in TXT format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. average English sentence 23 words. The data desensitization and quality checking had been done. It can be used as a fundamental dataset for machine translation, bilingual NLP tasks, and other text processing applications.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Storage format
TXT
Data content
English-Japanese Parallel Corpus Data
Data size
0.85 million pairs of English-Japanese Parallel Corpus Data. The English sentences contain 23 words on average.