[{"@type":"PropertyValue","name":"Storage format","value":"TXT"},{"@type":"PropertyValue","name":"Data content","value":"Chinese-Korean Parallel Corpus Data"},{"@type":"PropertyValue","name":"Data size","value":"12.82 million pairs of Chinese-Korean Parallel Corpus Data. The Chinese sentences contain 25.7 characters on average."},{"@type":"PropertyValue","name":"Language","value":"Chinese, Korean"},{"@type":"PropertyValue","name":"Accuracy rate","value":"90%"},{"@type":"PropertyValue","name":"Application scenario","value":"machine translation"}]
{"id":1200,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_wenben_default.webp","type1":"183","type1str":null,"type2":"183","type2str":null,"dataname":"12,820,000 Groups - Chinese-Korean Parallel Corpus Data","datazy":[{"title":"Storage format","value":"TXT"},{"title":"Data content","value":"Chinese-Korean Parallel Corpus Data"},{"title":"Data size","value":"12.82 million pairs of Chinese-Korean Parallel Corpus Data. The Chinese sentences contain 25.7 characters on average."},{"title":"Language","value":"Chinese, Korean"},{"title":"Accuracy rate","value":"90%"},{"title":"Application scenario","value":"machine translation"}],"datatag":"Chinese,Korean,Chinese-Korean,Parallel Corpus","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220720005_demo1711015209476/zh-ko%20%3F%3F%3F%3F.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=DiH301E2zIFDQhnNLMtcQ9OQwOs%3D","/data/apps/damp/temp/ziptemp/APY220720005_demo1711015209476/zh-ko ????.png",""],"officialSummary":"12,820,000 sets of parallel translation corpus between China and Korea, which are stored in txt files. It covers many fields including spoken language, traveling, news, and finance. Data cleaning, desensitization, and quality inspection have been carried out. It can be used as the basic corpus database in the text data files as well as used in machine translation.","dataexampl":"","datakeyword":["China and South Korea Parallel Corpus"," Corpus Data"," Alignment Corpus"," Parallel Corpus Data"," Alignment Corpus Data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"nlu","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"yes"}
[{"@type":"VideoObject","embedUrl":"p"},{"@type":"VideoObject","embedUrl":"t"},{"@type":"VideoObject","embedUrl":"d"},{"@type":"VideoObject"}]
12,820,000 Groups - Chinese-Korean Parallel Corpus Data
China and South Korea Parallel Corpus
Corpus Data
Alignment Corpus
Parallel Corpus Data
Alignment Corpus Data
12,820,000 sets of parallel translation corpus between China and Korea, which are stored in txt files. It covers many fields including spoken language, traveling, news, and finance. Data cleaning, desensitization, and quality inspection have been carried out. It can be used as the basic corpus database in the text data files as well as used in machine translation.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data content
Chinese-Korean Parallel Corpus Data
Data size
12.82 million pairs of Chinese-Korean Parallel Corpus Data. The Chinese sentences contain 25.7 characters on average.
Application scenario
machine translation
Sample
Recommended Dataset
Tell Us Your Special Needs
d4314d78-0a5a-4f56-8d2a-834251b507a3