[{"@type":"PropertyValue","name":"Storage format","value":"text"},{"@type":"PropertyValue","name":"Data content","value":"Chinese-Urdu Parallel Corpus Data"},{"@type":"PropertyValue","name":"Data size","value":"0.98 million pairs of Chinese-Urdu Parallel Corpus Data. The Chinese sentences contain 19.9 characters on average."},{"@type":"PropertyValue","name":"Language","value":"Chinese, Urdu"},{"@type":"PropertyValue","name":"Accuracy rate","value":"90%"},{"@type":"PropertyValue","name":"Application scenario","value":"machine translation"}]
{"id":1247,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_wenben_default.webp","type1":"183","type1str":null,"type2":"183","type2str":null,"dataname":"980,000 Groups - Chinese-Urdu Parallel Corpus Data","datazy":[{"title":"Storage format","value":"text"},{"title":"Data content","value":"Chinese-Urdu Parallel Corpus Data"},{"title":"Data size","value":"0.98 million pairs of Chinese-Urdu Parallel Corpus Data. The Chinese sentences contain 19.9 characters on average."},{"title":"Language","value":"Chinese, Urdu"},{"title":"Accuracy rate","value":"90%"},{"title":"Application scenario","value":"machine translation"}],"datatag":"Chinese,Urdu,Chinese-Urdu,Parallel Corpus","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230328001_demo1729159200917/zh-ur-demo.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=nUPuvCfxkLg92E5Ay0vj079s53I%3D","/data/apps/damp/temp/ziptemp/APY230328001_demo1729159200917/zh-ur-demo.png",""],"officialSummary":"980,000 sets of Chinese and Urdu language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.","dataexampl":"","datakeyword":["Chinese and Urdu Parallel Corpus Data"," Alignment Corpus"," Parallel Corpus Data"," Alignment Corpus Data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"nlu","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"yes"}
980,000 Groups - Chinese-Urdu Parallel Corpus Data
Chinese and Urdu Parallel Corpus Data
Alignment Corpus
Parallel Corpus Data
Alignment Corpus Data
980,000 sets of Chinese and Urdu language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Storage format
text
Data content
Chinese-Urdu Parallel Corpus Data
Data size
0.98 million pairs of Chinese-Urdu Parallel Corpus Data. The Chinese sentences contain 19.9 characters on average.