[{"@type":"PropertyValue","name":"Storage format","value":"TXT"},{"@type":"PropertyValue","name":"Data content","value":"Chinese-Uighur Parallel Corpus Data"},{"@type":"PropertyValue","name":"Data size","value":"0.1 million pairs of Chinese-Uighur Parallel Corpus Data"},{"@type":"PropertyValue","name":"Language","value":"Chinese, Uighur"},{"@type":"PropertyValue","name":"Application scenario","value":"machine translation"}]
{"id":149,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY170101225.png?Expires=2007353638&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=LCRNWY22zaL8D6zNzyBLJf7ofpE%3D","type1":"183","type1str":null,"type2":"183","type2str":null,"dataname":"100,000 Groups - Chinese-Uighur Parallel Corpus Data","datazy":[{"title":"Storage format","value":"TXT"},{"title":"Data content","value":"Chinese-Uighur Parallel Corpus Data"},{"title":"Data size","value":"0.1 million pairs of Chinese-Uighur Parallel Corpus Data"},{"title":"Language","value":"Chinese, Uighur"},{"title":"Application scenario","value":"machine translation"}],"datatag":"Chinese-Uighur,Parallel Corpus","technologydoc":null,"downurl":null,"datainfo":"It collected 100,000 sets of Chinese-Uighur intertranslation corpus data, which is cleaned, desensitized, and quality tested. It can be used as the basic corpus for text data analysis and used in machine translation.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["100,999 pairs","Chinese, Uygur"],"samplePresentation":["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY170101225_demo1711015202833/HW01509034_demo/HW01509034_demo.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=x%2FUBRlWPRlYDQsi2b%2BrA9Go9QWQ%3D","/data/apps/damp/temp/ziptemp/APY170101225_demo1711015202833/HW01509034_demo/HW01509034_demo.jpg",""],"officialSummary":"100,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document, data fluency and loyalty is above 80%. Data cleaning, desensitization and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.","dataexampl":"","datakeyword":["Chinese and Uygur Parallel Corpus Data"," Alignment Corpus"," Parallel Corpus Data"," Alignment Corpus Data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"nlu","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"yes"}
[{"@type":"VideoObject","embedUrl":"p"},{"@type":"VideoObject","embedUrl":"t"},{"@type":"VideoObject","embedUrl":"d"},{"@type":"VideoObject"}]
100,000 Groups - Chinese-Uighur Parallel Corpus Data
Chinese and Uygur Parallel Corpus Data
Alignment Corpus
Parallel Corpus Data
Alignment Corpus Data
100,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document, data fluency and loyalty is above 80%. Data cleaning, desensitization and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data content
Chinese-Uighur Parallel Corpus Data
Data size
0.1 million pairs of Chinese-Uighur Parallel Corpus Data
Application scenario
machine translation
Sample
Recommended Dataset
Tell Us Your Special Needs
9fba21cd-280f-47ba-8a2b-6a3c835f2c76