[{"@type":"PropertyValue","name":"Data content","value":"Text pairs of original and corrected texts for four European languages"},{"@type":"PropertyValue","name":"Data volume","value":"480000 pairs"},{"@type":"PropertyValue","name":"Languages","value":"French, German, Spanish, Italian"},{"@type":"PropertyValue","name":"Field","value":"input,output"},{"@type":"PropertyValue","name":"Format","value":"JSON"}]
{"id":1515,"datatype":"1","titleimg":"","type1":"226","type1str":null,"type2":"227","type2str":null,"dataname":"480000 corrected texts in German, Spanish, French, Italian","datazy":[{"title":"Data content","desc":"Data content","content":"Text pairs of original and corrected texts for four European languages"},{"desc":"Data volume","content":"480000 pairs","title":"Data volume"},{"desc":"Languages","content":"French, German, Spanish, Italian","title":"Languages"},{"desc":"Field","content":"input,output","title":"Field"},{"desc":"Format","content":"JSON","title":"Format"}],"datatag":"German, French, Spanish, Italian, proofreading","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"1FR000977.json","url":"https://storage-product.datatang.com/damp/product/instructions_zh/20250407185849/1FR000977.json?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=mY%2Fqqi96178exaUjE5HlR4YuA%2FM%3D","intro":"法语样例","size":240,"progress":100,"type":"mp4"},{"name":"1GE000008.json","url":"https://storage-product.datatang.com/damp/product/instructions_zh/20250407185849/1GE000008.json?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=uusNeM%2BcS%2Bzpo6VfgCZFq0p9qx0%3D","intro":"德语样例","size":452,"progress":100,"type":"mp4"},{"name":"1IT001477.json","url":"https://storage-product.datatang.com/damp/product/instructions_zh/20250407185849/1IT001477.json?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=qJi6PUpx8ZKMxvqy2Q8tTebxgPA%3D","intro":"意大利语样例","size":271,"progress":100,"type":"mp4"}],"officialSummary":"This dataset focuses on the four major European languages (French, German, Spanish, Italian) and contains 480000 pairs of original and corrected text pairs. Each piece of data is presented in JSON format, including two fields: input (raw text) and output (corrected text), which can assist in natural language processing, machine translation, and language teaching research.","dataexampl":null,"datakeyword":["German"," French"," Spanish"," Italian"," proofreading"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"llm","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"firstList":[{"name":"1SP000218.json","url":"https://storage-product.datatang.com/damp/product/instructions_zh/20250407185849/1SP000218.json?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=a%2B5EpPmkcjDtVcV35lfSPY7jZoI%3D","intro":"西班牙语样例","size":370,"progress":100,"type":"mp4"}]}
480000 corrected texts in German, Spanish, French, Italian
German
French
Spanish
Italian
proofreading
This dataset focuses on the four major European languages (French, German, Spanish, Italian) and contains 480000 pairs of original and corrected text pairs. Each piece of data is presented in JSON format, including two fields: input (raw text) and output (corrected text), which can assist in natural language processing, machine translation, and language teaching research.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data content
Text pairs of original and corrected texts for four European languages