[{"@type":"PropertyValue","name":"Data size","value":"1 million pairs of images and descriptions"},{"@type":"PropertyValue","name":"Image type","value":"covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture"},{"@type":"PropertyValue","name":"Data format","value":"image format is .jpg, text format is .txt"},{"@type":"PropertyValue","name":"Text length","value":"in principle, the description should be no less than 200 Chinese characters"},{"@type":"PropertyValue","name":"Main description content","value":"overall scene of the picture, detailed description of the elements within the scene, and the emotions conveyed by the picture"},{"@type":"PropertyValue","name":"Accuracy rate","value":"the proportion of correctly labeled images is not less than 95%"},{"@type":"PropertyValue","name":"Image Resolution","value":"no less than 2 million pixels, most of them are higher than 5 million pixels"}]
{"id":1437,"datatype":"1","titleimg":"","type1":"226","type1str":null,"type2":"226","type2str":null,"dataname":"1 Million Pairs Image Caption Data Of General Scenes","datazy":[{"title":"Data size","value":"1 million pairs of images and descriptions"},{"title":"Image type","value":"covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture"},{"title":"Data format","value":"image format is .jpg, text format is .txt"},{"title":"Text length","value":"in principle, the description should be no less than 200 Chinese characters"},{"title":"Main description content","value":"overall scene of the picture, detailed description of the elements within the scene, and the emotions conveyed by the picture"},{"title":"Accuracy rate","value":"the proportion of correctly labeled images is not less than 95%"},{"title":"Image Resolution","value":"no less than 2 million pixels, most of them are higher than 5 million pixels"}],"datatag":"AIGC,English description,Chinese description,Multiple image categories,Multiple descriptions","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240731001_demo1730368803008/1.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=OpwABv08gx%2BXTabHWFyammiBXj8%3D","/data/apps/damp/temp/ziptemp/APY240731001_demo1730368803008/1.png",""],["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240731001_demo1730368803008/2.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=LAE%2BreFO8LJn3j6QQ2wRp9wxd0s%3D","/data/apps/damp/temp/ziptemp/APY240731001_demo1730368803008/2.png",""]],"officialSummary":"1 million pairs of images and descriptions, the pictures cover various categories, including landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, along with an aesthetic subset. They depict the overall scene of the image, the details within the scene, and the emotions conveyed by the image. The description is provided in both English and Chinese languages.","dataexampl":"","datakeyword":["Text description"," multi-modality"," general scene data set"," English caption"," Chinese caption"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"llm","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
1 Million Pairs Image Caption Data Of General Scenes
Text description
multi-modality
general scene data set
English caption
Chinese caption
1 million pairs of images and descriptions, the pictures cover various categories, including landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, along with an aesthetic subset. They depict the overall scene of the image, the details within the scene, and the emotions conveyed by the image. The description is provided in both English and Chinese languages.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data size
1 million pairs of images and descriptions
Image type
covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture
Data format
image format is .jpg, text format is .txt
Text length
in principle, the description should be no less than 200 Chinese characters
Main description content
overall scene of the picture, detailed description of the elements within the scene, and the emotions conveyed by the picture
Accuracy rate
the proportion of correctly labeled images is not less than 95%
Image Resolution
no less than 2 million pixels, most of them are higher than 5 million pixels