[{"@type":"PropertyValue","name":"Data size","value":"10,000 images"},{"@type":"PropertyValue","name":"Collection environment","value":"including natural scenes, urban street scenes, shopping mall scenes, exhibitions, family environment, displays and other scenes"},{"@type":"PropertyValue","name":"Acquisition equipment","value":"various brands of cameras"},{"@type":"PropertyValue","name":"Collection diversity","value":"multiple scenes, multiple time periods, multiple shooting angles"},{"@type":"PropertyValue","name":"Data format","value":"image format is .jpg, text format is .txt"},{"@type":"PropertyValue","name":"Description language","value":"English, Chinese"},{"@type":"PropertyValue","name":"Text length","value":"in principle, 30~60 words, usually 3-5 sentences"},{"@type":"PropertyValue","name":"Main description content","value":"the main scene in the image, usually including foreground and background description"},{"@type":"PropertyValue","name":"Accuracy rate","value":"the proportion of correctly labeled images is not less than 97%"}]
{"id":1283,"datatype":"1","titleimg":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/asset/productNew/nexdata/APY231231002.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=qgHfjQHlnXW%2Fkj1lS%2FgZGCGo6qE%3D","type1":"226","type1str":null,"type2":"226","type2str":null,"dataname":"10,000 Image Caption Data of Diverse Scenes","datazy":[{"title":"Data size","value":"10,000 images"},{"title":"Collection environment","value":"including natural scenes, urban street scenes, shopping mall scenes, exhibitions, family environment, displays and other scenes"},{"title":"Acquisition equipment","value":"various brands of cameras"},{"title":"Collection diversity","value":"multiple scenes, multiple time periods, multiple shooting angles"},{"title":"Data format","value":"image format is .jpg, text format is .txt"},{"title":"Description language","value":"English, Chinese"},{"title":"Text length","value":"in principle, 30~60 words, usually 3-5 sentences"},{"title":"Main description content","value":"the main scene in the image, usually including foreground and background description"},{"title":"Accuracy rate","value":"the proportion of correctly labeled images is not less than 97%"}],"datatag":"AIGC,English caption,Scene caption,Multiple scenes,Multiple shooting angles,Multiple lighting conditions","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/%3F%3F2.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=5qbztRq1aDZVyC7UB6BpZg%2B6%2F1Y%3D","/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/??2.png",""],["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/%3F%3F5.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=z%2BmW6brPv48W7%2FPSNRgxpWDyU18%3D","/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/??5.png",""],["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/%3F%3F3.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=iEEfl7TPykgzYNMCOaOz49zbbg8%3D","/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/??3.png",""]],"officialSummary":"10,000 Image caption data of diverse scenes including natural scenes, urban street scenes, exhibitions, family environments and other scenes, shot with different brands of cameras, including multiple time periods, multiple shooting angles, description language is English, mainly describes the main scenes in the image, usually including foreground and background description.","dataexampl":"","datakeyword":["multi-modality"," natural scene data set"," scene information data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"llm","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no","firstList":[["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/%3F%3F1.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NQa95mWerjnOMxeKlDNjnVShawM%3D","/data/apps/damp/temp/ziptemp/APY231231002_demo1727344800226/??1.png",""]]}
10,000 Image caption data of diverse scenes including natural scenes, urban street scenes, exhibitions, family environments and other scenes, shot with different brands of cameras, including multiple time periods, multiple shooting angles, description language is English, mainly describes the main scenes in the image, usually including foreground and background description.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data size
10,000 images
Collection environment
including natural scenes, urban street scenes, shopping mall scenes, exhibitions, family environment, displays and other scenes
Acquisition equipment
various brands of cameras
Collection diversity
multiple scenes, multiple time periods, multiple shooting angles
Data format
image format is .jpg, text format is .txt
Description language
English, Chinese
Text length
in principle, 30~60 words, usually 3-5 sentences
Main description content
the main scene in the image, usually including foreground and background description
Accuracy rate
the proportion of correctly labeled images is not less than 97%