[{"@type":"PropertyValue","name":"Data size","value":"222,289 images"},{"@type":"PropertyValue","name":"Collecting environment","value":"including indoor and outdoor scenes"},{"@type":"PropertyValue","name":"Data diversity","value":"multiple scenes, multiple shooting angles"},{"@type":"PropertyValue","name":"Device","value":"cellphone, camera"},{"@type":"PropertyValue","name":"Shooting angle","value":"looking up angle, looking down angle, eye-level angle"},{"@type":"PropertyValue","name":"Data format","value":"the image data formats are jpg, png and jpeg, the annotation file format is .json"},{"@type":"PropertyValue","name":"Annotation content","value":"line-level rectangular bounding box annotation and transcription for the texts; word-level rectangular bounding box annotation and transcription for the texts; character-level rectangular bounding box annotation and transcription for the texts"}]
{"id":244,"datatype":"1","titleimg":"[{\"name\": \"APY170301451.png\", \"url\": \"https://res.datatang.com/asset/productNew/APY170301451.png?Expires=2007353641&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=B5C3VJ2KE0MJipkxrxLSyFLMLhE%3D\", \"size\": 2048, \"progress\": 100}]","type1":"147","type1str":null,"type2":"150","type2str":null,"dataname":"222,289 Images – Asian OCR Data in Natural Scenes","datazy":[{"title":"Data size","desc":"Data size","content":"222,289 images"},{"title":"Collecting environment","desc":"Collecting environment","content":"including indoor and outdoor scenes"},{"title":"Data diversity","desc":"Data diversity","content":"multiple scenes, multiple shooting angles"},{"title":"Device","desc":"Device","content":"cellphone, camera"},{"title":"Shooting angle","desc":"Shooting angle","content":"looking up angle, looking down angle, eye-level angle"},{"title":"Data format","desc":"Data format","content":"the image data formats are jpg, png and jpeg, the annotation file format is .json"},{"title":"Annotation content","desc":"Annotation content","content":"line-level rectangular bounding box annotation and transcription for the texts; word-level rectangular bounding box annotation and transcription for the texts; character-level rectangular bounding box annotation and transcription for the texts"}],"datatag":"OCR,Natural scenes,Line-level annotation,Word-level annotation,Character-level annotation,Transcription for the texts","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["222,289 images","Line-level annotation","Word-level annotation"],"samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000064760.jpg","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000064760.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=boTHlBYsSd6OA%2BYq1vDN1bSVPAI%3D","intro":"","size":0,"progress":100,"type":"jpg"},{"name":"/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000064757.jpg","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000064757.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=pQtBicZwC3cRsBThdTsqiIkQcc0%3D","intro":"","size":0,"progress":100,"type":"jpg"},{"name":"/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000004600.jpg","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000004600.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=JocVLl8s3LNAhFZUv7fimu3GRxA%3D","intro":"","size":0,"progress":100,"type":"jpg"}],"officialSummary":"222,289 Images – OCR Data in Natural Scenes. The collecting scenes of this dataset include indoor and outdoor scenes. The data diversity includes multiple scenes, and multiple shooting angles. For annotation, we have annotated in line-level, word-level, and character-level and well content matched text transcription included. The dataset can be used for OCR tasks in natural scenes.","dataexampl":null,"datakeyword":["OCR"," Natural scenes"," Line-level annotation","Word-level annotation","Character-level annotation","Transcription for the texts"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"ocr","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"firstList":[{"name":"/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000006529.jpg","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY170301451_demo1695808888397/APY170301451_demo/0000006529.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=0IzJ9FKyEkprkQJbVMrr1O9OjfI%3D","intro":"","size":0,"progress":100,"type":"jpg"}]}
222,289 Images – OCR Data in Natural Scenes. The collecting scenes of this dataset include indoor and outdoor scenes. The data diversity includes multiple scenes, and multiple shooting angles. For annotation, we have annotated in line-level, word-level, and character-level and well content matched text transcription included. The dataset can be used for OCR tasks in natural scenes.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data size
222,289 images
Collecting environment
including indoor and outdoor scenes
Data diversity
multiple scenes, multiple shooting angles
Device
cellphone, camera
Shooting angle
looking up angle, looking down angle, eye-level angle
Data format
the image data formats are jpg, png and jpeg, the annotation file format is .json
Annotation content
line-level rectangular bounding box annotation and transcription for the texts; word-level rectangular bounding box annotation and transcription for the texts; character-level rectangular bounding box annotation and transcription for the texts