en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1 Million Pairs Image Caption Data Of General Scenes

Text description
multi-modality
general scene data set
English caption
Chinese caption

1 million pairs of images and descriptions, the pictures cover various categories, including landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, along with an aesthetic subset. They depict the overall scene of the image, the details within the scene, and the emotions conveyed by the image. The description is provided in both English and Chinese languages.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data size
1 million pairs of images and descriptions
Image type
covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture
Data format
image format is .jpg, text format is .txt
Text length
in principle, the description should be no less than 200 Chinese characters
Main description content
overall scene of the picture, detailed description of the elements within the scene, and the emotions conveyed by the picture
Accuracy rate
the proportion of correctly labeled images is not less than 95%
Image Resolution
no less than 2 million pixels, most of them are higher than 5 million pixels
Sample Sample
  • 1 Million Pairs Image Caption Data Of General Scenes
  • 1 Million Pairs Image Caption Data Of General Scenes
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

ab26ca71-7140-4efe-beeb-940c70f95cc5

bd4b0b53-cfa7-4d69-a9c3-fd2ff136a6f8