Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Visual Question Answering
Video-Text-to-Text
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Table to Text
Multiple Choice
Text Ranking
Text Retrieval
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Tabular to Text
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Datasets
1,617
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
allegro/ConECT
Viewer
•
Updated
7 days ago
•
11.4k
•
154
•
4
visual-layer/imagenet-1k-vl-enriched
Viewer
•
Updated
Sep 16, 2024
•
1.33M
•
889
•
21
UniDataPro/synthetic-passports
Viewer
•
Updated
22 days ago
•
15
•
120
•
5
huggan/wikiart
Viewer
•
Updated
Mar 22, 2023
•
11.3k
•
5.49k
•
152
olivierdehaene/xkcd
Viewer
•
Updated
Oct 25, 2022
•
2.63k
•
129
•
10
poloclub/diffusiondb
Updated
Jan 22, 2024
•
8.1k
•
501
MMInstruction/M3IT
Updated
Nov 24, 2023
•
2.18k
•
127
Alex5666/Military-Aircraft-Recognition-dataset
Viewer
•
Updated
Sep 28, 2023
•
3.84k
•
140
•
3
ckandemir/amazon-products
Viewer
•
Updated
Nov 21, 2023
•
33.3k
•
411
•
15
TIGER-Lab/M-BEIR
Viewer
•
Updated
Aug 7, 2024
•
2.86M
•
1.5k
•
23
jovianzm/Pexels-400k
Viewer
•
Updated
Mar 25
•
400k
•
254
•
53
pixparse/cc3m-wds
Viewer
•
Updated
Dec 15, 2023
•
2.93M
•
6.67k
•
31
pixparse/idl-wds
Viewer
•
Updated
Mar 29, 2024
•
3.41M
•
2.77k
•
181
KBlueLeaf/danbooru2023-metadata-database
Viewer
•
Updated
Nov 1, 2024
•
7.83M
•
201
•
78
ys-zong/VLGuard
Viewer
•
Updated
Jan 19
•
3k
•
144
•
8
visualwebbench/VisualWebBench
Viewer
•
Updated
Apr 11, 2024
•
1.54k
•
271
•
14
rootsautomation/ScreenSpot
Viewer
•
Updated
Apr 10, 2024
•
1.27k
•
2.94k
•
29
BUAADreamer/llava-med-zh-instruct-60k
Viewer
•
Updated
May 21, 2024
•
56.6k
•
719
•
23
vera365/lexica_dataset
Viewer
•
Updated
May 16, 2024
•
61.5k
•
305
•
5
linxy/LaTeX_OCR
Viewer
•
Updated
Dec 29, 2024
•
269k
•
921
•
90
mlfoundations/DataComp-12M
Preview
•
Updated
Jun 26, 2024
•
275
•
9
DonkeySmall/OCR-English-Printed-12
Preview
•
Updated
Aug 3, 2024
•
8
•
3
OpenGVLab/OmniCorpus-CC-210M
Viewer
•
Updated
Mar 20
•
208M
•
704
•
26
philschmid/amazon-product-descriptions-vlm
Viewer
•
Updated
Sep 30, 2024
•
1.35k
•
1.36k
•
18
MMInstruction/VL-RewardBench
Viewer
•
Updated
25 days ago
•
1.25k
•
698
•
12
OpenFace-CQUPT/HumanCaption-HQ-311K
Viewer
•
Updated
4 days ago
•
313k
•
126
•
15
OpenFace-CQUPT/FaceCaptionHQ-4M
Viewer
•
Updated
4 days ago
•
2.96M
•
219
•
4
generalagents/showdown-clicks
Viewer
•
Updated
Mar 31
•
557
•
185
•
12
ieasybooks-org/waqfeya-library-compressed
Viewer
•
Updated
Apr 25
•
10.2k
•
720
•
5
gajeshladhar/core-five
Updated
23 days ago
•
9.64k
•
29
Previous
1
2
3
...
54
Next