Skip to the content.

Public Datasets

Dataset Finder

Image Datasets

Facial Recognition

Action Recognition

Object Detection and Recognition

Handwriting and character recognition

Aerial images

Thermal images

Underwater datasets

Document analysis datasets

Other images

Text data

Reviews

News Articles

Messages

Twitter and tweets

Dialogues

Question Answering

Other text

Medical Datasets

Audio Datasets

IoT Datasets

Recommender Datasets

Book

Dating

E-commerce

Music

Movies

Games

Jokes

Food

Anime

Scholarly Paper

Healthcare

Others

Anomaly Data

Text Classification

Machine Translation

Plant disease

Multivariate data

Financial

Demand and Sales forecasting

Other Multivariate

Time Series

Graphs

Document Analysis

General Classifications

​ - CelebFaces Attributes (CelebA) Dataset: A popular one to use over 200k images of celebrities and use Computer vision concepts for implementing facial recognition. Format:
Default task:

General Regression

eSports Datasets

In this Kaggle Dataset, I provide just over 1400 competitive matchmaking matches from Valve’s game Counter-strike: Global Offensive (CS:GO). The data was extracted from competitive matchmaking replays submitted to csgo-stats. I intend for this data-set to be purely exploratory, however users are free to create their own predictive models they see fit.
Format:
Default task:

After a series of adventures, we’re happy to announce that we’re finally ready to release a second data dump of over a billion matches, this time with information ranging from March 2011 to March 2016! Format:
Default task:

Synthetic Datasets

The dataset was used for neural rendering research at Google that takes advantage of rasterized image buffers and converts them into high quality raytraced fur renders. We believe that this dataset can contribute to the computer graphics and machine learning community to develop more advanced techniques with fur rendering.
Format:
Default task: