Open datasets

Discover the best
free datasets
for machine learning

Browse all the open and public datasets hosted on and off Segments.ai. You can easily clone and adapt them to your own needs.

Did you create a public dataset that you want to share?
Reach out to us.

multi sensor data fusion labeling platform for machine learning engineers
Filters
Dataset Type
Dataset Task
Out- or indoor
Search
  • BDD 100K

    Image

    A diverse driving dataset for heterogeneous multitask learning

    100M

    frames

    40

    classes

    100%

    labeled

  • nuPlan

    Multi-sensor

    nuPlan is a large-scale planning benchmark for autonomous driving.

    38.8M

    frames

    7

    classes

    100%

    labeled

  • Argoverse 2 Sensor

    Multi-sensor

    A collection of open-source autonomous driving data from six U.S. cities.

    6M

    frames

    30

    classes

    100%

    labeled

  • nuScenes

    Multi-sensor

    nuScenes is a dataset for autonomous driving. It includes street scenes from Singapore and Boston, U.S.

    1.8M

    frames

    32

    classes

    100%

    labeled

  • A2D2

    Multi-sensor

    A2D2 stands for Audi Autonomous Driving Dataset (A2D2). The data was captured in 3 German cities.

    400K

    frames

    38

    classes

    10%

    labeled

  • Waymo Perception

    Multi-sensor

    Diverse autonomous driving dataset with scenes captured in 6 U.S. areas with different environments and weather conditions.

    390K

    frames

    28

    classes

    100%

    labeled

  • Zenseact

    Multi-sensor

    A large multi-modal autonomous driving dataset, created by researchers at Zenseact.

    100K

    frames

    30

    classes

    100%

    labeled

  • JackRabbot Dataset and Benchmark

    Multi-sensort

    Indoor and outdoor data from a human perspective.

    60K

    frames

    1

    classes

    100%

    labeled

  • PandaSet

    Multi-sensor

    High-quality dataset by lidar producer Hesai. Its 100+ scenes are selected from two routes in Silicon Valley.

    48K

    frames

    37

    classes

    100%

    labeled

  • Leddar PixSet

    Multi-sensor

    The PixSet dataset contains 97 sequences for a total of roughly 29k frames using the AV sensor suite.

    29K

    frames

    22

    classes

    100%

    labeled

  • Mapillary Vistas

    Image > Segmentation

    street-level imagery dataset with pixel‑accurate and instance‑specific human annotations.

    2.5K

    frames

    124

    classes

    100%

    labeled

  • COCO

    Image > Segmentation

    200K labeled images of objects such as different kinds of animals, appliances, food, and much more.

    20K

    frames

    80

    classes

    100%

    labeled

  • Rooms

    Image > Segmentation

    A dataset of rooms

    12.9K

    frames

    58

    classes

    0%

    labeled

  • DENSE Seeing Through Fog

    Multi-sensor > Bounding box

    Different weather conditions like fog, snow, and rain; captured in northern Europe.

    12K

    frames

    4

    classes

    100%

    labeled

  • saptadeb – Soy

    Image > Segmentation

    A dataset of soy fields

    11.2K

    frames

    1

    classes

    2%

    labeled

  • comma10k

    Image > Segmentation

    10,000 PNGs of real driving captured from the comma fleet, semantically labeled by the public.

    10K

    frames

    5

    classes

    93%

    labeled

  • KITTI

    Multi-sensor

    KITTI is a dataset of lidar sequences of street scenes in Karlsruhe, Germany.

    7.5K

    frames

    8

    classes

    100%

    labeled

  • saptadeb – Corn

    Image > Segmentation

    A dataset of corn fields

    6.4K

    frames

    1

    classes

    32%

    labeled

  • Cityscapes

    Image > Segmentation

    The Cityscapes Dataset focuses on semantic understanding of urban street scenes.

    5K

    frames

    classes

    100%

    labeled

  • Hallway Gyeonggibuk Science High

    Image > Segmentation

    Hallway on the 3rd floor of Gyeonggibuk Science High School.

    3K

    frames

    1

    classes

    60%

    labeled

  • Panoptic – Visible

    Image > Segmentation

    A panoptic road segmentation

    2K

    frames

    133

    classes

    100%

    labeled

  • Lawn mower

    Image > Segmentation

    Driveable area on lawns

    1.3K

    frames

    1

    classes

    100%

    labeled

  • Sidewalk Segmentation

    Image > Segmentation

    A dataset of sidewalk images gathered in Belgium in the summer of 2021

    1K

    frames

    34

    classes

    100%

    labeled

  • Manthan

    Image > Segmentation

    A dataset consisting of road scenes

    641

    frames

    4

    classes

    48%

    labeled

  • Cars background removal

    Image > Segmentation

    A dataset of cars

    617

    frames

    3

    classes

    57%

    labeled

  • Class

    Image > Segmentation

    A dataset of a hallway with objects obstructing the way.

    451

    frames

    3

    classes

    100%

    labeled

  • Vienna Automotive Dataset

    Image > Segmentation

    Roadscene dataset in Vienna

    295

    frames

    29

    classes

    8%

    labeled

  • Lane lines

    Image > Segmentation

    Lane lines segmentation on a road

    189

    frames

    1

    classes

    100%

    labeled

  • Youbot Semantics

    Image > Segmentation

    Indoor segmentation from the viewpoint of the YouBot robot

    140

    frames

    13

    classes

    100%

    labeled

  • Blackberry2Right

    Image > Segmentation

    Segmentation of driveable space in a field of blackberries

    100

    frames

    1

    classes

    98%

    labeled

  • Road satelite images

    Image > Segmentation

    Road segmentation for satelite images

    97

    frames

    1

    classes

    100%

    labeled

  • Potholes

    Image > Segmentation

    A dataset of potholes

    81

    frames

    1

    classes

    100%

    labeled