Papers
arxiv:2606.02956

The Road Ahead in Autonomous Driving: The KITScenes Multimodal Dataset

Published on Jun 1
· Submitted by
Fabian Immel
on Jun 5

Abstract

KITScenes Multimodal dataset provides high-fidelity European driving data with comprehensive 3D maps and diverse urban environments for embodied AI research.

Existing autonomous driving datasets have enabled major progress, but fall short in sensor fidelity, map completeness, or geographic diversity. We present KITScenes Multimodal, a European dataset built around high-fidelity sensors and maps. Our fully synchronized sensor suite combines high-resolution global-shutter cameras, long-range lidar beyond 400m, 4D imaging radar, and redundant GNSS/INS localization. Our HD maps are, to our knowledge, the most complete of any sensor dataset, validated through autonomous driving trials on open-source software. For the first time in a public dataset, all driving-relevant traffic elements, such as traffic lights, are mapped in 3D to a reprojection-accurate level with full topological connectivity. Recorded in cities with irregular street layouts and mixed traffic modes, our dataset complements existing datasets by broadening the available geographic diversity. We also introduce four benchmarks, each advancing spatial learning for embodied AI: online HD map construction, long-range depth estimation, novel view synthesis, and end-to-end driving. Project page: https://kitscenes.com/

Community

Paper author Paper submitter
edited about 9 hours ago

We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.

Highlights

  • European urban focus — recordings from Karlsruhe, Frankfurt, and Sindelfingen
  • High-fidelity sensor suite — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS
  • Long-range sensing — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets
  • Production-grade HD maps — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles
  • Research benchmarks — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research

Paper author Paper submitter

We present KITScenes Multimodal, a European dataset built around a high-fidelity sensor suite together with the most complete HD maps for autonomous driving ever released.

Highlights

  • European urban focus — recordings from Karlsruhe, Frankfurt, and Sindelfingen
  • High-fidelity sensor suite — 72.5 MPix global-shutter surround cameras, 7 LiDARs (~906k points/frame on average), 3 Continental ARS548 4D imaging radars, and redundant GNSS/INS
  • Long-range sensing — effective LiDAR range beyond 400 m with substantially higher return density than common public driving datasets
  • Production-grade HD maps — Lanelet2 maps with lane topology, regulatory elements, 29 road-feature classes, 120 traffic-sign classes (GTSIGN-220 taxonomy), and 3D-localized signs, traffic lights, and poles
  • Research benchmarks — relational HD map perception, long-range monocular depth estimation, novel view synthesis, and end-to-end / world-model research

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2606.02956
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.02956 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.02956 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.