Hugging Face and Yaak Unveil Massive Open-Source Self-Driving Dataset

Max Carter

Max Carter

March 11, 2025 · 3 min read
Hugging Face and Yaak Unveil Massive Open-Source Self-Driving Dataset

Hugging Face, a prominent AI development platform, has teamed up with AI startup Yaak to expand its LeRobot collection with a massive open-source dataset for autonomous vehicles. The new dataset, called Learning to Drive (L2D), is designed to support the development of end-to-end self-driving models and boasts an impressive size of over 1 petabyte.

L2D is unique in its focus on capturing real-world driving scenarios, with data collected from sensors installed on cars in German driving schools. The dataset includes camera, GPS, and "vehicle dynamics" data from driving instructors and students navigating various environments, such as city streets, construction zones, intersections, and highways. This comprehensive approach enables the development of more accurate and robust self-driving models.

In contrast to other open self-driving training sets, such as those from Alphabet's Waymo and Comma AI, L2D is designed to support end-to-end learning. This approach enables the prediction of actions directly from sensor inputs, such as camera footage, rather than relying on high-quality annotations. According to L2D's creators, this makes it more scalable and accessible to the AI community.

The launch of L2D is a significant milestone in the development of autonomous vehicles, as it provides a massive, open-source dataset for researchers and developers to train and test their models. Hugging Face and Yaak are calling on the AI community to submit models and tasks they'd like to be evaluated on, such as navigating roundabouts and parking spaces. The companies plan to conduct real-world "closed-loop" testing of models trained using L2D and LeRobot this summer, deployed on a vehicle with a safety driver.

The implications of L2D are far-reaching, with the potential to accelerate the development of autonomous vehicles and robotics systems. By providing a massive, open-source dataset, Hugging Face and Yaak are democratizing access to self-driving technology and enabling a wider range of researchers and developers to contribute to the field. As the AI community begins to explore the possibilities of L2D, we can expect to see significant advancements in the years to come.

In conclusion, the launch of L2D marks a significant step forward in the development of autonomous vehicles and robotics systems. With its massive size, real-world focus, and end-to-end learning approach, L2D is poised to become a game-changer in the field. As the AI community begins to explore the possibilities of this new dataset, we can expect to see significant advancements in the years to come.

Similiar Posts

Copyright © 2024 Starfolk. All rights reserved.