SynthBAD – Synthetic batch data generation for active learning and domain adaptation

In order to avoid the costly collection of real data, the project is developing an innovative tool chain for the synthetic generation of realistic camera data. This data supports AI training in automated road traffic and robotics, tested at the institute's ViewCar II and FASCar facilities.

The collection of extensive real data plays an essential role in the development of AI systems and is often associated with a high level of time and financial expenditure that is no longer economically justifiable. For this reason, there is an urgent need to generate the data required for AI training synthetically. However, there are still various challenges, such as the imbalance and redundancy of the training data. For example, the rare occurrence of certain critical events (e.g. near misses) must not lead to incorrect system behaviour. A frequently used technique for data generation is domain adaptation, with which existing data from a source domain (e.g. path and obstacle detection) is adapted to a specific target domain (e.g. path and obstacle detection in bad weather) without having to annotate this data again. Therefore, the aim of the project is to develop a tool chain to synthetically generate realistic camera data for applications in automated road traffic and robotics and to apply and demonstrate it.

Conversion of synthetic data (left) into domain-adapted data required for AI (right).
Credit:

links/left ©Rockstar Games (aus/from GTA) (Playing for Data: Ground Truth from Computer Games (tu-darmstadt.de)), rechts/right ©DLR

The project makes a research contribution to automated driving by modelling the tool chain on the sensors of the institute's ViewCar II and FASCar systems and thus generating synthetic data for AI training with our vehicles. The DLR Institute of Transportation Systems defines the requirements for such a tool chain and designs, implements and tests it.

Initial process chain

Project title:
SynthBAD - Synthetic batch data generation for active learning and domain adaptation

Duration:
01/2023 to 12/2023

Project volume:
€ 165.902,26

This project is managed by the department:

Contact

Dr. Sascha Knake-Langhorst

Head of Department
German Aerospace Center (DLR)
Institute of Transportation Systems
Information Acquisition and Model Design
Lilienthalplatz 7, 38108 Braunschweig