Team: Data Science & Services

Representation of the group’s work topics
data archiving, data processing, data analysis, and in-situ data acquisition in front of the Neustrelitz ground station and the product Total Electron Content (TEC)

The Data Science & Services working group develops and operates research data infrastructure with a strong focus on the processing, the management, and the calibration and validation of remote sensing data. Further areas of expertise include

  • the conception and responsibility of operational process chains for remote sensing data products,
  • the research of mathematical data analysis methods with the aim of ground station development,
  • the IT management of the Neustrelitz ground station,
  • the development of calibration and validation methods for remote sensing data, and
  • the department-wide project management.

Data Processing

An important part of our task portfolio is the generation of high-quality remote sensing data products from received raw data. These so-called ARD products (ARD: Analysis Ready Data) form the basis for all further processing steps.

Our expertise lies in the development, implementation, operation, and maintenance of the associated processing chains. Therein, we take into account mission-specific requirements such as product parameters or time requirements for product provision.

One essential prerequisite for the efficient implementation and operation of processing chains is a suitable software framework. We have been developing the latter in coordination with other colleagues at the German Remote Sensing Data Center.

In order of being able to map the dynamics and the necessary scaling of our processing chains, we rely on modern approaches of container-based data processing such as Kubernetes.

We look back on many years of experience in the field of remote sensing data processing and operate a large number of different processing chains. These go hand in hand with diverse requirement profiles which range from many small processing jobs with products of small file size and near real-time requirements (e.g. product provision in less than 5 minutes) to the creation of complex products with long processing times (processing times of approx. 3 days).
We are currently developing and operating processing chains for the EnMAP and DESIS missions as well as the Ionosphere Monitoring and Prediction Center (IMPC).

Data Archiving

We develop and operate one of two sites of the German Satellite Data Archive (D-SDA). In this regard, our top priority is the reliable long-term storage of remote sensing products from a wide range of missions and ensuring access to them. Our infrastructure is designed to manage data volumes in the petabyte range.

  

1st Copy

IBM TS-4500 tape library with LTO-9 tape drives

FastLTA virtual tape library

2nd Copy

Oracle SL-8500 tape library with LTO-6 tape drives

Cache

NetApp E2824 storage system

Table: Overview of D-SDA systems at DLR site Neustrelitz.

By organizing annual conferences on current trends in data storage and hierarchical storage management, we promote professional exchange between users, manufacturers, developers, and service providers in the context of data management systems.

Data Analysis

Before, during and after satellite communication, a large amount of information – in addition to the payload data – is recorded at the Neustrelitz ground station. With the aim of optimizing and automating ground station processes, we have been researching methods for the mathematical analysis of this data. Based on our research, we develop and implement prototypes as well as operational services for the acquisition and processing of ground station data.

Together with our colleagues in the System Development Ground Stations and Software Systems working groups, we create the necessary foundations for this. This includes the following areas of responsibility:

  • identification and integration of data sources,
  • design and implementation of data models for ground stations,
  • planning and implementation of frameworks for ground station data management.

Data Calibration and Validation

We have been developing and operating the large-scale research facility DEMMIN as a measurement and calibration site for remote sensing data since 2004. Our tasks include the maintenance of the measurement infrastructure, the storage and provision of in-situ data, and the organization and implementation of measurement campaigns.

Amongst others, we are part of the networks TERENO and JECAM.

DEMMIN test site in Mecklenburg / Western Pomerania
Map, logo, and Landsat 8 (2014), RapidEye (2014), Ikonos (2006) image of the area around the city of Demmin

IT Management

To ensure the operation of the Neustrelitz ground station, our focus is on planning, implementing, and monitoring of the necessary fail-safe IT infrastructure. In addition, the following activities fall within our area of responsibility:

  • design, implementation, and operation of computing clusters (vmWare vSphere / Proxmox VE),
  • network design and security,
  • automated resource provisioning (e.g. virtual machines),
  • design and management of communication interfaces,
  • backup strategy and implementation,
  • project support (e.g. IT security, feasibility analysis, resource estimation),
  • coordination of the department-wide IT infrastructure (workstations).

Project Management

As part of the department-wide project management, we initiate and support a large number of national and international projects (e.g. ESA, USGS).

Typically, these are located in the following areas:

  • data reception,
  • data processing,
  • data archiving,
  • data delivery,
  • construction of research data infrastructure.