Bookmark
Scientific activities / projects

Storage structures for large-scale, multi-dimensional data using modern hardware

Starting date

immediately

Duration of contract

3 years

Remuneration

according to the German TVöD 14

Type of employment

Full-time

"Cutting-edge research requires excellent minds – particularly more females – at all levels. Launch your mission with us and send in your application now!" Prof. Pascale Ehrenfreund - Chair of the DLR Executive Board

Your mission:

For the Institute of Data Science in Jena we are establishing a new research group on „Data Management Technologies“. In this group, we will develop novel methods and techniques for storing, managing, and processing large amounts of multi-dimensional data in distributed storage systems. The work will be conducted in close collaboration with the German Remote Sensing Data Center in Oberpfaffenhofen and external partners at the Ilmenau University of Technology and the Friedrich-Schiller-University Jena.

You will develop the foundations of database management systems of tomorrow in cooperation with our internal and external partners. You are responsible for the in-depth analysis of requirements for the storage and management of large volumes (in the PB range) of scientific, multi-dimensional data. Based on these requirements you design novel methods and techniques for scalable storage management of large scientific data sets and efficient data access paths and processing capabilities for several application scenarios. It is a challenging research task, which requires a detailed analysis of requirements and scientific workloads, detailed knowledge about modern IT infrastructures & hardware and platforms, and expert knowledge in large-scale data management. The to-be-developed methods have to be robust and scalable to be integrated into real-world platform environments. In addition it requires a regular, technical exchange with partners and stake holders on defining interfaces and efficient data access paths.

The emphasis of your activities will be on:

  • analysis of core requirements for the management of large, complex volumes of data
  • literature research in the area of large-scale data management for multi-dimensional data (raster data) with the goal to compare existing and develop new methods and techniques
  • analysis of current IT architectures and systems for their usage to storage very large volumes of multi-dimensional data
  • development of innovative concepts for the storage and access to large volumes of multi-dimensional data
  • implementation of concepts in efficient data structures, algorithms, and modules
  • development of efficient, scalable algorithms and data structures for storing and processing large data volumes
  • planning and selection of to-be-leveraged hardware (e.g., SSDs, NVM, FPGAs, ASICs)
  • setup of collaborations with selected hardware partners
  • integration and testing of modules in platform environments
  • scientific evaluation, documentation, and publication of results in scientific journals, workshops, or conferences
  • presentation of achieved results at national and international conferences and workshops
  • technical supervision of PhD students and PostDocs
  • reviewing of student theses (M.Sc./B.Sc)
  • onboarding of students into complex software developed within the group
  • contribution to the management, documentation, and maintenance of IT platforms
  • establishing new research fields by extensive exchange with other application domains

Your qualifications:

  • completed academic degree in computer science, (applied) mathematics, or physics (university diploma / master’s degree)
  • independent and autonomous work on complex research questions
  • deep knowledge of database implementation techniques, such as operators and tree data structures
  • excellent knowledge in performance-oriented programming (C/C++ or similar) by leveraging modern hardware
  • fluency in written and oral English
  • practical experience with the implementation of large software in complex computer architectures (e.g., Multi-core, distributed, NUMA) is an asset
  • scientific publications in the mentioned topic areas at national and international conferences/journals/Workshops are a plus
  • multiple years of experience in the management of large data volumes, specifically in query processing, indexing, transaction management, storage management is of advantage

Your benefits:

Look forward to a fulfilling job with an employer who appreciates your commitment and supports your personal and professional development. Our unique infrastructure offers you a working environment in which you have unparalled scope to develop your creative ideas and accomplish your professional objectives. Our human resources policy places great value on a healthy family and work-life-balance as well as equal opportunities for persons of all genders (m/f/non-binary). Individuals with disabilities will be given preferential consideration in the event their qualifications are equivalent to those of other candidates.

  • Apply online now
  • You can send this job advertisement via e-mail and complete your application on a personal computer or laptop.

    We need your digital application documents (PDF). The document upload function is not supported by all mobile devices. Please complete your application on a PC/laptop.

    Complete application on PC

Technical contact

Dr. Marcus Paradies
Institute of Data Science

Phone: +49 3641 30960-103

Send message

Vacancy 29485

HR department Berlin

Send message

DLR site Jena

DLR Institute of Data Science

To institute