About
The group actively researches and shapes the digitalization and opening up of science, with a core focus on data infrastructures and tools for interdisciplinary, data-intensive research in the era of generative AI.
Our interdisciplinary research group is uniquely situated across three key institutions:
- Technical University of Berlin (Research Data Infrastructure group)
- Fraunhofer FOKUS (Research Data Infrastructure group)
- Weizenbaum Institute (Digitalisation and Opening up of Science group)
Our researchers actively contribute their expertise to several high-profile, third-party funded projects. To this end, the team is significantly involved and deeply integrated into the National Research Data Infrastructure (NFDI), the Berlin University Alliance (BUA), and the European Open Science Cloud (EOSC).
News
The Weizenbaum Library, the open repository at the Weizenbaum Institute, has officially relaunched with a fresh design and enhanced functionality.
Designed as a powerful infrastructure for digitalization research, the repository provides free, sustainable access to scientific findings.
Following a successful development phase guided by the research group, the Weizenbaum Institute has now taken over full operation of the repository.
Explore the new repository here: Weizenbaum Library
2026-01-19
1 min read
Congratulations to Angelie Kraft, Judith Simon, and Sonja Schimmler for receiving an “Honorable Mention” at the 14th International Joint Conference on Natural Language Processing (IJCNLP-AACL 2025).
Their paper, “Social Bias in Popular Question-Answering Benchmarks,” investigates how social biases infiltrate AI benchmarks. Their findings emphasize that building transparent, diversity-sensitive datasets is essential for responsible AI development.
Find the paper here.
2025-12-20
1 min read
At the Open Science Conference in Hamburg, Sonja Schimmler delivered a keynote titled Data Infrastructures and Data Competencies as a Foundation for AI Projects.
This presentation examined the critical components that drive AI progress, with a focus on the need for large-scale, machine-interpretable data that adapts to the unique demands of different sectors. Key topics included transparency and reproducibility in AI projects, as well as the importance of making essential resources, publications, data, models, and code, available and interconnected.
2025-10-09
1 min read