Max Waidhas - b-it Center

AI models in the natural sciences: From explaining predictions to capturing causal relationships

Life Science Informatics & Data ScienceBy Max Waidhas April 7, 2025

New publication of b-it Professor Dr. Jürgen Bajorath in the journal “Cell Reports Physical Science” shows how science can benefit from AI and what scientist need to look out for.

Efficient Language Model Adaptation: Bridging the Gap with Limited Resources

NLP ColloquiumBy Max Waidhas March 25, 2025

Large language models (LLMs) have demonstrated remarkable capabilities, but their high computational costs and reliance on extensive labeled data limit their practical deployment in resource-constrained settings. This talk explores strategies for efficiently adapting and leveraging smaller, more deployable models while minimizing reliance on human annotations.

Dynamic Personalization from Cross-model Consistencies

NLP ColloquiumBy Max Waidhas March 18, 2025

Scaling up Language Models has led to increasingly advanced capabilities for those who can afford to train them. In order to enable community-tailored models for the rest of us, we will examine cross-model consistencies in how LMs acquire their linguistic knowledge-from fundamental syntax and semantics up to higher-level pragmatic features, such as culture. By identifying these consistencies across different models, we highlight opportunities for how they can enable dynamic personalization approaches that improve the accessibility of language technologies for underserved communities, in which collecting sufficient training data is physically impossible.

Context-Aware Retrieval Augmented Generation Framework

NLP ColloquiumBy Max Waidhas March 12, 2025

In this talk, / will present CARAG, a Context-Aware Retrieval Augmented Generation framework that improves Automated Fact Verification (AFV) by incorporating both local and global explanations. Unlike traditional factchecking methods that focus on isolated claims, CARAG leverages thematic embedding aggregation to verify claims in a broader contextual landscape. I will also introduce CARAG-u, an unsupervised extension that eliminates the need for predefined thematic annotations, dynamically deriving contextually relevant evidence clusters from unstructured data. CARAG-u maintains strong performance while increasing adaptability and scalability. Through benchmarks on the FactVer dataset, / will demonstrate how these frameworks enhance explainability and thematic coherence, advancing the role of Al in trustworthy, transparent fact verification.

The Altre and the Challenges of NLG Evaluation

NLP ColloquiumBy Max Waidhas February 5, 2025

In the first part of my talk, I will discuss the joys and challenges of my master’s research on generating the script of a full-length play using GPT-2. Namely, I will share some of the strategies we used to navigate around the limited context length of the model, getting the characters to have a consistent persona, and above everything else, making the play interesting to watch for the audience. In the second part, / will share my ongoing doctoral research on evaluating natural language generation. / will discuss our work on data contamination, present an overview of how NG is evaluated across different specific tasks, and share my challenges of evaluating the semantic accuracy of summarization at a scale when no reference is available.

Al Agents From Foundation to Application

NLP ColloquiumBy Max Waidhas January 24, 2025

In this lecture, we will journey through the core principles of Al agents, building a conceptual bridge from foundational theories to cutting-edge practical implementations. Attendees will gain insights into how autonomous agents operate, starting with basic Al agent architectures and evolving into sophisticated web automation systems. Highlighting our latest research with WebPilot, the lecture will showcase how integrating Monte Carlo Tree Search with a dual optimization strategy addresses the complexities of dynamic web tasks-mitigating vast action spaces and uncertainty through strategic exploration and adaptive decision-making.

How To Train A Multilingual Large Language Model?

NLP ColloquiumBy Max Waidhas January 9, 2025

The Teuken 7B model, a large language model for *European languages*, has recently made the news. If you’re interested in knowing how such models are trained, this week’s speaker is one of the lead scientists who’s done it.
As part of the Lamarr NLP monthly meetings, this week we have the pleasure to host Dr. Mehdi Ali from the Fraunhofer IAIS who will give a guest lecture on How To Train A Multilingual Large Language Model?.

Celebrating our Achievements in 2024: A Year of Innovation, Collaboration, and Growth

Data Science & Language TechnologiesBy Max Waidhas December 30, 2024

The past decade of AI was largely driven by one question: how to make large language models work at all. How to scale them, stabilize them, and push their capabilities far enough to be usable.

Reliable Evaluation of Interactive LLM Agents in a World of Apps and People: AppWorld

NLP ColloquiumBy Max Waidhas December 11, 2024

We envision a world where Al agents (assistants) are widely used for complex tasks in our digital and physical worlds and are broadly integrated into our society. To move towards such a future, we need an environment for a robust evaluation of agents’ capability, reliability, and
trustworthiness.

Large language models for drug discovery

Life Science Informatics & Data ScienceBy Max Waidhas October 24, 2024

New study conducted by Prof. Dr. Bajorath and Sanjana Srinivasan at b-it and the Lamarr-Institute at the University of Bonn show the potential of language models in finding new medications. The researchers have created a chemical language model comparable to ChatGPT to predict potential active ingredients with special properties. Following a training phase, the AI was able to exactly reproduce the chemical structures of compounds with known dual-target activity that may be particularly effective medications.