Scalable Trustworthy AI

Creating scalable and trustworthy AI with human guidance

Overview

Announcement: The STAI group is moving to KAIST AI in February 2026.

AI is no longer a research curiosity. It is reshaping how we live and work. To fully exploit its benefits, we must address critical gaps in trustworthiness.

Current foundational models like LLMs have critical trustworthiness problems: they hallucinate false information, fail at continual learning, resist knowledge editing (making GDPR compliance impractical), leak private information embedded in parameters, and require prohibitive compute for training and personalisation. These issues are blocking the widespread adoption of AI and the productivity revolution it promises.

Our approach: Knowledge-Intelligence Separation. Just as the code-data separation in the 1960s enabled the modern software industry, we believe this separation is the key to unlocking AI’s full potential. When knowledge is stored in interpretable, editable external modules while intelligence (reasoning, generalisation) remains in the model, we enable faster customisation, training data attribution by design, and knowledge editing and unlearning .

Our work spans a range of interconnected areas:

Information retrieval and search, vector databases, RAG
Memory-augmented architectures
Multimodal models
Human-AI interaction
Expert-in-the-loop systems
Agentic AI
Training data attribution
Privacy and security
Data and model ownership

We are not alone in this effort. Many research labs worldwide contribute to Trustworthy AI. Our group finds its uniqueness by striving for working solutions that are widely applicable and can be deployed at scale. We thus name our group Scalable Trustworthy AI. For impact at scale, we commit ourselves to the following principles:

Simple is better than complex. Scalability and applicability are inversely correlated with complexity.
Understand and then solve. You can only solve a problem when you understand it.
Do not follow a dead end. When an approach is fundamentally limited in the long run, don’t take it.

For prospective students: You might be interested in our internal curriculum and guidelines for a PhD program: Principles for a PhD Program.

Members

Seong Joon Oh

Associate Professor

Elisa Nguyen

PhD Student

Alexander Rubinstein

PhD Student

Arnas Uselis

PhD Student

Sohyung Kim

PhD Student

Stefano Woerner

PhD Student

Yejin Kim

Research Intern

Ankit Sonthalia

PhD Student

Darina Koishigarina

PhD Student

Bryan Truong

PhD Student

Lennart Bramlage

Collaborating PhD Student

Jihyeok Jung

MSc student

Bora Kargi

MSc Student

Philipp Davydov

MSc Student

Seokwon Jung

MSc Student

Luca Füger

MSc student

Fabian Morelli

MSc Student

Alumni

Elif Akata

PhD Student

Michael Kirchhof

Collaborating PhD Student

Bálint Mucsányi

MSc Student

Evgenii Kortukov

MSc Student

Anastasiia Alekseeva

MSc Student

Johannes Bertram

Research Assistant

Albert Catalan Tatjer

MSc Student

Publications

Dynamics Reveals Structure: Challenging the Linear Propagation Assumption

Hoyeon Chang, Bálint Mucsányi, Seong Joon Oh

arXiv 2026

CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally

Darina Koishigarina, Arnas Uselis, Seong Joon Oh

ICLR 2026

arXiv PDF RTAI Code Project

DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

Alexander Rubinstein, Benjamin Raible, Martin Gubri, Seong Joon Oh

ICLR 2026

arXiv PDF RTAI Code Project

Dr.LLM: Dynamic Layer Routing for LLMs

Ahmed Heakl, Martin Gubri, Salman Khan, Sangdoo Yun, Seong Joon Oh

ICLR 2026

arXiv PDF RTAI Code

Enhancing Multi-Image Understanding through Delimiter Token Scaling

Minyoung Lee, Yeji Park, Dongjun Hwang, Yejin Kim, Seong Joon Oh, Junsuk Choe

ICLR 2026

SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?

Michael Kirchhof, Luca Füger, Adam Golinski, Eeshan Gunesh Dhekane, Arno Blaas, Seong Joon Oh, Sinead Williamson

ICLR 2026

Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity

Philipp Davydov, Ameya Prabhu, Matthias Bethge, Elisa Nguyen, Seong Joon Oh

arXiv 2025

arXiv PDF RTAI Dataset

Diffusion Classifiers Understand Compositionality, but Conditions Apply

Yujin Jeong*, Arnas Uselis*, Seong Joon Oh, Anna Rohrbach

NeurIPS D&B 2025

arXiv PDF RTAI Code

On the Rankability of Visual Embeddings

Ankit Sonthalia, Arnas Uselis, Seong Joon Oh

NeurIPS 2025

arXiv PDF RTAI Code

Overcoming Domain Limitations in Open-vocabulary Segmentation

Dongjun Hwang, Seong Joon Oh, Junsuk Choe

NeurIPS 2025

Does Data Scaling Lead to Visual Compositional Generalization?

Arnas Uselis, Andrea Dittadi, Seong Joon Oh

ICML 2025

arXiv PDF RTAI Code

Do Deep Neural Network Solutions Form a Star Domain?

Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad, Seong Joon Oh

ICLR 2025

arXiv PDF RTAI Code

Intermediate Layer Classifiers for OOD Generalization

Arnas Uselis, Seong Joon Oh

ICLR 2025

Decoupled Finetuning for Domain Generalizable Semantic Segmentation

Jaehyun Pahk, Donghyeon Kwon, Seong Joon Oh, Suha Kwak

ICLR 2025

Are We Done with Object-Centric Learning?

Alexander Rubinstein, Ameya Prabhu, Matthias Bethge, Seong Joon Oh

SCSL @ ICLR 2025

arXiv PDF RTAI Code Project

DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation

Sohyun Lee, Nayeong Kim, Juwon Kang, Seong Joon Oh, Suha Kwak

arXiv 2025

Mitigating Shortcut Learning with Diffusion Counterfactuals and Diverse Ensembles

Luca Scimeca, Alexander Rubinstein, Damien Teney, Seong Joon Oh, Yoshua Bengio

SCSL @ ICLR 2025

Playing repeated games with Large Language Models

Elif Akata, Lion Schulz, Julian Coda-Forno, Seong Joon Oh, Matthias Bethge, Eric Schulz

Nature Human Behaviour 2025

Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks

Bálint Mucsányi, Michael Kirchhof, Seong Joon Oh

NeurIPS D&B (Spotlight) 2024

arXiv PDF RTAI Code

Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts

Evgenii Kortukov, Alexander Rubinstein, Elisa Nguyen, Seong Joon Oh

CoLM 2024

arXiv PDF RTAI Code

Towards User-Focused Research in Training Data Attribution for Human-Centered Explainable AI

Elisa Nguyen, Johannes Betram, Evgenii Kortukov, Jean Y. Song, Seong Joon Oh

arXiv 2024

Scalable Ensemble Diversification for OOD Generalization and Detection

Alexander Rubinstein, Luca Scimeca, Damien Teney, Seong Joon Oh

arXiv 2024

arXiv PDF RTAI Code

Pretrained Visual Uncertainties

Michael Kirchhof, Mark Collier, Seong Joon Oh, Enkelejda Kasneci

arXiv 2024

arXiv PDF RTAI Code

A Bayesian Perspective On Training Data Attribution

Elisa Nguyen, Minjoon Seo, Seong Joon Oh

NeurIPS 2023

arXiv PDF RTAI Code

Exploring Practitioner Perspectives On Training Data Attribution Explanations

Elisa Nguyen, Evgenii Kortukov, Jean Song, Seong Joon Oh

NeurIPS XAI in Action Workshop 2023

ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets

Damien Teney, Lin Yong, Seong Joon Oh, Ehsan Abbasnejad

NeurIPS 2023

URL: A Representation Learning Benchmark for Transferable Uncertainty Estimates

Michael Kirchhof, Bálint Mucsányi, Seong Joon Oh, Enkelejda Kasneci

NeurIPS D&B 2023

arXiv PDF RTAI Code

Neglected Free Lunch -- Learning Image Classifiers Using Annotation Byproducts

Dongyoon Han*, Junsuk Choe*, Seonghyeok Chun, John Joon Young Chung, Minsuk Chang, Sangdoo Yun, Jean Y. Song, Seong Joon Oh

ICCV 2023

arXiv PDF RTAI Video Code Dataset Dataset Poster

Trustworthy Machine Learning

Bálint Mucsányi, Michael Kirchhof, Elisa Nguyen, Alexander Rubinstein, Seong Joon Oh

Textbook 2023

arXiv PDF RTAI Project

Scratching Visual Transformer's Back with Uniform Attention

Hyeon-Woo Nam, Yu-Ji Kim, Byeongho Heo, Dongyoon Han, Seong Joon Oh, Tae-Hyun Oh

ICCV 2023

arXiv PDF RTAI Code

Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs

Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh

ICML 2023

arXiv PDF RTAI Code

URL: A Representation Learning Benchmark for Transferable Uncertainty Estimates

Michael Kirchhof, Bálint Mucsányi, Seong Joon Oh, Enkelejda Kasneci

UAI-EAI Best Student Paper 2023

arXiv PDF RTAI Code

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO

Shanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh

ECCV 2022

arXiv PDF RTAI Code

Dataset Condensation via Efficient Synthetic-Data Parameterization

Jang-Hyun Kim, Jinuk Kim, Seong Joon Oh, Sangdoo Yun, Hwanjuk Song, Joonhyun Jeong, Jung-Woo Ha, Hyun Oh Song

ICML 2022

arXiv PDF RTAI Code

Weakly Supervised Semantic Segmentation Using Out-of-Distribution Data

Jungbeom Lee, Seong Joon Oh, Sangdoo Yun, Junsuk Choe, Eunji Kim, Sungroh Yoon

CVPR 2022

arXiv PDF RTAI Code

Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective

Luca Scimeca**, Seong Joon Oh**, Sanghyuk Chun, Michael Poli, Sangdoo Yun

ICLR 2022

Courses

Trustworthy Machine Learning @ KAIST AI

Spring Semester 2026

Trustworthy Machine Learning @ University of Tübingen

Winter Semester 2024-2025

Trustworthy Machine Learning @ University of Tübingen

Winter Semester 2023-2024

Trustworthy Machine Learning @ University of Tübingen

Winter Semester 2022-2023

Openings

Postdoc

Expectations

We expect postdocs to be independent researchers who can manage multiple research threads in the lab through supervision, while pursuing their own first-author agenda. A research vision broader than a single project is expected.

Application process

Send an email to Seong Joon Oh with your CV and research statement attached
Coffee chat with Seong Joon to figure out initial fit
On-site interview day (we support travel for on-site visits; remote interview possible if travel is not feasible)
- Job talk: Present your prior work to the entire lab (30 minutes + discussion)
- 1-on-1 interviews: Meet individually with 4+ lab members (on-site: scheduled throughout the day; remote: candidate reaches out to members and arranges meetings individually)
- Future research discussion: Chat with Seong Joon about research directions at the intersection of your expertise and our vision
- Lab lunch/dinner: Informal time to get to know the team
Offer

Posted: 1 January, 2026

PhD

Expectations

We expect PhD students to run their own first-author projects, with possible collaborations with both senior and junior members inside and outside the lab.

Application process

Send an email to Seong Joon Oh with your CV and research statement attached
Coffee chat with Seong Joon to figure out initial fit
Half-day interview
- Job talk: Present your prior work to the entire lab (30 minutes + discussion)
- 1-on-1 interviews: Meet individually with 2 lab members (on-site: scheduled throughout the day; remote: candidate reaches out to members and arranges meetings individually)
Offer
Apply to the grad school with Seong Joon Oh’s supervision intent via KAIST Graduate Admissions

Timeline

Steps 1-4 must be completed at least one week before the group offer announcement dates below. Please reach out well in advance.

Group offer announcement (step 4)

International spring: 31 August
International autumn (early track): 30 November
International autumn (regular track): 28 February
Domestic spring: 30 June
Domestic autumn: 31 March

Posted: 1 January, 2026

MSc

Expectations

We expect MSc students to run their own first-author projects, with possible collaborations with both senior and junior members inside and outside the lab.

Application process

Send an email to Seong Joon Oh with your CV and research statement attached
Coffee chat with Seong Joon to figure out initial fit
Interview: 30 min + 30 min with Seong Joon
- First half: Present your prior work (aim for 10 minutes, leaving 20 minutes for discussion)
- Second half: Discuss future research ideas at the intersection of your expertise and our vision
Offer
Apply to the grad school with Seong Joon Oh’s supervision intent via KAIST Graduate Admissions

Timeline

Steps 1-4 must be completed at least one week before the group offer announcement dates below. Please reach out well in advance.

Group offer announcement (step 4)

International spring: 31 August
International autumn (early track): 30 November
International autumn (regular track): 28 February
Domestic spring: 30 June
Domestic autumn: 31 March

Posted: 1 January, 2026

Internship (Pre-MSc)

Expectations

We expect interns to participate in a predefined research agenda as a co-author, working closely with their PhD student host.

Application process

Send an email to the relevant PhD student (cc: Seong Joon Oh) with your CV and research statement attached
Coffee chat with the PhD student
Interview: 30 min + 30 min with the PhD student (Seong Joon participates optionally)
- First half: Present your prior work (aim for 10 minutes, leaving 20 minutes for discussion)
- Second half: Discuss future research ideas at the intersection of your expertise and our vision
Offer

Posted: 1 January, 2026

Contact