Research group logo
  • Home
  • Overview
  • Members
  • Alumni
  • Publications
  • Courses
  • Openings
  • Contact

Trustworthy Machine Learning

Spring Semester 2026, KAIST AI

Why does AI solve the Math Olympiad but fail to manage your calendar? General AI is not trustworthy in private settings because of three broken communication channels. Human to AI: underspecification (it doesn't know what I want). AI to Human: unexplainability and overconfidence (I don't know why it did that or if it's guessing). Environment: hostility (privacy leaks and security attacks). This course covers theoretical and technical background for these key topics in Trustworthy Machine Learning (TML). We conduct a critical review of classical and contemporary research papers and provide hands-on practicals.

1. Goal

  1. Students will be able to critically read, assess, and discuss research work in Trustworthy Machine Learning (TML).
  2. Students will gain the technical background to implement basic TML techniques in a deep learning framework.
  3. Students will be ready to conduct their own research in TML and make contributions to the research community.

2. Prerequisites

  • Familiarity with Python and PyTorch coding.
  • A pass grade from the Deep Learning Course (or equivalent).
  • Basic knowledge of machine learning concepts.
  • Basic maths: multivariate calculus, linear algebra, probability, statistics, and optimisation.

3. TML Book

Previous course materials are available as a book: https://trustworthyml.io/ (also on arXiv).

The book will be useful for the course. However, the course materials are updated yearly to stay aligned with the latest research, so it won’t cover new topics.

4. Schedule

#DateContentQuizProject
L1Mar 06Orientation (14:30 - 16:00)—Interest survey
L2Mar 13I. Human → AI: Generalisation via DataQuiz 0—
L3Mar 20Modularity & Composition—Team formation due
L4Mar 27II. AI → Human: Training Data Attribution IQuiz 1—
L5Apr 03Training Data Attribution II——
L6Apr 10Explainability——
L7Apr 17Uncertainty IQuiz 2—
L8Apr 24Proposal presentations—Proposal due
L9May 01Uncertainty II——
L10May 08III. Privacy & Security: Contextual PrivacyQuiz 3—
L11May 15Parametric Privacy——
L12May 22Security & JailbreakingQuiz 4—
L13May 29IV. The Future: Agents & Synthesis——
L14Jun 05Course Wrap-upQuiz 5—
L15Jun 12Project Co-working——
L16Jun 19Final Presentations—Final report due

5. Assessment

5.1 Grading

ComponentWeight
Quizzes 1-5 (6% each)30%
Proposal presentation10%
Proposal report10%
Final presentation25%
Final report25%
Total100%

Quiz 0 is a trial run and does not count towards the grade.

5.2 Quizzes

  • Format: Google Forms, accessible via QR code in class (link shared on Slack simultaneously).
  • Timing: Strict 10-minute window at the start of class.
  • Structure: 3 questions covering material from previous lectures. Attending lectures is essential.

5.3 Team projects

  • Team size: 3 students per team.
  • Formation:
    • Week 1: Interest survey.
    • Week 3: Teams finalised.
  • Deliverable: 4-page report + 10 min presentation.
  • Peer evaluation: Mandatory form at the end. Distribute 100 points among your team members based on contribution. Unequal splits will affect individual grades.

Example project topics

Coming soon.

6. Generative AI Policies

Students may use generative AI tools (e.g. LLMs, VLMs, image generators). However, you are solely responsible for all outputs you submit. We will apply heavy penalties for:

  • Hallucinated or factually incorrect outputs.
  • Unsound or fabricated citations.
  • Plagiarised materials.
  • AI slop (low-effort, generic AI-generated content).

Severe cases may be reported to the university for disciplinary action.

You must be ready to answer clarification requests from the lecturer or tutors at any point. Inability to explain your own work will be treated as evidence of academic misconduct.

We do not tolerate very similar creative work among class members. AI tends to produce similar outputs across sessions and model families. Diversify your answers, especially for creative work. Suspicion of copied work will be penalised.

7. Communication & Logistics

Lecturer: Seong Joon Oh

Tutors: Bryan Truong, Seokwon Jung

When: Fridays 13:00-16:00

Where: 양재산학캠퍼스 대강의실 (Hybrid)

Email: stai.there@gmail.com for submissions, questions, and feedback.

Slack: Email us your name and preferred email address to be added. Use it for questions, announcements, and finding team members.

© Seong Joon Oh, 2024 · Partially powered by the Academic theme for Hugo.