
Previous course materials are available as a book: https://trustworthyml.io/ (also on arXiv).
The book will be useful for the course. However, the course materials are updated yearly to stay aligned with the latest research, so it won’t cover new topics.
| # | Date | Content | Project |
|---|---|---|---|
| L1 | Mar 06 | Orientation (short session) · Video | — |
| L2 | Mar 13 | I. Human → AI: ML Foundations & Generalisation Primer · Video | — |
| L3 | Mar 20 | Underspecification & Cues · Video | Team formation due (23:59) |
| L4 | Mar 27 | LLM Communication & Modularity · Video | — |
| L5 | Apr 03 | II. AI → Human: Explanation & XAI · Video | — |
| L6 | Apr 10 | Attribution Methods · Video | — |
| L7 | Apr 17 | Training Data Attribution · Video | — |
| L8 | Apr 24 | Proposal Presentations (Midterm Week) · Video | Proposal report due (23:59); Proposal presentation |
| L9 | May 08 | Uncertainty I (Aleatoric) | — |
| L10 | May 15 | Uncertainty II (Epistemic) | — |
| L11 | May 22 | Uncertainty III (LLMs) | — |
| L12 | May 29 | III. Privacy & Security: Privacy & Data Protection | — |
| L13 | Jun 05 | Security & Adversarial Robustness | — |
| L14 | Jun 12 | IV. Final Presentations (Part 1) | Final report due (11 June, 23:59) |
| L15 | Jun 19 | Final Presentations (Part 2) | Peer eval due (23:59) |
| Component | Weight |
|---|---|
| Proposal presentation | 15% |
| Proposal report | 15% |
| Final presentation | 35% |
| Final report | 35% |
| Total | 100% |
Late submissions are not accepted. A missed deadline counts as a zero.
All criteria use a 5-point scale: 1 (Poor) / 2 (Below expectations) / 3 (Meets expectations) / 4 (Good) / 5 (Excellent). Criteria within each rubric are equally weighted.
Due L8 (Apr 24, 23:59). 1-2 pages, ICML 2026 format.
| Criterion | Description | 1 (Poor) | 5 (Excellent) |
|---|---|---|---|
| Problem definition | Is the research question stated clearly? Does the report specify what problem the project addresses and why it matters? | Problem is vague or absent. | Reader immediately understands the gap and the question. |
| Related work awareness | Does the team cite key references and position their project relative to existing work? A full literature review is not expected, but awareness of the landscape is. | No references or awareness of prior work. | Clear positioning with relevant citations. |
| Proposed approach | Is there a concrete plan? This includes model choice, data, metrics, and experimental design. | Approach is missing or hand-wavy. | Detailed, actionable plan. |
| Writing quality | Is the report well-structured, concise, and free of major errors? AI slop (generic, low-effort AI-generated text) will be penalised. | Disorganised or incomprehensible. | Polished writing. |
L8 (Apr 24). 5-minute lightning talk per team.
| Criterion | Description | 1 (Poor) | 5 (Excellent) |
|---|---|---|---|
| Clarity of problem and motivation | Can the audience understand what the project is about and why it matters within the first two minutes? | Audience is lost. | Immediately clear. |
| Plan communication | Does the team convey a credible plan? This includes approach, data, and expected outcomes. | No plan is communicated. | Plan is convincing and concrete. |
| Slide quality | Are slides readable, well-designed, and not overloaded? | Unreadable walls of text or irrelevant graphics. | Clean, effective visuals. |
| Time management | Does the team stay within the 5-minute limit and pace themselves well? | Severely over or under time. | Smooth pacing with a natural conclusion. |
Due L15 (Jun 12, 23:59). 4 pages excluding references, ICML 2026 format.
| Criterion | Description | 1 (Poor) | 5 (Excellent) |
|---|---|---|---|
| Problem formulation | Is the research question clearly stated and well-motivated? Compared to the proposal, this should now be refined and precise. | Question remains vague. | Crisp, well-scoped question. |
| Technical depth | Does the report demonstrate understanding of the methods used? Are technical choices justified? | Superficial or incorrect technical content. | Command of relevant techniques with principled decisions. |
| Experimental design and results | Are experiments well-designed with appropriate baselines? Are results presented clearly (tables, figures, error bars where applicable)? | Missing or poorly designed experiments. | Rigorous experiments with clear presentation. |
| Analysis and discussion | Does the team interpret their results, discuss limitations, and reflect on what worked and what did not? | No interpretation. | Thoughtful analysis that goes beyond “method X got accuracy Y”. |
| Writing quality | Is the report well-organised, clearly written, and properly formatted? Are figures and tables captioned and referenced? AI slop will be penalised. | Disorganised or hard to follow. | Publication-ready writing. |
L16 (Jun 19). 10 minutes per team + Q&A.
| Criterion | Description | 1 (Poor) | 5 (Excellent) |
|---|---|---|---|
| Clarity and structure | Is the presentation logically structured? Does the audience follow the narrative from problem to method to results to takeaway? | Incoherent structure. | Compelling, well-organised talk. |
| Technical communication | Can the team explain their technical approach at the right level of detail? | Audience cannot understand the method. | Complex ideas made accessible without oversimplification. |
| Results presentation | Are results communicated effectively? Are key findings highlighted with readable figures and tables? | Results are buried or absent. | Audience clearly sees what was achieved. |
| Slide quality | Are slides clean, readable, and well-designed? | Unreadable walls of text or irrelevant graphics. | Clean, effective visuals. |
| Time management | Does the team stay within the 10-minute limit and pace themselves well? | Severely over or under time. | Smooth pacing with a natural conclusion. |
#team-formation Slack channel to find team members. Teams finalised by L3 (Mar 20, 23:59).Topics are open; students choose direction and methods. Examples of the kind of project that fits the course:
Students may use generative AI tools (e.g. LLMs, VLMs, image generators). However, you are solely responsible for all outputs you submit. We will apply heavy penalties for:
Severe cases may be reported to the university for disciplinary action.
You must be ready to answer clarification requests from the lecturer or tutors at any point. Inability to explain your own work will be treated as evidence of academic misconduct.
We do not tolerate very similar creative work among class members. AI tends to produce similar outputs across sessions and model families. Diversify your answers, especially for creative work. Suspicion of copied work will be penalised.
Language: English
Lecturer: Seong Joon Oh
Tutors: Myungkyu Koo (jameskoo0503@kaist.ac.kr), Gyouk Chu (kyouwook@kaist.ac.kr), Kyuyoung Kim (kykim@kaist.ac.kr), Sangwon Jang (sangwon.jang@kaist.ac.kr)
When: Fridays 13:00-15:30 (1st session 13:00-14:10, break 14:10-14:20, 2nd session 14:20-15:30)
Where: Online (Zoom)
Email: stai.there@gmail.com for submissions, questions, and feedback.
Slack: Email us your name and preferred email address to be added. Use it for questions, announcements, and finding team members.