Schedule

This schedule is tentative and subject to change. Readings will be assigned research papers and supplementary material. Key deadlines are included below.

Week	Class	Assessments and tasks
1 (Sep 2, Sep 4)	Topic: Course introduction, Introduction to AI Agents & RL Slides: W1-L1-P1-Intro.pdf, W1-L1-P2-RL.pdf, W1-L2-RL.pdf Reading: Prerequisites, Sutton and Barto - Reinforcement Learning	Join the slack server, find potential project partners
2 (Sep 9, Sep 11)	Topic: Foundations of RLHF – DPO, GRPO Slides: W2-L1-RL.pdf, W2-L1-dpo.pdf, W2-L2-grpo.pdf Reading: DPO, GRPO	Sign up for projects
3 (Sep 16, Sep 18)	Topic: Bilevel optimization (PARL), Alignment Challenges – MaxMin-RLHF, Test-time Alignment (Transfer-Q, GenARM, Collab) Slides: W3-L1&2.pdf Reading: PARL, MaxMin-RLHF, Transfer-Q, GenARM, Collab	TBA
4 (Sep 23, Sep 25)	Topic: Reasoning Models – Chain-of-thought, MCTS, ThinkLite-VL Slides: W4-L1&2.pdf Reading: MCTS, ThinkLite-VL	Review Slides for September Assignment
5 (Sep 30, Oct 2)	Topic: TA-led Q&A; Self-Improvement – EnsemW2S, SoTA with Less, MORSE-500 Reading: EnsemW2S, SoTA with Less, MORSE-500	Complete September Assignment (Due Sep 30 11:59pm)
6 (Oct 7, Oct 9)	Topic: Agentive Workflows – Design patterns, communication graphs, role optimization Slides: W5-L1&2.pdf Reading: Web-Agent Vulnerability Analysis, AegisLLM, AdvBDGen, RLHFPoisoning	TBA
7 (Oct 14, Oct 16)	Topic: Overflow / Project check-in Slides: W6-L2.pdf Reading: Language Agents Tutorial: Reasoning, Memory, and Planning, WebDreamer	TBA
8 (Oct 21, Oct 23)	Topic: Web Agents – Architectures, capabilities, vulnerabilities Slides: W7-L1.pdf, W7-L2.pdf Reading: WebArena, VisualWebAgent, WorkArena++, WebLINX, AgentOccam, AgentSandbox, Language Agents Tutorial Application & Data Evaluation, Black Hat EU 2023, Black Hat USA 2025, AWS re:Invent 2024, Scale/BrowserART	Complete Midterm Report (Due Oct 15 11:59pm)
9 (Oct 28, Oct 30)	Topic: Code Agents – Code generation, debugging Slides: W8-CodeAgents_Lecture1.pdf Reading: SWE-bench, SWE-bench Verified, SWE-agent, OpenHands, AutoCodeRover, Self-Debug, Reflexion, DebugBench, DebugEval	Complete October Assignment (Due Oct 31 11:59pm)
10 (Nov 4, Nov 6)	Topic: Code Agents - Security Slides: W9-CodeAgents_Lecture2.pdf Reading: OWASP Top 10, CSET 2024, Do Users Write More Insecure Code with AI Assistants?, Veracode 2025, Apiiro 2025 risk study	TBA
11 (Nov 11, Nov 13)	Topic: World Models – for web, robotics, and simulation-based agents Slides: W10-FoundationModel4Robotics.pdf Reading: TACO, Premium-TACO, FLARE, TraceVLA, IVE	TBA
12 (Nov 18, Nov 20)	Topic: AI-Generated Content Detection – watermarking, detectors Reading: WAVES	TBA
13 (Nov 25, Nov 27)	Topic: No Class - Thanksgiving Holiday Reading: TBA	TBA
14 (Dec 2, Dec 4)	Topic: Final Presentations Reading: TBA	TBA
15 (Dec 9, Dec 11, Dec 12)	Topic: Final Presentations Reading: TBA	TBA