Schedule

  • Event
    Date
    Description
    Course Material
  • Lecture
    01/26/2026
    Monday
    January 26, 2026

    Introduction to Senior Project I.

  • Lecture
    01/28/2026
    Wednesday
    January 28, 2026

    Overview of course expectations, deliverables, and grading criteria.

  • Lecture
    02/02/2026
    Monday
    February 2, 2026

    Suggested Readings:

    Overview of research methods, how to read research papers, and an introduction to AI evaluation metrics (Precision, Recall, F-Measure).

  • Lecture
    02/04/2026
    Wednesday
    February 4, 2026

    Introduction to Single-Agent vs. Multi-Agent systems, GridWorld examples, Markov Decision Processes (MDP), and the curse of dimensionality in multi-agent environments.

  • Lecture
    02/09/2026
    Monday
    February 9, 2026

    Overview of Generalizability in AI Agents versus simple memorization. The lecture covers Reward Modeling strategies, including the trade-offs between Dense (hackable) and Sparse (slow) rewards, as well as advanced techniques like Curriculum Learning and Inverse Reinforcement Learning (IRL).

  • Lecture
    02/11/2026
    Wednesday
    February 11, 2026

    Overview of the hierarchy of decision-making models, moving from simple MDPs to Partially Observable Stochastic Games (POSG). The lecture also explores the necessity of distributed processing for AI agents, covering Data versus Model Parallelism and the differences between Synchronous and Asynchronous execution.

  • Lecture
    02/18/2026
    Wednesday
    February 18, 2026

    Exploration of the physical limits of AI agents regarding latency, bandwidth, and throughput. The lecture contrasts Cloud versus Edge architectures and discusses engineering solutions for system constraints, such as model compression, quantization, and activation offloading, particularly in latency-critical scenarios like autonomous braking.

  • Lecture
    02/23/2026
    Monday
    February 23, 2026

    The Transformer architecture and its applications in Deep Reinforcement Learning (RL) and Multi-Agent Deep Reinforcement Learning (MARL). The lecture covers training and finetuning AI models, the Bellman Equation, Q Learning, and Deep Q Networks, providing insights into their roles in decision-making processes.

  • Lecture
    02/25/2026
    Wednesday
    February 25, 2026

    Walkthroughs on DQL, Experience Buffers, and comparison of Q-Learning with REINFORCE.

  • Lecture
    03/02/2026
    Monday
    March 2, 2026

    Discussion on Sample Complexity, Generalizability, Independent Q Learning (IQL), Actor-Critic methods, Multi-Agent Deep Deterministic Policy Gradient (MADDPG), and Imitation Learning methods (ehavioral Cloning, DAgger) in the context of Reinforcement Learning (RL) and Multi-Agent Deep Reinforcement Learning (MARL).

  • Lecture
    03/04/2026
    Wednesday
    March 4, 2026

    Decision Making Algorithms for AI Agents like AlphaGo (RL + MCTS), TRPO, PPO, DPO, GRPO, StrateGo, and Intelligent AI Delegation

  • Lecture
    03/09/2026
    Monday
    March 9, 2026

    Discussion on Robustness, Consistency and Risk Mitigation in AI Agents, Offline Reinforcement Learning, and Inverse Reinforcement Learning (IRL) in the context of Reinforcement Learning (RL) and Multi-Agent Deep Reinforcement Learning (MARL).

  • Lecture
    03/11/2026
    Wednesday
    March 11, 2026

    Discussion on Model Predictive Control (MPC): Real-time constraints, mistake correction and daily control tasks by AI Agents in the context of Reinforcement Learning (RL) and Multi-Agent Deep Reinforcement Learning (MARL).

  • Lecture
    03/16/2026
    Monday
    March 16, 2026

    Discussion on Game Theory for AI Agents in the context of Reinforcement Learning (RL) and Multi-Agent Deep Reinforcement Learning (MARL).

  • Lecture
    03/18/2026
    Wednesday
    March 18, 2026

    Discussion on Evolutionary Learning, Mean-Field Learning, Self-play for self-improvement and Continual Learning: Communication paradigms for AI Agents to Adapt in Real-Time Decentralized AI: When and How to Scale AI Agents in the context of Reinforcement Learning (RL) and Multi-Agent Deep Reinforcement Learning (MARL).

  • Lecture
    03/23/2026
    Monday
    March 23, 2026

    Discussion on implementing Multi-Agent Deep Reinforcement Learning algorithms.

  • Lecture
    03/25/2026
    Wednesday
    March 25, 2026

    Discussion on optimizing agentic systems for efficiency in terms of energy, network bandwidth, memory, chip utilization, and power.

  • Lecture
    03/30/2026
    Monday
    March 30, 2026

    Discussion on evaluating multimodal agentic systems post-training, including inference, qualitative and quantitative decision-making assessment, with a focus on trustworthy and explainable agents.

  • Lecture
    04/13/2026
    Monday
    April 13, 2026

    Exploration of the role of AI agents in the context of 6G and 7G networks, discussing the opportunities they present as well as the challenges that need to be addressed for successful integration and deployment.