Materials - CSc 59866-E: Senior Project I - AI Agents for Decision Making in the Real World / Spring 2026

Students should complete the following finetuning tutorials by Feb 11:

Torch-tune: PyTorch Native Finetuning - https://pytorch.org/blog/torchtune-fine-tune-llms/
Unsloth AI: Faster/Memory-Efficient Finetuning - https://docs.unsloth.ai/get-started/fine-tuning-llms-guide

Multi-Agent Reinforcement Learning Book by Stefano Albrecht, 2024 https://www.marl-book.com/download/marl-book.pdf
Reinforcement Learning by Dimitri P. Bertsekas, 2025 and 2019 (including video lectures) https://web.mit.edu/dimitrib/www/RLbook.html + https://web.mit.edu/dimitrib/www/RLbook.html
Rollout, Policy Iteration, and Distributed Reinforcement Learning by Dimitri P. Bertsekas, 2020 https://web.mit.edu/dimitrib/www/dp_rollout_book.html + https://web.mit.edu/dimitrib/www/Rollout_Complete%20Book.pdf
Parallel and Distributed Computation: Numerical Methods by Dimitri P. Bertsekas and John N. Tsitsiklis, 2018 https://web.mit.edu/dimitrib/www/pdc.html http://www.athenasc.com/pdcbook.pdf
Deep Learning by Ian Goodfellow, 2012 https://www.deeplearningbook.org/
An Introduction to Multi-Agent Systems by Michael Wooldridge, 2001 https://uranos.ch/research/references/Wooldridge_2001/TLTK.pdf
6G Flagship Book, 2023 https://www.6gflagship.com/news/unveiling-the-digital-horizon-new-book-on-5g-6g-and-future-digital-services-released/
Selected open-source research papers will be provided by the Professor

Probability: Introduction to Probability for Computing by Mor Harchol Balter, 2024 https://www.cs.cmu.edu/~harchol/Probability/chapters/HarcholBalterWholeBook.pdf
Mathematics for Machine Learning by Marc Peter Deisenroth, A. Aldo Faisal, Cheng Soon Ong, 2020 https://mml-book.github.io/book/mml-book.pdf
Convex Optimization for Statistics and Machine Learning, Volume 1: Analysis by Ryan Tibshirani 2025 https://github.com/ryantibs/convexopt-book1/blob/main/book1.pdf
An Introduction to Statistical Learning with Python by Garreth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, 2023 https://www.statlearning.com/