
Christopher Agostino
Founder and Research Scientist at NPC Worldwide
This course teaches you to build adaptive AI agents. You'll learn to transform static, "frozen" LLMs into dynamic systems that can learn, reason, and act. We cover two key fine-tuning methods: Supervised Fine-Tuning (SFT) for reliable, structured outputs and Parameter-Efficient Fine-Tuning (PEFT) for building specialized models efficiently. You'll design agent "brains" using ReAct reasoning loops and learn to generate training data using a "Teacher-Student" workflow. Finally, you'll tackle advanced AI alignment, learning to prevent "specification gaming" and use Direct Preference Optimization (DPO) to teach agents complex human preferences.

Subscription · Monthly
5 skills
5 prerequisites
Prior to enrolling, you should have the following knowledge:
You will also need to be able to communicate fluently and professionally in written and spoken English.
1 instructor
Unlike typical professors, our instructors come from Fortune 500 and Global 2000 companies and have demonstrated leadership and expertise in their professions:

Christopher Agostino
Founder and Research Scientist at NPC Worldwide
Go beyond prompting. Learn to train and align LLM agents using reinforcement learning and DPO for real-world, multi-step tasks.

Subscription · Monthly