The AI Concepts Podcast

Impossibile aggiungere al carrello

Puoi avere soltanto 50 titoli nel carrello per il checkout.

Riprova più tardi

Rimozione dalla Lista desideri non riuscita.

Riprova più tardi

Non è stato possibile aggiungere il titolo alla Libreria

Per favore riprova

Non è stato possibile seguire il Podcast

Per favore riprova

Esecuzione del comando Non seguire più non riuscita

The AI Concepts Podcast

Di: Sheetal ’Shay’ Dhar

Ascolta gratuitamente

A proposito di questo titolo

The AI Concepts Podcast is my attempt to turn the complex world of artificial intelligence into bite-sized, easy-to-digest episodes. Imagine a space where you can pick any AI topic and immediately grasp it, like flipping through an Audio Lexicon - but even better! Using vivid analogies and storytelling, I guide you through intricate ideas, helping you create mental images that stick. Whether you’re a tech enthusiast, business leader, technologist or just curious, my episodes bridge the gap between cutting-edge AI and everyday understanding. Dive in and let your imagination bring these concepts to life!Copyright 2024 All rights reserved.

Istruzione

Scienza

Istruzione Scienza

Episodi Visualizza tutto

Module 3: Reinforcement Learning from Human Feedback

Feb 20 2026

This episode addresses how Reinforcement Learning from Human Feedback (RLHF) adds the final layer of alignment after supervised fine-tuning, shifting the training signal from “right vs wrong” to “better vs worse.” We explore how preference rankings create a reward signal (reward models plus PPO) and the newer shortcut (DPO) that learns preferences directly, then connect RLHF to safety through the Helpful, Honest, Harmless goal. We also unpack the “alignment tax,” the trade-off between being safe and being genuinely useful, and close by setting up the next module on running models at scale, starting with GPU memory limits, plus a personal reflection on starting later without being behind.

Mostra di più Mostra meno

10 min

Impossibile aggiungere al carrello

Puoi avere soltanto 50 titoli nel carrello per il checkout.

Riprova più tardi

Riprova più tardi

Rimozione dalla Lista desideri non riuscita.

Riprova più tardi

Non è stato possibile aggiungere il titolo alla Libreria

Per favore riprova

Non è stato possibile seguire il Podcast

Per favore riprova

Esecuzione del comando Non seguire più non riuscita

Ascolta gratuitamente
Module 3: Supervised Fine Tuning

Feb 20 2026

This episode addresses how we turn a raw base model into something that behaves like a real assistant using Supervised Fine-Tuning (SFT). We explore instruction and response training data, why SFT makes behaviors consistent beyond prompting, and the practical engineering choices that keep fine-tuning efficient and safe, including low learning rates and LoRA-style adapters. By the end, you will understand what SFT solves, and why the next layer (RLHF) is needed to add human preference and nuance.

Mostra di più Mostra meno

9 min

Impossibile aggiungere al carrello

Puoi avere soltanto 50 titoli nel carrello per il checkout.

Riprova più tardi

Riprova più tardi

Rimozione dalla Lista desideri non riuscita.

Riprova più tardi

Non è stato possibile aggiungere il titolo alla Libreria

Per favore riprova

Non è stato possibile seguire il Podcast

Per favore riprova

Esecuzione del comando Non seguire più non riuscita

Ascolta gratuitamente
Module 3: Context Windows & Attention Complexity

Jan 26 2026

This episode addresses the physical and mathematical limits of a model’s "short-term memory." We explore the context window and the engineering trade-offs required to process long documents. You will learn about the quadratic cost of attention where doubling the input length quadruples the computational work and why this creates a massive bottleneck for long-form reasoning. We also introduce the architectural tricks like Flash Attention that allow us to push these limits further. By the end, you will understand why context is the most expensive real estate in the generative stack.

Mostra di più Mostra meno

11 min

Impossibile aggiungere al carrello

Puoi avere soltanto 50 titoli nel carrello per il checkout.

Riprova più tardi

Riprova più tardi

Rimozione dalla Lista desideri non riuscita.

Riprova più tardi

Non è stato possibile aggiungere il titolo alla Libreria

Per favore riprova

Non è stato possibile seguire il Podcast

Per favore riprova

Esecuzione del comando Non seguire più non riuscita

Ascolta gratuitamente

Ancora nessuna recensione

The AI Concepts Podcast

Impossibile aggiungere al carrello

Rimozione dalla Lista desideri non riuscita.

Non è stato possibile aggiungere il titolo alla Libreria

Non è stato possibile seguire il Podcast

Esecuzione del comando Non seguire più non riuscita

The AI Concepts Podcast

A proposito di questo titolo

Module 3: Reinforcement Learning from Human Feedback

Impossibile aggiungere al carrello

Rimozione dalla Lista desideri non riuscita.

Non è stato possibile aggiungere il titolo alla Libreria

Non è stato possibile seguire il Podcast

Esecuzione del comando Non seguire più non riuscita

Module 3: Supervised Fine Tuning

Impossibile aggiungere al carrello

Rimozione dalla Lista desideri non riuscita.

Non è stato possibile aggiungere il titolo alla Libreria

Non è stato possibile seguire il Podcast

Esecuzione del comando Non seguire più non riuscita

Module 3: Context Windows & Attention Complexity

Impossibile aggiungere al carrello

Rimozione dalla Lista desideri non riuscita.

Non è stato possibile aggiungere il titolo alla Libreria

Non è stato possibile seguire il Podcast

Esecuzione del comando Non seguire più non riuscita