[YouTube Lecture Summary] Andrej Karpathy - Deep Dive into LLMs like ChatGPT

Introduction

Pre-Training

Step 1: Download and preprocess the internet

Step 2: Tokenization

Step 3: Neural network training

Step 4: Inference

Base model

Post-Training: Supervised Finetuning

Conversations

Hallucinations

Knowledge of Self

Models need tokens to think

Things the model cannot do well

Post-Training: Reinforcement Learning

Reinforcement learning

DeepSeek-R1

AlphaGo

Reinforcement learning from human feedback (RLHF)

Preview of things to come

Keeping track of LLMs

Where to find LLMs

Keeping track of LLMs

🔍 How to understand LLM trends 🧠✨

Introducing a method to effectively track the latest large language model (LLM) information


1️⃣ El Marina: LLM Leaderboard 🏆

📌 El Marina ranks the best LLM models and evaluates them by comparing their responses directly with humans.

🔹 Ranking System :

  • Anonymous human evaluators compare the models' responses and choose the better model 👀

  • This provides an objective ranking .

🔹 Current top models (as of 2025)
🥇 Google Gemini
🥈 OpenAI GPT
🥉 DeepSeek (MIT Open License 🎉)
👉 DeepSeek is gaining attention as a powerful open-weight model that is free to use !

⚠️ In recent months, there have been concerns raised about possible ranking manipulation. It is important to use it in real life and check its performance for yourself!


2️⃣ AI Newsletter "AI News" 📩

🔍 AI News

  • AI-related newsletter run by Swix & Team

  • New information almost every day

  • Some are auto-generated by LLM , some are curated by humans

📌 Follow AI News so you don't miss important news! 👍


3️⃣ Use Twitter(X) 📢

🔥 The place where AI experts are most active is X (Twitter) !

  • Get the latest AI news & analysis in real time

  • Recommended to follow trusted AI researchers and experts