Introduction
Pre-Training
Step 1: Download and preprocess the internet
Step 2: Tokenization
Step 3: Neural network training
Step 4: Inference
Base model
Post-Training: Supervised Finetuning
Conversations
Hallucinations
Knowledge of Self
Models need tokens to think
Things the model cannot do well
Post-Training: Reinforcement Learning
Reinforcement learning
DeepSeek-R1
AlphaGo
Reinforcement learning from human feedback (RLHF)
Preview of things to come
Keeping track of LLMs
Where to find LLMs
Introducing a method to effectively track the latest large language model (LLM) information
📌 El Marina ranks the best LLM models and evaluates them by comparing their responses directly with humans.
🔹 Ranking System :
Anonymous human evaluators compare the models' responses and choose the better model 👀
This provides an objective ranking .
🔹 Current top models (as of 2025)
🥇 Google Gemini
🥈 OpenAI GPT
🥉 DeepSeek (MIT Open License 🎉)
👉 DeepSeek is gaining attention as a powerful open-weight model that is free to use !
⚠️ In recent months, there have been concerns raised about possible ranking manipulation. It is important to use it in real life and check its performance for yourself!
🔍 AI News
AI-related newsletter run by Swix & Team
New information almost every day
Some are auto-generated by LLM , some are curated by humans
📌 Follow AI News so you don't miss important news! 👍
🔥 The place where AI experts are most active is X (Twitter) !
Get the latest AI news & analysis in real time
Recommended to follow trusted AI researchers and experts