Subscribe
Sign in
Home
Notes
Chat
Support
LLMs From Scratch Book
Reasoning From Scratch Book
Archive
About
Latest
Top
Discussions
From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates
Understanding How DeepSeek's Flagship Open-Weight Models Evolved
Dec 3
•
Sebastian Raschka, PhD
176
7
23
November 2025
Beyond Standard LLMs
Linear Attention Hybrids, Text Diffusion, Code World Models, and Small Recursive Transformers
Nov 4
•
Sebastian Raschka, PhD
327
23
34
October 2025
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples
Oct 5
•
Sebastian Raschka, PhD
336
25
32
September 2025
Understanding and Implementing Qwen3 From Scratch
A Detailed Look at One of the Leading Open-Source LLMs
Sep 6
•
Sebastian Raschka, PhD
105
4
10
August 2025
From GPT-2 to gpt-oss: Analyzing the Architectural Advances
And How They Stack Up Against Qwen3
Aug 9
•
Sebastian Raschka, PhD
610
46
54
July 2025
The Big LLM Architecture Comparison
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
Jul 19
•
Sebastian Raschka, PhD
1,465
73
132
LLM Research Papers: The 2025 List (January to June)
A topic-organized collection of 200+ LLM research papers from 2025
Jul 1
•
Sebastian Raschka, PhD
86
5
9
June 2025
Understanding and Coding the KV Cache in LLMs from Scratch
KV caches are one of the most critical techniques for efficient inference in LLMs in production.
Jun 17
•
Sebastian Raschka, PhD
420
35
33
May 2025
Coding LLMs from the Ground Up: A Complete Course
Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot…
May 10
•
Sebastian Raschka, PhD
241
4
18
April 2025
The State of Reinforcement Learning for LLM Reasoning
Understanding GRPO and New Insights from Reasoning Model Papers
Apr 19
•
Sebastian Raschka, PhD
453
31
40
March 2025
First Look at Reasoning From Scratch: Chapter 1
Welcome to the next stage of large language models (LLMs): reasoning. LLMs have transformed how we process and generate text, but their success has been…
Mar 29
•
Sebastian Raschka, PhD
61
15
8
The State of LLM Reasoning Model Inference
Inference-Time Compute Scaling Methods to Improve Reasoning Models
Mar 8
•
Sebastian Raschka, PhD
400
10
32
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts