Subscribe
Sign in
Home
Notes
Chat
Support
LLMs From Scratch Book
Reasoning From Scratch Book
Archive
About
The Big LLM Architecture Comparison
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
Jul 19
•
Sebastian Raschka, PhD
1,565
77
143
Most Popular
View all
Understanding Reasoning LLMs
Feb 5
•
Sebastian Raschka, PhD
1,183
41
109
Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs
Jan 14, 2024
422
41
16
Understanding Large Language Models
Apr 16, 2023
•
Sebastian Raschka, PhD
936
53
50
From GPT-2 to gpt-oss: Analyzing the Architectural Advances
Aug 9
•
Sebastian Raschka, PhD
610
46
54
Recent posts
View all
From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates
Understanding How DeepSeek's Flagship Open-Weight Models Evolved
Dec 3
•
Sebastian Raschka, PhD
202
9
25
Beyond Standard LLMs
Linear Attention Hybrids, Text Diffusion, Code World Models, and Small Recursive Transformers
Nov 4
•
Sebastian Raschka, PhD
334
23
34
Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)
Multiple-Choice Benchmarks, Verifiers, Leaderboards, and LLM Judges with Code Examples
Oct 5
•
Sebastian Raschka, PhD
339
25
32
Understanding and Implementing Qwen3 From Scratch
A Detailed Look at One of the Leading Open-Source LLMs
Sep 6
•
Sebastian Raschka, PhD
107
4
10
From GPT-2 to gpt-oss: Analyzing the Architectural Advances
And How They Stack Up Against Qwen3
Aug 9
•
Sebastian Raschka, PhD
610
46
54
LLM Research Papers: The 2025 List (January to June)
A topic-organized collection of 200+ LLM research papers from 2025
Jul 1
•
Sebastian Raschka, PhD
87
5
9
Understanding and Coding the KV Cache in LLMs from Scratch
KV caches are one of the most critical techniques for efficient inference in LLMs in production.
Jun 17
•
Sebastian Raschka, PhD
424
35
33
Coding LLMs from the Ground Up: A Complete Course
Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot…
May 10
•
Sebastian Raschka, PhD
245
4
18
The State of Reinforcement Learning for LLM Reasoning
Understanding GRPO and New Insights from Reasoning Model Papers
Apr 19
•
Sebastian Raschka, PhD
456
31
40
First Look at Reasoning From Scratch: Chapter 1
Welcome to the next stage of large language models (LLMs): reasoning. LLMs have transformed how we process and generate text, but their success has been…
Mar 29
•
Sebastian Raschka, PhD
61
15
8
The State of LLM Reasoning Model Inference
Inference-Time Compute Scaling Methods to Improve Reasoning Models
Mar 8
•
Sebastian Raschka, PhD
400
10
32
Understanding Reasoning LLMs
Methods and Strategies for Building and Refining Reasoning Models
Feb 5
•
Sebastian Raschka, PhD
1,183
41
109
See all
Subscribe to receive new in-depth research insights on AI and machine learning.
Subscribe
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts