Subscribe
Sign in
Home
Notes
LLM Gallery
Support
LLMs From Scratch Book
Reasoning From Scratch Book
Archive
About
Latest
Top
Discussions
The Big LLM Architecture Comparison
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
Jul 19, 2025
•
Sebastian Raschka, PhD
1,964
96
173
Understanding Reasoning LLMs
Methods and Strategies for Building and Refining Reasoning Models
Feb 5, 2025
•
Sebastian Raschka, PhD
1,321
46
124
Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs
This article will teach you about self-attention mechanisms used in transformer architectures and large language models (LLMs) such as GPT-4 and Llama.
Jan 14, 2024
505
41
20
Components of A Coding Agent
How coding agents use tools, memory, and repo context to make LLMs work better in practice
Apr 4
•
Sebastian Raschka, PhD
863
63
88
Understanding Large Language Models
A Cross-Section of the Most Relevant Literature To Get Up to Speed
Apr 16, 2023
•
Sebastian Raschka, PhD
963
52
51
From GPT-2 to gpt-oss: Analyzing the Architectural Advances
And How They Stack Up Against Qwen3
Aug 9, 2025
•
Sebastian Raschka, PhD
629
47
55
Understanding Multimodal LLMs
An introduction to the main techniques and latest models
Nov 3, 2024
•
Sebastian Raschka, PhD
656
58
41
The State Of LLMs 2025: Progress, Problems, and Predictions
A 2025 review of large language models, from DeepSeek R1 and RLVR to inference-time scaling, benchmarks, architectures, and predictions for 2026.
Dec 30, 2025
•
Sebastian Raschka, PhD
525
38
55
The State of Reinforcement Learning for LLM Reasoning
Understanding GRPO and New Insights from Reasoning Model Papers
Apr 19, 2025
•
Sebastian Raschka, PhD
515
35
40
Coding LLMs from the Ground Up: A Complete Course
Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot…
May 10, 2025
•
Sebastian Raschka, PhD
259
4
18
Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)
Things I Learned From Hundreds of Experiments
Nov 19, 2023
•
Sebastian Raschka, PhD
321
51
21
Building LLMs from the Ground Up: A 3-hour Coding Workshop
If your weekend plans include catching up on AI developments and understanding Large Language Models (LLMs), I've prepared a 1-hour presentation on the…
Aug 31, 2024
•
Sebastian Raschka, PhD
445
16
28
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts