Subscribe
Sign in
Home
Notes
Chat
LLM Gallery
Support
LLMs From Scratch Book
Reasoning From Scratch Book
Archive
About
The Big LLM Architecture Comparison
From DeepSeek-V3 to Kimi K2: A Look At Modern LLM Architecture Design
Jul 19, 2025
•
Sebastian Raschka, PhD
1,968
97
174
Most Popular
View all
Understanding Reasoning LLMs
Feb 5, 2025
•
Sebastian Raschka, PhD
1,322
47
124
Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs
Jan 14, 2024
508
41
20
Components of A Coding Agent
Apr 4
•
Sebastian Raschka, PhD
872
63
89
Latest
Top
Discussions
LLM Research Papers: The 2026 List (January to May)
A curated roundup of notable LLM research papers that came out this year
Jun 6
•
Sebastian Raschka, PhD
43
3
10
Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention
From Gemma 4 to DeepSeek V4, How New Open-Weight LLMs Are Reducing Long-Context Costs
May 16
•
Sebastian Raschka, PhD
323
18
32
My Workflow for Understanding LLM Architectures
A learning-oriented workflow for understanding new open-weight model releases
Apr 18
•
Sebastian Raschka, PhD
69
4
5
Components of A Coding Agent
How coding agents use tools, memory, and repo context to make LLMs work better in practice
Apr 4
•
Sebastian Raschka, PhD
872
63
89
A Visual Guide to Attention Variants in Modern LLMs
From MHA and GQA to MLA, sparse attention, and hybrid architectures
Mar 22
•
Sebastian Raschka, PhD
411
14
34
A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026
A Round Up And Comparison of 10 Open-Weight LLM Releases in Spring 2026
Feb 25
•
Sebastian Raschka, PhD
213
13
20
Categories of Inference-Time Scaling for Improved LLM Reasoning
And an Overview of Recent Inference-Scaling Papers
Jan 24
•
Sebastian Raschka, PhD
45
3
See all
Ahead of AI
Ahead of AI focuses on machine learning and AI research and is read by more than 150,000 researchers and practitioners who want to stay ahead in a rapidly evolving field.
Subscribe
Ahead of AI
Subscribe
About
Archive
Recommendations
Sitemap
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts