Subscribe
Sign in
Home
Notes
Support
LLMs From Scratch Book
Free ML/AI Interview Book
Archive
About
Latest
Top
Discussions
Understanding Reasoning LLMs
Methods and Strategies for Building and Refining Reasoning Models
Feb 5
•
Sebastian Raschka, PhD
962
Share this post
Ahead of AI
Understanding Reasoning LLMs
Copy link
Facebook
Email
Notes
More
36
Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs
This article will teach you about self-attention mechanisms used in transformer architectures and large language models (LLMs) such as GPT-4 and Llama.
Jan 14, 2024
360
Share this post
Ahead of AI
Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs
Copy link
Facebook
Email
Notes
More
41
Understanding Large Language Models
A Cross-Section of the Most Relevant Literature To Get Up to Speed
Apr 16, 2023
•
Sebastian Raschka, PhD
899
Share this post
Ahead of AI
Understanding Large Language Models
Copy link
Facebook
Email
Notes
More
54
Building LLMs from the Ground Up: A 3-hour Coding Workshop
If your weekend plans include catching up on AI developments and understanding Large Language Models (LLMs), I've prepared a 1-hour presentation on the…
Aug 31, 2024
•
Sebastian Raschka, PhD
427
Share this post
Ahead of AI
Building LLMs from the Ground Up: A 3-hour Coding Workshop
Copy link
Facebook
Email
Notes
More
16
Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)
Things I Learned From Hundreds of Experiments
Nov 19, 2023
•
Sebastian Raschka, PhD
277
Share this post
Ahead of AI
Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)
Copy link
Facebook
Email
Notes
More
48
Coding LLMs from the Ground Up: A Complete Course
Why build LLMs from scratch? It's probably the best and most efficient way to learn how LLMs really work. Plus, many readers have told me they had a lot…
May 10
•
Sebastian Raschka, PhD
209
Share this post
Ahead of AI
Coding LLMs from the Ground Up: A Complete Course
Copy link
Facebook
Email
Notes
More
4
Understanding Multimodal LLMs
An introduction to the main techniques and latest models
Nov 3, 2024
•
Sebastian Raschka, PhD
471
Share this post
Ahead of AI
Understanding Multimodal LLMs
Copy link
Facebook
Email
Notes
More
51
The State of Reinforcement Learning for LLM Reasoning
Understanding GRPO and New Insights from Reasoning Model Papers
Apr 19
•
Sebastian Raschka, PhD
389
Share this post
Ahead of AI
The State of Reinforcement Learning for LLM Reasoning
Copy link
Facebook
Email
Notes
More
30
The State of LLM Reasoning Model Inference
Inference-Time Compute Scaling Methods to Improve Reasoning Models
Mar 8
•
Sebastian Raschka, PhD
374
Share this post
Ahead of AI
The State of LLM Reasoning Model Inference
Copy link
Facebook
Email
Notes
More
9
Finetuning Large Language Models
An introduction to the core ideas and approaches
Apr 22, 2023
•
Sebastian Raschka, PhD
311
Share this post
Ahead of AI
Finetuning Large Language Models
Copy link
Facebook
Email
Notes
More
25
Understanding Encoder And Decoder LLMs
Several people asked me to dive a bit deeper into large language model (LLM) jargon and explain some of the more technical terms we nowadays take for…
Jun 17, 2023
•
Sebastian Raschka, PhD
170
Share this post
Ahead of AI
Understanding Encoder And Decoder LLMs
Copy link
Facebook
Email
Notes
More
5
Ten Noteworthy AI Research Papers of 2023
This year has felt distinctly different. I've been working in, on, and with machine learning and AI for over a decade, yet I can't recall a time when…
Dec 30, 2023
•
Sebastian Raschka, PhD
353
Share this post
Ahead of AI
Ten Noteworthy AI Research Papers of 2023
Copy link
Facebook
Email
Notes
More
31
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts