Sitemap - 2024 - Ahead of AI

Noteworthy AI Research Papers of 2024 (Part One)

LLM Research Papers: The 2024 List

Understanding Multimodal LLMs

Building A GPT-Style LLM Classifier From Scratch

Building LLMs from the Ground Up: A 3-hour Coding Workshop

New LLM Pre-training and Post-training Paradigms

Instruction Pretraining LLMs

Developing an LLM: Building, Training, Finetuning

LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

Using and Finetuning Pretrained Transformers

Tips for LLM Pretraining and Evaluating Reward Models

A LoRA Successor, Small Finetuned LLMs Vs Generalist LLMs, and Transparent LLM Research

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adaptation (DoRA) from Scratch

Support Independent AI Research

Model Merging, Mixtures of Experts, and Towards Smaller LLMs

Understanding and Coding Self-Attention, Multi-Head Attention, Causal-Attention, and Cross-Attention in LLMs

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts