Tips

The LLM Alignment Frontier A Deep Dive into PPO, DPO, GRPO, DAPO, and GSPO
Group Relative Policy Optimization (GRPO)
High-Performance Image & Video Inference Frameworks
The Unified Architecture of Large Language Models
Stop Wasting GPUs Implementing the vLLM Mixture-of-Models Router
FineWeb Dataset
Diffusion Transformer (DiT)
DeepSeek V3.2 Crushing Long-Context Costs with Sparse Attention (DSA)
How Thinking AI Models Are Rewriting Inference Scaling Laws