Authors: || Published: 2025-01-29T11:25:00 || Updated: 2025-03-13T18:43:00
Categories: || Tags: || Post-format: link
Recent AI Reading [29 January 2025]
Papers
Agentic AI
Large Language Models
- Foundations of Large Language Models
- Large Language Models: A Survey
- Reasoning Language Models: A Blueprint
- Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- Bite: How Deepseek R1 was trained
- Breaking down the DeepSeek-R1 training process—no PhD required
- Understanding DeepSeek-R1 paper: Beginner’s guide
- DeepSeek-R1 Paper Explained – A New RL LLMs Era in AI?
- Morgan Brown (on X) - Finally had a chance to dig into DeepSeek’s r1…
- DeepSeek-V3 Technical Report
- DeepSeek R1 and R1-Zero Explained
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
- Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
- s1: Simple test-time scaling
Technical Reports
- Qwen2.5-1M Technical Report
- DeepSeek R1 Read Teaming Report from Enkrypt AI.