Authors: || Published: 2025-01-29T11:25:00 || Updated: 2025-02-06T22:25:00
Categories: || Tags: || Post-format: link
Recent AI Reading [29 January 2025]
Papers
Agentic AI
Large Language Models
- Foundations of Large Language Models
- Large Language Models: A Survey
- Reasoning Language Models: A Blueprint
- Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
- DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- Bite: How Deepseek R1 was trained
- Breaking down the DeepSeek-R1 training process—no PhD required
- DeepSeek-V3 Technical Report
- DeepSeek R1 and R1-Zero Explained
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
- Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
- Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
- s1: Simple test-time scaling
Technical Reports
- Qwen2.5-1M Technical Report
- DeepSeek R1 Read Teaming Report from Enkrypt AI.