[Paper Review] DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
It has not been a long time since DeepSeek was released. It was indeed a shock to those who are in AI industry. I was not familiar with LLM’s algorithm and the computing resource usage of the LLMs. All I was doing was to utilise the LLM APIs for deve...
Apr 20, 20256 min read30
![[Paper Review] DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning](/_next/image?url=https%3A%2F%2Fcdn.hashnode.com%2Fres%2Fhashnode%2Fimage%2Fupload%2Fv1745151315420%2F5c785e0c-f3b5-4df1-97fa-a420779b9422.png&w=3840&q=75)