#deepseek

[Paper Review] DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

It has not been a long time since DeepSeek was released. It was indeed a shock to those who are in AI industry. I was not familiar with LLM’s algorithm and the computing resource usage of the LLMs. All I was doing was to utilise the LLM APIs for deve...

Apr 20, 20256 min read30

[Paper Review] DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Command Palette