Explore other topics:deepseek 671b vramdeepseek-r1 incentivizing reasoning capability in llms via reinforcement learningdeepseek cost to builddeepseek reasoning examplefei-fei li deepseek