
Roosmikx
Add a review FollowOverview
-
Founded Date October 15, 2005
-
Sectors Writing
-
Posted Jobs 0
-
Viewed 14
Company Description
DeepSeek’s First-generation Reasoning Models
DeepSeek’s first-generation reasoning designs, accomplishing performance equivalent to OpenAI-o1 throughout math, code, and thinking jobs.
Models
DeepSeek-R1
Distilled models
DeepSeek group has actually demonstrated that the thinking patterns of bigger designs can be distilled into smaller sized designs, leading to much better performance compared to the reasoning found through RL on little models.
Below are the designs created by means of fine-tuning versus a number of dense models commonly utilized in the research study neighborhood using thinking data generated by DeepSeek-R1. The evaluation results demonstrate that the distilled smaller dense designs perform extremely well on criteria.
DeepSeek-R1-Distill-Qwen-1.5 B
DeepSeek-R1-Distill-Qwen-7B
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Qwen-14B
DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Llama-70B
License
The model weights are licensed under the MIT License. DeepSeek-R1 series support industrial usage, permit any modifications and acquired works, consisting of, but not limited to, distillation for training other LLMs.