r1
Here are 55 public repositories matching this topic...
🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.
-
Updated
Apr 14, 2025
Explore the Multimodal “Aha Moment” on 2B Model
-
Updated
Mar 18, 2025 - Python
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
-
Updated
Feb 19, 2025 - Python
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
-
Updated
Apr 16, 2025
Latest Advances on Long Chain-of-Thought Reasoning
-
Updated
Apr 13, 2025
Model Context Protocol server for DeepSeek's advanced language models
-
Updated
Mar 27, 2025 - JavaScript
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
-
Updated
Apr 22, 2025
Doge Family of Small Language Model
-
Updated
Apr 22, 2025 - Python
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
-
Updated
Apr 22, 2025 - Python
Code for "UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning"
-
Updated
Apr 8, 2025 - Python
A comprehensive collection of process reward models.
-
Updated
Apr 6, 2025
Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
-
Updated
Apr 18, 2025 - JavaScript
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
-
Updated
Apr 20, 2025 - Python
使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略
-
Updated
Apr 11, 2025 - Jupyter Notebook
Auto-generate fallback and meter display from existing group info in d&b audiotechnik's R1 and ArrayCalc software.
-
Updated
Mar 29, 2025 - Python
Recreating the minimal training methods of DeepSeek-R1 for small langauge models.
-
Updated
Feb 10, 2025 - Python
Improve this page
Add a description, image, and links to the r1 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the r1 topic, visit your repo's landing page and select "manage topics."