UbeCc

Follow

Haoran Wang UbeCc

Follow

I am not a beast of burden. I am a LLaMA! 不是牛马是拉马（我不是奶龙） (Junior@Tsinghua University)

48 followers · 123 following

Tsinghua University
Beijing, China
03:47 (UTC +08:00)
[email protected]
@UbecWang

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6.4k 628
THUDM/SWE-Dev THUDM/SWE-Dev Public

SWE-Dev is an open-source SWE agent with a scalable test case construction pipeline. This pipeline synthesizes test cases through a two-step process: generating Gherkin descriptions and correspondi…

Python 11
Generalization-of-Transformers Generalization-of-Transformers Public

[ICLR'25] Understanding the Generalization of In-Context Learning in Transformers: An Empirical Study

Python 2
Shape-Control-of-DLO Shape-Control-of-DLO Public

Deep Reinforcement Learning spring 24, Tsinghua Univ.

Python 4