lafmdp

Follow

🎯

Focusing

Jing-Cheng Pang lafmdp

🎯

Focusing

Follow

Senior researcher at HUAWEI. Interested in reinforcement learning and large language models.

45 followers · 15 following

Huawei Technologies Ltd.
NanJing, Jiangsu, China
00:16 (UTC +08:00)
https://jingchengpang.github.io

Achievements

Achievements

Highlights

Pro

Pinned Loading

Awesome-Papers-Autonomous-Agent Awesome-Papers-Autonomous-Agent Public

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

733 58
CharlieBrown-v1/KALM CharlieBrown-v1/KALM Public

[NeurIPS'24] KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts

Python 9 3
RLC RLC Public

[ICLR'24] Official code for "Language Model Self-improvement by Reinforcement Learning Contemplation".

Jupyter Notebook 7
LAMDA-RL/ImagineBench LAMDA-RL/ImagineBench Public

A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.

Python 13 1
ReViWo ReViWo Public

Forked from Trevor-emt/Reviwo

Code for ICLR 25 paper: Learning View-invariant World Models for Visual Robotic Manipulation

Python 7 1
HIDIL HIDIL Public

[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"

Python 12 1