A Framework for LLM-based Multi-Agent Reinforced Training and Inference
camel llama gemma multi-agent-systems autogen multi-agent-reinforcement-learning large-language-models qwen large-reasoning-models deepseek-r1 verl openrlhf
-
Updated
Oct 27, 2025 - Python