A Toolkit for Distributional Control of Generative Models
machine-learning ai alignment language-models monte-carlo-sampling generative-models fine-tuning human-preferences distributional-policy-gradients
-
Updated
Jul 31, 2025 - Python