Distributed Training
Coordinate Ray actor groups for diffusion and multimodal RL workloads.
UniRL combines Ray actor groups, Hydra recipes, composable training stacks, and pluggable rollout engines for diffusion and autoregressive generative models.
Launch UniRL Experiments
Hydra configs, Ray scheduling, and rollout engines
Quick start
python -m unirl.train_diffusion \
--config-name=diffusion/sd3_trainside \
num_devices=8
Coordinate Ray actor groups for diffusion and multimodal RL workloads.
Compose reproducible experiments from typed configs and focused overrides.
Swap rollout engines, rewards, and policy logic without changing the entrypoint.
Documentation Entrypoints
Recommended Path
Install dependencies, then launch a first single-node recipe.
Choose an experiment recipe and inspect the resolved Hydra config.
Adapt recipes for multinode runs and cluster-specific runtime paths.
Agent-Readable Endpoints