Go to file
2023-06-28 10:42:47 -07:00
config Minor changes; add train_timestep_fraction 2023-06-27 22:17:56 -07:00
ddpo_pytorch Fix aesthetic scorer 2023-06-28 10:42:30 -07:00
scripts Add reward to image caption 2023-06-28 10:42:47 -07:00
.gitignore Working non-lora training; other changes 2023-06-25 11:28:42 -07:00
README.md Initial commit 2023-06-23 19:25:54 -07:00
setup.py Working on DGX 2023-06-24 00:07:55 -07:00

ddpo-pytorch