Commit Graph

  • 1958463f02 Reformat main Kevin Black 2023-11-16 22:36:46 +0000
  • 378dd18298
    Merge pull request #16 from sayakpaul/patch-1 Kevin Black 2023-10-06 16:46:20 -0700
  • bfcba5e28e
    Update README.md to include a note about the trl integration Sayak Paul 2023-09-30 15:07:48 +0200
  • b590ec0a7c
    Fix accelerate version Kevin Black 2023-09-15 22:54:51 -0700
  • 500edd2b53
    Update README.md Kevin Black 2023-09-11 16:10:03 -0700
  • e17ecd265d
    Update README.md Kevin Black 2023-09-11 16:01:38 -0700
  • 5955244f37 Fix gradient sync for lora Kevin Black 2023-08-22 16:18:49 -0700
  • d7a63516cb
    Merge pull request #9 from desaixie/main Kevin Black 2023-08-22 11:54:52 -0700
  • 3130ddfaff
    Only log rewards from process 0 Desai Xie 2023-08-21 15:10:45 -0700
  • 173b2bb6e0
    Update README.md (add reward curves) Kevin Black 2023-07-13 12:37:22 -0700
  • c67c2adfee Enforce python version Kevin Black 2023-07-06 10:28:54 -0700
  • 64a20bc01d
    Update README.md Kevin Black 2023-07-04 01:29:50 -0700
  • 8c45353cce
    Update README.md Kevin Black 2023-07-04 01:28:40 -0700
  • 1f067b16c8 Add teaser image Kevin Black 2023-07-04 01:22:08 -0700
  • b14022ea92
    Update README.md Kevin Black 2023-07-04 01:21:46 -0700
  • 26177ccf40
    Create LICENSE Kevin Black 2023-07-04 01:19:47 -0700
  • c65dd3a39c Update README Kevin Black 2023-07-04 01:15:16 -0700
  • 953d59eb70 Fix pydantic issue in setup Kevin Black 2023-07-04 00:40:42 -0700
  • 10fbec322a Add activities asset Kevin Black 2023-07-04 00:27:04 -0700
  • beb8c2f86d Update configs Kevin Black 2023-07-04 00:25:37 -0700
  • ec499edf84 Fix aesthetic score (again), add llava reward Kevin Black 2023-07-04 00:23:33 -0700
  • c0bc708549 Commenting pass Kevin Black 2023-06-29 00:51:38 -0700
  • 8779f62a1c Adding checkpointing and resuming Kevin Black 2023-06-28 17:58:25 -0700
  • ad28862b48 Add reward to image caption Kevin Black 2023-06-28 10:42:47 -0700
  • fe9ed8a25f Fix aesthetic scorer Kevin Black 2023-06-28 10:42:30 -0700
  • 28d2d8c40e Minor changes; add train_timestep_fraction Kevin Black 2023-06-27 22:17:32 -0700
  • bae3f43f5f Add aesthetic scorer reward function Kevin Black 2023-06-27 10:40:36 -0700
  • 8cab96dea4 Minor changes, add assets Kevin Black 2023-06-27 10:20:03 -0700
  • 4c5322ca85 Device specific seed Kevin Black 2023-06-26 22:35:24 -0700
  • 1ce0994c8a Fix stat tracking bug Kevin Black 2023-06-26 22:25:43 -0700
  • 5c16a90ceb Move config out of module Kevin Black 2023-06-25 21:02:27 -0700
  • 269615a35e Working non-lora training; other changes Kevin Black 2023-06-25 11:28:42 -0700
  • c680890d5c Working on DGX Kevin Black 2023-06-24 00:07:55 -0700
  • 92fc030123 Continue implementation Kevin Black 2023-06-23 21:08:32 -0700
  • 6d848c3cdc Remove pycache Kevin Black 2023-06-23 21:08:19 -0700
  • 2fda3d4e78 Initial commit Kevin Black 2023-06-23 19:25:54 -0700