Commit Graph

36 Commits

Author SHA1 Message Date
Kevin Black
1958463f02 Reformat 2023-11-16 22:37:08 +00:00
Kevin Black
378dd18298
Merge pull request #16 from sayakpaul/patch-1
Update README.md to include a note about the `trl` integration
2023-10-06 16:46:20 -07:00
Sayak Paul
bfcba5e28e
Update README.md to include a note about the trl integration 2023-09-30 15:07:48 +02:00
Kevin Black
b590ec0a7c
Fix accelerate version 2023-09-15 22:54:51 -07:00
Kevin Black
500edd2b53
Update README.md 2023-09-11 16:10:03 -07:00
Kevin Black
e17ecd265d
Update README.md 2023-09-11 16:01:38 -07:00
Kevin Black
5955244f37 Fix gradient sync for lora 2023-08-22 23:34:40 +00:00
Kevin Black
d7a63516cb
Merge pull request #9 from desaixie/main
Only log rewards from process 0
2023-08-22 11:54:52 -07:00
Desai Xie
3130ddfaff
Only log rewards from process 0 2023-08-21 15:10:45 -07:00
Kevin Black
173b2bb6e0
Update README.md (add reward curves) 2023-07-13 12:37:22 -07:00
Kevin Black
c67c2adfee Enforce python version 2023-07-06 10:29:12 -07:00
Kevin Black
64a20bc01d
Update README.md 2023-07-04 01:29:50 -07:00
Kevin Black
8c45353cce
Update README.md 2023-07-04 01:28:40 -07:00
Kevin Black
1f067b16c8 Add teaser image 2023-07-04 01:22:28 -07:00
Kevin Black
b14022ea92
Update README.md 2023-07-04 01:21:46 -07:00
Kevin Black
26177ccf40
Create LICENSE 2023-07-04 01:19:47 -07:00
Kevin Black
c65dd3a39c Update README 2023-07-04 01:15:16 -07:00
Kevin Black
953d59eb70 Fix pydantic issue in setup 2023-07-04 00:40:42 -07:00
Kevin Black
10fbec322a Add activities asset 2023-07-04 00:27:04 -07:00
Kevin Black
beb8c2f86d Update configs 2023-07-04 00:25:37 -07:00
Kevin Black
ec499edf84 Fix aesthetic score (again), add llava reward 2023-07-04 00:23:33 -07:00
Kevin Black
c0bc708549 Commenting pass 2023-06-29 00:51:38 -07:00
Kevin Black
8779f62a1c Adding checkpointing and resuming 2023-06-28 17:58:25 -07:00
Kevin Black
ad28862b48 Add reward to image caption 2023-06-28 10:42:47 -07:00
Kevin Black
fe9ed8a25f Fix aesthetic scorer 2023-06-28 10:42:30 -07:00
Kevin Black
28d2d8c40e Minor changes; add train_timestep_fraction 2023-06-27 22:17:56 -07:00
Kevin Black
bae3f43f5f Add aesthetic scorer reward function 2023-06-27 10:40:36 -07:00
Kevin Black
8cab96dea4 Minor changes, add assets 2023-06-27 10:20:03 -07:00
Kevin Black
4c5322ca85 Device specific seed 2023-06-26 22:35:24 -07:00
Kevin Black
1ce0994c8a Fix stat tracking bug 2023-06-26 22:25:43 -07:00
Kevin Black
5c16a90ceb Move config out of module 2023-06-25 21:02:27 -07:00
Kevin Black
269615a35e Working non-lora training; other changes 2023-06-25 11:28:42 -07:00
Kevin Black
c680890d5c Working on DGX 2023-06-24 00:07:55 -07:00
Kevin Black
92fc030123 Continue implementation 2023-06-23 21:08:32 -07:00
Kevin Black
6d848c3cdc Remove pycache 2023-06-23 21:08:19 -07:00
Kevin Black
2fda3d4e78 Initial commit 2023-06-23 19:25:54 -07:00