Kevin Black
|
1958463f02
|
Reformat
|
2023-11-16 22:37:08 +00:00 |
|
Kevin Black
|
378dd18298
|
Merge pull request #16 from sayakpaul/patch-1
Update README.md to include a note about the `trl` integration
|
2023-10-06 16:46:20 -07:00 |
|
Sayak Paul
|
bfcba5e28e
|
Update README.md to include a note about the trl integration
|
2023-09-30 15:07:48 +02:00 |
|
Kevin Black
|
b590ec0a7c
|
Fix accelerate version
|
2023-09-15 22:54:51 -07:00 |
|
Kevin Black
|
500edd2b53
|
Update README.md
|
2023-09-11 16:10:03 -07:00 |
|
Kevin Black
|
e17ecd265d
|
Update README.md
|
2023-09-11 16:01:38 -07:00 |
|
Kevin Black
|
5955244f37
|
Fix gradient sync for lora
|
2023-08-22 23:34:40 +00:00 |
|
Kevin Black
|
d7a63516cb
|
Merge pull request #9 from desaixie/main
Only log rewards from process 0
|
2023-08-22 11:54:52 -07:00 |
|
Desai Xie
|
3130ddfaff
|
Only log rewards from process 0
|
2023-08-21 15:10:45 -07:00 |
|
Kevin Black
|
173b2bb6e0
|
Update README.md (add reward curves)
|
2023-07-13 12:37:22 -07:00 |
|
Kevin Black
|
c67c2adfee
|
Enforce python version
|
2023-07-06 10:29:12 -07:00 |
|
Kevin Black
|
64a20bc01d
|
Update README.md
|
2023-07-04 01:29:50 -07:00 |
|
Kevin Black
|
8c45353cce
|
Update README.md
|
2023-07-04 01:28:40 -07:00 |
|
Kevin Black
|
1f067b16c8
|
Add teaser image
|
2023-07-04 01:22:28 -07:00 |
|
Kevin Black
|
b14022ea92
|
Update README.md
|
2023-07-04 01:21:46 -07:00 |
|
Kevin Black
|
26177ccf40
|
Create LICENSE
|
2023-07-04 01:19:47 -07:00 |
|
Kevin Black
|
c65dd3a39c
|
Update README
|
2023-07-04 01:15:16 -07:00 |
|
Kevin Black
|
953d59eb70
|
Fix pydantic issue in setup
|
2023-07-04 00:40:42 -07:00 |
|
Kevin Black
|
10fbec322a
|
Add activities asset
|
2023-07-04 00:27:04 -07:00 |
|
Kevin Black
|
beb8c2f86d
|
Update configs
|
2023-07-04 00:25:37 -07:00 |
|
Kevin Black
|
ec499edf84
|
Fix aesthetic score (again), add llava reward
|
2023-07-04 00:23:33 -07:00 |
|
Kevin Black
|
c0bc708549
|
Commenting pass
|
2023-06-29 00:51:38 -07:00 |
|
Kevin Black
|
8779f62a1c
|
Adding checkpointing and resuming
|
2023-06-28 17:58:25 -07:00 |
|
Kevin Black
|
ad28862b48
|
Add reward to image caption
|
2023-06-28 10:42:47 -07:00 |
|
Kevin Black
|
fe9ed8a25f
|
Fix aesthetic scorer
|
2023-06-28 10:42:30 -07:00 |
|
Kevin Black
|
28d2d8c40e
|
Minor changes; add train_timestep_fraction
|
2023-06-27 22:17:56 -07:00 |
|
Kevin Black
|
bae3f43f5f
|
Add aesthetic scorer reward function
|
2023-06-27 10:40:36 -07:00 |
|
Kevin Black
|
8cab96dea4
|
Minor changes, add assets
|
2023-06-27 10:20:03 -07:00 |
|
Kevin Black
|
4c5322ca85
|
Device specific seed
|
2023-06-26 22:35:24 -07:00 |
|
Kevin Black
|
1ce0994c8a
|
Fix stat tracking bug
|
2023-06-26 22:25:43 -07:00 |
|
Kevin Black
|
5c16a90ceb
|
Move config out of module
|
2023-06-25 21:02:27 -07:00 |
|
Kevin Black
|
269615a35e
|
Working non-lora training; other changes
|
2023-06-25 11:28:42 -07:00 |
|
Kevin Black
|
c680890d5c
|
Working on DGX
|
2023-06-24 00:07:55 -07:00 |
|
Kevin Black
|
92fc030123
|
Continue implementation
|
2023-06-23 21:08:32 -07:00 |
|
Kevin Black
|
6d848c3cdc
|
Remove pycache
|
2023-06-23 21:08:19 -07:00 |
|
Kevin Black
|
2fda3d4e78
|
Initial commit
|
2023-06-23 19:25:54 -07:00 |
|