|
94fe13756f
|
try to update reward func
|
2024-09-14 23:56:36 +02:00 |
|
|
2ac17caa3c
|
need to update the model
|
2024-09-12 23:40:42 +02:00 |
|
|
0c60171c71
|
need to update the model
|
2024-09-10 16:57:42 +02:00 |
|
|
97fbdf91c7
|
try to deploy PPO policy
|
2024-09-09 23:50:10 +02:00 |
|
|
5dccf590e7
|
add sample phase and try to get log prob
|
2024-09-08 23:26:49 +02:00 |
|
|
3950a8438d
|
set batch_y to 1 and want to test 15625
|
2024-08-20 22:15:25 +02:00 |
|
|
01c5c277be
|
add the results
|
2024-08-20 09:11:10 +02:00 |
|
|
205f43291b
|
update some score function
|
2024-08-05 21:45:15 +02:00 |
|
|
5e66aa74e7
|
add best score part
|
2024-07-30 00:08:11 +02:00 |
|
|
55ff19421d
|
add the idea of guidance
|
2024-07-25 22:09:03 +02:00 |
|
|
fcdd8efc4f
|
find the guidance part
|
2024-07-16 13:27:44 +02:00 |
|
|
0b9da26eda
|
add some shape commits
|
2024-07-15 22:06:05 +02:00 |
|
|
d57575586d
|
make the metrics code back
|
2024-06-30 16:43:08 +02:00 |
|
|
82299e5213
|
try to run the graph, commented sampling codes
|
2024-06-25 00:09:27 +02:00 |
|
Hanzhang Ma
|
4f8945ca07
|
add somecomments
|
2024-06-08 21:35:35 +02:00 |
|
gang liu
|
2c00828630
|
update_name
|
2024-05-25 15:32:36 -04:00 |
|