gongjy
|
4ef9c41563
|
fix deepspeed local_rank bug
|
2024-09-27 23:25:56 +08:00 |
|
gongjy
|
75753ea765
|
Update data preprocessing methods
|
2024-09-27 17:19:03 +08:00 |
|
gongjy
|
a8ae342775
|
Update data preprocessing methods
|
2024-09-27 16:19:30 +08:00 |
|
gongjy
|
d57037624b
|
update batchsize
|
2024-09-25 12:35:29 +08:00 |
|
gongjy
|
89d260145f
|
fix dtype bug
|
2024-09-25 10:07:30 +08:00 |
|
jingyaogong
|
5dd4e15aa2
|
Merge branch 'master' into wandb
|
2024-09-24 14:09:54 +08:00 |
|
|
51dcf51c5d
|
添加了argparse,方便命令行输入参数
|
2024-09-24 12:41:58 +08:00 |
|
|
ef9a592d14
|
修复了wandb的bug,避免了多次产生项目
|
2024-09-24 11:43:30 +08:00 |
|
gongjy
|
7947fa17fb
|
update wandb monitor
|
2024-09-23 22:16:21 +08:00 |
|
gongjy
|
235b6c6fd3
|
update wandb monitor
|
2024-09-23 22:14:52 +08:00 |
|
|
06a66d88c9
|
添加了wandb
|
2024-09-23 20:11:45 +08:00 |
|
gongjy
|
9093519c37
|
Updated some explanations
|
2024-09-20 17:07:51 +08:00 |
|
gongjy
|
ee218402cd
|
update some explain of the code
|
2024-09-20 17:04:16 +08:00 |
|
Ben
|
2dceaf4a92
|
添加注释,方便学习者快速理解
|
2024-09-18 21:53:39 +08:00 |
|
gongjy
|
c404941677
|
add accumulation_grad for pretrain
|
2024-09-16 20:58:46 +08:00 |
|
gongjy
|
f3f1cc5fac
|
update config
|
2024-09-15 15:08:04 +08:00 |
|
gongjy
|
dd52733d6f
|
update readme
|
2024-09-13 14:16:10 +08:00 |
|
gongjy
|
8be42693f6
|
MiniMind first open source
|
2024-08-28 16:41:44 +08:00 |
|