Commit Graph

  • 3ff66f7221 update model gongjy 2024-10-20 15:13:58 +08:00
  • 00e09efb5b update num_workers gongjy 2024-10-19 06:31:15 +08:00
  • f16991d7ec update sft gongjy 2024-10-16 22:58:06 +08:00
  • 6861d1af56 update rlhf gongjy 2024-10-15 15:20:31 +08:00
  • c59b8b3e26 update rlhf gongjy 2024-10-15 15:04:38 +08:00
  • 11d5cadb9c update rlhf gongjy 2024-10-15 10:30:32 +08:00
  • 02adb7bc0d update trl gongjy 2024-10-13 22:44:28 +08:00
  • 5c4b34bbe3 update inference gongjy 2024-10-13 13:11:41 +08:00
  • ad6fc92399 update inference gongjy 2024-10-13 13:09:52 +08:00
  • 135421690e update data_process gongjy 2024-10-12 19:46:08 +08:00
  • 2698f6b57d update data_process gongjy 2024-10-12 18:47:08 +08:00
  • 36fadc7ef1 update lora-sft gongjy 2024-10-11 17:43:52 +08:00
  • 3a034a47c8 update readme gongjy 2024-10-10 22:13:10 +08:00
  • 5dbd6174b3 update readme gongjy 2024-10-09 09:12:55 +08:00
  • 4542ecf858 update readme gongjy 2024-10-09 00:03:07 +08:00
  • 772834148e update readme gongjy 2024-10-08 23:40:29 +08:00
  • 000b0a496b update readme gongjy 2024-10-08 23:16:29 +08:00
  • 5929d0f8b1 update readme gongjy 2024-10-07 18:42:53 +08:00
  • e4b8789d8c update readme gongjy 2024-10-05 22:59:00 +08:00
  • eb875da306 update readme gongjy 2024-10-05 00:35:54 +08:00
  • 1b864453fa update requirements.txt gongjy 2024-10-03 14:45:58 +08:00
  • a87f628400 update model (fix loss bug) gongjy 2024-09-29 16:58:48 +08:00
  • 4ef9c41563 fix deepspeed local_rank bug gongjy 2024-09-27 23:25:56 +08:00
  • 2981d3ea86 Update data preprocessing methods gongjy 2024-09-27 22:34:30 +08:00
  • 75753ea765 Update data preprocessing methods gongjy 2024-09-27 17:19:03 +08:00
  • 1cc73836d4 update readme info gongjy 2024-09-27 16:38:18 +08:00
  • a8ae342775 Update data preprocessing methods gongjy 2024-09-27 16:19:30 +08:00
  • d57037624b update batchsize gongjy 2024-09-25 12:35:29 +08:00
  • 89d260145f fix dtype bug gongjy 2024-09-25 10:07:30 +08:00
  • 13105cfa0c
    Merge pull request #44 from iomgaa-ycz/wandb jingyaogong 2024-09-24 14:10:05 +08:00
  • 5dd4e15aa2
    Merge branch 'master' into wandb jingyaogong 2024-09-24 14:09:54 +08:00
  • d7a056a545 更新了ReadMe Yu Chengzhang 2024-09-24 12:45:21 +08:00
  • 51dcf51c5d 添加了argparse,方便命令行输入参数 Yu Chengzhang 2024-09-24 12:41:58 +08:00
  • ef9a592d14 修复了wandb的bug,避免了多次产生项目 Yu Chengzhang 2024-09-24 11:43:30 +08:00
  • 7947fa17fb update wandb monitor gongjy 2024-09-23 22:16:21 +08:00
  • 235b6c6fd3 update wandb monitor gongjy 2024-09-23 22:14:52 +08:00
  • 15f8242ba7
    Merge pull request #43 from iomgaa-ycz/wandb jingyaogong 2024-09-23 21:54:48 +08:00
  • 06a66d88c9 添加了wandb Yu Chengzhang 2024-09-23 20:11:45 +08:00
  • b4359b3335 fix data_process bug gongjy 2024-09-23 20:11:19 +08:00
  • 5f8279f661
    Merge pull request #42 from iomgaa-ycz/wandb jingyaogong 2024-09-23 20:08:13 +08:00
  • 0fa4d17d26 更新了忽视列表 Yu Chengzhang 2024-09-23 18:36:04 +08:00
  • 6fb569abd8 修复了process_seq_monkey写入错误的bug Yu Chengzhang 2024-09-23 18:34:58 +08:00
  • bf64ffb056 update data_process gongjy 2024-09-22 21:17:05 +08:00
  • 8ff2ccae4c update readme gongjy 2024-09-22 14:49:05 +08:00
  • d41b39c88c update readme gongjy 2024-09-22 14:45:37 +08:00
  • cbdea6bd9f update readme gongjy 2024-09-21 23:11:02 +08:00
  • 7115a98453 update data_process Chunk_size Read gongjy 2024-09-21 22:57:22 +08:00
  • 6759da45c1 update model mask gongjy 2024-09-21 20:00:25 +08:00
  • 02297df3c1 Efficient implementation of Inference KV cache gongjy 2024-09-21 00:01:05 +08:00
  • 0cd7d4b2c2 update readme gongjy 2024-09-20 21:21:25 +08:00
  • b969382f0a update readme gongjy 2024-09-20 21:19:59 +08:00
  • 6c1118ab86 update readme gongjy 2024-09-20 21:15:55 +08:00
  • 19523a121f update readme gongjy 2024-09-20 21:15:14 +08:00
  • 1c493e8c2a update readme gongjy 2024-09-20 20:33:46 +08:00
  • 9093519c37 Updated some explanations gongjy 2024-09-20 17:07:51 +08:00
  • 39423c6a14 Updated some explanations gongjy 2024-09-20 17:05:50 +08:00
  • ee218402cd update some explain of the code gongjy 2024-09-20 17:04:16 +08:00
  • b4170e3766
    Merge pull request #34 from chuanzhubin/master jingyaogong 2024-09-20 16:51:32 +08:00
  • 95b1ee8175
    Merge branch 'master' into master jingyaogong 2024-09-20 16:51:03 +08:00
  • c81c17dab7 update others gongjy 2024-09-19 09:35:02 +08:00
  • 48ea6a4cbf update readme gongjy 2024-09-19 09:32:31 +08:00
  • c28664dac8 update minimind-v1-moe gongjy 2024-09-18 23:51:56 +08:00
  • 2dceaf4a92 添加注释,方便学习者快速理解 Ben 2024-09-18 21:53:39 +08:00
  • 61cb61a46a update minimind-v1-moe gongjy 2024-09-17 11:33:31 +08:00
  • c404941677 add accumulation_grad for pretrain gongjy 2024-09-16 20:58:46 +08:00
  • 8c18b324d0 update model gongjy 2024-09-16 16:59:52 +08:00
  • e4ad822c40 update model gongjy 2024-09-16 15:29:57 +08:00
  • e1f68b5e37 update readme gongjy 2024-09-16 09:31:44 +08:00
  • 3db31aeba3 update readme gongjy 2024-09-16 09:29:40 +08:00
  • 07150c7b57 update readme gongjy 2024-09-15 22:58:40 +08:00
  • 5000e2e30a update contributor gongjy 2024-09-15 20:59:52 +08:00
  • 846b37d6f4 update contributor gongjy 2024-09-15 18:13:52 +08:00
  • 30b498f0c3
    Merge pull request #29 from chuanzhubin/master jingyaogong 2024-09-15 17:34:57 +08:00
  • f5966fe69e
    Update my_openai_api.py 忽略客户端传来的未知字段 Ben 2024-09-15 07:37:24 +00:00
  • 16928c1231 update some config gongjy 2024-09-15 15:12:47 +08:00
  • aa5d70321f update config gongjy 2024-09-15 15:09:21 +08:00
  • f3f1cc5fac update config gongjy 2024-09-15 15:08:04 +08:00
  • b043ec996b update my_openai_api.py gongjy 2024-09-15 12:07:55 +08:00
  • 13e791e516 update export_model gongjy 2024-09-15 11:39:33 +08:00
  • 2c22f1bb26 update readme gongjy 2024-09-14 19:16:36 +08:00
  • 326ec8852a update readme gongjy 2024-09-14 19:11:46 +08:00
  • 3068e5efcc update model/dataset.py gongjy 2024-09-14 16:09:42 +08:00
  • a8405b08a2 update 0-eval_pretrain.py gongjy 2024-09-14 14:14:12 +08:00
  • b00a3b5ff7 update 0-eval_pretrain.py gongjy 2024-09-14 14:09:29 +08:00
  • ecf6d44133 update model/dataset.py gongjy 2024-09-14 14:05:41 +08:00
  • 8a407ad1c6 update readme gongjy 2024-09-13 22:10:07 +08:00
  • 3696d3bf80 update requirements.txt gongjy 2024-09-13 16:43:06 +08:00
  • 0fb22635a1 update requirements.txt gongjy 2024-09-13 16:09:42 +08:00
  • 9f21ba1578 update readme gongjy 2024-09-13 15:36:02 +08:00
  • dd52733d6f update readme gongjy 2024-09-13 14:16:10 +08:00
  • 56c6139896 update data_process & full_sft gongjy 2024-09-13 13:32:24 +08:00
  • 41b474e2bf update images gongjy 2024-09-13 10:23:36 +08:00
  • d4acd94beb update readme gongjy 2024-09-12 23:28:36 +08:00
  • 01e91e9a58 update readme gongjy 2024-09-12 23:15:28 +08:00
  • 98600070cd update seq-monkey dataset baidu-download gongjy 2024-09-12 22:10:18 +08:00
  • 4ceaa04d05 update readme gongjy 2024-09-12 22:07:41 +08:00
  • edb26ced7b update readme gongjy 2024-09-12 09:23:06 +08:00
  • 57ef4a4bfc update readme gongjy 2024-09-12 08:09:28 +08:00
  • be164565f3 update readme gongjy 2024-09-11 11:12:06 +08:00
  • 49465cfc61 update readme gongjy 2024-09-11 11:11:38 +08:00