Commit Graph

  • 30f8614c97 update online demo gongjy 2025-02-09 23:55:59 +08:00
  • 29d6cd9cd2 update scripts file gongjy 2025-02-09 23:52:50 +08:00
  • 58e3af0359 add minimind2 gongjy 2025-02-09 23:49:47 +08:00
  • 6e9cd28ef9 update readme gongjy 2024-12-13 20:08:59 +08:00
  • f7127e4310 update lora-sft gongjy 2024-11-10 20:40:49 +08:00
  • 1240829c89 update train_tokenizer gongjy 2024-11-06 17:48:33 +08:00
  • 7c67ba0b92 update fast_inference gongjy 2024-10-30 15:26:28 +08:00
  • db39571493 update readme gongjy 2024-10-30 10:39:29 +08:00
  • 0bce9e5a31 update eval-chat gongjy 2024-10-28 12:57:25 +08:00
  • 6d7a988365 update readme gongjy 2024-10-27 21:25:55 +08:00
  • 0d10efeeed update process gongjy 2024-10-27 17:21:29 +08:00
  • 517e0a4e7c update readme gongjy 2024-10-26 14:35:22 +08:00
  • d63c525d04 update readme gongjy 2024-10-26 14:32:55 +08:00
  • a9e65969c8 update sft-lr gongjy 2024-10-24 08:58:31 +08:00
  • 42bd06e55d update data_process gongjy 2024-10-23 12:25:45 +08:00
  • 69bcb8dc90 update data_process gongjy 2024-10-23 12:02:28 +08:00
  • 3ff66f7221 update model gongjy 2024-10-20 15:13:58 +08:00
  • 00e09efb5b update num_workers gongjy 2024-10-19 06:31:15 +08:00
  • f16991d7ec update sft gongjy 2024-10-16 22:58:06 +08:00
  • 6861d1af56 update rlhf gongjy 2024-10-15 15:20:31 +08:00
  • c59b8b3e26 update rlhf gongjy 2024-10-15 15:04:38 +08:00
  • 11d5cadb9c update rlhf gongjy 2024-10-15 10:30:32 +08:00
  • 02adb7bc0d update trl gongjy 2024-10-13 22:44:28 +08:00
  • 5c4b34bbe3 update inference gongjy 2024-10-13 13:11:41 +08:00
  • ad6fc92399 update inference gongjy 2024-10-13 13:09:52 +08:00
  • 135421690e update data_process gongjy 2024-10-12 19:46:08 +08:00
  • 2698f6b57d update data_process gongjy 2024-10-12 18:47:08 +08:00
  • 36fadc7ef1 update lora-sft gongjy 2024-10-11 17:43:52 +08:00
  • 3a034a47c8 update readme gongjy 2024-10-10 22:13:10 +08:00
  • 5dbd6174b3 update readme gongjy 2024-10-09 09:12:55 +08:00
  • 4542ecf858 update readme gongjy 2024-10-09 00:03:07 +08:00
  • 772834148e update readme gongjy 2024-10-08 23:40:29 +08:00
  • 000b0a496b update readme gongjy 2024-10-08 23:16:29 +08:00
  • 5929d0f8b1 update readme gongjy 2024-10-07 18:42:53 +08:00
  • e4b8789d8c update readme gongjy 2024-10-05 22:59:00 +08:00
  • eb875da306 update readme gongjy 2024-10-05 00:35:54 +08:00
  • 1b864453fa update requirements.txt gongjy 2024-10-03 14:45:58 +08:00
  • a87f628400 update model (fix loss bug) gongjy 2024-09-29 16:58:48 +08:00
  • 4ef9c41563 fix deepspeed local_rank bug gongjy 2024-09-27 23:25:56 +08:00
  • 2981d3ea86 Update data preprocessing methods gongjy 2024-09-27 22:34:30 +08:00
  • 75753ea765 Update data preprocessing methods gongjy 2024-09-27 17:19:03 +08:00
  • 1cc73836d4 update readme info gongjy 2024-09-27 16:38:18 +08:00
  • a8ae342775 Update data preprocessing methods gongjy 2024-09-27 16:19:30 +08:00
  • d57037624b update batchsize gongjy 2024-09-25 12:35:29 +08:00
  • 89d260145f fix dtype bug gongjy 2024-09-25 10:07:30 +08:00
  • 13105cfa0c
    Merge pull request #44 from iomgaa-ycz/wandb jingyaogong 2024-09-24 14:10:05 +08:00
  • 5dd4e15aa2
    Merge branch 'master' into wandb jingyaogong 2024-09-24 14:09:54 +08:00
  • d7a056a545 更新了ReadMe Yu Chengzhang 2024-09-24 12:45:21 +08:00
  • 51dcf51c5d 添加了argparse,方便命令行输入参数 Yu Chengzhang 2024-09-24 12:41:58 +08:00
  • ef9a592d14 修复了wandb的bug,避免了多次产生项目 Yu Chengzhang 2024-09-24 11:43:30 +08:00
  • 7947fa17fb update wandb monitor gongjy 2024-09-23 22:16:21 +08:00
  • 235b6c6fd3 update wandb monitor gongjy 2024-09-23 22:14:52 +08:00
  • 15f8242ba7
    Merge pull request #43 from iomgaa-ycz/wandb jingyaogong 2024-09-23 21:54:48 +08:00
  • 06a66d88c9 添加了wandb Yu Chengzhang 2024-09-23 20:11:45 +08:00
  • b4359b3335 fix data_process bug gongjy 2024-09-23 20:11:19 +08:00
  • 5f8279f661
    Merge pull request #42 from iomgaa-ycz/wandb jingyaogong 2024-09-23 20:08:13 +08:00
  • 0fa4d17d26 更新了忽视列表 Yu Chengzhang 2024-09-23 18:36:04 +08:00
  • 6fb569abd8 修复了process_seq_monkey写入错误的bug Yu Chengzhang 2024-09-23 18:34:58 +08:00
  • bf64ffb056 update data_process gongjy 2024-09-22 21:17:05 +08:00
  • 8ff2ccae4c update readme gongjy 2024-09-22 14:49:05 +08:00
  • d41b39c88c update readme gongjy 2024-09-22 14:45:37 +08:00
  • cbdea6bd9f update readme gongjy 2024-09-21 23:11:02 +08:00
  • 7115a98453 update data_process Chunk_size Read gongjy 2024-09-21 22:57:22 +08:00
  • 6759da45c1 update model mask gongjy 2024-09-21 20:00:25 +08:00
  • 02297df3c1 Efficient implementation of Inference KV cache gongjy 2024-09-21 00:01:05 +08:00
  • 0cd7d4b2c2 update readme gongjy 2024-09-20 21:21:25 +08:00
  • b969382f0a update readme gongjy 2024-09-20 21:19:59 +08:00
  • 6c1118ab86 update readme gongjy 2024-09-20 21:15:55 +08:00
  • 19523a121f update readme gongjy 2024-09-20 21:15:14 +08:00
  • 1c493e8c2a update readme gongjy 2024-09-20 20:33:46 +08:00
  • 9093519c37 Updated some explanations gongjy 2024-09-20 17:07:51 +08:00
  • 39423c6a14 Updated some explanations gongjy 2024-09-20 17:05:50 +08:00
  • ee218402cd update some explain of the code gongjy 2024-09-20 17:04:16 +08:00
  • b4170e3766
    Merge pull request #34 from chuanzhubin/master jingyaogong 2024-09-20 16:51:32 +08:00
  • 95b1ee8175
    Merge branch 'master' into master jingyaogong 2024-09-20 16:51:03 +08:00
  • c81c17dab7 update others gongjy 2024-09-19 09:35:02 +08:00
  • 48ea6a4cbf update readme gongjy 2024-09-19 09:32:31 +08:00
  • c28664dac8 update minimind-v1-moe gongjy 2024-09-18 23:51:56 +08:00
  • 2dceaf4a92 添加注释,方便学习者快速理解 Ben 2024-09-18 21:53:39 +08:00
  • 61cb61a46a update minimind-v1-moe gongjy 2024-09-17 11:33:31 +08:00
  • c404941677 add accumulation_grad for pretrain gongjy 2024-09-16 20:58:46 +08:00
  • 8c18b324d0 update model gongjy 2024-09-16 16:59:52 +08:00
  • e4ad822c40 update model gongjy 2024-09-16 15:29:57 +08:00
  • e1f68b5e37 update readme gongjy 2024-09-16 09:31:44 +08:00
  • 3db31aeba3 update readme gongjy 2024-09-16 09:29:40 +08:00
  • 07150c7b57 update readme gongjy 2024-09-15 22:58:40 +08:00
  • 5000e2e30a update contributor gongjy 2024-09-15 20:59:52 +08:00
  • 846b37d6f4 update contributor gongjy 2024-09-15 18:13:52 +08:00
  • 30b498f0c3
    Merge pull request #29 from chuanzhubin/master jingyaogong 2024-09-15 17:34:57 +08:00
  • f5966fe69e
    Update my_openai_api.py 忽略客户端传来的未知字段 Ben 2024-09-15 07:37:24 +00:00
  • 16928c1231 update some config gongjy 2024-09-15 15:12:47 +08:00
  • aa5d70321f update config gongjy 2024-09-15 15:09:21 +08:00
  • f3f1cc5fac update config gongjy 2024-09-15 15:08:04 +08:00
  • b043ec996b update my_openai_api.py gongjy 2024-09-15 12:07:55 +08:00
  • 13e791e516 update export_model gongjy 2024-09-15 11:39:33 +08:00
  • 2c22f1bb26 update readme gongjy 2024-09-14 19:16:36 +08:00
  • 326ec8852a update readme gongjy 2024-09-14 19:11:46 +08:00
  • 3068e5efcc update model/dataset.py gongjy 2024-09-14 16:09:42 +08:00
  • a8405b08a2 update 0-eval_pretrain.py gongjy 2024-09-14 14:14:12 +08:00
  • b00a3b5ff7 update 0-eval_pretrain.py gongjy 2024-09-14 14:09:29 +08:00