From 8ff2ccae4c1502f667381588ae64c67791092032 Mon Sep 17 00:00:00 2001 From: gongjy <2474590974@qq.com> Date: Sun, 22 Sep 2024 14:49:05 +0800 Subject: [PATCH] update readme --- README_en.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README_en.md b/README_en.md index fadc9fe..7c88224 100644 --- a/README_en.md +++ b/README_en.md @@ -479,8 +479,8 @@ better with the scaling law for small models. | Model Name | params | Config | pretrain_model | single_sft_model | multi_sft_model | |-------------------|--------|-----------------------------|-----------------------------------------------------------------|----------------------------------------------------------------|----------------------------------------------------------------| | minimind-v1-small | 26M | d_model=512
n_layers=8 | [URL](https://pan.baidu.com/s/1wP_cAIc8cgaJ6CxUmR9ECQ?pwd=6666) | [URL](https://pan.baidu.com/s/1_COe0FQRDmeapSsvArahCA?pwd=6666) | [URL](https://pan.baidu.com/s/1GsGsWSL0Dckl0YPRXiBIFQ?pwd=6666) | -| minimind-v1-moe | 4×26M | d_model=512
n_layers=8 | [URL](https://pan.baidu.com/s/1IZdkzPRhbZ_bSsRL8vInjg?pwd=6666) | [URL](https://pan.baidu.com/s/1tqB-GMvuiGQBvEl-yZ-oBw?pwd=6666) | [URL](https://pan.baidu.com/s/1GHJ2T4904EcT1u8l1rVqtg?pwd=6666) | -| minimind-v1 | 108M | d_model=768
n_layers=16 | - | [URL](https://pan.baidu.com/s/1p713loS7EfwHQf3G9eYI3Q?pwd=6666) | [URL](https://pan.baidu.com/s/12iHGpAs6R0kqsOnGtgK6vQ?pwd=6666) | +| minimind-v1-moe | 4×26M | d_model=512
n_layers=8 | [URL](https://pan.baidu.com/s/1IZdkzPRhbZ_bSsRL8vInjg?pwd=6666) | [URL](https://pan.baidu.com/s/1tqB-GMvuiGQBvEl-yZ-oBw?pwd=6666) | [URL](https://pan.baidu.com/s/1GHJ2T4904EcT1u8l1rVqtg?pwd=6666) | +| minimind-v1 | 108M | d_model=768
n_layers=16 | [URL](https://pan.baidu.com/s/1B60jYo4T8OmJI0ooqsixaA?pwd=6666) | [URL](https://pan.baidu.com/s/1p713loS7EfwHQf3G9eYI3Q?pwd=6666) | [URL](https://pan.baidu.com/s/12iHGpAs6R0kqsOnGtgK6vQ?pwd=6666) | ---