From 48ea6a4cbf312d3b551b9ee7c76f3a52e9f0c1dc Mon Sep 17 00:00:00 2001
From: gongjy <2474590974@qq.com>
Date: Thu, 19 Sep 2024 09:32:31 +0800
Subject: [PATCH] update readme
---
README.md | 5 ++++-
1 file changed, 4 insertions(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 2d8e777..a2ad982 100644
--- a/README.md
+++ b/README.md
@@ -419,7 +419,7 @@ MobileLLM提出架构的深度比宽度更重要,「深而窄」的「瘦长
|-------------------|--------|-----------------------------|----------------------------------------------------------------|----------------------------------------------------------------|----------------------------------------------------------------|
| minimind-v1-small | 26M | d_model=512
n_layers=8 | [链接](https://pan.baidu.com/s/1wP_cAIc8cgaJ6CxUmR9ECQ?pwd=6666) | [链接](https://pan.baidu.com/s/1_COe0FQRDmeapSsvArahCA?pwd=6666) | [链接](https://pan.baidu.com/s/1GsGsWSL0Dckl0YPRXiBIFQ?pwd=6666) |
| minimind-v1-moe | 4×26M | d_model=512
n_layers=8 | [链接](https://pan.baidu.com/s/1IZdkzPRhbZ_bSsRL8vInjg?pwd=6666) | [链接](https://pan.baidu.com/s/1tqB-GMvuiGQBvEl-yZ-oBw?pwd=6666) | [链接](https://pan.baidu.com/s/1GHJ2T4904EcT1u8l1rVqtg?pwd=6666) |
-| minimind-v1 | 108M | d_model=768
n_layers=16 | - | [链接](https://pan.baidu.com/s/1p713loS7EfwHQf3G9eYI3Q?pwd=6666) | [链接](https://pan.baidu.com/s/12iHGpAs6R0kqsOnGtgK6vQ?pwd=6666) |
+| minimind-v1 | 108M | d_model=768
n_layers=16 | [链接](https://pan.baidu.com/s/1B60jYo4T8OmJI0ooqsixaA?pwd=6666) | [链接](https://pan.baidu.com/s/1p713loS7EfwHQf3G9eYI3Q?pwd=6666) | [链接](https://pan.baidu.com/s/12iHGpAs6R0kqsOnGtgK6vQ?pwd=6666) |
---
@@ -688,6 +688,9 @@ minimind模型本身没有使用较大的数据集训练,也没有针对回答
@ipfgao:
🔗训练步骤记录
+@chuanzhubin:
+🔗代码逐行注释
+
## 🫶支持者