DataSci Ocean
Posts
Tags
Categories
English
繁體中文
English
Light
Dark
Auto
DataSci Ocean
Cancel
Posts
Tags
Categories
Light
Dark
Auto
English
繁體中文
English
Mixture of Experts
2024
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
04-24
Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
04-10