GitHub - pjlab-sys4nlp/llama-moe: ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training - discu.eu

Reddit

[P] Don't have enough GPU to train Mixtral? Why not try LLaMA-MoE~ https://github.com/pjlab-sys4nlp/llama-moe 6 comments 25/12/2023 machinelearning

Linking pages

GitHub - Wang-ML-Lab/llm-continual-learning-survey: Continual Learning of Large Language Models: A Comprehensive Survey https://github.com/Wang-ML-Lab/llm-continual-learning-survey 0 comments

Linked pages

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:github.com

Search title: GitHub - pjlab-sys4nlp/llama-moe: ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

See how to search.

Submit link to: