mixture-of-experts

The idea to create the perfect LLM currently possible came to my mind because I was watching a YouTube on GaLore, the "sequel" to LoRa, and I realized how fucking groundbreaking that tech is. I was daydreaming about pretraining my own model, this (probably impossible to implement) concept is a refined version of that model.

Updated May 31, 2024

SMTorg / smt

Star

Surrogate Modeling Toolbox

machine-learning derivative sampling predictive-modeling surrogate-models mixture-of-experts multi-fidelity

Updated May 29, 2024
Jupyter Notebook

relf / egobox

Star

Efficient global optimization toolbox in Rust: bayesian optimization, mixture of gaussian processes, sampling methods

global-optimization gaussian-processes latin-hypercube-sampling surrogate-models mixture-of-experts

Updated May 29, 2024
Rust

liuqidong07 / MOELoRA-peft

Star

[SIGIR'24] The official implementation code of MOELoRA.

multi-task peft multitask-learning mixture-of-experts large-language-models chatglm low-rank-adaptation parameter-efficient-fine-tuning peft-fine-tuning-llm

Updated May 28, 2024
Python

zjukg / MoMoK

Star

[Paper][Preprint 2024] Mixture of Modality Knowledge Experts for Robust Multi-modal Knowledge Graph Completion

knowledge-graph knowledge-graph-completion mutual-information knowledge-graph-embeddings mixture-of-experts contrastive-learning multi-modal-fusion multi-modal-knowledge-graph

Updated May 28, 2024
Python

ssube / packit

Star

an LLM toolkit

inference ensemble agents conversational-ai mixture-of-experts llm

Updated May 27, 2024
Python

DongmingShenDS / Mistral_From_Scratch

Star

Mistral and Mixtral (MoE) from scratch

mixture-of-experts kv-cache large-language-models mistral-7b mixtral-8x7b peft-fine-tuning-llm

Updated May 27, 2024
Python

pranoyr / attention-models

Sponsor

Star

Simplified Implementation of SOTA Deep Learning Papers in Pytorch

computer-vision transformers image-generation attention-mechanism mixture-of-experts

Updated May 27, 2024
Python

YangLing0818 / RealCompo

Star

RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models

mixture-of-experts diffusion-models layout-to-image text-to-image-diffusion

Updated May 27, 2024
Python

Wuyxin / GraphMETRO

Star

Early release of the official implementation for "GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts"

generalization graph-neural-networks mixture-of-experts

Updated May 22, 2024
Python

koayon / awesome-adaptive-computation

Star

A curated reading list of research in Adaptive Computation, Dynamic Compute & Mixture of Experts (MoE).

nlp machine-learning computer-vision tensorflow transformers pytorch mixture-of-experts adaptive-computation

Updated May 22, 2024

efeslab / fiddler

Star

Fast Inference of MoE Models with CPU-GPU Orchestration

mixture-of-experts llm local-inference llm-inference mixtral-8x7b

Updated May 22, 2024
Python

kyegomez / LIMoE

Sponsor

Star

Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts"

machine-learning ai tensorflow ml pytorch artificial-intelligence moe swarms mixture-of-experts

Updated May 18, 2024
Python

kyegomez / SwitchTransformers

Sponsor

Star

Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"

ai ml moe llama multi-modal mixture-model mixture-of-experts mixture-of-models gpt4

Updated May 17, 2024
Python

Improve this page

Add a description, image, and links to the mixture-of-experts topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mixture-of-experts topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mixture-of-experts

Here are 101 public repositories matching this topic...

microsoft / DeepSpeed

Leeroo-AI / mergoo

eduardzamfir / seemoredetails

alexliap / greek_gpt

TorchMoE / MoE-Infinity

james-oldfield / muMoE

lolguy91 / perfect-llm-imho

SMTorg / smt

relf / egobox

liuqidong07 / MOELoRA-peft

zjukg / MoMoK

ssube / packit

DongmingShenDS / Mistral_From_Scratch

pranoyr / attention-models

YangLing0818 / RealCompo

Wuyxin / GraphMETRO

koayon / awesome-adaptive-computation

efeslab / fiddler

kyegomez / LIMoE

kyegomez / SwitchTransformers

Improve this page

Add this topic to your repo