Pinned
Repositories
Showing 10 of 317 repositories
-
- LOLA-Megatron-DeepSpeed Public Forked from microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
-