-
Notifications
You must be signed in to change notification settings - Fork 240
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[JAX] Splitting cpp_extensions.py
enhancement
New feature or request
jax
#899
opened Jun 7, 2024 by
phu0ngng
Loading…
5 of 11 tasks
Make transformer_engine::getenv arguments independent of C++ ABI version
#896
opened Jun 7, 2024 by
ksivaman
Loading…
8 of 11 tasks
[PyTorch] Refine definition of sliding window size based on attention mask
#895
opened Jun 7, 2024 by
cyanguwa
Loading…
3 tasks
[PyTorch] Disabling TorchDynamo for TE activation checkpoint wrapper
#894
opened Jun 7, 2024 by
denera
Loading…
5 of 11 tasks
Add documentation for dot product attention
#889
opened Jun 4, 2024 by
cyanguwa
Loading…
2 of 4 tasks
Use unoptimized RMSNorm kernel if pointers are not aligned
bug
Something isn't working
#886
opened Jun 3, 2024 by
timmoon10
Loading…
4 of 11 tasks
[PyTorch] Add support for cuDNN FusedAttention + THD + CP
#885
opened Jun 3, 2024 by
xrennvidia
Loading…
6 tasks
[JAX] Splitting CPP extensions by category
#883
opened Jun 1, 2024 by
phu0ngng
Loading…
5 of 11 tasks
[JAX] Added unit tests for distributed LayernormMLP
#878
opened May 29, 2024 by
phu0ngng
Loading…
4 of 9 tasks
[PyTorch] Avoid select op in PyTorch extensions
enhancement
New feature or request
#865
opened May 24, 2024 by
timmoon10
Loading…
6 of 11 tasks
[Common/PyTorch] Grouped GEMM via multi-stream cuBLAS
#853
opened May 17, 2024 by
yaox12
Loading…
8 of 11 tasks
[JAX] Rewrite the Format of FP8 Meta and Remove unused ShardingTypes.
#842
opened May 13, 2024 by
mingxu1067
Loading…
8 of 11 tasks
[Pytorch] Implement fp32 accumulation for attention with context parallel in both forward and backward pass.
#821
opened Apr 28, 2024 by
Yuxin-CV
Loading…
[PyTorch] Fix minor bug in computing num_gqa_groups_per_partition
bug
Something isn't working
#777
opened Apr 13, 2024 by
knowlsie
Loading…
[C/PyTorch] Refactor and move userbuffers into TE/common
#760
opened Apr 8, 2024 by
denera
Loading…
7 of 13 tasks
[PyTorch] Sequential fuser
enhancement
New feature or request
#707
opened Mar 9, 2024 by
timmoon10
Loading…
2 of 6 tasks
Remove now useless padding as it is now down automatically.
jax
#680
opened Feb 25, 2024 by
nouiz
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.