-
Notifications
You must be signed in to change notification settings - Fork 426
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add CUTLASS INT4 weight-only GEMM
cla signed
fb-exported
#2643
opened May 30, 2024 by
jiawenliu64
Loading…
Create an abstract class for EmbeddingKVDB
cla signed
fb-exported
#2642
opened May 29, 2024 by
sryap
Loading…
update cmake for dense TBE VBE support
cla signed
fb-exported
#2641
opened May 29, 2024 by
joshuadeng
Loading…
Add a check on grid before launching cuda kernels
cla signed
fb-exported
#2639
opened May 29, 2024 by
jingsh
Loading…
Add FP16 weight and output support
cla signed
fb-exported
#2638
opened May 29, 2024 by
sryap
Loading…
Add VBE to Dense TBE frontend
cla signed
fb-exported
#2628
opened May 23, 2024 by
joshuadeng
Loading…
[ROCm] enable experimental gen_ai build
cla signed
module: rocm
#2610
opened May 20, 2024 by
jeffdaily
Loading…
all_to_one cuda support non-2d inputs
cla signed
fb-exported
#2575
opened May 9, 2024 by
IvanKobzarev
Loading…
add max norm support to PARTIAL_ROWWISE_ADAM
cla signed
fb-exported
#2567
opened May 7, 2024 by
zainhuda
Loading…
Pyre Configurationless migration for] [batch:9/28]
cla signed
fb-exported
#2557
opened May 3, 2024 by
connernilsen
Loading…
Pyre Configurationless migration for] [batch:6/29]
cla signed
#2548
opened Apr 29, 2024 by
connernilsen
Loading…
Integrate triton row and blockwise fp8 gemm to llm inference.
cla signed
fb-exported
#2547
opened Apr 29, 2024 by
choutim
Loading…
Make CowClipDefinition and CounterBasedRegularizationDefinition hashable
cla signed
#2539
opened Apr 27, 2024 by
csmiler
Loading…
Pyre Configurationless migration for] [batch:6/29]
cla signed
#2538
opened Apr 25, 2024 by
connernilsen
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.