pytorch / FBGEMM Public

Notifications You must be signed in to change notification settings
Fork 426
Star 1.1k

Code
Issues 23
Pull requests 286
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: pytorch/FBGEMM

Labels 19 Milestones 0

New pull request New

286 Open 2,213 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add CUTLASS INT4 weight-only GEMM cla signed fb-exported

#2643 opened May 30, 2024 by jiawenliu64

Loading…

Create an abstract class for EmbeddingKVDB cla signed fb-exported

#2642 opened May 29, 2024 by sryap

Loading…

update cmake for dense TBE VBE support cla signed fb-exported

#2641 opened May 29, 2024 by joshuadeng

Loading…

Add feature table map support cla signed fb-exported

#2640 opened May 29, 2024 by sryap

Loading…

Add a check on grid before launching cuda kernels cla signed fb-exported

#2639 opened May 29, 2024 by jingsh

Loading…

Add FP16 weight and output support cla signed fb-exported

#2638 opened May 29, 2024 by sryap

Loading…

Add -lrt to asmjit bazel build cla signed

#2634 opened May 25, 2024 by cyyever

Loading…

[fbgemm_gpu] Enable NCCL code cla signed

#2631 opened May 24, 2024 by q10

Loading…

Add VBE to Dense TBE frontend cla signed fb-exported

#2628 opened May 23, 2024 by joshuadeng

Loading…

[ROCm] enable experimental gen_ai build cla signed module: rocm

#2610 opened May 20, 2024 by jeffdaily

Loading…

FP32 Autovec Final Optimization cla signed

#2586 opened May 13, 2024 by crystalrchen

Loading…

all_to_one cuda support non-2d inputs cla signed fb-exported

#2575 opened May 9, 2024 by IvanKobzarev

Loading…

add max norm support to PARTIAL_ROWWISE_ADAM cla signed fb-exported

#2567 opened May 7, 2024 by zainhuda

Loading…

Pyre Configurationless migration for] [batch:9/28] cla signed fb-exported

#2557 opened May 3, 2024 by connernilsen

Loading…

Fp8 updated cla signed

#2550 opened Apr 30, 2024 by elopez0409

Loading…

Change the caller cla signed

#2549 opened Apr 30, 2024 by jianyuh

Loading…

Pyre Configurationless migration for] [batch:6/29] cla signed

#2548 opened Apr 29, 2024 by connernilsen

Loading…

Integrate triton row and blockwise fp8 gemm to llm inference. cla signed fb-exported

#2547 opened Apr 29, 2024 by choutim

Loading…

Add fp8 row/block-wise scaled GEMMs cla signed

#2546 opened Apr 29, 2024 by choutim

Loading…

Revert D56685840: Multisect successfully blamed "D56685840: [fbgemm] Change model transform fp8 linear op to fbgemm quantize ops" for one test failure cla signed

#2545 opened Apr 29, 2024 by jianyuh

Loading…

Refactor fbgemm / llama csrc code base cla signed

#2544 opened Apr 29, 2024 by jianyuh

Loading…

Make CowClipDefinition and CounterBasedRegularizationDefinition hashable cla signed

#2539 opened Apr 27, 2024 by csmiler

Loading…

Pyre Configurationless migration for] [batch:6/29] cla signed

#2538 opened Apr 25, 2024 by connernilsen

Loading…

Fix hip imports in fbgemm cla signed

#2536 opened Apr 25, 2024 by xw285cornell

Loading…

FP32 Autovec Optimization cla signed

#2535 opened Apr 24, 2024 by NathanielIskandar

Loading…

Previous 1 2 3 4 5 … 11 12 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly