-
Notifications
You must be signed in to change notification settings - Fork 48
Pull requests: intel/xFasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Layers] Increased the threshold for enabling flashAttn
performance
performance related.
#428
opened Jun 3, 2024 by
abenmao
Loading…
[Kernel] Add GPU kernels.
enhancement
New feature or request
gpu
Related to GPU
#372
opened May 7, 2024 by
changqi1
Loading…
[Eval] Add eval test with opencompass.
benchmark
performance or accuracy benchmark
enhancement
New feature or request
Update AWQ GPTQ quantization guide
documentation
Improvements or additions to documentation
#306
opened Apr 10, 2024 by
miaojinc
Loading…
ProTip!
Mix and match filters to narrow down what you’re looking for.