awq

Here are 7 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated May 28, 2024
Python

modelscope / swift

Star

ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs

agent deploy llama lora finetune peft multimodal sft dpo pre-training awq llm modelscope llava qwen galore unsloth llama3 pissa

Updated May 29, 2024
Python

intel / auto-round

Star

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

rounding quantization awq int4 gptq neural-compressor weight-only

Updated May 29, 2024
Python

GURPREETKAURJETHRA / Quantize-LLM-using-AWQ

Star

Quantize LLM using AWQ

quantize awq large-language-models llms generative-ai llm-training

Updated Apr 26, 2024
Jupyter Notebook

glurp / rfilter

Star

programmable filter, as posix awq, with ruby syntaxe and embeddable function

ruby bash filter plotting awq

Updated Apr 25, 2022
Ruby

This repository contains notebooks and resources related to the Software Development Group Project (SDGP) machine learning component. Specifically, it includes two notebooks used for creating a dataset and fine-tuning a Mistral-7B-v0.1-Instruct model.

machine-learning transformers pytorch peft awq qlora autoawq

Updated Mar 21, 2024
Jupyter Notebook

FireStrike1010 / artificial_personality

Star

Artificial Personality is text2text AI chatbot that can use character cards

ai chatbot transformers neural-networks chatbot-framework awq tavernai

Updated May 28, 2024
Python

Improve this page

Add a description, image, and links to the awq topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the awq topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

awq

Here are 7 public repositories matching this topic...

intel / neural-compressor

modelscope / swift

intel / auto-round

GURPREETKAURJETHRA / Quantize-LLM-using-AWQ

glurp / rfilter

vpgits / sdgp-ml

FireStrike1010 / artificial_personality

Improve this page

Add this topic to your repo