Second Brain
Search
Search
Dark mode
Light mode
Explorer
Tag: quantization
4 items with this tag.
Mar 26, 2024
The Unreasonable Ineffectiveness of the Deeper Layers
transformers
efficient_dl
pruning
quantization
Jan 01, 2024
PyTorch Quantization for TensorRT
quantization
efficient_dl
Jun 01, 2023
AWQ - Activation-aware Weight Quantization for LLM Compression and Acceleration
efficient_dl
quantization
Jun 07, 2017
Training quantized nets - A deeper understanding
quantization