Second Brain

Tag: quantization

4 items with this tag.

  • Mar 26, 2024

    The Unreasonable Ineffectiveness of the Deeper Layers

    • transformers
    • efficient_dl
    • pruning
    • quantization
  • Jan 01, 2024

    PyTorch Quantization for TensorRT

    • quantization
    • efficient_dl
  • Jun 01, 2023

    AWQ - Activation-aware Weight Quantization for LLM Compression and Acceleration

    • efficient_dl
    • quantization
  • Jun 07, 2017

    Training quantized nets - A deeper understanding

    • quantization

Created with Quartz v4.5.2 © 2025