About 291,000 results
Open links in new tab
  1. Quantization (signal processing) - Wikipedia

    In mathematics and digital signal processing, quantization is the process of mapping input values from a large set (often a continuous set) to output values in a (countable) smaller set, often …

  2. What is Quantization - GeeksforGeeks

    Nov 6, 2025 · Quantization is a model optimization technique that reduces the precision of numerical values such as weights and activations in models to make them faster and more …

  3. Model Quantization: Concepts, Methods, and Why It Matters

    1 day ago · Quantization reduces the precision of model parameters and activations (for example, from FP32/FP16 to FP8) to shrink memory footprint, improve inference speed, and lower …

  4. What Is Quantization? | How It Works & Applications

    Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the context of simulation and embedded computing, it is about approximating real …

  5. Quantization Explained: Why the Same LLM Gives Better Results ...

    Nov 11, 2025 · Quantization: the same model with lower precision A quantized model is the exact same model — with the same architecture and the same number of parameters — stored and …

  6. What is quantization in machine learning? - Cloudflare

    What is quantization in machine learning? Quantization is a technique for lightening the load of executing machine learning and artificial intelligence (AI) models. It aims to reduce the …

  7. What Is Quantization? Optimizing Data Compression - Coursera

    Oct 16, 2025 · Quantization converts high-precision data into lower-precision data by compressing it to reduce data loss. By optimizing quantization, you can reduce your model's …