TR2001-41

Quantization-based language model compression


    •  Edward Whittaker, "Quantization-based language model compression", Tech. Rep. TR2001-41, Mitsubishi Electric Research Laboratories, Cambridge, MA, December 2001.
      BibTeX TR2001-41 PDF
      • @techreport{MERL_TR2001-41,
      • author = {Edward Whittaker},
      • title = {Quantization-based language model compression},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR2001-41},
      • month = dec,
      • year = 2001,
      • url = {https://www.merl.com/publications/TR2001-41/}
      • }
  • Research Area:

    Speech & Audio

Abstract:

This paper describes two techniques for reducing the size of statistical back-off N-gram language models in computer memory. Language model compression is achieved through a combination of quantizing language model probabilities and back-off weights and the pruning of parameters that are determined to be unnecessary after quantization. The recognition performance of the original and compressed language models is evaluated across three different language models and two different recognition tasks. The results show that the language models can be compressed by up to 60% of their original size with no significant loss in recognition performance. Moreover, the techniques that are described provide a principled method with which to compress language models further while minimising degradation in recognition performance.