Optimal vector quantization: from signal processing to clustering and numerical probability | Vidéo | Carmin.tv

00:00:00 / 00:00:00

Optimal vector quantization: from signal processing to clustering and numerical probability

By Gilles Pagès

Appears in collections : CEMRACS - Summer school: Numerical methods for stochastic models: control, uncertainty quantification, mean-field / CEMRACS - École d'été : Méthodes numériques pour équations stochastiques : contrôle, incertitude, champ moyen, Ecoles de recherche

Optimal vector quantization has been originally introduced in Signal processing as a discretization method of random signals, leading to an optimal trade-off between the speed of transmission and the quality of the transmitted signal. In machine learning, similar methods applied to a dataset are the historical core of unsupervised classification methods known as “clustering”. In both case it appears as an optimal way to produce a set of weighted prototypes (or codebook) which makes up a kind of skeleton of a dataset, a signal and more generally, from a mathematical point of view, of a probability distribution. Quantization has encountered in recent years a renewed interest in various application fields like automatic classification, learning algorithms, optimal stopping and stochastic control, Backward SDEs and more generally numerical probability. In all these various applications, practical implementation of such clustering/quantization methods more or less rely on two procedures (and their countless variants): the Competitive Learning Vector Quantization $(CLV Q)$ which appears as a stochastic gradient descent derived from the so-called distortion potential and the (randomized) Lloyd's procedure (also known as k- means algorithm, nu ees dynamiques) which is but a fixed point search procedure. Batch version of those procedures can also be implemented when dealing with a dataset (or more generally a discrete distribution). In a more formal form, if is probability distribution on an Euclidean space $\mathbb{R}^d$, the optimal quantization problem at level $N$ boils down to exhibiting an $N$-tuple $(x_{1}^{²}, . . . , x_{N}^{²})$, solution to

argmin$_{(x1,\dotsb,x_N)\epsilon(\mathbb{R}^d)^N} \int_{\mathbb{R}^d 1\le i\le N} \min |x_i-\xi|^2 \mu(d\xi)$

and its distribution i.e. the weights $(\mu(C(x_{i}^{²}))_{1\le i\le N}$ where $(C(x_{i}^{²})$ is a (Borel) partition of $\mathbb{R}^d$ satisfying

$C(x_{i}^{²})\subset \lbrace\xi\epsilon\mathbb{R}^d :|x_{i}^{²} -\xi|\le_{1\le j\le N} \min |x_{j}^{²}-\xi|\rbrace$.

To produce an unsupervised classification (or clustering) of a (large) dataset $(\xi_k)_{1\le k\le n}$, one considers its empirical measure

$\mu=\frac{1}{n}\sum_{k=1}^{n}\delta_{\xi k}$

whereas in numerical probability $\mu = \mathcal{L}(X)$ where $X$ is an $\mathbb{R}^d$-valued simulatable random vector. In both situations, $CLV Q$ and Lloyd's procedures rely on massive sampling of the distribution $\mu$. As for clustering, the classification into $N$ clusters is produced by the partition of the dataset induced by the Voronoi cells $C(x_{i}^{²}), i = 1, \dotsb, N$ of the optimal quantizer. In this second case, which is of interest for solving non linear problems like Optimal stopping problems (variational inequalities in terms of PDEs) or Stochastic control problems (HJB equations) in medium dimensions, the idea is to produce a quantization tree optimally fitting the dynamics of (a time discretization) of the underlying structure process. We will explore (briefly) this vast panorama with a focus on the algorithmic aspects where few theoretical results coexist with many heuristics in a burgeoning literature. We will present few simulations in two dimensions.

Information about the video

Date of recording 19/07/2017
Date of publication 31/07/2017
Institution CIRM
Licence CC BY NC ND
Language English
Audience Researchers, Graduate Students
Director(s) Guillaume Hennenfent
Format MP4

Citation data

DOI 10.24350/CIRM.V.19199603
Cite this video Pagès, Gilles (19/07/2017). Optimal vector quantization: from signal processing to clustering and numerical probability. CIRM. Audiovisual resource. DOI: 10.24350/CIRM.V.19199603
URL https://dx.doi.org/10.24350/CIRM.V.19199603

Domain(s)

Bibliography

Duflo, M. (1996). Algorithmes stochastiques. Paris: Springer-Verlag - http://www.springer.com/fr/book/9783540606994
Gersho, A., & Gray, R.M. (1992). Vector Quantization and Signal Compression. Boston: Kluwer Academic Publishers - http://dx.doi.org/10.1007/978-1-4615-3626-0
Graf, S., & Luschgy, H. (2000). Foundations of quantization for probability distributions. Berlin: Springer - http://dx.doi.org/10.1007/BFb0103945
Kushner, H., & Yin, G.G. (2003). Stochastic approximation and recursive algorithms and applications. 2nd ed. New York: Springer - http://dx.doi.org/10.1007/b97441
Pagès, G. (2015). Introduction to vector quantization and its applications for numerics. ESAIM Proceedings and Surveys, 48, 29-79 - http://dx.doi.org/10.1051/proc/201448002
Pagès, G., & Printems, J. (2009). Optimal quantization for finance: from random vectors to stochastic processes. In A. Bensoussan, & Q. Zhang (Eds.), Handbook of numerical analysis. XV, Special volume, Mathematical modeling and numerical methods in finance (pp. 595-649). Amsterdam: Elsevier/North-Holland - http://dx.doi.org/10.1016/S1570-8659(08)00015-x
Pagès, G., & Wilbertz, B. (2011). Optimal Delaunay and Voronoi quantization schemes for pricing American style options. - https://hal.archives-ouvertes.fr/hal-00572709

MSC codes

Document(s)

http://smai.emath.fr/cemracs/cemracs17/Slides/pages.pdf

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow

Copyright Carmin.tv 2025

Give feedback