Understanding geometric attributes with autoencoders | Vidéo | Carmin.tv

00:00:00 / 00:00:00

Understanding geometric attributes with autoencoders

By Alasdair Newson

Appears in collection : 2019 - T1 - WS3 - Imaging and machine learning

Autoencoders are neural networks which project data to and from a lower dimensional latent space, the projection being learned via training on the data. While these networks produce impressive results, there is as yet little understanding of the internal mechanisms which allow autoencoders to produce such results. In this work, we aim to describe how an autoencoder is able to process certain fundamental image attributes. We analyse two of these attributes in particular : size and position. For the former, we study the case of binary images of disks, and describe the encoding and decoding processes, and in particular that the optimal decoder in the case of a network without biases can be described precisely. In the case of position, we describe how the encoder can extract the position of a Dirac impulse. Finally, we present ongoing work into an approach to create a PCA-like autoencoder, that is to say an autoencoder which presents similar characteristics to the PCA in terms of the interpretability of the latent space. We shall show preliminary experimental results on synthetic data.

Information about the video

Date of recording 03/04/2019
Date of publication 09/05/2019
Institution IHP
Licence CC BY-NC-ND
Language English
Format MP4
Venue Institut Henri Poincaré

Domain(s)

Computer Science

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow

Copyright Carmin.tv 2025

Give feedback