Mathematical Methods of Modern Statistics 2 / Méthodes mathématiques en statistiques modernes 2

Collection Mathematical Methods of Modern Statistics 2 / Méthodes mathématiques en statistiques modernes 2

Organizer(s) Bogdan, Malgorzata ; Graczyk, Piotr ; Panloup, Fabien ; Proïa, Frédéric ; Roquain, Etienne

Date(s) 15/06/2020 - 19/06/2020

linked URL https://www.cirm-math.com/cirm-virtual-event-2146.html

00:00:00 / 00:00:00

5 25

The price of competition: effect size heterogeneity matters in high dimensions!

In high-dimensional regression, the number of explanatory variables with nonzero effects - often referred to as sparsity - is an important measure of the difficulty of the variable selection problem. As a complement to sparsity, this paper introduces a new measure termed effect size heterogeneity for a finer-grained understanding of the trade-off between type I and type II errorsor, equivalently, false and true positive rates using the Lasso. Roughly speaking, a regression coefficient vector has higher effect size heterogeneity than another vector (of the same sparsity) if the nonzero entries of the former are more heterogeneous than those of the latter in terms of magnitudes. From the perspective of this new measure, we prove that in a regime of linear sparsity, false and true positive rates achieve the optimal trade-off uniformly along the Lasso path when this measure is maximum in the sense that all nonzero effect sizes have very differentmagnitudes, and the worst-case trade-off is achieved when it is minimum in the sense that allnonzero effect sizes are about equal. Moreover, we demonstrate that the Lasso path produces anoptimal ranking of explanatory variables in terms of the rank of the first false variable when the effect size heterogeneity is maximum, and vice versa. Metaphorically, these two findings suggest that variables with comparable effect sizes—no matter how large they are—would compete with each other along the Lasso path, leading to an increased hardness of the variable selection problem. Our proofs use techniques from approximate message passing theory as well as a novel argument for estimating the rank of the first false variable.

Information about the video

Date of recording 02/06/2020
Date of publication 15/06/2020
Institution CIRM
Licence CC BY NC ND
Language English
Audience Researchers
Director(s) Guillaume Hennenfent
Format MP4

Citation data

DOI 10.24350/CIRM.V.19644303
Cite this video Wang, Hua (02/06/2020). The price of competition: effect size heterogeneity matters in high dimensions!. CIRM. Audiovisual resource. DOI: 10.24350/CIRM.V.19644303
URL https://dx.doi.org/10.24350/CIRM.V.19644303

Domain(s)

Bibliography

ANDERSON, Theodore W. An introduction to multivariate statistical analysis. 1958.
BARANIUK, Richard, DAVENPORT, Mark, DEVORE, Ronald, et al. A simple proof of the restricted isometry property for random matrices. Constructive Approximation, 2008, vol. 28, no 3, p. 253-263. - https://doi.org/10.1007/s00365-007-9003-x
BAYATI, Mohsen et MONTANARI, Andrea. The dynamics of message passing on dense graphs, with applications to compressed sensing. IEEE Transactions on Information Theory, 2011, vol. 57, no 2, p. 764-785. - https://doi.org/10.1109/TIT.2010.2094817
BAYATI, Mohsen et MONTANARI, Andrea. The LASSO risk for Gaussian matrices. IEEE Transactions on Information Theory, 2011, vol. 58, no 4, p. 1997-2017. - https://doi.org/10.1109/TIT.2011.2174612
BICKEL, Peter J., RITOV, Ya’acov, TSYBAKOV, Alexandre B., et al. Simultaneous analysis of Lasso and Dantzig selector. The Annals of Statistics, 2009, vol. 37, no 4, p. 1705-1732. - http://dx.doi.org/10.1214/08-AOS620
BÜHLMANN, P. Invited discussion on” regression shrinkage and selection via the lasso: a retrospective (r. tibshirani)”. Journal of the Royal Statistical Society: Series B, 2011, vol. 73, p. 277-279. - https://doi.org/10.1111/j.1467-9868.2011.00771.x
BÜHLMANN, Peter et VAN DE GEER, Sara. Statistics for high-dimensional data: methods, theory and applications. Springer Science & Business Media, 2011. - http://dx.doi.org/10.1007/978-3-642-20192-9
CANDES, Emmanuel J. et TAO, Terence. Decoding by linear programming. IEEE transactions on information theory, 2005, vol. 51, no 12, p. 4203-4215. - https://doi.org/10.1109/TIT.2005.858979
DONOHO, David et MONTANARI, Andrea. High dimensional robust m-estimation: Asymptotic variance via approximate message passing. Probability Theory and Related Fields, 2016, vol. 166, no 3-4, p. 935-969. - http://dx.doi.org/10.1007/s00440-015-0675-z
DONOHO, David et TANNER, Jared. Observed universality of phase transitions in high-dimensional geometry, with implications for modern data analysis and signal processing. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 2009, vol. 367, no 1906, p. 4273-4293. - https://doi.org/10.1098/rsta.2009.0152
DONOHO, David L., MALEKI, Arian, et MONTANARI, Andrea. Message-passing algorithms for compressed sensing. Proceedings of the National Academy of Sciences, 2009, vol. 106, no 45, p. 18914-18919. - https://doi.org/10.1073/pnas.0909892106
DONOHO, David L., JAVANMARD, Adel, et MONTANARI, Andrea. Information-theoretically optimal compressed sensing via spatial coupling and approximate message passing. IEEE transactions on information theory, 2013, vol. 59, no 11, p. 7434-7464. - https://doi.org/10.1109/TIT.2013.2274513
EFRON, Bradley, HASTIE, Trevor, JOHNSTONE, Iain, et al. Least angle regression. The Annals of statistics, 2004, vol. 32, no 2, p. 407-499. - http://dx.doi.org/10.1214/009053604000000067
FAN, Jianqing, SONG, Rui, et al. Sure independence screening in generalized linear models with NP-dimensionality. The Annals of Statistics, 2010, vol. 38, no 6, p. 3567-3604. - http://dx.doi.org/10.1214/10-AOS798
BARBER, Rina Foygel, CANDÈS, Emmanuel J., et al. A knockoff filter for high-dimensional selective inference. The Annals of Statistics, 2019, vol. 47, no 5, p. 2504-2537. - http://dx.doi.org/10.1214/18-AOS1755
G'SELL, Max Grazier, WAGER, Stefan, CHOULDECHOVA, Alexandra, et al. Sequential selection procedures and false discovery rate control. Journal of the royal statistical society: series B (statistical methodology), 2016, vol. 78, no 2, p. 423-444. - https://doi.org/10.1111/rssb.12122
JANSON, Lucas, SU, Weijie, et al. Familywise error rate control via knockoffs. Electronic Journal of Statistics, 2016, vol. 10, no 1, p. 960-975. - http://dx.doi.org/10.1214/16-EJS1129
MONTANARI, Andrea et RICHARD, Emile. Non-negative principal component analysis: Message passing algorithms and sharp asymptotics. IEEE Transactions on Information Theory, 2015, vol. 62, no 3, p. 1458-1484. - https://doi.org/10.1109/TIT.2015.2457942
MOUSAVI, Ali, MALEKI, Arian, BARANIUK, Richard G., et al. Consistent parameter estimation for LASSO and approximate message passing. The Annals of Statistics, 2018, vol. 46, no 1, p. 119-148. - http://dx.doi.org/10.1214/17-AOS1544
POKAROWSKI, Piotr et MIELNICZUK, Jan. Combined l1 and greedy l0 penalized least squares for linear model selection. Journal of Machine Learning Research, 2015, vol. 16, no 5, p. 961-992. - http://www.jmlr.org/papers/volume16/pokarowski15a/pokarowski15a.pdf
REEVES, Galen et GASTPAR, Michael C. Approximate sparsity pattern recovery: Information-theoretic lower bounds. IEEE Transactions on Information Theory, 2013, vol. 59, no 6, p. 3451-3465. - https://doi.org/10.1109/TIT.2013.2253852
RHEE, Soo-Yon, TAYLOR, Jonathan, WADHERA, Gauhar, et al. Genotypic predictors of human immunodeficiency virus type 1 drug resistance. Proceedings of the National Academy of Sciences, 2006, vol. 103, no 46, p. 17355-17360. - https://doi.org/10.1073/pnas.0607274103
SISKIND, Victor. Second moments of inverse Wishart-matrix elements. Biometrika, 1972, vol. 59, no 3, p. 690-691. - https://doi.org/10.1093/biomet/59.3.690
SU, Weijie, BOGDAN, Małgorzata, CANDES, Emmanuel, et al. False discoveries occur early on the lasso path. The Annals of statistics, 2017, vol. 45, no 5, p. 2133-2150. - http://dx.doi.org/10.1214/16-AOS1521
SU, Weijie J. When is the first spurious variable selected by sequential regression procedures?. Biometrika, 2018, vol. 105, no 3, p. 517-527. - https://doi.org/10.1093/biomet/asy032
TIBSHIRANI, Robert. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 1996, vol. 58, no 1, p. 267-288. - https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
WAINWRIGHT, Martin J. Sharp thresholds for High-Dimensional and noisy sparsity recovery using $\ell _ {1} $-Constrained Quadratic Programming (Lasso). IEEE transactions on information theory, 2009, vol. 55, no 5, p. 2183-2202. - https://doi.org/10.1109/TIT.2009.2016018
WAINWRIGHT, Martin J. Information-theoretic limits on sparsity recovery in the high-dimensional and noisy setting. IEEE Transactions on Information Theory, 2009, vol. 55, no 12, p. 5728-5741. - https://doi.org/10.1109/TIT.2009.2032816
WAINWRIGHT, Martin J. High-dimensional statistics: A non-asymptotic viewpoint. Cambridge University Press, 2019. - https://doi.org/10.1017/9781108627771

MSC codes

Document(s)

https://www.cirm-math.fr/RepOrga/2146/Slides/Wang.pdf

Last related questions on MathOverflow

You have to connect your Carmin.tv account with mathoverflow to add question

Ask a question on MathOverflow

All the collection videos

01:08:34

published on June 15, 2020

Consistent model selection criteria and goodness-of-fit test for common time series models

By Jean-Marc Bardet

01:01:17

published on June 15, 2020

Isotonic Distributional Regression (IDR) - leveraging monotonicity, uniquely so!

By Tilmann Gneiting

57:07

published on June 15, 2020

Quasi logistic distributions and Gaussian scale mixing

By Gerard Letac

35:09

published on June 15, 2020

Scaling of scoring rules

By Jonas Wallin

34:07

published on June 15, 2020

The price of competition: effect size heterogeneity matters in high dimensions!

By Hua Wang

36:40

published on June 15, 2020

How to estimate a density on a spider web ?

By Dominique Picard

39:06

published on June 15, 2020

High-dimensional classification by sparse logistic regression

By Felix Abramovich

38:07

published on June 15, 2020

Optimal control of false discovery criteria in the general two-group model

By Ruth Heller

32:25

published on June 15, 2020

Sparse multiple testing: can one estimate the null distribution ?

By Etienne Roquain

50:46

published on June 15, 2020

Hierarchical bayes modeling for large-scale inference

By Daniel Yekutieli

29:01

published on June 15, 2020

On Cholesky structures on real symmetric matrices and their applications

By Hideyuki Ishi

29:04

published on June 15, 2020

Treatment effect estimation with missing attributes

By Julie Josse

56:18

published on June 15, 2020

Knockoff genotypes: value in counterfeit

By Chiara Sabatti

50:05

published on June 15, 2020

The smoothed multivariate square-root Lasso: an optimization lens on concomitant estimation

By Joseph Salmon

39:43

published on June 15, 2020

High-dimensional, multiscale online changepoint detection

By Richard Samworth

46:32

published on June 15, 2020

De-biasing arbitrary convex regularizers and asymptotic normality

By Pierre C. Bellec

48:44

published on June 15, 2020

Floodgate: inference for model-free variable importance

By Lucas Janson

15:53

published on June 15, 2020

Shrinkage estimation of mean for complex multivariate normal distribution with unknown covariance when p > n

By Yoshihiko Konno

42:35

published on June 15, 2020

Structure learning for CTBN's

By Błażej Miasojedow

34:30

published on June 15, 2020

Universal inference using the split likelihood ratio test

By Aaditya K. Ramdas

46:04

published on June 15, 2020

Optimal and maximin procedures for multiple testing problems

By Saharon Rosset

34:46

published on June 15, 2020

Post hoc bounds on false positives using reference families

By Pierre Neuvial

38:00

published on June 15, 2020

Change: detection, estimation, segmentation

By David Siegmund

32:29

published on June 15, 2020

Bayesian spatial adaptation

By Veronika Rockova

54:34

published on June 15, 2020

Experimenting in equilibrium

By Stefan Wager

Copyright Carmin.tv 2026

Give feedback