SIMAI 2025

Bi- and Multi-Level Optimization Strategies for Sparse and Interpretable Learning in NMF

Selicato, Laura (National Research Council (CNR) - Water Resea)
Ang, Andersen (University of Southamptopn)
Del Buono, Nicoletta (University of Bari Aldo Moro)
Esposito, Flavia (University of Bari Aldo Moro)
Zdunek, Rafal (Politechnika Wrocławska)

In session: MS001 - Advanced Numerical Methods and Machine Learning Techniques in Applied Science

Please login to view abstract download link

Learning algorithms often rely on effective hyperparameter tuning, particularly in penalized optimization problems where constraints aid in extracting interpretable, low-dimensional data representations. This work explores bi-level and multi-level optimization strategies for Nonnegative Matrix Factorization (NMF), a powerful tool for uncovering latent structures while enforcing nonnegativity. We introduce a family of novel algorithms that jointly address factorization and hyperparameter tuning across different divergence measures and domains. Our first contribution, AltBi, formulates Kullback-Leibler NMF as a bi-level optimization problem where penalty hyperparameters are adaptively updated through alternating minimization steps. This algorithm demonstrates enhanced interpretability and reconstruction performance on signal-domain datasets, as shown in [1]. Building on this, AltBi-J integrates a diversity-based penalty into the Frobenius norm framework, yielding solutions with significantly higher sparsity and better components disentanglement, particularly suited for structured data [2]. Extending our approach to the spectral domain, we propose SHINBO, a bi-level optimization algorithm for Itakura-Saito NMF. SHINBO introduces adaptive, row-wise penalty tuning that effectively isolates periodic components from noisy signals, with effective application to fault detection in mechanical systems [3]. Motivated by learning applications involving multiple constraints, we generalize bi-level methodology into a multi-level optimization paradigm capable of handling nested, nonconvex problems. This hierarchical perspective offers a robust theoretical and algorithmic foundation for advancing optimization-driven learning in structured and high-dimensional environments. This is a joint work with Flavia Esposito and Nicoletta Del Buono from the University of Bari Aldo Moro, Andersen Ang from the University of Southamptopn, and Rafal Zdunek from Politechnika Wrocławska.