AISTATS 2025 Sunday 05/4

Timezone: Asia/Bangkok

Registration Desk

8:30 AM - 4:00 PM

Invited Talk

Nonlinear Independent Component Analysis for Principled Disentanglement in Unsupervised Deep Learning

Aapo Hyvarinen

9:00 AM - 10:00 AM

A central problem in unsupervised deep learning is how to find useful representations of high-dimensional data, sometimes called "disentanglement". Most approaches are heuristic and lack a proper theoretical foundation. In linear representation learning, independent component analysis (ICA) has been successful in many applications areas, and it is principled, i.e., based on a well-defined probabilistic model. However, extension of ICA to the nonlinear case has been problematic due to the lack of identifiability, i.e., uniqueness of the representation. Recently, nonlinear extensions that utilize temporal structure or some auxiliary information have been proposed. Such models are in fact identifiable, and consequently, an increasing number of algorithms have been developed. In particular, some self-supervised algorithms can be shown to estimate nonlinear ICA, even though they have initially been proposed from heuristic perspectives. This talk reviews the state-of-the-art of nonlinear ICA theory and algorithms, based on a review paper available at https://arxiv.org/pdf/2303.16535 .

... more

Speaker Bio

Aapo Hyvarinen studied undergraduate mathematics at the universities of Helsinki (Finland), Vienna (Austria), and Paris (France), and obtained a Ph.D. degree in Information Science at the Helsinki University of Technology in 1997. After post-doctoral work at the Helsinki University of Technology, he moved to the University of Helsinki in 2003. In 2008, he was appointed Professor of Computational Data Analysis, and in 2013, Professor of Computer Science. From 2016 to 2019, he was on leave and in the position of Professor of Machine Learning at the Gatsby Computational Neuroscience Unit, University College London, UK.

... more

Oral

Oral Session 3: Optimization

10:30 AM - 11:30 AM

5 Events in this session

Cubic regularized subspace Newton for non-convex optimization

Jim Zhao · Nikita Doikov · Aurelien Lucchi

Implicit Diffusion: Efficient optimization through stochastic sampling

Pierre Marion · Anna Korba · Peter Bartlett · Mathieu Blondel · Valentin De Bortoli · Arnaud Doucet · Felipe Llinares-López · Courtney Paquette · Quentin Berthet

ScoreFusion: Fusing Score-based Generative Models via Kullback–Leibler Barycenters

Hao Liu · Junze Ye · Jose Blanchet · NIAN SI

The Pivoting Framework: Frank-Wolfe Algorithms with Active Set Size Control

Mathieu Besançon · Sebastian Pokutta · Elias Wirth

Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs

Enea Monzio Compagnoni · Rustem Islamov · Frank Proske · Aurelien Lucchi

Go to Event Page

Poster

Paper Awards Talks

11:30 AM - 12:30 PM

3 Events in this session

Pick-to-Learn and Self-Certified Gaussian Process Approximations

Daniel Marks · Dario Paccagnan

Variational Inference in Location-Scale Families: Exact Recovery of the Mean and Correlation Matrix

Charles Margossian · Lawrence Saul

Deeply supervised Nets

Chen-Yu Lee · Saining Xie · Patrick Gallagher · Zhengyou Zhang · Zhuowen Tu

Go to Event Page

Oral

Oral Session 4: Privacy and Games

2:00 PM - 3:00 PM

5 Events in this session

Almost linear time differentially private release of synthetic graphs

Zongrui Zou · Jingcheng Liu · Jalaj Upadhyay

A Novel Convex Gaussian Min Max Theorem for Repeated Features

David Bosch · Ashkan Panahi

Balls-and-Bins Sampling for DP-SGD

Lynn Chua · Badih Ghazi · Charlie Harrison · Pritish Kamath · Ravi Kumar · Ethan Leeman · Pasin Manurangsi · Amer Sinha · Chiyuan Zhang

Some Targets Are Harder to Identify than Others: Quantifying the Target-dependent Membership Leakage

Achraf Azize · Debabrota Basu

The Sample Complexity of Stackelberg Games

Francesco Bacchiocchi · Matteo Bollini · Matteo Castiglioni · Alberto Marchesi · Nicola Gatti

Go to Event Page

Poster

Poster Session 2

3:00 PM - 6:00 PM

206 Events in this session

Bayesian Circular Regression with von Mises Quasi-Processes

Yarden Cohen · Alexandre Wu Navarro · Jes Frellsen · Richard Turner · Raziel Riemer · Ari Pakman

Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment

Tong Yang · Jincheng Mei · Hanjun Dai · Zixin Wen · Shicong Cen · Dale Schuurmans · Yuejie Chi · Bo Dai

Fundamental computational limits of weak learnability in high-dimensional multi-index models

Emanuele Troiani · Yatin Dandi · Leonardo Defilippis · Lenka Zdeborova · Bruno Loureiro · FLORENT KRZAKALA

General Staircase Mechanisms for Optimal Differential Privacy

Alex Kulesza · Ananda Theertha Suresh · Yuyan Wang

Testing Conditional Independence with Deep Neural Network Based Binary Expansion Testing (DeepBET)

Yang Yang · Kai Zhang · Ping-Shou Zhong

Differentially Private Graph Data Release: Inefficiencies & Unfairness

Ferdinando Fioretto · Diptangshu Sen · Juba Ziani

Selecting the Number of Communities for Weighted Degree-Corrected Stochastic Block Models

Yucheng Liu · Xiaodong Li

Subspace Recovery in Winsorized PCA: Insights into Accuracy and Robustness

Sangil Han · Kyoowon Kim · Sungkyu Jung

The Pivoting Framework: Frank-Wolfe Algorithms with Active Set Size Control

Mathieu Besançon · Sebastian Pokutta · Elias Wirth

Theoretical Analysis of Leave-one-out Cross Validation for Non-differentiable Penalties under High-dimensional Settings

Haolin Zou · Arnab Auddy · Kamiar Rahnama Rad · Arian Maleki

Copula Based Trainable Calibration Error Estimator of Multi-Label Classification with Label Interdependencies

Arkapal Panda · Utpal Garain

Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices

Chanwoo Chun · SueYeon Chung · Daniel Lee

Cost-aware simulation-based inference

Ayush Bharti · Daolang Huang · Samuel Kaski · Francois-Xavier Briol

TVineSynth: A Truncated C-Vine Copula Generator of Synthetic Tabular Data to Balance Privacy and Utility

Elisabeth Griesbauer · Claudia Czado · Arnoldo Frigessi · Ingrid Haff

Bandit Pareto Set Identification in a Multi-Output Linear Model

Cyrille Kone · Emilie Kaufmann · Laura Richert

Sparse Activations as Conformal Predictors

Margarida Campos · João Cálem · Sophia Sklaviadis · Mario Figueiredo · Andre Martins

Score matching for bridges without learning time-reversals

Elizabeth Baker · Moritz Schauer · Stefan Sommer

Differentiable Causal Structure Learning with Identifiability by NOTIME

Jeroen Berrevoets · Jakob Raymaekers · Mihaela van der Schaar · Tim Verdonck · Ruicong Yao

Model selection for behavioral learning data and applications to contextual bandits

Julien Aubert · Louis Köhler · Luc Lehéricy · Giulia Mezzadri · Patricia Reynaud-Bouret

Batch, match, and patch: low-rank approximations for score-based variational inference

Chirag Modi · Diana Cai · Lawrence Saul

The Sample Complexity of Stackelberg Games

Francesco Bacchiocchi · Matteo Bollini · Matteo Castiglioni · Alberto Marchesi · Nicola Gatti

Some Targets Are Harder to Identify than Others: Quantifying the Target-dependent Membership Leakage

Achraf Azize · Debabrota Basu

SNAP: Sequential Non-Ancestor Pruning for Targeted Causal Effect Estimation With an Unknown Graph

Mátyás Schubert · Tom Claassen · Sara Magliacane

Flexible Copula-Based Mixed Models in Deep Learning: A Scalable Approach to Arbitrary Marginals

Giora Simchoni · Saharon Rosset

Rethinking Neural-based Matrix Inversion: Why can't, and Where can

Yuliang Ji · Jian Wu · Yuanzhe Xi

Double Debiased Machine Learning for Mediation Analysis with Continuous Treatments

Houssam Zenati · Judith Abécassis · julie Josse · Bertrand Thirion

Density Ratio-based Proxy Causal Learning Without Density Ratios

Bariscan Bozkurt · Ben Deaner · Dimitri Meunier · Liyuan Xu · Arthur Gretton

Learning Visual-Semantic Subspace Representations

Gabriel Moreira · Manuel Marques · Joao Costeira · Alexander Hauptmann

On the Identifiability of Causal Abstractions

Xiusi Li · Sékou-Oumar Kaba · Siamak Ravanbakhsh

A primer on linear classification with missing data

Angel David Reyero Lobo · Alexis Ayme · Claire Boyer · Erwan Scornet

When Can We Solve the Weighted Low Rank Approximation Problem in Truly Subquadratic Time?

Chenyang Li · Yingyu Liang · Zhenmei Shi · Zhao Song

Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis

Sihan Zeng · Sujay Bhatt · Alec Koppel · Sumitra Ganesh

Distributional Off-policy Evaluation with Bellman Residual Minimization

Sungee Hong · Zhengling Qi · Raymond K. W. Wong

Generalized Criterion for Identifiability of Additive Noise Models Using Majorization

Aramayis Dallakyan · Yang Ni

Safety in the Face of Adversity: Achieving Zero Constraint Violation in Online Learning with Slowly Changing Constraints

Bassel Hamoud · Ilnura Usmanova · Kfir Yehuda Levy

Consistent Amortized Clustering via Generative Flow Networks

Irit Chelly · Roy Uziel · Oren Freifeld · Ari Pakman

Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis

Jia Lin Hau · Erick Delage · Esther Derman · Mohammad Ghavamzadeh · Marek Petrik

Variational Inference on the Boolean Hypercube with the Quantum Entropy

Eliot Beyler · Francis Bach

Analyzing Generative Models by Manifold Entropic Metrics

Daniel Galperin · Ullrich Köthe

Robust Gradient Descent for Phase Retrieval

Alex Buna · Patrick Rebeschini

MODL: Multilearner Online Deep Learning

Antonios Valkanas · Boris Oreshkin · Mark Coates

Kernel Single Proxy Control for Deterministic Confounding

Liyuan Xu · Arthur Gretton

Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory

Lucas GNECCO HEREDIA · Matteo Sammut · Muni Sreenivas Pydi · Rafael Pinot · Benjamin Negrevergne · Yann Chevaleyre

Multimodal Learning with Uncertainty Quantification based on Discounted Belief Fusion

Grigor Bezirganyan · Sana Sellami · Laure Berti-Equille · Sébastien Fournier

Nonparametric estimation of Hawkes processes with RKHSs

Anna Bonnet · Maxime Sangnier

Contractivity and linear convergence in bilinear saddle-point problems: An operator-theoretic approach

Colin Dirren · Mattia Bianchi · Panagiotis D. Grontas · John Lygeros · Florian Dorfler

Covariance Selection over Networks

Wenfu Xia · Fengpei Li · Ying Sun · Ziping Zhao

Refined Analysis of Constant Step Size Federated Averaging and Federated Richardson-Romberg Extrapolation

Paul Mangold · Alain Durmus · Aymeric Dieuleveut · Sergey Samsonov · Eric Moulines

High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Daniel Williams · Leyang Wang · Qizhen Ying · Song Liu · Mladen Kolar

Domain Adaptation and Entanglement: an Optimal Transport Perspective

Okan Koc · Alexander Soen · Chao-Kai Chiang · Masashi Sugiyama

BudgetIV: Optimal Partial Identification of Causal Effects with Mostly Invalid Instruments

Jordan Penn · Lee Gunderson · Gecia Bravo-Hermsdorff · Ricardo Silva · David Watson

Fully Dynamic Adversarially Robust Correlation Clustering in Polylogarithmic Update Time

Vladimir Braverman · Prathamesh Dharangutte · Shreyas Pai · Vihan Shah · Chen Wang

Improving N-Glycosylation and Biopharmaceutical Production Predictions Using AutoML-Built Residual Hybrid Models

Pedro Seber e Silva · Richard Braatz

Classification of High-dimensional Time Series in Spectral Domain Using Explainable Features with Applications to Neuroimaging Data

Sarbojit Roy · Malik Shahid Sultan · Tania Vallejo · Leena Ibrahim · Hernando Ombao

Meta-learning Task-specific Regularization Weights for Few-shot Linear Regression

Tomoharu Iwata · Atsutoshi Kumagai · Yasutoshi Ida

Max-Rank: Efficient Multiple Testing for Conformal Prediction

Alexander Timans · Christoph-Nikolas Straehle · Kaspar Sakmann · Christian Andersson Naesseth · Eric Nalisnick

LITE: Efficiently Estimating Gaussian Probability of Maximality

Nicolas Menet · Jonas Hübotter · Parnian Kassraie · Andreas Krause

On the Difficulty of Constructing a Robust and Publicly-Detectable Watermark

Jaiden Fairoze · Guillermo Ortiz-Jimenez · Mel Vecerik · Somesh Jha · Sven Gowal

SemlaFlow -- Efficient 3D Molecular Generation with Latent Attention and Equivariant Flow Matching

Ross Irwin · Alessandro Tibo · Jon Paul Janet · Simon Olsson

Understanding the Effect of GCN Convolutions in Regression Tasks

Juntong Chen · Johannes Schmidt-Hieber · Claire Donnat · Olga Klopp

Large Covariance Matrix Estimation With Nonnegative Correlations

Yixin Yan · QIAO YANG · Ziping Zhao

Diffusion Models as Constrained Samplers for Optimization with Unknown Constraints

Lingkai Kong · Yuanqi Du · Wenhao Mu · Kirill Neklyudov · Valentin De Bortoli · Dongxia Wu · Haorui Wang · Aaron Ferber · Yian Ma · Carla Gomes · Chao Zhang

On Tractability of Learning Bayesian Networks with Ancestral Constraints

Juha Harviainen · Pekka Parviainen

Structure based SAT dataset for analysing GNN generalisation

Yi Fu · Anthony Tompkins · Yang Song · Maurice Pagnucco

Scalable Out-of-Distribution Robustness in the Presence of Unobserved Confounders

Parjanya Prashant · Seyedeh Baharan Khatami · Bruno Ribeiro · Babak Salimi

Transfer Learning for High-dimensional Reduced Rank Time Series Models

Mingliang Ma · Abolfazl Safikhani

Continuous Structure Constraint Integration for Robust Causal Discovery

Lyuzhou Chen · Taiyu Ban · Derui Lyu · Yijia Sun · Kangtao Hu · Xiangyu Wang · Huanhuan Chen

On the Relationship Between Robustness and Expressivity of Graph Neural Networks

Lorenz Kummer · Wilfried Gansterer · Nils Kriege

Data Reconstruction Attacks and Defenses: A Systematic Evaluation

Sheng Liu · Zihan Wang · Yuxiao Chen · Qi Lei

Decision from Suboptimal Classifiers: Excess Risk Pre- and Post-Calibration

Alexandre Perez-Lebel · Gaël Varoquaux · Sanmi Koyejo · Matthieu Doutreligne · Marine Le Morvan

Stein Boltzmann Sampling: A Variational Approach for Global Optimization

Gaëtan Serré · Argyris Kalogeratos · Nicolas Vayatis

Asynchronous Decentralized Optimization with Constraints: Achievable Speeds of Convergence for Directed Graphs

firooz shahriari-mehr · Ashkan Panahi

Axiomatic Explainer Globalness via Optimal Transport

Davin Hill · Joshua Bone · Aria Masoomi · Max Torop · Jennifer Dy

Optimistic Safety for Online Convex Optimization with Unknown Linear Constraints

Spencer Hutchinson · Tianyi Chen · Mahnoosh Alizadeh

Bridging Multiple Worlds: Multi-marginal Optimal Transport for Causal Partial-identification Problem

Zijun Gao · Shu Ge · Jian Qian

Approximate information maximization for bandit games

Alex Chebbah · Christian L. Vestergaard · jean-baptiste masson · Etienne Boursier

Understanding the Learning Dynamics of LoRA: A Gradient Flow Perspective on Low-Rank Adaptation in Matrix Factorization

Ziqing Xu · Hancheng Min · Lachlan MacDonald · Jinqi Luo · Salma Tarmoun · Enrique Mallada · Rene Vidal

Enhanced Adaptive Gradient Algorithms for Nonconvex-PL Minimax Optimization

Feihu Huang · Chunyu Xuan · Xinrui Wang · Siqi Zhang · Songcan Chen

Partial Information Decomposition for Data Interpretability and Feature Selection

Charles Westphal · Stephen Hailes · Mirco Musolesi

Training Neural Samplers with Reverse Diffusive KL Divergence

Jiajun He · Wenlin Chen · Mingtian Zhang · David Barber · Jose Miguel Hernandez-Lobato

Approximate Global Convergence of Independent Learning in Multi-Agent Systems

Ruiyang Jin · Zaiwei Chen · Yiheng Lin · Jie Song · Adam Wierman

A Unified Evaluation Framework for Epistemic Predictions

Shireen Kudukkil Manchingal · Muhammad Mubashar · Kaizheng Wang · Fabio Cuzzolin

Ant Colony Sampling with GFlowNets for Combinatorial Optimization

Minsu Kim · Sanghyeok Choi · Hyeonah Kim · Jiwoo Son · Jinkyoo Park · Yoshua Bengio

Calibrated Computation-Aware Gaussian Processes

Disha Hegde · Mohamed Adil · Jon Cockayne

Parameter estimation in state space models using particle importance sampling

Yuxiong Gao · Wentao Li · Rong Chen

When the Universe is Too Big: Bounding Consideration Probabilities for Plackett-Luce Rankings

Ben Aoki-Sherwood · Catherine Bregou · David Liben-Nowell · Kiran Tomlinson · Thomas Zeng

Optimal estimation of linear non-Gaussian structure equation models

Sunmin Oh · Seungsu Han · Gunwoong Park

Graph-based Complexity for Causal Effect by Empirical Plug-in

Rina Dechter · Annie Raichev · Jin Tian · Alexander Ihler

Active Bipartite Ranking with Smooth Posterior Distributions

James Cheshire · Stephan Clemencon

A graphical global optimization framework for parameter estimation of statistical models with nonconvex regularization functions

Danial Davarnia · Mohammadreza Kiaghadi

Variational Schr\"odinger Momentum Diffusion

Kevin Rojas · Yixin Tan · Molei Tao · Yuriy Nevmyvaka · Wei Deng

Decoupling epistemic and aleatoric uncertainties with possibility theory

Nong Hieu · Jeremie Houssineau · Neil Chada · Emmanuel Delande

All models are wrong, some are useful: Model Selection with Limited Labels

Patrik Okanovic · Andreas Kirsch · Jannes Kasper · Torsten Hoefler · Andreas Krause · Nezihe Merve Gürel

From Gradient Clipping to Normalization for Heavy Tailed SGD

Florian Hübler · Ilyas Fatkhullin · Niao He

Computing high-dimensional optimal transport by flow neural networks

Chen Xu · Xiuyuan Cheng · Yao Xie

MING: A Functional Approach to Learning Molecular Generative Models

Van Khoa NGUYEN · Maciej Falkiewicz · Giangiacomo Mercatali · Alexandros Kalousis

Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics

Daniel Paulin · Peter Whalley · Neil Chada · Benedict Leimkuhler

Locally Optimal Descent for Dynamic Stepsize Scheduling

Gilad Yehudai · Alon Cohen · Amit Daniely · Yoel Drori · Tomer Koren · Mariano Schain

Distance Estimation for High-Dimensional Discrete Distributions

Kuldeep S. Meel · Gunjan Kumar · Yash Pote

Personalized Convolutional Dictionary Learning of Physiological Time Series

Axel Roques · Samuel Gruffaz · Kyurae Kim · Alain Durmus · Laurent Oudre

TRADE: Transfer of Distributions between External Conditions with Normalizing Flows

Stefan Wahl · Armand Rousselot · Felix Draxler · Ullrich Köthe

Signed Graph Autoencoder for Explainable and Polarization-Aware Network Embeddings

Nikolaos Nakis · Chrysoula Kosma · Giannis Nikolentzos · Michail Chatzianastasis · Iakovos Evdaimon · Michalis Vazirgiannis

Flexible and Efficient Probabilistic PDE Solvers through Gaussian Markov Random Fields

Tim Weiland · Marvin Pförtner · Philipp Hennig

Regularity in Canonicalized Models: A Theoretical Perspective

Behrooz Tahmasebi · Stefanie Jegelka

Causal Discovery on Dependent Binary Data

Alex Chen · Qing Zhou

Spectral Differential Network Analysis for High-Dimensional Time Series

Michael Hellstern · Byol Kim · Zaid Harchaoui · Ali Shojaie

The VampPrior Mixture Model

Andrew Stirn · David Knowles

Multi-Agent Credit Assignment with Pretrained Language Models

Wenhao Li · Dan Qiao · Baoxiang Wang · Xiangfeng Wang · Wei Yin · Hao Shen · Bo Jin · Hongyuan Zha

Factor Analysis with Correlated Topic Model for Multi-Modal Data

Małgorzata Łazęcka · Ewa Szczurek

Ordered $\mathcal{V}$-information Growth: A Fresh Perspective on Shared Information

Rohan Ghosh · Mehul Motani

Learning Gaussian Multi-Index Models with Gradient Flow: Time Complexity and Directional Convergence

Berfin Simsek · Amire Bendjeddou · Daniel Hsu

$\mathcal{I}$-trustworthy Models. A framework for trustworthiness evaluation of probabilistic classifiers

Ritwik Vashistha · Arya Farahi

An Iterative Algorithm for Rescaled Hyperbolic Functions Regression

Yeqi Gao · Zhao Song · Junze Yin

Calm Composite Losses: Being Improper Yet Proper Composite

Han Bao · Nontawat Charoenphakdee

Credal Two-Sample Tests of Epistemic Uncertainty

Siu Lun Chau · Antonin Schrab · Arthur Gretton · Dino Sejdinovic · Krikamol Muandet

ADEPT: Hierarchical Bayes Approach to Personalized Federated Unsupervised Learning

Kaan Ozkara · Bruce Huang · Ruida Zhou · Suhas Diggavi

Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits

Sreejeet Maity · Aritra Mitra

Memory-Efficient Optimization with Factorized Hamiltonian Descent

Son Nguyen · Lizhang Chen · Bo Liu · qiang liu

Cross Validation for Correlated Data in Classification Models

Oren Yuval · Saharon Rosset

Empirical Error Estimates for Graph Sparsification

Siyao Wang · Miles Lopes

Bayesian Inference in Recurrent Explicit Duration Switching Linear Dynamical Systems

Mikołaj Słupiński

Decomposing Global Feature Effects Based on Feature Interactions

Julia Herbinger

A kernel Stein test for comparing latent variable models

Heishiro Kanagawa

Radial Neighbors for Provably Accurate Scalable Approximations of Gaussian Processes

Cheng Li

Solving Estimating Equations With Copulas

Thomas Nagler

Loss Gradient Gaussian Width based Generalization and Optimization Guarantees

Arindam Banerjee · Qiaobo Li · Yingxue Zhou

Zero-Shot Action Generalization with Limited Observations

Abdullah Alchihabi · Hanping Zhang · Yuhong Guo

Theoretically Grounded Pruning of Large Ground Sets for Constrained, Discrete Optimization

Ankur Nath · Alan Kuhnle

Deep Optimal Sensor Placement for Black Box Stochastic Simulations

Paula Cordero Encinar · Tobias Schroeder · Peter Yatsyshin · Andrew Duncan

Truncated Inverse-Lévy Measure Representation of the Beta Process

Junyi ZHANG · Angelos Dassios · Zhong Chong · Qiufei Yao

On the Convergence of Continual Federated Learning Using Incrementally Aggregated Gradients

Satish Keshri · Nazreen Shah · Ranjitha Prasad

Improving Pre-trained Self-Supervised Embeddings Through Effective Entropy Maximization

Deep Chakraborty · Yann LeCun · Tim G. J. Rudner · Erik Learned-Miller

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

Khimya Khetarpal · Zhaohan Daniel Guo · Bernardo Avila Pires · Yunhao Tang · Clare Lyle · Mark Rowland · Nicolas Heess · Diana Borsa · Arthur Guez · Will Dabney

Perfect Recovery for Random Geometric Graph Matching with Shallow Graph Neural Networks

Suqi Liu · Morgane Austern

On the Inherent Privacy of Zeroth-Order Projected Gradient Descent

Devansh Gupta · Meisam Razaviyayn · Vatsal Sharan

SINE: Scalable MPE Inference for Probabilistic Graphical Models using Advanced Neural Embeddings

Shivvrat Arya · Tahrima Rahman · Vibhav Gogate

Vecchia Gaussian Process Ensembles on Internal Representations of Deep Neural Networks

Felix Jimenez · Matthias Katzfuss

Parabolic Continual Learning

Haoming Yang · Ali Hasan · Vahid Tarokh

Tamed Langevin sampling under weaker conditions

Iosif Lytras · Panayotis Mertikopoulos

A Differential Inclusion Approach for Learning Heterogeneous Sparsity in Neuroimaging Analysis

Wenjing Han · Yueming Wu · Xinwei Sun · Lingjing Hu · Yizhou Wang

Learning Identifiable Structures Helps Avoid Bias in DNN-based Supervised Causal Learning

Jiaru Zhang · Rui Ding · Qiang Fu · Huang Bojun · Zizhen Deng · Yang Hua · Haibing Guan · Shi Han · Dongmei Zhang

Beyond Discretization: Learning the Optimal Solution Path

Qiran Dong · Paul Grigas · Vishal Gupta

Analysis of Two-Stage Rollout Designs with Clustering for Causal Inference under Network Interference

Mayleen Cortez-Rodriguez · Matthew Eichhorn · Christina Yu

Energy-consistent Neural Operators for Hamiltonian and Dissipative Partial Differential Equations

Yusuke Tanaka · Takaharu Yaguchi · Tomoharu Iwata · Naonori Ueda

Poisoning Bayesian Inference via Data Deletion and Replication

Matthieu Carreau · Roi Naveiro · William Caballero

Survival Models: Proper Scoring Rule and Stochastic Optimization with Competing Risks

Julie Alberge · Vincent Maladiere · Olivier Grisel · Judith Abécassis · Gaël Varoquaux

What Ails Generative Structure-based Drug Design: Expressivity is Too Little or Too Much?

Rafał Karczewski · Samuel Kaski · Markus Heinonen · Vikas Garg

Causal Temporal Regime Structure Learning

Abdellah Rahmani · Pascal Frossard

UNHaP: Unmixing Noise from Hawkes Processes

Virginie Loison · Guillaume Staerman · Thomas Moreau

Disentangling impact of capacity, objective, batchsize, estimators, and step-size on flow VI

Abhinav Agrawal · Justin Domke

Model Evaluation in the Dark: Robust Classifier Metrics with Missing Labels

Danial Dervovic · Michael Cashmore

Fairness Risks for Group-Conditionally Missing Demographics

Kaiqi Jiang · Wenzhe Fan · Mao Li · Xinhua Zhang

Sampling from the Random Linear Model via Stochastic Localization Up to the AMP Threshold

Han Cui · Zhiyuan Yu · Jingbo Liu

Near-Polynomially Competitive Active Logistic Regression

Yihan Zhou · Eric Price · Trung Nguyen

The Strong Product Model for Network Inference without Independence Assumptions

Bailey Andrew · David Westhead · Luisa Cutillo

Hyperboloid GPLVM for Discovering Continuous Hierarchies via Nonparametric Estimation

Koshi Watanabe · Keisuke Maeda · Takahiro Ogawa · Miki Haseyama

ScoreFusion: Fusing Score-based Generative Models via Kullback–Leibler Barycenters

Hao Liu · Junze Ye · Jose Blanchet · NIAN SI

Statistical Test for Auto Feature Engineering by Selective Inference

Tatsuya Matsukawa · Tomohiro Shiraishi · Shuichi Nishino · Teruyuki Katsuoka · Ichiro Takeuchi

Disentangling Interactions and Dependencies in Feature Attributions

Gunnar König · Eric Günther · Ulrike von Luxburg

New User Event Prediction Through the Lens of Causal Inference

Henry Yuchi · Shixiang Zhu · Li Dong · Yigit Arisoy · Matthew Spencer

Infinite Width Limits of Self Supervised Neural Networks

Maximilian Fleissner · Gautham Anil · Debarghya Ghoshdastidar

Gated Recurrent Neural Networks with Weighted Time-Delay Feedback

N. Benjamin Erichson · Soon Hoe Lim · Michael Mahoney

Scalable Implicit Graphon Learning

Ali Azizpour · Nicolas Zilberstein · Santiago Segarra

Is Gibbs sampling faster than Hamiltonian Monte Carlo on GLMs?

Son Luu · Zuheng Xu · Nikola Surjanovic · Miguel Biron-Lattes · Trevor Campbell · Alexandre Bouchard-Côté

Revisiting LocalSGD and SCAFFOLD: Improved Rates and Missing Analysis

Ruichen Luo · Sebastian Stich · Samuel Horvath · Martin Takac

AxlePro: Momentum-Accelerated Batched Training of Kernel Machines

Yiming Zhang · Parthe Pandit

Tensor Network Based Feature Learning Model

Albert Saiapin · Kim Batselier

Credibility-Aware Multimodal Fusion Using Probabilistic Circuits

Sahil Sidheekh · Pranuthi Tenali · Saurabh Mathur · Erik Blasch · Kristian Kersting · Sriraam Natarajan

A Family of Distributions of Random Subsets for Controlling Positive and Negative Dependence

Takahiro Kawashima · Hideitsu Hino

Mean-Field Microcanonical Gradient Descent

Marcus Häggbom · Morten Karlsmark · Joakim Andén

Scalable Inference for Bayesian Multinomial Logistic-Normal Dynamic Linear Models

Manan Saxena · Tinghua Chen · Justin Silverman

On the Geometry and Optimization of Polynomial Convolutional Networks

Vahid Shahverdi · Giovanni Luca Marchetti · Kathlén Kohn

Riemann$^2$: Learning Riemannian Submanifolds from Riemannian Data

Leonel Rozo · Miguel González-Duque · Noémie Jaquier · Soren Hauberg

Generalization Lower Bounds for GD and SGD in Smooth Stochastic Convex Optimization

Peiyuan Zhang · Jiaye Teng · Jingzhao Zhang

Posterior Mean Matching: Generative Modeling through Online Bayesian Inference

Sebastian Salazar · Michal Kucer · Yixin Wang · Emily Casleton · David Blei

Implicit Diffusion: Efficient optimization through stochastic sampling

Pierre Marion · Anna Korba · Peter Bartlett · Mathieu Blondel · Valentin De Bortoli · Arnaud Doucet · Felipe Llinares-López · Courtney Paquette · Quentin Berthet

RetroDiff: Retrosynthesis as Multi-stage Distribution Interpolation

Yiming Wang · Yuxuan Song · Yiqun Wang · Minkai Xu · Rui Wang · Hao Zhou · Wei-Ying Ma

InnerThoughts: Disentangling Representations and Predictions in Large Language Models

Didier Chételat · Joseph Cotnareanu · Rylee Thompson · Yingxue Zhang · Mark Coates

Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel–Young Loss Perspective and Gap-Dependent Regret Analysis

Shinsaku Sakaue · Han Bao · Taira Tsuchiya

FedBaF: Federated Learning Aggregation Biased by a Foundation Model

Jong-Ik Park · Srinivasa Pranav · José Moura · Carlee Joe-Wong

Mixed-Feature Logistic Regression Robust to Distribution Shifts

Qingshi Sun · Nathan Justin · Andres Gomez · Phebe Vayanos

Offline RL via Feature-Occupancy Gradient Ascent

Gergely Neu · Nneka Okolo

Clustering Context in Off-Policy Evaluation

Daniel Guzmán Olivares · Philipp Schmidt · Jacek Golebiowski · Artur Bekasov

Synthetic Potential Outcomes and Causal Mixture Identifiability

Bijan Mazaheri · Chandler Squires · Caroline Uhler

Adaptive Extragradient Methods for Root-finding Problems under Relaxed Assumptions

Yang Luo · Michael O'Neill

Integer Programming Based Methods and Heuristics for Causal Graph Learning

Sanjeeb Dash · Joao Goncalves · Tian Gao

Unconditionally Calibrated Priors for Beta Mixture Density Networks

Alix Lhéritier · Maurizio Filippone

RTD-Lite: Scalable Topological Analysis for Comparing Weighted Graphs in Learning Tasks

Eduard Tulchinskii · Daria Voronkova · Ilya Trofimov · Evgeny Burnaev · Serguei Barannikov

Strong Screening Rules for Group-based SLOPE Models

Fabio Feser · Marina Evangelou

Conditioning diffusion models by explicit forward-backward bridging

Adrien Corenflos · Zheng Zhao · Thomas Schön · Simo Särkkä · Jens Sjölund

Separation-Based Distance Measures for Causal Graphs

Jonas Wahl · Jakob Runge

HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search

Tuan Nguyen · Jay Barrett · Kwang-Sung Jun

Tight Analysis of Difference-of-Convex Algorithm (DCA) Improves Convergence Rates for Proximal Gradient Descent

Teodor Rotaru · Panagiotis Patrinos · François Glineur

FreqMoE: Enhancing Time Series Forecasting through Frequency Decomposition Mixture of Experts

ziqi Liu

Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants

Fares Fourati · Salma Kharrat · Vaneet Aggarwal · Mohamed-Slim Alouini

Stochastic Compositional Minimax Optimization with Provable Convergence Guarantees

yuyang deng · Fuli Qiao · Mehrdad Mahdavi

Generalization Bounds for Dependent Data using Online-to-Batch Conversion.

Sagnik Chatterjee · MANUJ MUKHERJEE · Alhad Sethi

Meta-learning from Heterogeneous Tensors for Few-shot Tensor Completion

Tomoharu Iwata · Atsutoshi Kumagai

Optimal Time Complexity Algorithms for Computing General Random Walk Graph Kernels on Sparse Graphs

Krzysztof Choromanski · Isaac Reid · Arijit Sehanobish · Kumar Avinava Dubey

Learning-Augmented Algorithms for Online Concave Packing and Convex Covering Problems

Elena Grigorescu · Young-San Lin · Maoyuan Song

S-CFE: Simple Counterfactual Explanations

Shpresim Sadiku · Moritz Wagner · Sai Ganesh Nagarajan · Sebastian Pokutta

Rate of Model Collapse in Recursive Training

Ananda Theertha Suresh · Andrew Thangaraj · Aditya Khandavally

Robust Classification by Coupling Data Mollification with Label Smoothing

Markus Heinonen · Ba-Hien Tran · Michael Kampffmeyer · Maurizio Filippone

Learning to Forget: Bayesian Time Series Forecasting using Recurrent Sparse Spectrum Signature Gaussian Processes

Csaba Tóth · Masaki Adachi · Michael A. Osborne · Harald Oberhauser

Improving Stochastic Cubic Newton with Momentum

El Mahdi Chayti · Nikita Doikov · Martin Jaggi

Personalizing Low-Rank Bayesian Neural Networks Via Federated Learning

Boning Zhang · Dongzhu Liu · Osvaldo Simeone · Guanchu Wang · Dimitrios Pezaros · Guangxu Zhu

Go to Event Page