AISTATS 2025 Monday 05/5

Timezone: Asia/Bangkok

Registration Desk

8:30 AM - 3:00 PM

Invited Talk

Understanding Inference Time Compute: Self-Improvement and Scaling

Akshay Krishnamurthy

9:00 AM - 10:00 AM

Inference-time compute has emerged as a new axis for scaling large language models, leading to breakthroughs in AI reasoning. Broadly speaking, inference-time compute methods involve allowing the language model to interact with a verifier to search for desirable, high-quality, or correct responses. While recent breakthroughs involve using a ground-truth verifier of correctness, it is also possible to invoke the language model itself or an otherwise learned model as verifiers. These latter protocols raise the possibility of self-improvement, whereby the AI system evaluates and refines its own generations to achieve higher performance.

This talk presents new understanding of and new algorithms for language model self-improvement. The first part of the talk focuses on a new perspective on self-improvement that we refer to as sharpening, whereby we "sharpen" the model toward one placing large probability mass on high-quality sequence, as measured by the language model itself. We show how the sharpening process can be done purely at inference time or amortized into the model via post-training, thereby avoiding expensive inference-time computation. In the second part of the talk, we consider the more general setting of a learned reward model, show that the performance of naive-but-widely-used inference-time compute strategies does not improve monotonically with compute, and develop a new compute-monotone algorithm with optimal statistical performance.

Based on joint works with Audrey Huang, Dhruv Rohatgi, Adam Block, Qinghua Liu, Jordan T. Ash, Cyril Zhang, Max Simchowitz, Dylan J. Foster and Nan Jiang.

... more

Speaker Bio

Akshay is a senior principal research manager at Microsoft Research, New York City. Previously, he spent two years as an assistant professor in the College of Information and Computer Sciences at the University of Massachusetts, Amherst and a year as a postdoctoral researcher at Microsoft Research, NYC. Before that, he completed my PhD in the Computer Science Department at Carnegie Mellon University, advised by Aarti Singh. He received his undergraduate degree in EECS at UC Berkeley.

... more

Oral

Oral Session 5: Probabilistic Inference and Optimzation

10:30 AM - 11:30 AM

5 Events in this session

Entropic Matching for Expectation Propagation of Markov Jump Processes

Yannick Eich · Bastian Alt · Heinz Koeppl

Information Transfer Across Clinical Tasks via Adaptive Parameter Optimisation

Anshul Thakur · Elena Gal · Soheila Molaei · Xiao Gu · Patrick Schwab · Danielle Belgrave · Kim Branson · David Clifton

posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms

Måns Magnusson · Jakob Torgander · Paul Bürkner · Lu Zhang · Bob Carpenter · Aki Vehtari

Restructuring Tractable Probabilistic Circuits

Honghua Zhang · Benjie Wang · Marcelo Arenas · Guy Van den Broeck

Variation Due to Regularization Tractably Recovers Bayesian Deep Learning Uncertainty

James McInerney · Nathan Kallus

Go to Event Page

Oral

Oral Session 6: RL and Dynamical Systems

11:30 AM - 12:30 PM

5 Events in this session

Corruption Robust Offline Reinforcement Learning with Human Feedback

Debmalya Mandal · Andi Nika · Parameswaran Kamalaruban · Adish Singla · Goran Radanovic

Hybrid Transfer Reinforcement Learning: Provable Sample Efficiency from Shifted-Dynamics Data

Chengrui Qu · Laixi Shi · Kishan Panaganti · Pengcheng You · Adam Wierman

Multi-marginal Schrödinger Bridges with Iterative Reference Refinement

Yunyi Shen · Renato Berlinghieri · Tamara Broderick

Near-Optimal Algorithm for Non-Stationary Kernelized Bandits

Shogo Iwazaki · Shion Takeno

Pure Exploration with Feedback Graphs

Alessio Russo · Yichen Song · Aldo Pacchiano

Go to Event Page

Oral

Oral Session 7: Robust Learning

2:00 PM - 3:00 PM

5 Events in this session

A Robust Kernel Statistical Test of Invariance: Detecting Subtle Asymmetries

Ashkan Soleymani · Behrooz Tahmasebi · Stefanie Jegelka · Patrick Jaillet

Certifiably Quantisation-Robust training and inference of Neural Networks

Hue Dang · Matthew Wicker · Goetz Botterweck · Andrea Patane

Learning from biased positive-unlabeled data via threshold calibration

Paweł Teisseyre · Timo Martens · Jessa Bekker · Jesse Davis

Robust Kernel Hypothesis Testing under Data Corruption

Antonin Schrab · Ilmun Kim

Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces

Shengbo Wang · NIAN SI · Jose Blanchet · Zhengyuan Zhou

Go to Event Page

Poster

Poster Session 3

3:00 PM - 6:00 PM

179 Events in this session

Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks

Samuel Tesfazgi · Leonhard Sprandl · Sandra Hirche

Invariant Link Selector for Spatial-Temporal Out-of-Distribution Problem

Katherine Tieu · Dongqi Fu · Jun Wu · Jingrui He

Evidential Uncertainty Probes for Graph Neural Networks

Linlin Yu · Kangshuo Li · Pritom Saha · Yifei Lou · Feng Chen

Task Shift: From Classification to Regression in Overparameterized Linear Models

Tyler LaBonte · Kuo-Wei Lai · Vidya Muthukumar

Robust Kernel Hypothesis Testing under Data Corruption

Antonin Schrab · Ilmun Kim

Variation Due to Regularization Tractably Recovers Bayesian Deep Learning Uncertainty

James McInerney · Nathan Kallus

Evaluating Prediction-based Interventions with Human Decision Makers In Mind

Inioluwa Raji · Lydia Liu

Efficient Optimization Algorithms for Linear Adversarial Training

Antonio Ribeiro · Thomas Schön · Dave Zachariah · Francis Bach

posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms

Måns Magnusson · Jakob Torgander · Paul Bürkner · Lu Zhang · Bob Carpenter · Aki Vehtari

On the Consistent Recovery of Joint Distributions from Conditionals

Mahbod Majid · Rattana Pukdee · Vishwajeet Agrawal · Burak Varici · Pradeep Ravikumar

Variational Combinatorial Sequential Monte Carlo for Bayesian Phylogenetics in Hyperbolic Space

Alex Chen · Philippe Chlenski · Kenneth Munyuza · Antonio Moretti · Christian Andersson Naesseth · Itsik Pe'er

Optimal Multi-Objective Best Arm Identification with Fixed Confidence

Zhirui Chen · P. N. Karthik · Yeow Meng Chee · Vincent Tan

Pick-to-Learn and Self-Certified Gaussian Process Approximations

Daniel Marks · Dario Paccagnan

Additive Model Boosting: New Insights and Path(ologie)s

Rickmer Schulte · David Rügamer

Do Regularization Methods for Shortcut Mitigation Work As Intended?

Haoyang Hong · Ioanna Papanikolaou · Sonali Parbhoo

Diffusion Models under Group Transformations

Haoye Lu · Spencer Szabados · Yaoliang Yu

StableMDS: A Novel Gradient Descent-Based Method for Stabilizing and Accelerating Weighted Multidimensional Scaling

Zhongxi Fang · Xun Su · Tomohisa Tabuchi · Jianming Huang · Hiroyuki Kasai

Transfer Neyman-Pearson Algorithm for Outlier Detection

Mohammadreza Mousavi Kalan · Eitan Neugut · Samory Kpotufe

$\beta$-th order Acyclicity Derivatives for DAG Learning

Madhumitha Shridharan · Garud Iyengar

Powerful batch conformal prediction for classification

Ulysse Gazin · ruth heller · Etienne Roquain · Aldo Solari

Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model

Zilong Deng · Simon Khan · Shaofeng Zou

Optimising Clinical Federated Learning through Mode Connectivity-based Model Aggregation

Anshul Thakur · Soheila Molaei · Patrick Schwab · Danielle Belgrave · Kim Branson · David Clifton

Safe exploration in reproducing kernel Hilbert spaces

Abdullah Tokmak · Kiran Krishnan · Thomas Schön · Dominik Baumann

Certifiably Quantisation-Robust training and inference of Neural Networks

Hue Dang · Matthew Wicker · Goetz Botterweck · Andrea Patane

Geometry-Aware Generative Autoencoders for Warped Riemannian Metric Learning and Generative Modeling on Data Manifolds

Xingzhi Sun · Danqi Liao · Kincaid MacDonald · Yanlei Zhang · Guillaume Huguet · Guy Wolf · Ian Adelstein · Tim G. J. Rudner · Smita Krishnaswamy

Your copula is a classifier in disguise: classification-based copula density estimation

David Huk · Mark Steel · Ritabrata Dutta

Graph Machine Learning based Doubly Robust Estimator for Network Causal Effects

Seyedeh Baharan Khatami · Harsh Parikh · Haowei Chen · Sudeepa Roy · Babak Salimi

Order-Optimal Regret with Novel Policy Gradient Approaches in Infinite-Horizon Average Reward MDPs

Swetha Ganesh · Washim Uddin Mondal · Vaneet Aggarwal

Protein Fitness Landscape: Spectral Graph Theory Perspective

HAO ZHU · Daniel M. Steinberg · Piotr Koniusz

Policy Teaching via Data Poisoning in Learning from Human Preferences

Andi Nika · Jonathan Nöther · Debmalya Mandal · Parameswaran Kamalaruban · Adish Singla · Goran Radanovic

Superiority of Multi-Head Attention: A Theoretical Study in Shallow Transformers in In-Context Linear Regression

Yingqian Cui · Jie Ren · Pengfei He · Hui Liu · Jiliang Tang · Yue Xing

Restructuring Tractable Probabilistic Circuits

Honghua Zhang · Benjie Wang · Marcelo Arenas · Guy Van den Broeck

Beyond Size-Based Metrics: Measuring Task-Specific Complexity in Symbolic Regression

Krzysztof Kacprzyk · Mihaela van der Schaar

Differentially Private Kernelized Contextual Bandits

Nikola Pavlovic · Sudeep Salgia · Qing Zhao

Hyperbolic Prototypical Entailment Cones for Image Classification

Samuele Fonio · Roberto Esposito · Marco Aldinucci

Natural Language Counterfactual Explanations for Graphs Using Large Language Models

Flavio Giorgi · Cesare Campagnano · Fabrizio Silvestri · Gabriele Tolomei

Paths and Ambient Spaces in Neural Loss Landscapes

Daniel Dold · Julius Kobialka · Nicolai Palm · Emanuel Sommer · David Rügamer · Oliver Dürr

Wasserstein Distributionally Robust Bayesian Optimization with Continuous Context

Francesco Micheli · Efe Balta · Anastasios Tsiamis · John Lygeros

Global Group Fairness in Federated Learning via Function Tracking

Yves Rychener · Daniel Kuhn · Yifan Hu

A Computation-Efficient Method of Measuring Dataset Quality based on the Coverage of the Dataset

BEOMJUN KIM · Jaehwan Kim · Kangyeon Kim · Sunwoo Kim · Heejin Ahn

Theory of Agreement-on-the-Line in Linear Models and Gaussian Data

Christina Baek · Aditi Raghunathan · Zico Kolter

A Robust Kernel Statistical Test of Invariance: Detecting Subtle Asymmetries

Ashkan Soleymani · Behrooz Tahmasebi · Stefanie Jegelka · Patrick Jaillet

Adaptive Convergence Rates for Log-Concave Maximum Likelihood

Gil Kur · Aditya Guntuboyina

Bayesian Decision Theory on Decision Trees: Uncertainty Evaluation and Interpretability

Yuta Nakahara · Shota Saito · Naoki Ichijo · Koki Kazama · Toshiyasu Matsushima

On Tradeoffs in Learning-Augmented Algorithms

Ziyad Benomar · Vianney Perchet

Accuracy on the wrong line: On the pitfalls of noisy data for out-of-distribution generalisation

Amartya Sanyal · Yaxi Hu · Yaodong Yu · Yian Ma · Yixin Wang · Bernhard Schölkopf

On adaptivity and minimax optimality of two-sided nearest neighbors

Tathagata Sadhukhan · Manit Paul · Raaz Dwivedi

Quantile Additive Trend Filtering

Zhi Zhang · Kyle Ritscher · Oscar Madrid

Synthesis and Analysis of Data as Probability Measures With Entropy-Regularized Optimal Transport

Brendan Mallery · James Murphy · Shuchin Aeron

Advancing Fairness in Precision Medicine: A Universal Framework for Optimal Treatment Estimation in Censored Data

Hongni Wang · Junxi Zhang · Na Li · Linglong Kong · Bei Jiang · Xiaodong Yan

Primal-Dual Spectral Representation for Off-policy Evaluation

Yang Hu · Tianyi Chen · Na Li · Kai Wang · Bo Dai

Pure Exploration with Feedback Graphs

Alessio Russo · Yichen Song · Aldo Pacchiano

On the Power of Multitask Representation Learning with Gradient Descent

Qiaobo Li · Zixiang Chen · Yihe Deng · Yiwen Kou · Yuan Cao · Quanquan Gu

A Safe Exploration Approach to Constrained Markov Decision Processes

Tingting Ni · Maryam Kamgarpour

Theoretical Convergence Guarantees for Variational Autoencoders

Sobihan Surendran · Antoine Godichon-Baggioni · Sylvain Le Corff

Sampling From Multiscale Densities With Delayed Rejection Generalized Hamiltonian Monte Carlo

Gilad Turok · Chirag Modi · Bob Carpenter

Efficient and Asymptotically Unbiased Constrained Decoding for Large Language Models

Haotian Ye · Himanshu Jain · Chong You · Ananda Theertha Suresh · Haowei Lin · James Zou · Felix Yu

Decision-Point Guided Safe Policy Improvement

Abhishek Sharma · Leo Benac · Sonali Parbhoo · Finale Doshi-Velez

Reward Maximization for Pure Exploration: Minimax Optimal Good Arm Identification for Nonparametric Multi-Armed Bandits

Brian Cho · Dominik Meier · Kyra Gan · Nathan Kallus

High-probability Convergence Bounds for Online Nonlinear Stochastic Gradient Descent under Heavy-tailed Noise

Aleksandar Armacki · Shuhua Yu · Pranay Sharma · Gauri Joshi · Dragana Bajovic · Dusan Jakovetic · Soummya Kar

To Give or Not to Give? The Impacts of Strategically Withheld Recourse

Yatong Chen · Andrew Estornell · Yevgeniy Vorobeychik · Yang Liu

Representer Theorems for Metric and Preference Learning: Geometric Insights and Algorithms

Peyman Morteza

Nyström Kernel Stein Discrepancy

Florian Kalinke · Zoltan Szabo · Bharath Sriperumbudur

Visualizing token importance for black-box language models

Paulius Rauba · Qiyao Wei · Mihaela van der Schaar

Sparse Causal Effect Estimation using Two-Sample Summary Statistics in the Presence of Unmeasured Confounding

Shimeng Huang · Niklas Pfister · Jack Bowden

Federated UCBVI: Communication-Efficient Federated Regret Minimization with Heterogeneous Agents

Safwan Labbi · Daniil Tiapkin · Lorenzo Mancini · Paul Mangold · Eric Moulines

Understanding Inverse Reinforcement Learning under Overparameterization: Non-Asymptotic Analysis and Global Optimality

Ruijia Zhang · Siliang Zeng · Chenliang Li · Alfredo Garcia · Mingyi Hong

Active Feature Acquisition for Personalised Treatment Assignment

Julianna Piskorz · Nicolás Astorga · Jeroen Berrevoets · Mihaela van der Schaar

Federated Causal Inference: Multi-Study ATE Estimation beyond Meta-Analysis

Remi Khellaf · Aurélien Bellet · julie Josse

Trustworthy assessment of heterogeneous treatment effect estimator via analysis of relative error

Zijun Gao

Koopman-Equivariant Gaussian Processes

Petar Bevanda · Max Beier · Alexandre Capone · Stefan Sosnowski · Sandra Hirche · Armin Lederer

Learning signals defined on graphs with optimal transport and Gaussian process regression

Raphael Carpintero Perez · Sébastien da Veiga · Josselin Garnier · Brian Staber

Statistical Guarantees for Unpaired Image-to-Image Cross-Domain Analysis using GANs

Saptarshi Chakraborty · Peter Bartlett

Infinite-Horizon Reinforcement Learning with Multinomial Logit Function Approximation

Jaehyun Park · Junyeop Kwon · Dabeen Lee

Conditional simulation via entropic optimal transport: Toward non-parametric estimation of conditional Brenier maps

Ricardo Baptista · Aram-Alexandre Pooladian · Michael Brennan · Youssef Marzouk · Jonathan Niles-Weed

Geometric Collaborative Filtering with Convergence

Hisham Husain · Julien Monteil

Noise-Aware Differentially Private Variational Inference

Talal Alrawajfeh · Joonas Jälkö · Antti Honkela

The Uniformly Rotated Mondrian Kernel

Calvin Osborne · Eliza O'Reilly

Logarithmic Neyman Regret for Adaptive Estimation of the Average Treatment Effect

Ojash Neopane · Aaditya Ramdas · Aarti Singh

The Local Learning Coefficient: A Singularity-Aware Complexity Measure

Edmund Lau · Zach Furman · George Wang · Daniel Murfet · Susan Wei

Collaborative non-parametric two-sample testing

Alejandro de la Concha · Nicolas Vayatis · Argyris Kalogeratos

Learning High-dimensional Gaussians from Censored Data

Arnab Bhattacharyya · Constantinos Daskalakis · Themistoklis Gouleakis · Yuhao Wang

Clustered Invariant Risk Minimization

Tomoya Murata · Atsushi Nitanda · Taiji Suzuki

Sequential Kernelized Stein Discrepancy

Diego Martinez-Taboada · Aaditya Ramdas

DDEQs: Distributional Deep Equilibrium Models through Wasserstein Gradient Flows

Jonathan Geuter · Clément Bonet · Anna Korba · David Alvarez-Melis

HR-Bandit: Human-AI Collaborated Linear Recourse Bandit

Junyu Cao · Ruijiang Gao · Esmaeil Keyvanshokooh

Learning Laplacian Positional Encodings for Heterophilous Graphs

Michael Ito · Jiong Zhu · Dexiong Chen · Danai Koutra · Jenna Wiens

Inverse Optimization with Prediction Market: A Characterization of Scoring Rules for Elciting System States

Han Bao · Shinsaku Sakaue

A Subquadratic Time Approximation Algorithm for Individually Fair k-Center

Matthijs Ebbens · Nicole Funk · Jan Höckendorff · Christian Sohler · Vera Weil

Is Merging Worth It? Securely Evaluating the Information Gain for Causal Dataset Acquisition

Jake Fawkes · Lucile Ter-Minassian · Desi Ivanova · Uri Shalit · Chris Holmes

Weighted Sum of Gaussian Process Latent Variable Models

James Odgers · Ruby Sedgwick · Chrysoula Kappatou · Ruth Misener · Sarah Filippi

Best-Arm Identification in Unimodal Bandits

Riccardo Poiani · Marc Jourdan · Emilie Kaufmann · Rémy Degenne

Conditional diffusions for amortized neural posterior estimation

Tianyu Chen · Vansh Bansal · James Scott

Efficient Trajectory Inference in Wasserstein Space Using Consecutive Averaging

Amartya Banerjee · Lee · Nir Sharon · Caroline Moosmüller

Density Ratio Estimation via Sampling along Generalized Geodesics on Statistical Manifolds

Masanari Kimura · Howard Bondell

Stochastic Gradient Descent for Bézier Simplex Representation of Pareto Set in Multi-Objective Optimization

Yasunari Hikima · Ken Kobayashi · Akinori Tanaka · Akiyoshi Sannai · Naoki Hamada

$f$-PO: Generalizing Preference Optimization with $f$-divergence Minimization

Jiaqi Han · Mingjian Jiang · Yuxuan Song · Stefano Ermon · Minkai Xu

Reliable and Scalable Variable Importance Estimation via Warm-start and Early Stopping

Zexuan Sun · Garvesh Raskutti

Elastic Representation: Mitigating Spurious Correlations for Group Robustness

Tao Wen · Zihan Wang · Quan Zhang · Qi Lei

Quantifying Knowledge Distillation using Partial Information Decomposition

Sachindra Pasan Dissanayake · Faisal Hamman · Barproda Halder · Ilia Sucholutsky · Qiuyi Zhang · Sanghamitra Dutta

Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs

Kihyuk (Ki) Hong · Woojin Chae · Yufan Zhang · Dabeen Lee · Ambuj Tewari

Prediction-Centric Uncertainty Quantification via MMD

Zheyang Shen · Jeremias Knoblauch · Sam Power · Chris Oates

Robust Multi-fidelity Bayesian Optimization with Deep Kernel and Partition

Fengxue Zhang · Thomas Desautels · Yuxin Chen

Model-Based Causal Feature Selection for General Response Types

Lucas Kook

Entropic Matching for Expectation Propagation of Markov Jump Processes

Yannick Eich · Bastian Alt · Heinz Koeppl

DeCaf: A Causal Decoupling Framework for OOD Generalization on Node Classification

Xiaoxue Han · Huzefa Rangwala · Yue Ning

Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits

Ha Manh Bui · Enrique Mallada · Anqi Liu

Tensor Network-Constrained Kernel Machines as Gaussian Processes

Frederiek Wesel · Kim Batselier

Causal Discovery-Driven Change Point Detection in Time Series

Shanyun Gao · Raghavendra Addanki · Tong Yu · Ryan Rossi · Murat Kocaoglu

Adversarial Vulnerabilities in Large Language Models for Time Series Forecasting

Fuqiang Liu · Sicong Jiang · Luis Miranda-Moreno · Seongjin Choi · Lijun Sun

Anytime-Valid A/B Testing of Counting Processes

Michael Lindon · Nathan Kallus

Task-Driven Discrete Representation Learning

Tung Long Vuong

Models That Are Interpretable But Not Transparent

Chudi Zhong · Panyu Chen · Cynthia Rudin

Algorithmic Accountability in Small Data: Sample-Size-Induced Bias Within Classification Metrics

Jarren Briscoe · Garrett Kepler · Daryl DeFord · Assefaw Gebremedhin

$q\texttt{POTS}$: Efficient Batch Multiobjective Bayesian Optimization via Pareto Optimal Thompson Sampling

Ashwin Renganathan · Kade Carlson

Consistent Validation for Predictive Methods in Spatial Settings

David Burt · Yunyi Shen · Tamara Broderick

Corruption Robust Offline Reinforcement Learning with Human Feedback

Debmalya Mandal · Andi Nika · Parameswaran Kamalaruban · Adish Singla · Goran Radanovic

Get rid of your constraints and reparametrize: A study in NNLS and implicit bias

Hung-Hsu Chou · Johannes Maly · Claudio Mayrink Verdun · Bernardo da Costa · Heudson Mirandola

Composition and Control with Distilled Energy Diffusion Models and Sequential Monte Carlo

James Thornton · Louis Béthune · Ruixiang ZHANG · Arwen Bradley · Preetum Nakkiran · Shuangfei Zhai

Permutation Invariant Functions: Statistical Testing, Density Estimation, and Metric Entropy

Wee Chaimanowong · Ying Zhu

Q-function Decomposition with Intervention Semantics for Factored Action Spaces

Junkyu Lee · Tian Gao · Elliot Nelson · Miao Liu · Debarun Bhattacharjya · Songtao Lu

Differentially Private Range Queries with Correlated Input Perturbation

Prathamesh Dharangutte · Jie Gao · Ruobin Gong · Guanyang Wang

Cubic regularized subspace Newton for non-convex optimization

Jim Zhao · Nikita Doikov · Aurelien Lucchi

The cost of local and global fairness in Federated Learning

Yuying Duan · Gelei Xu · Yiyu Shi · Michael Lemmon

Differentiable Calibration of Inexact Stochastic Simulation Models via Kernel Score Minimization

Ziwei Su · Diego Klabjan

M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling

Xudong Sun · Nutan Chen · Alexej Gossmann · Yu Xing · Matteo Wohlrapp · Emilio Dorigatti · Carla Feistner · Felix Drost · Daniele Scarcella · Lisa Beer · Carsten Marr

Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits

Ambrus Tamás · Szabolcs Szentpéteri · Balázs Csanád Csáji

Bilevel Reinforcement Learning via the Development of Hyper-gradient without Lower-Level Convexity

Yan Yang · Bin Gao · Ya-xiang Yuan

Adversarial Training in High-Dimensional Regression: Generated Data and Neural Networks

Yue Xing

Parallel Backpropagation for Inverse of a Convolution with Application to Normalizing Flows

Sandeep Nagar · Girish Varma

Performative Reinforcement Learning with Linear Markov Decision Process

Debmalya Mandal · Goran Radanovic

A Novel Convex Gaussian Min Max Theorem for Repeated Features

David Bosch · Ashkan Panahi

What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

Yufeng Zhang · Fengzhuo Zhang · Zhuoran Yang · Zhaoran Wang

Constrained Multi-objective Bayesian Optimization through Optimistic Constraints Estimation

Diantong Li · Fengxue Zhang · Chong Liu · Yuxin Chen

Weighted Euclidean Distance Matrices over Mixed Continuous and Categorical Inputs for Gaussian Process Models

Mingyu Pu · Songhao Wang · Haowei Wang · Szu Hui Ng

From Deep Additive Kernel Learning to Last-Layer Bayesian Neural Networks via Induced Prior Approximation

Wenyuan Zhao · Haoyuan Chen · Tie Liu · Rui Tuo · Chao Tian

ClusterSC: Advancing Synthetic Control with Donor Selection

Saeyoung Rho · Andrew Tang · Noah Bergam · Rachel Cummings · Vishal Misra

Two-Timescale Linear Stochastic Approximation: Constant Stepsizes Go a Long Way

Jeongyeol Kwon · Luke Dotson · Yudong Chen · Qiaomin Xie

A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models

Xiaoyan Hu · Ho-fung Leung · Farzan Farnia

Robust Offline Policy Learning with Observational Data from Multiple Sources

Aldo Carranza · Susan Athey

Near-Optimal Sample Complexity in Reward-Free Kernel-based Reinforcement Learning

Aya Kayal · Sattar Vakili · Laura Toni · Alberto Bernacchia

Enhancing Feature-Specific Data Protection via Bayesian Coordinate Differential Privacy

Maryam Aliakbarpour · Syomantak Chaudhuri · Thomas Courtade · Alireza Fallah · Michael Jordan

Learning from biased positive-unlabeled data via threshold calibration

Paweł Teisseyre · Timo Martens · Jessa Bekker · Jesse Davis

A Theoretical Framework for Preventing Class Collapse in Supervised Contrastive Learning

Chungpa Lee · Jeongheon Oh · Kibok Lee · Jy-yong Sohn

M$^2$AD: Multi-Sensor Multi-System Anomaly Detection through Global Scoring and Calibrated Thresholding

Sarah Alnegheimish · Zelin He · Matthew Reimherr · Akash Chandrayan · Abhinav Pradhan · Luca D'Angelo

Robust Fair Clustering with Group Membership Uncertainty Sets

Sharmila Duppala · Juan Luque · John Dickerson · Seyed Esmaeili

Density-Dependent Group Testing

Rahil Morjaria · Saikiran Bulusu · Venkata Gandikota · Sidharth Jaggi

Post-processing for Fair Regression via Explainable SVD

Zhiqun Zuo · Ding Zhu · Mahed Abroshan

No-Regret Bayesian Optimization with Stochastic Observation Failures

Shogo Iwazaki · Tomohiko Tanabe · Mitsuru Irie · Shion Takeno · Kota Matsui · Yu Inatsu

Multi-agent Multi-armed Bandit Regret Complexity and Optimality

Mengfan Xu · Diego Klabjan

Invertible Fourier Neural Operators for Tackling Both Forward and Inverse Problems

Da Long · Zhitong Xu · Qiwei Yuan · Yin Yang · Shandian Zhe

On Distributional Discrepancy for Experimental Design with General Assignment Probabilities

Anup Rao · Peng Zhang

Learning Pareto manifolds in high dimensions: How can regularization help?

Tobias Wegel · Filip Kovačević · Alexandru Tifrea · Fanny Yang

High Dimensional Bayesian Optimization using Lasso Variable Selection

Vu Hoang · Hung Tran · Sunil Gupta · Vu Nguyen

Global Optimization of Gaussian Process Acquisition Functions Using a Piecewise-Linear Kernel Approximation

Yilin Xie · Shiqiang Zhang · Joel Paulson · Calvin Tsay

ROTI-GCV: Generalized Cross-Validation for right-ROTationally Invariant Data

Kevin Luo · Yufan Li · Pragya Sur

LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits

Masahiro Kato · Shinji Ito

Conformal Prediction Under Generalized Covariate Shift with Posterior Drift

Baozhen Wang · Xingye Qiao

Accelerated Methods for Riemannian Min-Max Optimization Ensuring Bounded Geometric Penalties

David Martínez-Rubio · Christophe Roux · Christopher Criscitiello · Sebastian Pokutta

Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models

Siyan Zhao · Daniel Israel · Guy Van den Broeck · Aditya Grover

Strategic Conformal Prediction

Daniel Csillag · Claudio Struchiner · Guilherme Goedert

Change Point Detection in Hadamard Spaces by Alternating Minimization

Anica Kostic · Vincent Runge · Charles Truong

Information Transfer Across Clinical Tasks via Adaptive Parameter Optimisation

Anshul Thakur · Elena Gal · Soheila Molaei · Xiao Gu · Patrick Schwab · Danielle Belgrave · Kim Branson · David Clifton

The Hardness of Validating Observational Studies with Experimental Data

Jake Fawkes · Michael O'Riordan · Athanasios Vlontzos · Oriol Corcoll · Ciarán Gilligan-Lee

Linearized Wasserstein Barycenters: Synthesis, Analysis, Representational Capacity, and Applications

Matthew Werenski · Brendan Mallery · Shuchin Aeron · James Murphy

Cost-Aware Optimal Pairwise Pure Exploration

Di Wu · Chengshuai Shi · Ruida Zhou · Cong Shen

Online Student-$t$ Processes with an Overall-local Scale Structure for Modelling Non-stationary Data

Taole Sha · Michael Zhang

Stochastic Weight Sharing for Bayesian Neural Networks

Moule Lin · Shuhao Guan · Weipeng Jing · Goetz Botterweck · Andrea Patane

Leveraging Frozen Batch Normalization for Co-Training in Source-Free Domain Adaptation

Xianwen Deng · Yijun Wang · Zhi Xue

Neural Point Processes for Pixel-wise Regression

Chengzhi Shi · Gözde Özcan · Miquel Sirera Perelló · Yuanyuan Li · Nina I. Shamsi · Stratis Ioannidis

Dynamic DBSCAN with Euler Tour Sequences

Seiyun Shin · Ilan Shomorony · Peter Macgregor

From Learning to Optimize to Learning Optimization Algorithms

Camille Castera · Peter Ochs

Explaining ViTs Using Information Flow

Chase Walker · Md Rubel Ahmed · Sumit Kumar Jha · Rickard Ewetz

Dissecting the Impact of Model Misspecification in Data-Driven Optimization

Adam Elmachtoub · Henry Lam · Haixiang Lan · Haofeng Zhang

Performative Prediction on Games and Mechanism Design

Antonio Gois · Mehrnaz Mofakhami · Fernando Santos · Simon Lacoste-Julien · Gauthier Gidel

A Shapley-value Guided Rationale Editor for Rationale Learning

Zixin Kuang · Meng-Fen Chiang · Wang-Chien Lee

Noisy Low-Rank Matrix Completion via Transformed $L_1$ Regularization and its Theoretical Properties

Kun Zhao · Jiayi Wang · Yifei Lou

Towards a mathematical theory for consistency training in diffusion models

Gen Li · Zhihan Huang · Yuting Wei

Nonparametric Distributional Regression via Quantile Regression

Cheng Peng · Stan Uryasev

Go to Event Page