The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) brings together researchers and practitioners from around the world to present advances in artificial intelligence, machine learning and computational neuroscience.

This year, Penn researchers contributed more than 100 papers across the main conference and workshops, covering topics such as large language models, generative AI, reinforcement learning, trustworthy AI, computational biology, neuroscience and applications in healthcare and the physical sciences. Below is the list of Penn-affiliated papers and collaborators presented at NeurIPS 2025.

Conference Spotlights

When Data Can’t Meet: Estimating Correlation Across Privacy Barriers
Abhinav Chakraborty, Arnab Auddy, T. Tony Cai

Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers?
Yihao Li, Saeed Salehi, Lyle Ungar, Konrad Kording

Mitigating the Privacy–Utility Trade-off in Decentralized Federated Learning via f-Differential Privacy
Xiang Li, Chendi Wang, Buxin Su, Qi Long, Weijie J. Su

Optimal Neural Compressors for the Rate-Distortion-Perception Tradeoff
Eric Lei, Hamed Hassani, Shirin Saeedi Bidokhti

ESCA: Contextualizing Embodied Agents via Scene-Graph Generation
Jiani Huang, Amish Sethi, Matthew Kuo, Mayank Keoliya, Neelay Velingker, Jungho Jung, Ser-Nam Lim, Ziyang Li, Mayur Naik

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis
Jiatao Gu, Tianrong Chen, David Berthelot, Huangjie Zheng, Yuyang Wang, Ruixiang Zhang, Laurent Dinh, Miguel Ángel Bautista, Joshua M. Susskind, Shuangfei Zhai

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
Weiqing He, Xiang Li, Tianqi Shang, Li Shen, Weijie J. Su, Qi Long

Conference Posters

Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows
Ruixiang Zhang, Shuangfei Zhai, Jiatao Gu, Yizhe Zhang, Huangjie Zheng, Tianrong Chen, Miguel Ángel Bautista, Joshua M. Susskind, Navdeep Jaitly

FORLA: Federated Object-centric Representation Learning with Slot Attention
Guiqiu Liao, Matjaz Jogan, Eric Eaton, Daniel A. Hashimoto

Let Me Think! A Long Chain of Thought Can Be Worth Exponentially Many Short Ones
Parsa Mirtaheri, Ezra Edelman, Samy Jelassi, Eran Malach, Enric Boix-Adserà

Generalizable, Real-time Neural Decoding with Hybrid State-space Models
Avery Hee-Woon Ryoo, Nanda H. Krishna, Ximeng Mao, Mehdi Azabou, Eva L. Dyer, Matthew G. Perich, Guillaume Lajoie

Uncertainty-Calibrated Prediction of Randomly Timed Biomarker Trajectories with Conformal Bands
Vasiliki Tassopoulou, Charis Stamouli, Haochang Shou, George J. Pappas, Christos Davatzikos

Alignment of Large Language Models with Constrained Learning
Botong Zhang, Shuo Li, Ignacio Hounie, Osbert Bastani, Dongsheng Ding, Alejandro Ribeiro

Know Thyself by Knowing Others: Learning Neuron Identity from Population Context
Vinam Arora, Divyansha Lachi, Ian Jarratt Knight, Mehdi Azabou, Blake Aaron Richards, Cole Lincoln Hurwitz, Josh Siegle, Eva L. Dyer

Conformal Prediction Beyond the Seen: A Missing Mass Perspective for Uncertainty Quantification in Generative Models
Sima Noorani, Shayan Kiyani, George J. Pappas, Hamed Hassani

Composition and Alignment of Diffusion Models Using Constrained Learning
Shervin Khalafi, Ignacio Hounie, Dongsheng Ding, Alejandro Ribeiro

Real-world Reinforcement Learning of Active Perception Behaviors
Edward S. Hu, Jie Wang, Xingfang Yuan, Fiona Luo, Muyao Li, Gaspard Lambrechts, Oleh Rybkin, Dinesh Jayaraman

Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization
Yu Huang, Zixin Wen, Aarti Singh, Yuejie Chi, Yuxin Chen

Stochastic Regret Guarantees for Online Zeroth- and First-order Bilevel Optimization
Parvin Nazari, Bojian Hou, Davoud Ataee Tarzanagh, Li Shen, George Michailidis

A Scalable, Causal, and Energy-efficient Framework for Neural Decoding with Spiking Neural Networks
Georgios Mentzelopoulos, Ioannis Asmanis, Konrad Kording, Eva L. Dyer, Kostas Daniilidis, Flavia Vitale

REMI: Reconstructing Episodic Memory During Internally Driven Path Planning
Zhaoze Wang, Genela Morris, Dori Derdikman, Pratik Chaudhari, Vijay Balasubramanian

Synthetic-powered Predictive Inference
Meshi Bashari, Roy Maor Lotan, Yonghoon Lee, Edgar Dobriban, Yaniv Romano

Statistical Inference for Gradient Boosting Regression
Haimo Fang, Kevin Tan, Giles Hooker

Imbalances in Neurosymbolic Learning: Characterization and Mitigating Strategies
Efthymia Tsamoura, Kaifu Wang, Dan Roth

Foundations of Top-k Decoding for Language Models
Georgy Noarov, Soham Mallick, Tao Wang, Sunay Joshi, Yan Sun, Yangxinyu Xie, Mengxin Yu, Edgar Dobriban

Probabilistic Stability Guarantees for Feature Attributions
Helen Jin, Anton Xue, Weiqiu You, Surbhi Goel, Eric Wong

MotionBind: Multi-modal Human Motion Alignment for Retrieval, Recognition, and Generation
Kaleab A. Kinfu, Rene Vidal

Convergence Rates for Gradient Descent on the Edge of Stability for Overparameterized Least Squares
Lachlan Ewen Macdonald, Hancheng Min, Leandro Palma, Salma Tarmoun, Ziqing Xu, Rene Vidal

TADA: Improved Diffusion Sampling with Training-free Augmented Dynamics
Tianrong Chen, Huangjie Zheng, David Berthelot, Jiatao Gu, Joshua M. Susskind, Shuangfei Zhai

Efficient PAC Learning for Realizable-statistic Models via Convex Surrogates
Shivani Agarwal

Gradient Alignment in Physics-informed Neural Networks: A Second-order Optimization Perspective
Sifan Wang, Ananyae Kumar Bhartari, Bowen Li, Paris Perdikaris

Disentangling Misreporting from Genuine Adaptation in Strategic Settings: A Causal Approach
Dylan Zapzalka, Trenton Chang, Lindsay Warrenburg, Sae-Hwan Park, Daniel K. Shenfeld, Ravi B. Parikh, Jenna Wiens, Maggie Makar

Once Upon an Input: Reasoning via Per-instance Program Synthesis
Adam Stein, Neelay Velingker, Mayur Naik, Eric Wong

Neural Collapse Under Gradient Flow on Shallow ReLU Networks for Orthogonally Separable Data
Hancheng Min, Zhihui Zhu, Rene Vidal

BetaConform: Efficient MAP Estimation of LLM Ensemble Judgment Performance with Prior Transfer
Huaizhi Qu, Inyoung Choi, Zhen Tan, Song Wang, Sukwon Yun, Qi Long, Faizan Siddiqui, Kwonjoon Lee, Tianlong Chen

Conformal Information Pursuit for Interactively Guiding Large Language Models
Kwan Ho Ryan Chan, Yuyan Ge, Edgar Dobriban, Hamed Hassani, Rene Vidal

Deployment-efficient Reward-free Exploration with Linear Function Approximation
Zihan Zhang, Yuxin Chen, Jason D. Lee, Simon Shaolei Du, Lin Yang, Ruosong Wang

CTSketch: Compositional Tensor Sketching for Scalable Neurosymbolic Learning
Seewon Choi, Alaia Solko-Breslin, Rajeev Alur, Eric Wong

Covering Multiple Objectives with a Small Set of Solutions Using Bayesian Optimization
Natalie Maus, Kyurae Kim, Yimeng Zeng, Haydn Thomas Jones, Fangping Wan, Marcelo Der Torossian Torres, Cesar De La Fuente-Nunez, Jacob R. Gardner

Conformal Inference Under High-dimensional Covariate Shifts via Likelihood-ratio Regularization
Sunay Joshi, Shayan Kiyani, George J. Pappas, Edgar Dobriban, Hamed Hassani

Nearly Dimension-independent Convergence of Mean-field Black-box Variational Inference
Kyurae Kim, Yian Ma, Trevor Campbell, Jacob R. Gardner

PhysCtrl: Generative Physics for Controllable and Physics-grounded Video Generation
Chen Wang, Chuhao Chen, Yiming Huang, Zhiyang Dou, Yuan Liu, Jiatao Gu, Lingjie Liu

Non-line-of-sight 3D Reconstruction with Radar
Haowen Lai, Zitong Lan, Mingmin Zhao

Resounding Acoustic Fields with Reciprocity
Zitong Lan, Yiduo Hao, Mingmin Zhao

SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations
Buyun Liang, Liangzu Peng, Jinqi Luo, Darshan Thaker, Kwan Ho Ryan Chan, Rene Vidal

On the Mechanisms of Weak-to-strong Generalization: A Theoretical Perspective
Behrad Moniri, Hamed Hassani

A Dataset for Distilling Knowledge Priors from Literature for Therapeutic Design
Haydn Thomas Jones, Natalie Maus, Josh Magnus Ludan, Maggie Ziyu Huan, Jiaming Liang, Marcelo Der Torossian Torres, Jiatao Liang, Zachary Ives, Yoseph Barash, Cesar De La Fuente-Nunez, Jacob R. Gardner, Mark Yatskar

moPPIt-v3: Motif-specific Peptides Generated via Multi-objective-guided Discrete Flow Matching
Tong Chen, Zachary Quinn, Yinuo Zhang, Pranam Chatterjee

Workshop Papers

High-Throughput Protein Perturbation Screens with AI-Designed Degraders
Lin Zhao, Aastha Pal, Tong Chen, Pranam Chatterjee

PrimateFace: A Resource for Generalizable Cross-Species Facial Analysis
Felipe Parodi, Konrad Kording

Unified Pretraining on Mixed Optophysiology and Electrophysiology Data Across Brain Regions
Ian Jarratt Knight, Vinam Arora, Mehdi Azabou, Eva L. Dyer

A Scalable Self-Supervised Method for Modeling Human Intracranial Recordings During Natural Behavior
Shivashriganesh P. Mahato, Jingyun Xiao, Alexandre Andre, Geeling Chau, Wenrui Ma, Ian Jarratt Knight, Duy Nguyen, Lawrence Jianqiao Hu, Bingni W. Brunton, Michael S. Beauchamp, Bijan Pesaran, Sergey A. Shuvaev, Eva L. Dyer

Disentangling Misreporting from Genuine Adaptation in Strategic Settings: A Causal Approach
Dylan Zapzalka, Trenton Chang, Lindsay Warrenburg, Sae-Hwan Park, Daniel K. Shenfeld, Ravi B. Parikh, Jenna Wiens, Maggie Makar

Predicting and Generating Antibiotics Against Future Pathogens with ApexOracle
Tianang Leng, Fangping Wan, Marcelo Der Torossian Torres, Cesar De La Fuente-Nunez

Token-Level Guided Discrete Diffusion for Membrane Protein Design
Shrey Goel, Peregrine Michael Schray, Yinuo Zhang, Sophia Vincoff, Huong T. Kratochvil, Pranam Chatterjee

Explaining Temporal Effects in Sepsis Prediction
Chaehyeon Kim, Eric Wong

Instruction Following by Boosting Attention of Large Language Models
Vitoria Guardieiro, Adam Stein, Avishree Khare, Eric Wong

Where’s the Bug? Attention Probing for Scalable Fault Localization
Adam Stein, Arthur Wayne, Aaditya Naik, Mayur Naik, Eric Wong

SuperActivators: Only the Tail of the Distribution Contains Reliable Concept Signals
Cassandra Goldberg, Chaehyeon Kim, Adam Stein, Eric Wong

Learning from Frustration: Torsor CNNs on Graphs
Shreya Arya, Robert Ghrist, Daiyuan Li

The Binding Problem in Vision Models: Geometric, Functional, and Behavioral Approaches
Lianghuan Huang, Yihao Li, Yingshan Chang, Saeed Salehi, Konrad Kording

Exploiting All Laplacian Eigenvectors for Node Classification with Graph Transformers
Vinam Arora, Divyansha Lachi, Shivashriganesh P. Mahato, Mehdi Azabou, Zihao Chen, Eva L. Dyer

Transferability of Graph Transformers with Convolutional Positional Encodings
Javier Porras-Valenzuela, Zhiyang Wang, Xiaotao Shang, Alejandro Ribeiro

GNN-Parametrized Diffusion Policies for Wireless Resource Allocation
Yigit Berkay Uslu, Samar Hadou, Shirin Saeedi Bidokhti, Alejandro Ribeiro

RELATE: A Schema-Agnostic Cross-Attention Encoder for Multimodal Relational Graphs
Joe Meyer, Divyansha Lachi, Mahmoud Mohammadi, Roshan Reddy Upendra, Eva L. Dyer, Minghua Li, Tom Palczewski

Explaining Temporal Effects in Sepsis Prediction
Chaehyeon Kim, Eric Wong

Integrating Slow Neural Oscillations and Physiological Burden for Trait Anxiety Prediction
Jungyoun Janice Min, Jiong Chen, Jingxuan Bao, Shu Yang, Yize Zhao, Li Shen, Duy Duong-Tran

ScooBDoob: Schrödinger Bridge with Doob’s h-transform for Molecular Dynamics
Yinuo Zhang, Sophia Tang, Pranam Chatterjee

Entangled Schrödinger Bridge Matching
Sophia Tang, Yinuo Zhang, Pranam Chatterjee

Multi-objective Nanobody Design via Masked Discrete Diffusion with Simplex Refinement
Rosie Zhang, Pranam Chatterjee

Collaborators

Aalto University, Allen Institute, Amirkabir University of Technology, Arizona State University, Beijing Normal University, Beijing University of Posts and Telecommunications, Bernstein Center for Computational Neuroscience, Carnegie Mellon University, Center for AI Safety, Chinese University of Hong Kong, City University of Hong Kong, Columbia University, Dalian University of Technology, Donghua University, Shanghai, Emory University, Federal Institute of Ceará, Fudan University, Georgia Institute of Technology, Harvard University, Hong Kong University of Science and Technology, Huazhong University of Science and Technology, Ludwig-Maximilians-Universität München (University of Munich), Macau University of Science and Technology, McGill University, Mila – Quebec AI Institute, Nanyang Technological University, New Jersey Institute of Technology, New York University, Ohio State University, Peking University, Physical Intelligence, Princeton University, Rensselaer Polytechnic Institute, Shanghai AI Laboratory, Shanghai Jiao Tong University, Shanghaitech University, Southeast University, Stanford University, Sun Yat-sen University, Technion – Israel Institute of Technology, Technische Universität Berlin, Tel Aviv University, Texas A&M University – College Station, Université de Liège, Université de Montréal, University of British Columbia, University of California, Berkeley, University of California, Los Angeles, University of California, San Diego, University of Central Florida, University of Chicago, University of Electronic Science and Technology of China, University of Hong Kong, University of Illinois Urbana–Champaign, University of Macau, University of Michigan – Ann Arbor, University of North Carolina at Chapel Hill, University of Ottawa, University of Science and Technology of China, University of Toronto, University of Virginia, University of Washington, University of Wisconsin–Madison, Westlake University, Western Galilee College, Wuhan University, Yale University, Zhejiang University

Air Liquide, Amazon, Apple, Cerebras Systems, Inc., Google, Hengxin Technology Ltd., Honda, Huawei Technologies Ltd., Meta, Microsoft, Samsung, Stealth Mode Startup, Xiaomi Corporation