Manzil Zaheer

Research Areas

Authored Publications

Google Publications

Other Publications

Generalization Properties of Retrieval-based Models

Ankit Singh Rawat

Manzil Zaheer

Soumya Basu

ICML 2023 (to appear)

Teacher Guided Training: An Efficient Framework for Knowledge Transfer

Manzil Zaheer

Ankit Singh Rawat

Seungyeon Kim

Chong You

Himanshu Jain

Andreas Veit

Rob Fergus

Sanjiv Kumar

International Conference on Learning Representations (2023) (to appear)

Compositional Generalization and Decomposition in Neural Program Synthesis

Kensen Shi

Joey Hong

Manzil Zaheer

Pengcheng Yin

Charles Sutton

Deep Learning for Code (DL4C) Workshop at ICLR'22 (2022)

A Context Integrated Transformer-based Neural Network for Auction Design

Zhijian Duan

Jingwu Tang

Yutong Yin

Zhe Feng

Xiang Yan

Manzil Zaheer

Xiaotie Deng

The Thirty-ninth International Conference on Machine Learning (ICML'22) (2022)

Thompson Sampling with a Mixture Prior

Joey Hong

Branislav Kveton

Manzil Zaheer

Mohammad Ghavamzadeh

Craig Boutilier

Proceedings of The 25th International Conference on Artificial Intelligence and Statistics (AI-Stats-22) (2022), pp. 7565-7586

A Fourier Approach to Mixture Learning

Mingda Qiao

Guru Prashanth Guruganesh

Ankit Singh Rawat

Avinava Dubey

Manzil Zaheer

Conference on Neural Information Processing Systems (2022) (to appear)

Scalable Hierarchical Agglomerative Clustering

Nick Monath

Avinava Dubey

Guru Prashanth Guruganesh

Manzil Zaheer

Amr Mahmoud El Houssieny Ahmed

Andrew McCallum

Gokhan Mergen

Marc Najork

Mert Terzihan

Bryon Tjanaka

Yuan Wang

Yuchen Wu

Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (2021), 1245–1255

Adaptive Federated Optimization

Sashank Reddi

Zachary Burr Charles

Manzil Zaheer

Zachary Garrett

Keith Rush

Jakub Konečný

Sanjiv Kumar

Brendan McMahan

(2021)

Meta-Thompson Sampling

Branislav Kveton

Michael Konobeev

Manzil Zaheer

Martin Mladenov

Craig Boutilier

Chih-wei Hsu

Csaba Szepesvari

Proceedings of the 38th International Conference on Machine Learning (ICML 2021), pp. 5884-5893

Non-Stationary Off-policy Optimization

Joey Hong

Branislav Kveton

Manzil Zaheer

Yinlam Chow

Amr Mahmoud El Houssieny Ahmed

International Conference on Artificial Intelligence and Statistics (AISTATS) (2021)

Exact and Approximate Hierarchical Clustering Using A*

Craig Greenberg

Sebastian Macaluso

Nicholas Monath

Avinava Dubey

Patrick Flaherty

Manzil Zaheer

Amr Mahmoud El Houssieny Ahmed

Kyle Cranmer

Andrew McCallum

Uncertainty in Artificial Intelligence (2021)

DAG-structured Clustering by Nearest-Neighbors

Nicholas Monath

Manzil Zaheer

Avinava Dubey

Amr Mahmoud El Houssieny Ahmed

Andrew McCallum

International Conference on Artificial Intelligence and Statistics (2021)

A Field Guide to Federated Optimization

Jianyu Wang

Zachary Burr Charles

Zheng Xu

Gauri Joshi

Brendan McMahan

Blaise Hilary Aguera-Arcas

Maruan Al-Shedivat

Galen Andrew

A. Salman Avestimehr

Katharine Daly

Deepesh Data

Suhas Diggavi

Hubert Eichner

Advait Gadhikar

Zachary Garrett

Antonious M. Girgis

Filip Hanzely

Andrew Hard

Chaoyang He

Samuel Horvath

Zhouyuan Huo

Alex Ingerman

Martin Jaggi

Tara Javidi

Peter Kairouz

Satyen Chandrakant Kale

Sai Praneeth Karimireddy

Jakub Konečný

Sanmi Koyejo

Tian Li

Luyang Liu

Mehryar Mohri

Hang Qi

Sashank Reddi

Peter Richtarik

Karan Singhal

Virginia Smith

Mahdi Soltanolkotabi

Weikang Song

Ananda Theertha Suresh

Sebastian Stich

Ameet Talwalkar

Hongyi Wang

Blake Woodworth

Shanshan Wu

Felix Yu

Honglin Yuan

Manzil Zaheer

Mi Zhang

Tong Zhang

Chunxiang (Jake) Zheng

Chen Zhu

Wennan Zhu

arxiv (2021)

Latent Programmer: Discrete Latent Codes for Program Synthesis

Joey Hong

David Martin Dohan

Rishabh Singh

Charles Sutton

Manzil Zaheer

International Conference on Machine Learning (ICML) (2021)

No Regrets for Learning the Prior in Bandits

Branislav Kveton

Csaba Szepesvari

Manzil Zaheer

Soumya Basu

NeurIPS 2021

Modifying Memories in Transformer Models

Ankit Singh Rawat

Chen Zhu

Daliang Li

Felix Yu

Manzil Zaheer

Sanjiv Kumar

Srinadh Bhojanapalli

International Conference on Machine Learning (ICML) 2021 (2020)

DIFFERENTIABLE MULTI-HOP REASONING OVER A VIRTUAL KNOWLEDGE BASE

Bhuwan Dhingra

Graham Neubig

Manzil Zaheer

Ruslan Salakhutdinov

Vidhisha Balachandran

William Weston Cohen

ICLR (2020) (to appear)

Big Bird: Transformers for Longer Sequences

Manzil Zaheer

Guru Prashanth Guruganesh

Avinava Dubey

Joshua Ainslie

Chris Alberti

Santiago Ontanon

Philip Minh Pham

Anirudh Ravula

Qifan Wang

Li Yang

Amr Mahmoud El Houssieny Ahmed

NeurIPS (2020)

Latent Bandits Revisited

Joey Hong

Branislav Kveton

Manzil Zaheer

Yinlam Chow

Amr Ahmed

Craig Boutilier

Advances in Neural Information Processing Systems 33 (NeurIPS 2020), pp. 13423-13433

Differentiable Meta-Learning of Bandit Policies

Craig Boutilier

Chih-wei Hsu

Branislav Kveton

Martin Mladenov

Csaba Szepesvari

Manzil Zaheer

Advances in Neural Information Processing Systems 33 (NeurIPS 2020), pp. 2122-2134

Randomized Exploration in Generalized Linear Bandits

Branislav Kveton

Manzil Zaheer

Csaba Szepesvari

Lihong Li

Mohammad Ghavamzadeh

Craig Boutilier

23rd International Conference on Artificial Intelligence and Statistics (2020)

Multi-step Retriever-Reader Interaction for Scalable Open-domain Question Answering

Andrew McCallum

Manzil Zaheer

Rajarshi Das

Shehzaad Dhuliawala

ICLR (2019)

Anchor & Transform: Learning Sparse Representations of Discrete Objects

Amr Mahmoud El Houssieny Ahmed

Manzil Zaheer

Paul Pu Liang

Yuan Wang

ICLR (2019)

Towards Gradient Free and Projection Free Stochastic Optimization

Anit Kumar Sahu

Manzil Zaheer

Soummya Kar

AISTATS in Proceedings of Machine Learning Research (2019)

Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space

Nick Monath

Manzil Zaheer

Daniel Silva

Andrew McCallum

Amr Ahmed

The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD ’19) (2019)

Uncovering Hidden Structure in Sequence Data via Threading Recurrent Models

Manzil Zaheer

Amr Ahmed

Yuan Wang

Daniel Silva

Marc Najork

Yuchen Wu

Shibani Sanan

Surojit Chatterjee

Proceedings of the 12 ACM International Conference on Web Search and Data Mining (2019), pp. 186-194

Adaptive Methods for Nonconvex Optimization

Manzil Zaheer

Sashank Reddi

Devendra Singh Sachan

Satyen Kale

Sanjiv Kumar

NIPS (2018)

Latent LSTM Allocation: Joint clustering and non-linear dynamic modeling of sequence data

Manzil Zaheer

Amr Ahmed

Alexander Smola

WSDM, ACM (2017)

State Space LSTM Models with Particle MCMC Inference

Xun Zheng

Manzil Zaheer

Amr Ahmed

Yuan Wang

Alex J. Smola

Eric Xing

The Thirty-first Annual Conference on Neural Information Processing Systems (NIPS) workshop on Bayesian Deep Learning. (2017)

Canopy --- Fast Sampling with Cover Trees

Manzil Zaheer

Satwik Kottur

Amr Ahmed

Jose Moura

Alex J. Smola

ICML 2017 (2017)

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Manzil Zaheer

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Manzil Zaheer

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities