Search Results for author: Michael Jordan

Found 25 papers, 9 papers with code

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics

no code implementations • 7 May 2024 • Hanlin Zhu, Baihe Huang, Shaolun Zhang, Michael Jordan, Jiantao Jiao, Yuandong Tian, Stuart Russell

Auto-regressive large language models (LLMs) show impressive capacities to solve many complex reasoning tasks while struggling with some simple logical reasoning tasks such as inverse search: when trained on ''A is B'', LLM fails to directly conclude ''B is A'' during inference, which is known as the ''reversal curse'' (Berglund et al., 2023).

Logical Reasoning

Paper
Add Code

Data Acquisition via Experimental Design for Decentralized Data Markets

no code implementations • 20 Mar 2024 • Charles Lu, Baihe Huang, Sai Praneeth Karimireddy, Praneeth Vepakomma, Michael Jordan, Ramesh Raskar

Acquiring high-quality training data is essential for current machine learning models.

Data Valuation Experimental Design

Paper
Add Code

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

1 code implementation • 7 Mar 2024 • Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios Nikolas Angelopoulos, Tianle Li, Dacheng Li, Hao Zhang, Banghua Zhu, Michael Jordan, Joseph E. Gonzalez, Ion Stoica

To address this issue, we introduce Chatbot Arena, an open platform for evaluating LLMs based on human preferences.

Chatbot

34,596

Paper
Code

Classifier Calibration with ROC-Regularized Isotonic Regression

no code implementations • 21 Nov 2023 • Eugene Berta, Francis Bach, Michael Jordan

IR acts as an adaptive binning procedure, which allows achieving a calibration error of zero, but leaves open the issue of the effect on performance.

Classifier calibration regression

Paper
Add Code

Provably Personalized and Robust Federated Learning

1 code implementation • 14 Jun 2023 • Mariel Werner, Lie He, Michael Jordan, Martin Jaggi, Sai Praneeth Karimireddy

Identifying clients with similar objectives and learning a model-per-cluster is an intuitive and interpretable approach to personalization in federated learning.

Clustering Personalized Federated Learning +1

Paper
Code

Doubly Robust Self-Training

1 code implementation • 1 Jun 2023 • Banghua Zhu, Mingyu Ding, Philip Jacobson, Ming Wu, Wei Zhan, Michael Jordan, Jiantao Jiao

Self-training is an important technique for solving semi-supervised learning problems.

3D Object Detection Autonomous Driving +2

Paper
Code

Neural Dependencies Emerging from Learning Massive Categories

no code implementations • CVPR 2023 • Ruili Feng, Kecheng Zheng, Kai Zhu, Yujun Shen, Jian Zhao, Yukun Huang, Deli Zhao, Jingren Zhou, Michael Jordan, Zheng-Jun Zha

Through investigating the properties of the problem solution, we confirm that neural dependency is guaranteed by a redundant logit covariance matrix, which condition is easily met given massive categories, and that neural dependency is highly sparse, implying that one category correlates to only a few others.

Image Classification

Paper
Add Code

Rank Diminishing in Deep Neural Networks

no code implementations • 13 Jun 2022 • Ruili Feng, Kecheng Zheng, Yukun Huang, Deli Zhao, Michael Jordan, Zheng-Jun Zha

By virtue of our numerical tools, we provide the first empirical analysis of the per-layer behavior of network rank in practical settings, i. e., ResNets, deep MLPs, and Transformers on ImageNet.

Paper
Add Code

Can Reinforcement Learning Efficiently Find Stackelberg-Nash Equilibria in General-Sum Markov Games?

no code implementations • 29 Sep 2021 • Han Zhong, Zhuoran Yang, Zhaoran Wang, Michael Jordan

To our best knowledge, we establish the first provably efficient RL algorithms for solving SNE in general-sum Markov games with leader-controlled state transitions.

Reinforcement Learning (RL)

Paper
Add Code

Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations

no code implementations • NeurIPS 2020 • Zhuoran Yang, Chi Jin, Zhaoran Wang, Mengdi Wang, Michael Jordan

Reinforcement learning (RL) algorithms combined with modern function approximators such as kernel functions and deep neural networks have achieved significant empirical successes in large-scale application problems with a massive number of states.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Variational Refinement for Importance SamplingUsing the Forward Kullback-Leibler Divergence

no code implementations • pproximateinference AABI Symposium 2021 • Ghassen Jerfel, Serena Lutong Wang, Clara Fannjiang, Katherine A Heller, Yian Ma, Michael Jordan

Variational Inference (VI) is a popular alternative to asymptotically exact sampling in Bayesian inference.

Bayesian Inference Variational Inference

Paper
Add Code

Towards Accurate Model Selection in Deep Unsupervised Domain Adaptation

2 code implementations • International Conference on Machine Learning 2019 • Kaichao You, Ximei Wang, Mingsheng Long, Michael Jordan

Deep unsupervised domain adaptation (Deep UDA) methods successfully leverage rich labeled data in a source domain to boost the performance on related but unlabeled data in a target domain.

Model Selection Unsupervised Domain Adaptation

325

Paper
Code

SAFFRON: an adaptive algorithm for online control of the false discovery rate

1 code implementation • ICML 2018 • Aaditya Ramdas, Tijana Zrnic, Martin Wainwright, Michael Jordan

However, unlike older methods, SAFFRON's threshold sequence is based on a novel estimate of the alpha fraction that it allocates to true null hypotheses.

Paper
Code

A Berkeley View of Systems Challenges for AI

no code implementations • 15 Dec 2017 • Ion Stoica, Dawn Song, Raluca Ada Popa, David Patterson, Michael W. Mahoney, Randy Katz, Anthony D. Joseph, Michael Jordan, Joseph M. Hellerstein, Joseph E. Gonzalez, Ken Goldberg, Ali Ghodsi, David Culler, Pieter Abbeel

With the increasing commoditization of computer vision, speech recognition and machine translation systems and the widespread deployment of learning-based back-end technologies such as digital advertising and intelligent infrastructures, AI (Artificial Intelligence) has moved from research labs to production.

Machine Translation speech-recognition +1

Paper
Add Code

A deep generative model for single-cell RNA sequencing with application to detecting differentially expressed genes

no code implementations • 13 Oct 2017 • Romain Lopez, Jeffrey Regier, Michael Cole, Michael Jordan, Nir Yosef

We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing.

Stochastic Optimization Variational Inference

Paper
Add Code

A deep generative model for gene expression profiles from single-cell RNA sequencing

2 code implementations • 7 Sep 2017 • Romain Lopez, Jeffrey Regier, Michael Cole, Michael Jordan, Nir Yosef

We also extend our framework to account for batch effects and other confounding factors, and propose a Bayesian hypothesis test for differential expression that outperforms DESeq2.

Stochastic Optimization Variational Inference

1,137

Paper
Code

Saturating Splines and Feature Selection

no code implementations • 21 Sep 2016 • Nicholas Boyd, Trevor Hastie, Stephen Boyd, Benjamin Recht, Michael Jordan

We extend the adaptive regression spline model by incorporating saturation, the natural requirement that a function extend as a constant outside a certain range.

Additive models feature selection

Paper
Add Code

Local Maxima in the Likelihood of Gaussian Mixture Models: Structural Results and Algorithmic Consequences

no code implementations • NeurIPS 2016 • Chi Jin, Yuchen Zhang, Sivaraman Balakrishnan, Martin J. Wainwright, Michael Jordan

Our first main result shows that the population likelihood function has bad local maxima even in the special case of equally-weighted mixtures of well-separated and spherical Gaussians.

Open-Ended Question Answering

Paper
Add Code

Fast robustness quantification with variational Bayes

no code implementations • 23 Jun 2016 • Ryan Giordano, Tamara Broderick, Rachael Meager, Jonathan Huggins, Michael Jordan

Bayesian hierarchical models are increasing popular in economics.

Paper
Add Code

Linear Response Methods for Accurate Covariance Estimates from Mean Field Variational Bayes

1 code implementation • NeurIPS 2015 • Ryan Giordano, Tamara Broderick, Michael Jordan

We call our method linear response variational Bayes (LRVB).

Paper
Code

High-Dimensional Continuous Control Using Generalized Advantage Estimation

17 code implementations • 8 Jun 2015 • John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, Pieter Abbeel

Policy gradient methods are an appealing approach in reinforcement learning because they directly optimize the cumulative reward and can straightforwardly be used with nonlinear function approximators such as neural networks.

Continuous Control Policy Gradient Methods +1

48,883

Paper
Code

Variational Bayesian Inference with Stochastic Search

no code implementations • 27 Jun 2012 • John Paisley, David Blei, Michael Jordan

This requires the ability to integrate a sum of terms in the log joint likelihood using this factorized distribution.

Bayesian Inference Stochastic Optimization +1

Paper
Add Code

Nonparametric Link Prediction in Dynamic Networks

no code implementations • 27 Jun 2012 • Purnamrita Sarkar, Deepayan Chakrabarti, Michael Jordan

We propose a non-parametric link prediction algorithm for a sequence of graph snapshots over time.

Link Prediction

Paper
Add Code

Nonparametric Link Prediction in Large Scale Dynamic Networks

no code implementations • 6 Sep 2011 • Purnamrita Sarkar, Deepayan Chakrabarti, Michael Jordan

We propose a nonparametric approach to link prediction in large-scale dynamic networks.

Link Prediction

Paper
Add Code

Learning Semantic Correspondences with Less Supervision

1 code implementation • 1 Aug 2009 • Percy Liang, Michael Jordan, Dan Klein

A central problem in grounded language acquisition is learning the correspondences between a rich world state and a stream of text which references that world state.

Language Acquisition

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.