Search Results for author: Albert S. Berahas

Found 16 papers, 3 papers with code

Second-order Information Promotes Mini-Batch Robustness in Variance-Reduced Gradients

no code implementations • 23 Apr 2024 • Sachin Garg, Albert S. Berahas, Michał Dereziński

We show that, for finite-sum minimization problems, incorporating partial second-order information of the objective function can dramatically improve the robustness to mini-batch size of variance-reduced stochastic gradient methods, making them more scalable while retaining their benefits over traditional Newton-type approaches.

Paper
Add Code

Non-Uniform Smoothness for Gradient Descent

no code implementations • 15 Nov 2023 • Albert S. Berahas, Lindon Roberts, Fred Roosta

The analysis of gradient descent-type methods typically relies on the Lipschitz continuity of the objective gradient.

Paper
Add Code

Adaptive Consensus: A network pruning approach for decentralized optimization

no code implementations • 6 Sep 2023 • Suhail M. Shah, Albert S. Berahas, Raghu Bollapragada

We consider network-based decentralized optimization problems, where each node in the network possesses a local function and the objective is to collectively attain a consensus solution that minimizes the sum of all the local functions.

Network Pruning

Paper
Add Code

Collaborative and Distributed Bayesian Optimization via Consensus: Showcasing the Power of Collaboration for Optimal Design

1 code implementation • 25 Jun 2023 • Xubo Yue, Raed Al Kontar, Albert S. Berahas, Yang Liu, Blake N. Johnson

Empirically, through simulated datasets and a real-world collaborative sensor design experiment, we show that our framework can effectively accelerate and improve the optimal design process and benefit all participants.

Bayesian Optimization

Paper
Code

A Sequential Quadratic Programming Method with High Probability Complexity Bounds for Nonlinear Equality Constrained Stochastic Optimization

no code implementations • 1 Jan 2023 • Albert S. Berahas, Miaolan Xie, Baoyu Zhou

A step-search sequential quadratic programming method is proposed for solving nonlinear equality constrained stochastic optimization problems.

Stochastic Optimization

Paper
Add Code

A Stochastic Sequential Quadratic Optimization Algorithm for Nonlinear Equality Constrained Optimization with Rank-Deficient Jacobians

1 code implementation • 24 Jun 2021 • Albert S. Berahas, Frank E. Curtis, Michael J. O'Neill, Daniel P. Robinson

A sequential quadratic optimization algorithm is proposed for solving smooth nonlinear equality constrained optimization problems in which the objective function is defined by an expectation of a stochastic function.

Paper
Code

SONIA: A Symmetric Blockwise Truncated Optimization Algorithm

no code implementations • 6 Jun 2020 • Majid Jahani, MohammadReza Nazari, Rachael Tappenden, Albert S. Berahas, Martin Takáč

This work presents a new algorithm for empirical risk minimization.

BIG-bench Machine Learning Second-order methods

Paper
Add Code

Finite Difference Neural Networks: Fast Prediction of Partial Differential Equations

no code implementations • 2 Jun 2020 • Zheng Shi, Nur Sila Gulgec, Albert S. Berahas, Shamim N. Pakzad, Martin Takáč

Discovering the underlying behavior of complex systems is an important topic in many science and engineering disciplines.

Paper
Add Code

Scaling Up Quasi-Newton Algorithms: Communication Efficient Distributed SR1

no code implementations • 30 May 2019 • Majid Jahani, MohammadReza Nazari, Sergey Rusakov, Albert S. Berahas, Martin Takáč

In this paper, we present a scalable distributed implementation of the Sampled Limited-memory Symmetric Rank-1 (S-LSR1) algorithm.

Paper
Add Code

Linear interpolation gives better gradients than Gaussian smoothing in derivative-free optimization

no code implementations • 29 May 2019 • Albert S. Berahas, Liyuan Cao, Krzysztof Choromanski, Katya Scheinberg

We then demonstrate via rigorous analysis of the variance and by numerical comparisons on reinforcement learning tasks that the Gaussian sampling method used in [Salimans et al. 2016] is significantly inferior to the orthogonal sampling used in [Choromaski et al. 2018] as well as more general interpolation methods.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Theoretical and Empirical Comparison of Gradient Approximations in Derivative-Free Optimization

no code implementations • 3 May 2019 • Albert S. Berahas, Liyuan Cao, Krzysztof Choromanski, Katya Scheinberg

To this end, we use the results in [Berahas et al., 2019] and show how each method can satisfy the sufficient conditions, possibly only with some sufficiently large probability at each iteration, as happens to be the case with Gaussian smoothing and smoothing on a sphere.

Optimization and Control

Paper
Add Code

Quasi-Newton Methods for Machine Learning: Forget the Past, Just Sample

1 code implementation • 28 Jan 2019 • Albert S. Berahas, Majid Jahani, Peter Richtárik, Martin Takáč

We present two sampled quasi-Newton methods (sampled LBFGS and sampled LSR1) for solving empirical risk minimization problems that arise in machine learning.

Benchmarking BIG-bench Machine Learning +3

Paper
Code

A Robust Multi-Batch L-BFGS Method for Machine Learning

no code implementations • 26 Jul 2017 • Albert S. Berahas, Martin Takáč

This paper describes an implementation of the L-BFGS method designed to deal with two adversarial situations.

BIG-bench Machine Learning Binary Classification +1

Paper
Add Code

An Investigation of Newton-Sketch and Subsampled Newton Methods

no code implementations • 17 May 2017 • Albert S. Berahas, Raghu Bollapragada, Jorge Nocedal

Sketching, a dimensionality reduction technique, has received much attention in the statistics community.

Dimensionality Reduction

Paper
Add Code

A Multi-Batch L-BFGS Method for Machine Learning

no code implementations • NeurIPS 2016 • Albert S. Berahas, Jorge Nocedal, Martin Takáč

The question of how to parallelize the stochastic gradient descent (SGD) method has received much attention in the literature.

BIG-bench Machine Learning Distributed Computing

Paper
Add Code

adaQN: An Adaptive Quasi-Newton Algorithm for Training RNNs

no code implementations • 4 Nov 2015 • Nitish Shirish Keskar, Albert S. Berahas

In this paper, we present adaQN, a stochastic quasi-Newton algorithm for training RNNs.

Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.