no code implementations • 16 Apr 2021 • Ezra Tampubolon, Holger Boche
In this work, we propose an axiomatic approach for measuring the performance/welfare of a system consisting of concurrent agents in a resource-driven system.
no code implementations • 21 Oct 2020 • Ezra Tampubolon, Haris Ceribasic, Holger Boche
In this work, we study the system of interacting non-cooperative two Q-learning agents, where one agent has the privilege of observing the other's actions.
no code implementations • 21 Oct 2020 • Ezra Tampubolon, Holger Boche
Competitive non-cooperative online decision-making agents whose actions increase congestion of scarce resources constitute a model for widespread modern large-scale applications.
no code implementations • 21 Oct 2019 • Ezra Tampubolon, Holger Boche
Online-learning literature has focused on designing algorithms that ensure sub-linear growth of the cumulative long-term constraint violations.
no code implementations • 21 Oct 2019 • Ezra Tampubolon, Holger Boche
We give a condition on the step sizes and the degree of the augmentation of the Lagrangian, such that the proposed algorithm converges to a generalized Nash equilibrium.
no code implementations • 21 Oct 2019 • Ezra Tampubolon, Holger Boche
In case that the noise is persistent, and for several choices of the intrinsic parameter of the agents, such as their learning rate, and of the mechanism parameters, such as the learning rate of -, the progressivity of the price-setters, and the extrinsic price sensitivity of the agents, we show that the accumulative violation of the resource constraints of the resulted iterates is sub-linear w. r. t.