A Best-of-Both-Worlds Proof for Tsallis-INF without Fenchel Conjugates

arXiv — stat.MLMonday, November 17, 2025 at 5:00:00 AM
arXiv:2511.11211v1 Announce Type: cross Abstract: In this short note, we present a simple derivation of the best-of-both-world guarantee for the Tsallis-INF multi-armed bandit algorithm from J. Zimmert and Y. Seldin. Tsallis-INF: An optimal algorithm for stochastic and adversarial bandits. Journal of Machine Learning Research, 22(28):1-49, 2021. URL https://jmlr.csail.mit.edu/papers/volume22/19-753/19-753.pdf. In particular, the proof uses modern tools from online convex optimization and avoid the use of conjugate functions. Also, we do not optimize the constants in the bounds in favor of a slimmer proof.
— via World Pulse Now AI Editorial System

Was this article worth reading? Share it