jeremykun.com
Adversarial Bandits and the Exp3 Algorithm
In the last twenty years there has been a lot of research in a subfield of machine learning called Bandit Learning. The name comes from the problem of being faced with a large sequence of slot mach…