no code implementations • NeurIPS 2013 • Eshcar Hillel, Zohar Karnin, Tomer Koren, Ronny Lempel, Oren Somekh
That is, distributing learning to $k$ players gives rise to a factor $\sqrt{k}$ parallel speed-up.
Multi-Armed Bandits