Daily Shaarli

All links of one day in a single page.

May 28, 2019

20 lines of code that will beat A/B testing every time

Algorithm to the multi-armed bandit.

One strategy that has been shown to perform well time after time in practical problems is the epsilon-greedy method. We always keep track of the number of pulls of the lever and the amount of rewards we have received from that lever. 10% of the time, we choose a lever at random. The other 90% of the time, we choose the lever that has the highest expectation of rewards.