Daily - May 28, 2019 - Exploring the web, one link at a time

Delete Set public Set private Add tags Delete tags

Daily Weekly Monthly

Daily Shaarli

Previous day

All links of one day in a single page.

Next day

May 28, 2019

20 lines of code that will beat A/B testing every time

Algorithm to the multi-armed bandit.

One strategy that has been shown to perform well time after time in practical problems is the epsilon-greedy method. We always keep track of the number of pulls of the lever and the amount of rewards we have received from that lever. 10% of the time, we choose a lever at random. The other 90% of the time, we choose the lever that has the highest expectation of rewards.

algorithm testing statistics