Highest Rated Comments


livinGoat2 karma

How much of the research done on bandit problems is useful in practice? Every year there are a lot of papers published on this topic with small variations to existing settings. Seb Bubeck wrote in a blog post that at some point he thought there was not much left to do in bandits, however new ideas keep arising. What do you see as future direction that could be relevant in practice? What do you think about the model selection problem in contextual bandits?