• Bookmarks

    Bookmarks

  • Concepts

    Concepts

  • Activity

    Activity

  • Courses

    Courses


Thompson Sampling is a Bayesian approach to the multi-armed bandit problem that balances exploration and exploitation by sampling from the posterior distribution of each action's expected reward. It is particularly effective in online decision-making scenarios where uncertainty about action outcomes needs to be reduced over time through adaptive learning.
3