[go: nahoru, domu]

Skip to content

tangzhenyu/Multi-arm-Bandit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Multi-arm-Bandit

resources of multi-arm bandit

The features of a multi-arm bandit problem:

  • (F1) only one machine is operated at each time instant. The evolution of the machine that is being operated is uncontrolled; that is, the processor chooses which machine to operate but not how to operate it;
  • (F2) machines that are not operated remain frozen;
  • (F3) machines are independent;
  • (F4) frozen machines contribute no reward.

Algorithms:

  • Epsilon-Greedy
  • UCB
  • Contextual Bandits
    • LinUCB
    • CoLin
    • hLinUCB
    • FactorUCB
  • Thompson Sampling (Bayesian)
    • Bernoulli, Binomial <=> Beta Distributions

Books and Book Chapters

Academic Articles

Blog Posts

Presentations

Resources

scholars

About

resources of multi-arm bandit

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages