• norsk
    • English
  • norsk 
    • norsk
    • English
  • Logg inn
Vis innførsel 
  •   Hjem
  • Faculty of Social Sciences
  • Department of Information Science and Media Studies
  • Department of Information Science and Media Studies
  • Vis innførsel
  •   Hjem
  • Faculty of Social Sciences
  • Department of Information Science and Media Studies
  • Department of Information Science and Media Studies
  • Vis innførsel
JavaScript is disabled for your browser. Some features of this site may not work without it.

Multi-Armed Bandit Networks: Exploring Online Learning with Networks

Hansen, Viktor
Master thesis
Thumbnail
Åpne
master thesis (2.649Mb)
Permanent lenke
http://hdl.handle.net/1956/18665
Utgivelsesdato
2018-06-26
Metadata
Vis full innførsel
Samlinger
  • Department of Information Science and Media Studies [1006]
Sammendrag
Classical Multi-Armed Bandit solutions often assumes independent arms as a simplification of the problem. This has shown great results in many different fields of practice, but could in some cases, presumably leave untapped potential. In this paper I explore network based MAB solutions using explore-exploit algorithms as nodes to further minimize regret, and take advantage of inter-Bandit dependencies. I explore two network approaches; Hierarchical and Flat network. As well as a special cases of the Bernoulli Bandit with dependent arms, referred to as Symbiotic Bandit. The results show that some networked solutions prevail the single node versions in both the Bernoulli Bandit and the Symbiotic Bandit regret wise.
Utgiver
The University of Bergen
Opphavsrett
Copyright the author. All rights reserved.

Kontakt oss | Gi tilbakemelding

Personvernerklæring
DSpace software copyright © 2002-2019  DuraSpace

Levert av  Unit
 

 

Bla i

Hele arkivetDelarkiv og samlingerUtgivelsesdatoForfattereTitlerEmneordDokumenttyperTidsskrifterDenne samlingenUtgivelsesdatoForfattereTitlerEmneordDokumenttyperTidsskrifter

Min side

Logg inn

Statistikk

Besøksstatistikk

Kontakt oss | Gi tilbakemelding

Personvernerklæring
DSpace software copyright © 2002-2019  DuraSpace

Levert av  Unit