搜索结果: 1-1 共查到“模糊数学 Markovian Rewards”相关记录1条 . 查询时间(0.078 秒)
On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards
Combinatorial Multi-Armed Bandit Problem Markovian Rewards
2011/1/21
We consider a combinatorial generalization of the classical multi-armed bandit problem that is defined as follows.There is a given bipartite graph of M users and N M resources.