Dynamic Spectrum Access in Time-varying Environment: Distributed Learning Beyond Expectation Optimization

28 Jul 2017  ·  Xu Yuhua, Wang Jinlong, Wu Qihui, Zheng Jianchao, Shen Liang, Anpalagan Alagan ·

This article investigates the problem of dynamic spectrum access for canonical wireless networks, in which the channel states are time-varying. In the most existing work, the commonly used optimization objective is to maximize the expectation of a certain metric (e.g., throughput or achievable rate). However, it is realized that expectation alone is not enough since some applications are sensitive to fluctuations. Effective capacity is a promising metric for time-varying service process since it characterizes the packet delay violating probability (regarded as an important statistical QoS index), by taking into account not only the expectation but also other high-order statistic. Therefore, we formulate the interactions among the users in the time-varying environment as a non-cooperative game, in which the utility function is defined as the achieved effective capacity. We prove that it is an ordinal potential game which has at least one pure strategy Nash equilibrium. Based on an approximated utility function, we propose a multi-agent learning algorithm which is proved to achieve stable solutions with dynamic and incomplete information constraints. The convergence of the proposed learning algorithm is verified by simulation results. Also, it is shown that the proposed multi-agent learning algorithm achieves satisfactory performance.

PDF Abstract
No code implementations yet. Submit your code now

Categories


Information Theory Computer Science and Game Theory Information Theory

Datasets


  Add Datasets introduced or used in this paper