一种基于Q学习算法的网络选择方法

发明公开

请登陆查看更多内容

专利标题： 一种基于Q学习算法的网络选择方法
专利标题（英）： Network selection method based on Q-learning algorithm
申请号： CN201710915293.X

申请日： 2017-09-30
公开(公告)号： CN107690176A

公开(公告)日： 2018-02-13
发明人: 李洋 , 冯宝 , 刘金锁 , 赵高峰 , 张立武 , 蔡世龙 , 刘文贵 , 完颜绍澎 , 卞宇翔 , 马涛 , 丁晨阳 , 胡阳 , 蒯本链
申请人： 南京南瑞集团公司 , 南京南瑞信息通信科技有限公司
申请人地址： 江苏省南京市鼓楼区南瑞路8号
专利权人： 南京南瑞集团公司,南京南瑞信息通信科技有限公司
当前专利权人： 南京南瑞集团公司,南京南瑞信息通信科技有限公司
当前专利权人地址： 江苏省南京市鼓楼区南瑞路8号
代理机构： 南京纵横知识产权代理有限公司
代理商 姚兰兰; 董建林
主分类号： H04W48/18
IPC分类号： H04W48/18 ; H04W24/06

摘要：

本发明公开了一种基于Q学习算法的网络选择方法，包括以下几个步骤：(1)初始化Q值表，设置折扣因子γ和学习率α；(2)确定设定时刻到达业务的类型k和当前两个网络的负载率BWi，得到当前的状态sn；(3)在动作集合A中选择可用动作，并记录该动作以及下一个网络状态sn+1；(4)根据执行选择动作后的网络状态，计算立即回报函数r；(5)更新Q值函数Qn(s,a)，学习率α按照反比例函数规律逐渐减小为0；(6)重复步骤(2)-(5)，直至Q值收敛，即更新前后Q值差值小于门限值；(7)返回步骤(3)选择动作，接入最佳网络。本发明能有效降低语音业务阻塞率和数据业务丢包率，提高网络平均吞吐量。

摘要（英）：

The invention discloses a network selection method based on a Q-learning algorithm. The network selection method includes the steps of (1) initializing a Q value table and setting a discount factor gamma and a learning rate alpha; (2) determining a business type k and load rates of two current networks when a set time is up so as to obtain a current state sn; (3) selecting a useable action from anaction set A and recording such action as well as next network state sn+1; (4) computing an immediate return function r according to the network state after the selected action is implemented; (5), updating a Q value function Qn (s, a), and gradually decreasing the learning rate alpha to 0 according to rules of an inverse proportional function; (6), repeating the steps (2)-(5) until Q values areconverged, in other words, a difference value of the Q values before and after updating is smaller than a threshold value; (7) returning to the step (3) to select the action and accessing to an optimal network. The network selection method based on the Q-learning algorithm is capable of decreasing a voice business block rate and a data service packet loss rate and increasing average network throughout.

公开/授权文献

CN107690176B 一种基于Q学习算法的网络选择方法公开/授权日：2020-08-25

信息查询

中国专利公布公告 Global Dossier Espacenet

IPC分类:

H	电学
H04	电通信技术
H04W	无线通信网络(广播通信入H04H;使用无线链路来进行非选择性通信的通信系统，如无线扩展入H04M1/72)
H04W48/00	接入限制(防止未授权接入的接入安全入H04W12/08)；网络选择；接入点选择
H04W48/18	.选择网络或通信业务