系统工程理论与实践
繫統工程理論與實踐
계통공정이론여실천
Systems Engineering—Theory & Practice
2007年
7期
160~165
,共null页
运筹学 动态武器目标分配 算法 策略优化 马尔可夫决策过程
運籌學 動態武器目標分配 算法 策略優化 馬爾可伕決策過程
운주학 동태무기목표분배 산법 책략우화 마이가부결책과정
operations research; dynamic weapon target assignment; algorithm; policy optimization; Markovdecision process (MDP) ; mathematical model
动态武器目标分配(Weapon Target Assignment,WTA)中的目标选择策略问题可以通过建立马尔可夫决策过程(Markov decision pmcesses,MDP)模型进行研究,但目前尚无有效求解此类较大规模的MDP问题中最优策略的算法.通过分析动态WTA问题的MDP模型特点,给出了求解该问题最优策略的改进算法.该算法主要在初始策略选取规则、策略改进规则以及最优策略的判断准则等方面进行了改进.该算法具有计算量小,节省内存,并可得到最优解等优点.最后,通过算例将该算法与传统算法进行了比较.改进算法可以用于解决较大规模的动态WTA中的策略优化问题。
動態武器目標分配(Weapon Target Assignment,WTA)中的目標選擇策略問題可以通過建立馬爾可伕決策過程(Markov decision pmcesses,MDP)模型進行研究,但目前尚無有效求解此類較大規模的MDP問題中最優策略的算法.通過分析動態WTA問題的MDP模型特點,給齣瞭求解該問題最優策略的改進算法.該算法主要在初始策略選取規則、策略改進規則以及最優策略的判斷準則等方麵進行瞭改進.該算法具有計算量小,節省內存,併可得到最優解等優點.最後,通過算例將該算法與傳統算法進行瞭比較.改進算法可以用于解決較大規模的動態WTA中的策略優化問題。
동태무기목표분배(Weapon Target Assignment,WTA)중적목표선택책략문제가이통과건립마이가부결책과정(Markov decision pmcesses,MDP)모형진행연구,단목전상무유효구해차류교대규모적MDP문제중최우책략적산법.통과분석동태WTA문제적MDP모형특점,급출료구해해문제최우책략적개진산법.해산법주요재초시책략선취규칙、책략개진규칙이급최우책략적판단준칙등방면진행료개진.해산법구유계산량소,절성내존,병가득도최우해등우점.최후,통과산례장해산법여전통산법진행료비교.개진산법가이용우해결교대규모적동태WTA중적책략우화문제。
The policies optimization problem of dynamic weapon target assignment (WTA) could be modeled with Markov decision processes (MDP); however, there have been no effective algorithms to solve the optimal policies of such large-scale problems by now. The characteristics of the MDP are analyzed, and the improved algorithm to solve optimal policies of the problem is proposed correspondingly. The algorithm is mainly improved in the selection rnle of initial policy, the improvement rnle of policy and the evaluation criterion of optimal policies, so both the storage space and computing time are reduced. Meanwhile the optimal solution of the MDP problem could be obtained by the improved algorithm. Finally, a simple comparison between the improved algorithm and conventional algorithm is given through an example. It can be concluded that the improvement algorithm is suitable to solve large-scale problems such as the policies optimization problem of dynamic WTA.