policy improvement iteration
同义词释义
- 1)policy improvement iteration,策略改进[迭代]2)policy iteration,策略迭代3)Iterative strategy,迭代策略4)Iterative improvement,迭代改进5)improvement,改进策略6)improvement strategy,改进策略
用法例句
Tradional algorithms to solve POMPDs are value iteration algorithm and policy iteration algorithm.
介绍了部分可观察Markov决策过程的基本原理和决策过程,提出一种基于策略迭代和值迭代的部分可观察Markov决策算法,该算法利用线性规划和动态规划的思想,解决当信念状态空间较大时出现的"维数灾"问题,得到Markov决策的逼近最优解。
By the equivalent Markov process, formulas of performance potentials and average-cost optimality equations for SMCPs are derived, and a policy iteration algorithm and a value iteration algorithm are proposed, which can lead to an optimal or suboptimal stationary policy in a finite number of iterations.
利用等价Markov过程的方法,导出了SMCP的性能势公式和平均代价最优性方程,给出了求解最优或次最优平稳策略的策略迭代算法和数值迭代算法,并证明了算法的收敛性。
An appropriate selection of basis function directly in?uences the learning performance of a policy iteration method during the value function approximation.
在策略迭代结强化学习方法的值函数逼近过程中,基函数的合理选择直接影响方法的性能。
The al-gorithm is a integration of progressive alignment approach and iterative strategy.
该算法先用渐进方法进行多序列比对,然后通过迭代策略,利用上一轮多序列比对结果修正指导树,产生新一轮比对。
Aiming at difficulties of the method in solving parameters, this study carries through the iterative improvement.
针对二项式产能方程法在求解参数过程中存在的难点,对该方法进行了迭代改进,并编写了相应程序,提高了计算效率及精度。
The Obstruction of Census Register System in the Realization of Education Equity and Reflections on Its Tactical Improvement;
户籍制度在实现教育公平中的阻滞作用及其改进策略思考
Problem and Improvement Strategy on the Legal System Education in the Universities and Colleges;
高校法制教育中的问题及改进策略研究
The paper addresses an efficiency measurement and an improvement strategy for the Formal Software Evolution Process Model(FSEPM).
提出了一种支持形式化的软件演化过程模型的效率度量方法和改进策略。
Based on the datas of the investigation, the author proposed an improvement strategy and creatively applied visual instruments on recitation.
笔者首先对L外国语中学三个年级300名学生进行了随机问卷调查,从态度、认知和实际应用三个层面进行调查分析并着重从应用层面提出相应的改进策略,其中创造性地提出将可视化工具运用于背诵以增进信息的编码与加工。
Recursive Least Squares Policy Iteration Based on Geodesic Gaussian Basis Function
基于测地高斯基函数的递归最小二乘策略迭代
Iterative Digitising Strategies for Continuous Scanning Probes in Reverse Engineering;
逆向工程中连续扫描测头的迭代数字化策略
rQrRi--A Rsrouce Quality Rating and Reputation Iterating Strategy for p2p Systems;
rQrRi—p2p系统的资源质量评价和信用迭代策略
A cooperative communication scheme based on PDA+LDPC iterative detection algorithm
一种基于PDA+LDPC迭代检测算法的协作通信策略
Distributed cooperative communication scheme based on the PDA+LDPC iterative receivers
基于PDA+LDPC迭代接收机的分布式协作通信策略
A novel iterative learning control for product quality control in batch process
一种间歇过程产品质量迭代学习控制策略
Metaheuristic Strategy Based K-Means with the Iterative Self-Learning Framework
一种基于元启发式策略的迭代自学习K-Means算法
An Iteratively Bargaining-based Strategy for Optimizing Service Composition Execution Path
基于迭代Bargaining策略优化服务合成执行路径
Research of initial iterative learning control strategy based on BP neural network
基于BP神经网络的迭代学习初始控制策略研究
Telecommunications Network Traffic Flow Distribution Based on Markov-processes and Improvement of the Policy Iteration;
通信网流量分配的马氏决策及策略改善迭代计算
Research of Energy Saving in Air Conditioning Systems Based on Flexible ILC;
基于柔性迭代学习控制的空调系统节能策略研究
Study of Initial Iterative Learning Control Strategy Based on Adaptive Neural-Fuzzy Inference System
基于自适应神经模糊推理系统的迭代学习控制初始控制策略研究
Study on Energy Saving of Air Conditioning System Based on Flexible ILC with Feedback Compensation
基于柔性迭代学习控制和反馈补偿的空调系统节能策略
On Translation Strategies of Gladys Yang's the Border Town
戴乃迭《边城》英译本的翻译策略研究
Iteration and Iterative Roots for Polygonal Functions and Multifunctions;
折线函数和集值函数的迭代与迭代根
The Convergence of Mann Iteration is Equivalent to the Convergence of Ishikawa Iteration with Errors;
Mann迭代和带误差的Ishikawa迭代的等价性
On the Traditional Iteration and Elongate Iteration in the 3N + 1 Conjecture;
3N+1猜想中的通常迭代与伸长迭代
Mann Iterates and Ishikawa Iterates in Banach Space
Banach空间中的Mann迭代和Ishikawa迭代
相关英语词汇
- penalized likelihood function是什么意思
- phase oscillation是什么意思
- pressure dispersion是什么意思
- principle of harmonic measure是什么意思
- penning ion source是什么意思
- pseudoproduct是什么意思
- polar of nonnegative convex function是什么意思
- periodicity interval是什么意思
- proper convex body是什么意思
- pivot column是什么意思
- polyhedral group是什么意思
- parallel coordinates是什么意思
- pre hilbert space是什么意思
- piecewise smooth是什么意思
- profile monitor是什么意思
- principal minor是什么意思
- projective metric是什么意思
- point slope form是什么意思
- proof with the deductive method是什么意思
- proof是什么意思
- polyhedron是什么意思
- paramagnetic element是什么意思
- purely transcendental extension是什么意思
- pseudodifferential operator是什么意思
- pseudoconformal mapping是什么意思
- probable ellipsoid是什么意思
- projective conic是什么意思
- photographic photometry是什么意思
- positive eyepiece是什么意思
- periodicity of function是什么意思