IAES International Journal of Robotics and Automation (IJRA)
Vol 7, No 1: March 2018

An Actor-critic Algorithm Using Cross Evaluation of Value Functions

Hui Wang (Soochow University)
Peng Zhang (Soochow University)
Quan Liu (Soochow University)



Article Info

Publish Date
01 Mar 2018

Abstract

In order to overcome the difficulty of learning a global optimal policy caused by maximization bias in a continuous space, an actor-critic algorithm for cross evaluation of double value function is proposed. Two independent value functions make the critique closer to the real value function. And the actor is guided by a crossover function to choose its optimal actions. Cross evaluation of value functions avoids the policy jitter phenomenon behaved by greedy optimization methods in continuous spaces. The algorithm is more robust than CACLA learning algorithm, and the experimental results show that our algorithm is smoother and the stability of policy is improved obviously under the condition that the computation remains almost unchanged.

Copyrights © 2018






Journal Info

Abbrev

IJRA

Publisher

Subject

Automotive Engineering Electrical & Electronics Engineering

Description

Robots are becoming part of people's everyday social lives and will increasingly become so. In future years, robots may become caretaker assistants for the elderly, or academic tutors for our children, or medical assistants, day care assistants, or psychological counselors. Robots may become our ...