News
In this work, a novel value function-based reinforcement learning (RL) approach, descending dynamic policy programming (DDPP) is proposed to address the issues of sample-efficiency and learning ...
Sorting tasks are typical applications in the product and industrial domains. When facing new settings, such as different types of objects and their positions, the system has to be reprogrammed by ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results