Proc. of MACC'97

Transferring learned values between agents of reinforcement learning on different state-spaces
UEDA Nobuhisa, SATO Taisuke
Tokyo Institute of Technology
Contact to: ueda@cs.titech.ac.jp
Abstract
Transmitting learned results between agents is a way to reduce the time for learning. With this transmission, the closer what agents have to learn are, the longer time it reduces. To solve cooperative problems for multiple agents, what one agent has learned is expected to resemble what the others have to learn.
On the other hand, in reinforcement learning, agents learn estimated values for their actions on their states. Their states generally consists of the observations of the other agents. One agent observe himself in a fixed place. But the others observe him in an arbitrary place. Then their state-spaces must be different to distinguish one agent from the others. Even if they learn the same task, the estimated values on different state-spaces are different. Hence they cannot share directly learned results with the others.
We propose here a method to transmit one agent's learned estimated values to the others. We assume that any two agents have at least one observation in common of the agents which constitutes their state-spaces. We also show an experimental result of applying this method to a pursuit problem in which multiple agents cooperate to capture a prey.
keywords
reinforcement learning, transmitting learned results.
PS file(+gzip) (in Japanese)

Wed Jan 21 09:37:36 JST 1998