An information processing apparatus includes a first recurrent neural network
(RNN) for performing processing which corresponds to a time-series and a second
RNN for processing another correlated time-series. The difference between a context
set output by the first RNN and a context set output by the second RNN is computed
by a subtractor, and the obtained difference is used as a prediction error. Backpropagation
is performed based on the prediction error, thus determining a coefficient for
each neuron of an output layer, an intermediate layer, and an input layer.