Research Coagent Networks Training a neural network without backpropagation Thinker Learning to plan and act by augmenting the environment