OpenAI’s article on evolution strategies algorithm as a scalable alternative to reinforcement learning. For unsupervised learning, the gradients can be computed with finite differences instead of backpropagation.
Apparently, using this algorithm in tensorflow which works with denormal numbers off, nonlinearity arises from deep linear networks.