Yarin Gal has published his PhD thesis on deep learning and offers some great insights and gives an introduction to Bayesian Deep Learning. The full thesis has 120 pages, but the introduction is a nice read already.