Applied Maths of Neural Networks -- books and papers?

AI Thread Summary
The discussion centers on the search for up-to-date books and research papers focused on the mathematics of neural networks, specifically seeking generalized theorems related to optimizing neuron placement and weight calculations. A user mentions having read "Intro to the Theory of Neural Computation" by John Hertz but finds it outdated. They express interest in rigorous texts that include proofs, particularly those that delve into geometry and differential equations, including fractional applications. Recommendations include "Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow," which, while practical, may lack detailed mathematical rigor, and "Algorithms for Optimization," noted for its coverage of machine learning algorithms and written in Julia. The conversation highlights a desire for resources that balance theoretical depth with practical application in neural network optimization.
s00mb
Messages
33
Reaction score
10
Does anyone know any good up to date books on the mathematics of neural networks? If you know any good research papers that would be cool too. I've read through "Intro to the theory of neural computation" by John Hertz but it's a 1991 book. I'm looking for generalized theorems on optimizing/minimizing the placement and number of neurons in the network assuming some given way of calculating the weights. Also, if you know any cool research papers I'd like to know them. I've already seen the one on changing one pixel input to confuse a NN though. Geometry and Differential Equations (especially fractional applications) a plus. I've been browsing the internet for it and figured I'd try to pick your brains for it while I am at it. Thanks in advance.
 
Physics news on Phys.org
There's a lot of books out there. The one I like is:

Hands-on Machine Learning with Scikit-Learn, Keras and Tensorflow

It likely has some interesting math in it although most books skip past the details and go for the practice more. People want to use the algorithms to do cool stuff and leave the math to the developers of these packages.

Another book of interest would be:

Algorithms for Optimization

which covers many of the algorithms used in machine learning. All the code is written in Julia.
 
Yeah, I've read through most of the "practical" books, I was looking for rigorous books with proofs. Ohhh bonus points for using Julia I like that I'll have to check it out just for that :)
 
Last edited:
The book is fascinating. If your education includes a typical math degree curriculum, with Lebesgue integration, functional analysis, etc, it teaches QFT with only a passing acquaintance of ordinary QM you would get at HS. However, I would read Lenny Susskind's book on QM first. Purchased a copy straight away, but it will not arrive until the end of December; however, Scribd has a PDF I am now studying. The first part introduces distribution theory (and other related concepts), which...
I've gone through the Standard turbulence textbooks such as Pope's Turbulent Flows and Wilcox' Turbulent modelling for CFD which mostly Covers RANS and the closure models. I want to jump more into DNS but most of the work i've been able to come across is too "practical" and not much explanation of the theory behind it. I wonder if there is a book that takes a theoretical approach to Turbulence starting from the full Navier Stokes Equations and developing from there, instead of jumping from...

Similar threads

Back
Top