Maths of Neural Networks -- books and papers?

Click For Summary
SUMMARY

The discussion centers on the search for contemporary literature on the mathematics of neural networks, specifically seeking rigorous texts that provide generalized theorems for optimizing neuron placement and weight calculations. Notable recommendations include "Hands-on Machine Learning with Scikit-Learn, Keras and TensorFlow," which, while practical, may lack in-depth mathematical detail, and "Algorithms for Optimization," which offers insights into machine learning algorithms with code examples in Julia. The participant expresses a preference for materials that include proofs and advanced mathematical concepts, particularly in geometry and differential equations.

PREREQUISITES
  • Understanding of neural network architecture and design
  • Familiarity with optimization algorithms in machine learning
  • Knowledge of mathematical concepts in geometry and differential equations
  • Proficiency in Julia programming language for implementation
NEXT STEPS
  • Research "Mathematics for Machine Learning" for foundational concepts
  • Explore "Deep Learning" by Ian Goodfellow for advanced neural network theories
  • Investigate "Neural Networks and Deep Learning" online course for practical applications
  • Study "Fractional Differential Equations" for advanced mathematical techniques
USEFUL FOR

Researchers, machine learning practitioners, and students seeking a deeper mathematical understanding of neural networks and optimization techniques.

s00mb
Messages
33
Reaction score
10
Does anyone know any good up to date books on the mathematics of neural networks? If you know any good research papers that would be cool too. I've read through "Intro to the theory of neural computation" by John Hertz but it's a 1991 book. I'm looking for generalized theorems on optimizing/minimizing the placement and number of neurons in the network assuming some given way of calculating the weights. Also, if you know any cool research papers I'd like to know them. I've already seen the one on changing one pixel input to confuse a NN though. Geometry and Differential Equations (especially fractional applications) a plus. I've been browsing the internet for it and figured I'd try to pick your brains for it while I am at it. Thanks in advance.
 
Physics news on Phys.org
There's a lot of books out there. The one I like is:

Hands-on Machine Learning with Scikit-Learn, Keras and Tensorflow

It likely has some interesting math in it although most books skip past the details and go for the practice more. People want to use the algorithms to do cool stuff and leave the math to the developers of these packages.

Another book of interest would be:

Algorithms for Optimization

which covers many of the algorithms used in machine learning. All the code is written in Julia.
 
Yeah, I've read through most of the "practical" books, I was looking for rigorous books with proofs. Ohhh bonus points for using Julia I like that I'll have to check it out just for that :)
 
Last edited:

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 14 ·
Replies
14
Views
4K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • Sticky
  • · Replies 16 ·
Replies
16
Views
12K
  • · Replies 2 ·
Replies
2
Views
3K