Any good math-theory-focused books on neural networks and data science?

Click For Summary
SUMMARY

This discussion focuses on the search for mathematically rigorous books on neural networks and data science. Key recommendations include "Algorithms for Optimization" by Kochenderfer, which utilizes Julia for examples, and "The 100 Page Machine Learning Book" by Burkov, available as a try-and-buy online. Additionally, "Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Geron is mentioned for its practical insights, though it lacks the depth of mathematical rigor sought. The discussion also references academic papers on function approximation, gradient descent effectiveness, and error estimation for further exploration.

PREREQUISITES
  • Understanding of neural networks and their applications in data science
  • Familiarity with optimization theorems and techniques
  • Basic knowledge of the Julia programming language
  • Awareness of machine learning frameworks such as Scikit-Learn, Keras, and TensorFlow
NEXT STEPS
  • Research "Algorithms for Optimization" by Kochenderfer for mathematical approaches in ML
  • Explore "The 100 Page Machine Learning Book" by Burkov for concise ML concepts
  • Investigate academic papers on function approximation and gradient descent effectiveness
  • Study "Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Geron for practical applications
USEFUL FOR

Data scientists, machine learning practitioners, and students seeking a deeper mathematical understanding of neural networks and optimization techniques.

s00mb
Messages
33
Reaction score
10
Hi. I'm looking for books on data science, preferably leaning towards neural networks, that focus on mathematical rigor. For example, theorems on optimization, minimum number of layers to accomplish a task efficiently, etc. Most books I've seen seem to hand wave this stuff. Anyone know any juicy books on the topic?
 
Physics news on Phys.org
There's a rather recent book by Kochenderfer called Algorithms for Optimization with many examples written in Julia, a hot programming language from MIT that folks are using for numerical work in diverse fields including ML and Data Science.

https://www.amazon.com/dp/0262039427/?tag=pfamazon01-20

There's also the 100 page ML book by Burkov:

https://www.amazon.com/dp/199957950X/?tag=pfamazon01-20

which is available online as a try and buy book.

Lastly, Geron's book Hands-on ML with Scikit-Learn, Keras and Tensorflow:

https://www.amazon.com/dp/1492032646/?tag=pfamazon01-20

All are good books that discuss the math behind the ML although not at the rigor you're looking for.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
6K
  • · Replies 5 ·
Replies
5
Views
7K
  • · Replies 13 ·
Replies
13
Views
5K
Replies
10
Views
5K
  • · Replies 11 ·
Replies
11
Views
10K
Replies
15
Views
41K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K
Replies
10
Views
3K