Prob/Stats Any good math-theory-focused books on neural networks and data science?

AI Thread Summary
The discussion centers on finding books that provide a mathematically rigorous approach to data science, particularly in the context of neural networks and optimization theorems. Recommendations include "Algorithms for Optimization" by Kochenderfer, which features practical examples in Julia, and "The Hundred-Page Machine Learning Book" by Burkov, available as a try-and-buy online option. Additionally, "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Geron is noted for its mathematical discussions, though it may not meet the desired level of rigor. The conversation acknowledges that many rigorous insights are currently found in academic papers rather than textbooks. Several relevant research papers on function approximation, gradient descent effectiveness, and error estimation are also shared, indicating ongoing advancements in the field.
s00mb
Messages
33
Reaction score
10
Hi. I'm looking for books on data science, preferably leaning towards neural networks, that focus on mathematical rigor. For example, theorems on optimization, minimum number of layers to accomplish a task efficiently, etc. Most books I've seen seem to hand wave this stuff. Anyone know any juicy books on the topic?
 
Physics news on Phys.org
There's a rather recent book by Kochenderfer called Algorithms for Optimization with many examples written in Julia, a hot programming language from MIT that folks are using for numerical work in diverse fields including ML and Data Science.

https://www.amazon.com/dp/0262039427/?tag=pfamazon01-20

There's also the 100 page ML book by Burkov:

https://www.amazon.com/dp/199957950X/?tag=pfamazon01-20

which is available online as a try and buy book.

Lastly, Geron's book Hands-on ML with Scikit-Learn, Keras and Tensorflow:

https://www.amazon.com/dp/1492032646/?tag=pfamazon01-20

All are good books that discuss the math behind the ML although not at the rigor you're looking for.
 
I looked through "Data mining" by Witten, Frank, Hall and, Pal which covers most of these, but it isn't rigorous like the good old math analysis books. No wonder for such a rapidly developing (practical) area.
 
For the following four books, has anyone used them in a course or for self study? Compiler Construction Principles and Practice 1st Edition by Kenneth C Louden Programming Languages Principles and Practices 3rd Edition by Kenneth C Louden, and Kenneth A Lambert Programming Languages 2nd Edition by Allen B Tucker, Robert E Noonan Concepts of Programming Languages 9th Edition by Robert W Sebesta If yes to either, can you share your opinions about your personal experience using them. I...
Hi, I have notice that Ashcroft, Mermin and Wei worked at a revised edition of the original solid state physics book (here). The book, however, seems to be never available. I have also read that the reason is related to some disputes related to copyright. Do you have any further information about it? Did you have the opportunity to get your hands on this revised edition? I am really curious about it, also considering that I am planning to buy the book in the near future... Thanks!
I’ve heard that in some countries (for example, Argentina), the curriculum is structured differently from the typical American program. In the U.S., students usually take a general physics course first, then move on to a textbook like Griffiths, and only encounter Jackson at the graduate level. In contrast, in those countries students go through a general physics course (such as Resnick-Halliday) and then proceed directly to Jackson. If the slower, more gradual approach is considered...
Back
Top