Good introductory book on statistical/data analysis?

  • #1
HAYAO
Science Advisor
Gold Member
370
233
TL;DR Summary: I'm looking for a book on statistical/data analysis.

Hey all. I've been doing statistical analysis in my research (such as using PCA and LDA), but I have never received a formal education on statistical analysis or data mining, and what I know about analysis is quite scattered and unorganized.

I think it is about time I get a good introductory textbook to get a broader and well-organized understanding on the topic. Could you guys recommend me a book that would be good for someone like me?

Thank you.
 

Answers and Replies

  • #2
WWGD
Science Advisor
Gold Member
6,291
8,180
If you have access to a college or otherwise major library, I suggest you drop by , browse a few books and see which feels right for you. I use a rule of thumb of seeing that the book has a carefully written index, list of notation, as a reflection of having put care into writing the book.
 
  • Like
Likes HAYAO and The Bill
  • #3
14,194
8,184
There’s a,couple of machine learning books:
- 100 pg machine learning book by burkiv
- Hands on Machine Learning with Scikit Learn, … by Geron

that have chapters on statistical / data analysis as this is a central theme of machine learning aka statistical learning.

The 100 page book is available online from the author as a kind of try and buy scheme.

The Hands-On book has some good implementation outlines at the end to help in setting and running a machine learning project. Many of the steps would be used in data mining as well.
 
  • Like
Likes PhDeezNutz and HAYAO
  • #4
HAYAO
Science Advisor
Gold Member
370
233
If you have access to a college or otherwise major library, I suggest you drop by , browse a few books and see which feels right for you. I use a rule of thumb of seeing that the book has a carefully written index, list of notation, as a reflection of having put care into writing the book.
My gosh, why have I not thought of this lol. Thanks.

I'll go and see what the library has in there. The problem is, I live in Japan and many of the books are in Japanese. Of course, there are English books in there, but is probably somewhat limited compared to what you guys have in English-speaking countries.

There’s a,couple of machine learning books:
- 100 pg machine learning book by burkiv
- Hands on Machine Learning with Scikit Learn, … by Geron

that have chapters on statistical / data analysis as this is a central theme of machine learning aka statistical learning.

The 100 page book is available online from the author as a kind of try and buy scheme.

The Hands-On book has some good implementation outlines at the end to help in setting and running a machine learning project. Many of the steps would be used in data mining as well.
Thanks. Yeah, machine learning and stuff is definitely related, but I would like to keep it more introductory. But good point about some of these machine learning books contain chapters on statistical/data analysis. Thanks for the suggestion!
 
  • #5
gleem
Science Advisor
Education Advisor
2,085
1,521
From your OP it was not clear to me that machine learning was your interest. But I stumbled on this PDF about applications of statistical learning that include PCA and LDA approaches.

https://www.ime.unicamp.br/~dias/Intoduction to Statistical Learning.pdf

From the introduction
Who Should Read This Book? This book is intended for anyone who is interested in using modern statistical methods for modeling and prediction from data. This group includes scientists, engineers, data analysts, or quants, but also less technical individuals with degrees in non-quantitative fields such as the social sciences or business. We expect that the reader will have had at least one elementary course in statistics. Background in linear regression is also useful, though not required, since we review the key concepts behind linear regression in Chapter 3. The mathematical level of this book is modest, and a detailed knowledge of matrix operations is not required. This book provides an introduction to the statistical programming language R. Previous exposure to a programming language, such as MATLAB or Python, is useful but not required. We have successfully taught material at this level to master’s and PhD students in business, computer science, biology, earth sciences, psychology, and many other areas of the physical and social sciences. This book could also be appropriate for advanced undergraduates who have already taken a course on linear regression. In the context of a more mathematically rigorous course in which ESL serves as the primary textbook, ISL could be used as a supplementary text for teaching computational aspects of the various approaches.
 
  • #6
HAYAO
Science Advisor
Gold Member
370
233
From your OP it was not clear to me that machine learning was your interest. But I stumbled on this PDF about applications of statistical learning that include PCA and LDA approaches.

https://www.ime.unicamp.br/~dias/Intoduction to Statistical Learning.pdf

From the introduction
This is awesome! Thank you very much.

I'm skimming through what I have in the Library, but this text covers many of what I want to learn.
 
  • Like
Likes gleem and jedishrfu

Suggested for: Good introductory book on statistical/data analysis?

  • Last Post
Replies
4
Views
583
  • Last Post
Replies
3
Views
447
Replies
21
Views
1K
  • Last Post
Replies
17
Views
940
Replies
13
Views
589
Replies
5
Views
297
  • Last Post
Replies
2
Views
859
Replies
3
Views
379
Replies
6
Views
729
Top