Resources for Data Science, Statistical Analysis, ML & Scientific Computing

In summary, becoming a data scientist requires a strong understanding of various fields such as distributed computing, statistical analysis, numerical analysis, complexity measures, wavelets, data compression, and machine learning. Resources for learning about these topics can be found at various websites such as Code Google, AIMA, netlib, and GitHub. Additionally, there are online repositories for scientific computing and high performance computing, as well as online communities and forums for discussing and sharing information on these topics.
  • #1
Simfish
Gold Member
823
2
http://www.quora.com/How-do-I-become-a-data-scientist

Huge collections of resources can be found at http://www.quora.com/Alex-Kamil/answers (you can edit them too)

For example,

http://www.quora.com/What-are-some-good-resources-for-learning-about-distributed-computing
http://www.quora.com/What-are-some-good-resources-for-learning-about-statistical-analysis
http://www.quora.com/What-are-some-good-resources-for-learning-about-numerical-analysis
http://www.quora.com/What-are-some-measures-of-complexity
http://www.quora.com/What-are-some-good-resources-for-learning-about-wavelets
http://www.quora.com/What-are-some-good-resources-for-learning-about-data-compression
http://www.quora.com/What-are-some-alternatives-to-Bishops-PRML-textbook
http://www.quora.com/Machine-Learni...s-for-learning-about-dimensionality-reduction
http://www.quora.com/What-are-the-b...-edge-technologies-and-recent-research-trends
http://www.quora.com/What-are-some-good-resources-for-learning-about-machine-learning

Now, as for scientific computing...

http://www.code.google.com (you can search for a lot of code there). I'm sure there are better repositories somewhere else though.
http://aima.cs.berkeley.edu/code.html (online code repository for the Russell and Norvig AI textbook)
http://www.sai.msu.su/sal/B/1/ (numerical analysis repositories)
http://www.astro.psu.edu/statcodes/ (online statistical software for astronomy and other fields)
http://www.josemiguelpasini.name/links/scientific_computing.php (a few scientific computing links)
http://www.netlib.org/ (netlib repository, seems to be highly regarded from the other websites)
http://www.codecogs.com/ (open source scientific library, not sure how useful this is yet though)
http://www.cisl.ucar.edu/css/software/spherepack/ ("SPHEREPACK 3.2 is a collection of FORTRAN programs that facilitates computer modeling of geophysical processes. The package contains programs for computing certain common differential operators including divergence, vorticity, gradients, and the Laplacian of both scalar and vector functions.")

http://www.delicious.com/tag/scientific-computing (delicious bookmarks, will be very hit and miss)

You can also occasionally use the filetype: operator in google search to find source code in a particular language. So filetype:c, or filetype:m, or filetype:py, etc...
 
Last edited by a moderator:
Mathematics news on Phys.org
  • #4
http://shootout.alioth.debian.org/ - "Compare the performance of ≈24 programming languages for 4 different combinations of OS/machine. Contribute faster more elegant programs. And please don't jump to conclusions!"

Also check out the tags (on these forums) that correspond to computational and SciComputing

http://vizsage.com/other/leastsquaresexcel/ - Least Squares Error Fitting with errors in both coordinates

http://www.mathworks.com/matlabcentral/fileexchange/ - MATLAB File Exchange

http://courses.washington.edu/matlab2/lessons.html - MATLAB Lessons - pretty advanced features here
 
Last edited by a moderator:
  • #8
wow very nice sharing and the links are working properly thanks for the sharing
 

1. What are some popular programming languages used in data science and statistical analysis?

Some popular programming languages used in data science and statistical analysis include Python, R, and SQL. These languages have robust libraries and tools specifically designed for data analysis and machine learning.

2. Are there any free resources available for learning data science and statistical analysis?

Yes, there are many free resources available for learning data science and statistical analysis. Some popular ones include online courses on platforms like Coursera and Udemy, open-source libraries and tools, and free tutorials and guides on websites like Kaggle and Towards Data Science.

3. What is the difference between data science and statistical analysis?

Data science and statistical analysis are closely related, but there are some key differences. Data science involves using various techniques and methods to extract insights and knowledge from large and complex datasets. Statistical analysis focuses on using statistical methods and models to analyze and interpret data and make predictions.

4. What are some essential skills for a data scientist?

Some essential skills for a data scientist include programming skills (especially in languages like Python and R), statistical knowledge, data wrangling and cleaning, machine learning, and data visualization. Strong communication and problem-solving skills are also important for effectively communicating insights and solving complex data problems.

5. Are there any online communities or forums for data scientists and statisticians to connect and collaborate?

Yes, there are many online communities and forums for data scientists and statisticians to connect and collaborate. Some popular ones include Kaggle, Data Science Stack Exchange, and DataCamp Community. These communities are great for networking, sharing knowledge and resources, and discussing various data-related topics and challenges.

Similar threads

  • Science and Math Textbooks
Replies
6
Views
1K
  • STEM Academic Advising
Replies
10
Views
1K
Replies
1
Views
781
  • Programming and Computer Science
Replies
2
Views
1K
  • Programming and Computer Science
4
Replies
107
Views
5K
  • STEM Academic Advising
Replies
1
Views
1K
Replies
4
Views
2K
  • STEM Academic Advising
Replies
7
Views
2K
Replies
2
Views
74
Back
Top