Resources for Data Science, Statistical Analysis, ML & Scientific Computing

Click For Summary

Discussion Overview

The discussion focuses on resources related to data science, statistical analysis, machine learning, and scientific computing. Participants share various links to repositories, software, and educational materials that may aid in these fields.

Discussion Character

  • Exploratory
  • Technical explanation
  • Resource sharing

Main Points Raised

  • One participant shares a collection of links from Quora for learning about various topics in data science and scientific computing, including distributed computing and statistical analysis.
  • Another participant suggests GitHub and Bitbucket as repositories for finding code and resources.
  • A link to Gelman's public research software is provided, along with additional resources from his website.
  • Performance comparisons of programming languages are discussed, with a link to a benchmarking site.
  • Participants mention various forums and communities, such as Reddit and Quora, for MATLAB and Mathematica discussions.
  • A textbook on high performance computing is referenced as a potential resource.
  • Links to a special issue of Science dealing with data are shared, indicating a focus on current research topics.
  • Parallel computing resources are highlighted, including links to Berkeley's research papers and projects.
  • One participant expresses appreciation for the shared links and their functionality.

Areas of Agreement / Disagreement

Participants generally agree on the value of sharing resources, but there is no consensus on which specific resources are the most useful or comprehensive. Multiple viewpoints on the best platforms and materials remain present.

Contextual Notes

Some links may vary in quality and relevance, and the effectiveness of resources can depend on individual needs and contexts. There is no resolution on the best practices for selecting resources.

Who May Find This Useful

Individuals interested in data science, statistical analysis, machine learning, and scientific computing may find the shared resources beneficial for their studies or projects.

Simfish
Gold Member
Messages
811
Reaction score
2
http://www.quora.com/How-do-I-become-a-data-scientist

Huge collections of resources can be found at http://www.quora.com/Alex-Kamil/answers (you can edit them too)

For example,

http://www.quora.com/What-are-some-good-resources-for-learning-about-distributed-computing
http://www.quora.com/What-are-some-good-resources-for-learning-about-statistical-analysis
http://www.quora.com/What-are-some-good-resources-for-learning-about-numerical-analysis
http://www.quora.com/What-are-some-measures-of-complexity
http://www.quora.com/What-are-some-good-resources-for-learning-about-wavelets
http://www.quora.com/What-are-some-good-resources-for-learning-about-data-compression
http://www.quora.com/What-are-some-alternatives-to-Bishops-PRML-textbook
http://www.quora.com/Machine-Learni...s-for-learning-about-dimensionality-reduction
http://www.quora.com/What-are-the-b...-edge-technologies-and-recent-research-trends
http://www.quora.com/What-are-some-good-resources-for-learning-about-machine-learning

Now, as for scientific computing...

http://www.code.google.com (you can search for a lot of code there). I'm sure there are better repositories somewhere else though.
http://aima.cs.berkeley.edu/code.html (online code repository for the Russell and Norvig AI textbook)
http://www.sai.msu.su/sal/B/1/ (numerical analysis repositories)
http://www.astro.psu.edu/statcodes/ (online statistical software for astronomy and other fields)
http://www.josemiguelpasini.name/links/scientific_computing.php (a few scientific computing links)
http://www.netlib.org/ (netlib repository, seems to be highly regarded from the other websites)
http://www.codecogs.com/ (open source scientific library, not sure how useful this is yet though)
http://www.cisl.ucar.edu/css/software/spherepack/ ("SPHEREPACK 3.2 is a collection of FORTRAN programs that facilitates computer modeling of geophysical processes. The package contains programs for computing certain common differential operators including divergence, vorticity, gradients, and the Laplacian of both scalar and vector functions.")

http://www.delicious.com/tag/scientific-computing (delicious bookmarks, will be very hit and miss)

You can also occasionally use the filetype: operator in google search to find source code in a particular language. So filetype:c, or filetype:m, or filetype:py, etc...
 
Last edited by a moderator:
Physics news on Phys.org
Last edited by a moderator:
http://shootout.alioth.debian.org/ - "Compare the performance of ≈24 programming languages for 4 different combinations of OS/machine. Contribute faster more elegant programs. And please don't jump to conclusions!"

Also check out the tags (on these forums) that correspond to computational and SciComputing

http://vizsage.com/other/leastsquaresexcel/ - Least Squares Error Fitting with errors in both coordinates

http://www.mathworks.com/matlabcentral/fileexchange/ - MATLAB File Exchange

http://courses.washington.edu/matlab2/lessons.html - MATLAB Lessons - pretty advanced features here
 
Last edited by a moderator:
wow very nice sharing and the links are working properly thanks for the sharing
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 10 ·
Replies
10
Views
3K
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 10 ·
Replies
10
Views
9K
  • · Replies 133 ·
5
Replies
133
Views
11K