Getting into data science from computational physics?

Click For Summary

Discussion Overview

The discussion revolves around the transition from computational physics to data science, exploring the skills and experiences that may facilitate this shift. Participants share their perspectives on the relevance of computational and experimental physics backgrounds to data science roles, as well as the qualifications needed for entry-level positions in the field.

Discussion Character

  • Exploratory
  • Debate/contested
  • Technical explanation

Main Points Raised

  • One participant notes that some physics PhDs have successfully transitioned to data science roles despite a background in computational astrophysics, raising questions about the skills required for such a move.
  • Another participant challenges the assumption that only experimentalists are suited for data science, suggesting that computational physicists can also acquire relevant skills.
  • There is a discussion about the vagueness of the term "data scientist," with some participants emphasizing the importance of database architecture and data analysis skills such as SQL, SAS, and Excel.
  • One participant mentions that students in experimental physics are expected to analyze large data sets during their graduate studies, while questioning how computational physics students can gain similar experience.
  • A participant shares their personal experience of working with data organization and analysis in a laboratory setting, highlighting the relevance of these skills to data science.
  • A reference is made to a friend's transition from computational biophysics to bioinformatics, suggesting potential pathways for others in similar fields.

Areas of Agreement / Disagreement

Participants express differing views on the skills acquired in computational versus experimental physics, with some asserting that experimentalists are better prepared for data science roles, while others argue that computational physicists can also develop necessary competencies. The discussion remains unresolved regarding the specific pathways and skills that facilitate the transition to data science.

Contextual Notes

There are limitations in the discussion regarding the specific skills and experiences that computational physicists may lack compared to their experimental counterparts, as well as the varying definitions and expectations of data science roles across different industries.

1a2
Messages
20
Reaction score
0
I'm currently working on a master's degree in physics where my project uses C++. I have read about how some physics phD's were able to get data scientist roles despite working on computational astrophysics. This is a little surprising to me since I thought experimentalists would be more suited to data science since they work with data and data analysis more than computationists.

If anyone here has gone from computational physics to data science, or know anyone who made the switch, can you explain how? Just go to kaggle and work on data sets?

Also, do companies only hire phD's for entry-level data scientist roles? With just a master's, if I can't get a data scientist role, what other similar jobs are available?
 
Physics news on Phys.org
Where did you read that about moving from computational astrophysics to data science?
 
Locrian said:
Where did you read that about moving from computational astrophysics to data science?

I browsed a few profiles of former computational astrophysics students (for example, by checking the 'previous students' of some profs) and read that they now work as data scientists
 
Ah okay, was wondering if there was an article, etc. out there about it.

As far as I know, one does not typically obtain many of the requisite skills for handling or analyzing large data sets in astrophysics, experimental or otherwise. Of course, anyone who gets a physics degree hopefully picks up the ability to learn new and difficult things quickly and without much external help.

"Data scientist" is a pretty vague term that seems to mean different things to different people. As far as I can tell, it comes down to an understanding of database architecture, the skills needed to retrieve and analyze that data (SQL, SAS, Access, Excel, etc.) and some knowledge of more advanced data mining techniques.

SQL would be a very good place to start. Play around with some cloud data analytics and maybe some student versions of TOAD, SAS or other query platforms. Ultimately anyone who hires you will have their own (sometimes very unusual) process for data extraction and analysis, so be prepared to be flexible.
 
Locrian said:
Ah okay, was wondering if there was an article, etc. out there about it.

As far as I know, one does not typically obtain many of the requisite skills for handling or analyzing large data sets in astrophysics, experimental or otherwise. Of course, anyone who gets a physics degree hopefully picks up the ability to learn new and difficult things quickly and without much external help.

From what I understand, that is only true for those who have completed an undergraduate degree in physics. Once you go further into graduate studies, particularly a PhD but even masters level studies, students in experimental areas of physics are expected to analyze large data sets as part of their research work. At least that is how it was explained to me by my friends who ended up pursuing graduate studies in physics.

"Data scientist" is a pretty vague term that seems to mean different things to different people. As far as I can tell, it comes down to an understanding of database architecture, the skills needed to retrieve and analyze that data (SQL, SAS, Access, Excel, etc.) and some knowledge of more advanced data mining techniques.

SQL would be a very good place to start. Play around with some cloud data analytics and maybe some student versions of TOAD, SAS or other query platforms. Ultimately anyone who hires you will have their own (sometimes very unusual) process for data extraction and analysis, so be prepared to be flexible.

"Data science" is as much involved with the statistical analysis of high-dimensional data that is generated in a variety of different areas (e.g. marketing data, sensor data from GIS, genomics/proteomics data, times series data of financial transactions, medical data, etc.) as understanding the underlying database architecture. Certainly understanding how to retrieve the data using SQL, SAS, or Access is important, but that is only a part of the piece. In essence, I regard "data science" as a fancy re-labelling of statistical analysis.
 
Once you go further into graduate studies, particularly a PhD but even masters level studies, students in experimental areas of physics are expected to analyze large data sets as part of their research work.

So then how can students in computational physics get to analyze large data sets?
 
StatGuy2000 said:
From what I understand, that is only true for those who have completed an undergraduate degree in physics. Once you go further into graduate studies, particularly a PhD but even masters level studies, students in experimental areas of physics are expected to analyze large data sets as part of their research work. At least that is how it was explained to me by my friends who ended up pursuing graduate studies in physics.

Hah yea, that is a good description of what they think.
 
StatGuy2000 said:
...students in experimental areas of physics are expected to analyze large data sets as part of their research work.
.

Yes - that was one of my standard explanations when somebody asked how a physicist could transition into 'IT'. I worked on the optimization of the properties of thin films, thus varied different experimental parameters when manufacturing those films and then measured electrical and optical properties. Making sense of the effect of the impacts of experiments effectively meant navigating some area in a multi-dimensional space of parameters and properties .

Another thing, mundane as it was, was organizing data in a database and 'normalizing ambiguous data'. I learned about relational databases when trying to keep track of tons of STEM / TEM images and important findings, at that time usually documented only by hand-written comments in a lab notebook.
 
A friend of mine did his PhD in computational biophysics and is currently doing a post-doc in bioinformatics. There may be a similar sort of strategy for you although nothing springs to mind.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
5K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
5K
  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 14 ·
Replies
14
Views
4K
  • · Replies 68 ·
3
Replies
68
Views
11K
  • · Replies 18 ·
Replies
18
Views
7K