The coming data explosion and what this means for employable skills

Click For Summary

Discussion Overview

The discussion revolves around the implications of the increasing volume of data in scientific discovery and the skills that may become valuable in the job market as a result. It touches on the transition from traditional paradigms of experiment and theory to data-intensive approaches, and explores the potential applications in various fields, including astrophysics and biological sciences.

Discussion Character

  • Exploratory
  • Debate/contested
  • Conceptual clarification

Main Points Raised

  • Some participants note that the evolution of scientific paradigms now includes data-intensive discovery, which can yield results without a priori assumptions, challenging traditional scientific methods.
  • There is a suggestion that skills such as data mining, pattern recognition, and the ability to analyze large datasets will be increasingly employable in the near future.
  • One participant expresses a desire to pursue a PhD in astrophysics, questioning whether the skills gained would be recognized as valuable in other fields.
  • Another participant emphasizes the importance of programming skills and the ability to summarize complex data succinctly, suggesting that communication skills are essential in dealing with large datasets.
  • Concerns are raised about the bottlenecks in software and social systems, indicating that data alone lacks value without context.
  • There is a comparison made between modern data analysis and historical methods, such as Kepler's Laws, with a focus on the advancements in computational power.
  • Some participants argue that the current data revolution is already underway, rather than being a future event, and that all fields are being impacted by advancements in technology.

Areas of Agreement / Disagreement

Participants express a range of views on the nature of the data revolution, with some seeing it as a current phenomenon while others suggest it is forthcoming. There is no consensus on the qualitative differences between modern data analysis methods and historical approaches to scientific discovery.

Contextual Notes

Participants highlight limitations related to the social context required for data interpretation and the dependency on software capabilities, which remain unresolved in the discussion.

Simfish
Gold Member
Messages
811
Reaction score
2
http://www.nytimes.com/external/rea...writeweb-the-coming-data-explosion-13154.html

An interesting book:
http://research.microsoft.com/en-us/collaboration/fourthparadigm/

So basically, the two main paradigms used to be experiment and theory. Then in the 1950s came simulations, and now we have data-intensive scientific discovery. Some people have recently written programs that can derive physical formulas from massive amounts of data. Such methods can produce true results without an a priori basis for scientific discovery, which runs counter to the scientific method.

==

Anyways, so I'm seeing that there are several skillsets that will become valuable quite soon. (a) working with better sensors that have additional dimensions of physical data, (b) data mining/pattern recognition, (c) finding ways to efficiently analyze mass amounts of physical data, (d) intuition with respect to finding patterns out of massive datasets (or finding algorithms that find the best patterns out of them)

So the question here, is, do you see these skillsets as extremely employable in the near future (perhaps more employable than many other skillsets)? And what would people look for if they look for people with such skillsets?

For instance, I would like to go for a PhD in astrophysics. Astrophysics, of course, is one beneficiary of this revolution, as we get better sensors (telescopes/CCDs) and massive amounts of data to analyze. But I have many scientific interests, and I'm especially interested in other applications of this upcoming revolution (especially as it applies to the biological sciences, which are also in the process of an upcoming revolution - this revolution may depend on training different from the types of training biologists have traditionally received). Anyways, would people in other fields be convinced that astrophysics would provide me with the skills to go into this?
 
Physics news on Phys.org
Simfish said:
So the question here, is, do you see these skillsets as extremely employable in the near future (perhaps more employable than many other skillsets)? And what would people look for if they look for people with such skillsets?

Learn to problem. If you are good at programming computers, it's like being about to read English. Also study history and philosophy. Technology changes quickly, but humans change rather slowly, and in looking at patterns, it's a good idea to look at human patterns.

Also, it's not a "coming revolution" it's a current one.

For instance, I would like to go for a PhD in astrophysics. Astrophysics, of course, is one beneficiary of this revolution, as we get better sensors (telescopes/CCDs) and massive amounts of data to analyze.

One thing about the massive amounts of data is that it's much too much for anyone human being to understand, so a lot of dealing with complex problems involves having cross-disciplinary teams. Just find a subject that you like and go with it.

The other thing is to develop basic communications and education skills. One key skill is to be able to take several exabytes of data and summarize it all in two sentences. You need a human to do that.

Anyways, would people in other fields be convinced that astrophysics would provide me with the skills to go into this?

A lot of what matters is to be able to give someone the key google term that they need. The word you are looking for is "bioinformatics." In any event, because computers are touching everything, what field you go into isn't that important since they are all getting hit by cheap computer power, and a lot of the basic techniques are field independent.
 
Simfish said:
So basically, the two main paradigms used to be experiment and theory. Then in the 1950s came simulations, and now we have data-intensive scientific discovery. Some people have recently written programs that can derive physical formulas from massive amounts of data. Such methods can produce true results without an a priori basis for scientific discovery, which runs counter to the scientific method.

How is this qualitatively different from, say, Kepler's Laws of planetary motion? His laws were derived from observation, without regard to theory, model or explanation.
 
DaveC426913 said:
How is this qualitatively different from, say, Kepler's Laws of planetary motion? His laws were derived from observation, without regard to theory, model or explanation.

It's really not, except that now we have power tools rather than hand tools. Kepler took 19 years to figure out his three laws. What he did could be done by modern computers in about an hour.

The amount of data and hardware out there is incredibly but the bottle necks are the software and the social systems. Data is useless without a social context to make sense out of it.
 
twofish-quant said:
The amount of data and hardware out there is incredibly but the bottle necks are the software and the social systems. Data is useless without a social context to make sense out of it.
Yep. That is the central theme of Web 2.0 the Semantic Web initiative, and why HTML5 has been released with all sorts of new features to enable semantic interpretation.
 

Similar threads

  • · Replies 29 ·
Replies
29
Views
7K
  • · Replies 18 ·
Replies
18
Views
4K
  • · Replies 2 ·
Replies
2
Views
4K
Replies
1
Views
3K
  • · Replies 108 ·
4
Replies
108
Views
20K
  • · Replies 9 ·
Replies
9
Views
2K
Replies
127
Views
23K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 11 ·
Replies
11
Views
6K
  • · Replies 19 ·
Replies
19
Views
5K