What is Data: Definition and 998 Discussions

Data are units of information, often numeric, that are collected through observation. In a more technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects, while a datum (singular of data) is a single value of a single variable.Although the terms "data" and "information" are often used interchangeably, these terms have distinct meanings. In some popular publications, data are sometimes said to be transformed into information when they are viewed in context or in post-analysis. However, in academic treatments of the subject data are simply units of information. Data are used in scientific research, businesses management (e.g., sales data, revenue, profits, stock price), finance, governance (e.g., crime rates, unemployment rates, literacy rates), and in virtually every other form of human organizational activity (e.g., censuses of the number of homeless people by non-profit organizations).
Data are measured, collected and reported, and analyzed, and from data visualizations such as graphs, tables or images are produced. Data as a general concept refers to the fact that some existing information or knowledge is represented or coded in some form suitable for better usage or processing. Raw data ("unprocessed data") is a collection of numbers or characters before it has been "cleaned" and corrected by researchers. Raw data needs to be corrected to remove outliers or obvious instrument or data entry errors (e.g., a thermometer reading from an outdoor Arctic location recording a tropical temperature). Data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next stage. Field data is raw data that is collected in an uncontrolled "in situ" environment. Experimental data is data that is generated within the context of a scientific investigation by observation and recording.
Data has been described as the new oil of the digital economy.

View More On Wikipedia.org
  1. F

    A What are the typical uncertainties of data in Astophysics?

    Hello, everybody. I'm conducting an investigation in planetary sciences. Specifically in exoplanet detection by the radial velocity method and I'm stuck because I need to know how to defend the data that I'm using. I'm using the data from http://exoplanet.eu/catalog/ and I have found some data...
  2. P

    A Recently, I want to write something about data in physics

    Nowadays, the machine learning of computer science is hot. It is based on data, and drove by data. Thus, a question is naturally coming out: the data in physics, and the models of data. I think it is a really empirical way to know how physicists do the same thing as the computer scientists. So...
  3. MatthijsRog

    I Fitting a curve over noisy data

    Hi all, I performed a resonance experiment over the past two weeks, in which I collected the intensity of a Fabry-Perot cavity whilst adjusting the mirror distance with a piezo-element (the specific setup of the experiment is fairly detached from the question I will ask). My raw data is...
  4. W

    Inserting Missing Data into Adventure Works 2014

    Hi All, Somehow the copy of AdventureWorks2014 I downloaded ( saved and restored) is missing the data, i.e., it contains the schema : tables, indexes, etc. , but many tables are missing the actual data. I have not done any delete/truncate/drop operations, so this may have been a problem with my...
  5. I

    I Data Model of Kepler's Second Law of Planetary Motion

    Hello, I am completing a research project for differential equations class. I am to derive Kepler's three laws and then compare the results of the derivation with real-world data. For Kepler's second law (a planet sweeps out an equal area in an equal time), I was hoping to find orbital data for...
  6. D

    I Tail in data of Duane-Hunt limit experiment

    https://photos-5.dropbox.com/t/2/AAC1PAsxThHE7dTxxumANssxIDSrZGA0wi9u1T2alieA9g/12/217355121/png/32x32/1/_/1/2/Screen%20Shot%202018-04-24%20at%2014.40.53.png/EJ6fyaMBGOQEIAIoAg/zVJasOZ8quUZpWc6eN6tzuO7YSmC-VjpQ4ikXIkpC8A?preserve_transparency=1&size=2048x1536&size_mode=3 So in looking at the...
  7. U

    How to understand dimensionless data of MacCormack's method

    Dear, everyone.. I am doing a numerical investigation of unsteady heat transfer in a Newtonian fluid occupying a square cavity. I solved numerically using HSMAC(Highly Simplified Marker and Cell) finite difference approach as the governing dimensionless equation. The fluid flow is simulated...
  8. T

    Differences between Zone Sizing Data and Terminal Unit Data in HAP

    What's the difference between zone sizing data and terminal unit sizing data in HAP and is it possible that the maximum cooling sensible load for the zone be greater than the Sensible coil load of the terminal unit as indicated in the system design report attached .Note that it is a single space...
  9. Jakub

    Bi-exponential function fitting in Origin Lab SW

    I can't understand the exponential function fit for this set of data works well: ExpDec2 exponential function fit 0 3,04 10 2,77 20 2,52 30 2,27 40 2,09 50 1,92 60 1,75 70 1,62 80 1,51 90 1,43 100 1,36 110 1,29 120 1,24 130 1,19 140 1,14 150 1,09 160 1,05 170 1,02 180 0,99 190 0,97 200 0,95 210...
  10. ChrisVer

    A Question about data & Monte Carlo statistical uncertainties

    Hi I was wondering the following/feeling uneasy about it: Does it make sense to separate the statistical uncertainties of data and Monte Carlo? For example assume infinite statistics in your MC (uncertainty-->0) while your data is finite : so they come with some "uncertainty" (if that makes...
  11. Vital

    I Reading the vertical data in a frequency histogram polygon

    Hello. Please, take a look at the screenshot from the textbook. They say in the textbook that there are in total 48 data observations, 20 of which lie in the interval 0 - 2, and 6 lie in the interval 2 - 4. Yes, both 20 and 6 are more or less clear on the graph, but how did they come up with 48...
  12. F

    Edge Data Center Size Cost Estimation in Bakersfield, CA

    I want to do a cost estimation for an edge data center in Bakersfield, California. I don't know how big should the center be and i do not know how i can view the data traffic. Can anyone help?
  13. Arman777

    Project Euler-Problem 11 data taking

    1. The problem statement, all variables, and given/known data https://projecteuler.net/problem=11 2. Homework Equations 3. The Attempt at a Solution I need to take those data and I want to convert them into a string to proceed. But I can't turn it into the string since there are gaps between...
  14. quasarLie

    MATLAB Creating a Histogram with Data from a File.dat in MATLAB

    Hello everyone, I'm trying to make a MATLAB program which read a file.dat and then do a histogram This what I did Data2=importdata('Ma.DAT'); R1=Data2.data(:,17) R1(R1>-9.9) L = 0:0.1:8; histc(R1,L) bar(L,histc(R1,L),'histc') xlabel('R1') ylabel('counts') I want to eliminate all the number...
  15. quasarLie

    MATLAB How to Plot Data into Bins in MATLAB: A Step-by-Step Guide

    Hello, I am writing a program in MATLAB that reads a file.dat, I also want to draw by bin of z and degraded the color for example for z between 0 and 1 put the color red, for z between 1 and 2 put blue. But I do not know how to do it Here is my code Data2=importdata('fichier.dat')...
  16. S

    Job Skills Data Analyst at startup for first job?

    I've been trying to figure out what jobs I can use my BS in physics for and I see data analyst being recommended a lot. I have limited python and SQL experience from college. I was thinking of trying for startups in NYC, but now I see articles saying startups want people that need no training so...
  17. FallenApple

    Job Skills Data Scientist Phone interview at Apple

    So I have a phone interview at Apple tomorrow. Are there any tips on what part of statistics I should review? I guess it wouldn't be helpful to do computations with matrix equations or derivations, but focus more on the concepts? I suppose I can review linear regression and logistic...
  18. L

    B Find the missing energy value given a set of data (Hypothetical question)

    Hypothetical question: Let's say there's a granite cube with a volume of 60 cubic centimeters. A cannon ball is shot at it, reducing it to rubble, and the kinetic energy of the ball was 10,000 joules. We use the same ball with the same kinetic energy output on a stronger unknown substance with...
  19. S

    I Calculate Voyager trajectory using JPL data

    Hey everyone, I've recently programmed an animated simulation of the main elements (core planets and Sun) of our solar system: by using the initial coordinates from the JPL database, and then calculating the combined gravitational perturbations and the relativistic effects from the Sun...
  20. F

    Securely Erase Data: A Guide to Effective Techniques and Tools

    I have read in several places, I don't remember exactly where, that to effectively and securely erase stored date from most modern digital storage devices, it is necessary (or at least this is one often-used technique) to repeatedly, in several, maybe dozens, of passes, write randomly selected...
  21. R

    Divide observational data into normal periods for study

    When analyzing hydrological and climatological timeseries/observations it is common practice to compare statistics made on normal periods. As WMO calls them "WMO Climatological Normals". These periods consist normally of 30 years of data. If you want to compare two normal periods with each...
  22. N

    Engineering Find the equivalent resistance of this circuit w/o Data?

    Homework Statement Find the equivalent resistance of this circuit between points a and b https://imgur.com/a/LWcIC Homework Equations The Attempt at a Solution I am not sure how to complete this problem without any variables. I tried replacing the numbers with the variables which is what I...
  23. Eclair_de_XII

    Math Should I try for this paid Data Analyst internship?

    The job basically entails collecting data and conducting analyses to translate into "business insights". I'm apprehensive about posting the whole job description, but basically, it's available for students entering their senior year in college who are enrolled in a program "with a focus on...
  24. B

    I Continuous Lensing Models: Discrete Data

    Hello, I am not sure if this question is better suited to the mathematics section, but I thought it would be easier to explain the problem here. In Schneider, Kochanek and Wambsganss's "Gravitational Lensing: Strong Weak and Micro" pages 279-280, they derive a relation for determining the...
  25. D

    IC Engine Connecting Rod Mass Moment of Inertia Measurements

    Does anyone know of a source, on the Internet preferably, for component inertial data for a real IC engine? In particular, what I'd like to obtain (for some real engine, any engine) are these items: 1. Piston weight (including wrist pin) 2. Cylinder bore dimension 3. Connecting rod length...
  26. E

    Other Writing a resume for data science positions

    Hello all, I revised a resume for data science positions. I empathized on skills that are needed for the positions like statistics/mathematics and programming. I mention that I am familiar with the theory of many machine learning algorithms and Python libraries used for data analysis. I still...
  27. DaveC426913

    B When will ball 25 drop? Predicting future observations using data

    I've got a collection of data that contains observations over time. I want to predict when a given future observation is likely to occur. As a simple example: Say I'm watching billiard balls drop into a pocket. The billiard balls drop in with approximate regularity. My dataset: 1 ball 0:00...
  28. M

    Feeding temperature profile from experimental data into MAXWELL

    Hi All, my aim is to find the thermal conductivity of litz wire and also convection coefficient of litz wire and air. So, I am running experimental analysis (10A current into litz wire) and capturing the temperature of litz wire. So, I have temperature profile versus time for litz wire. Would...
  29. E

    Python Projects involving data science

    Hello, I am trying to do some projects on data science/machine learning using Python, but I am not sure what to do. I downloaded a very simple dataset from WHO, and I am trying to do something with it, but most of (actually all) what I can do with it is data manipulation and exploratory data...
  30. F

    Job Skills Can Data Science Skills Lead to Careers in Physics?

    Hello everyone, sorry in advance for the long post I am about to finish up with a masters degree in Data Analytics within a year and will soon be switching careers from being a Systems Admin, which I am quickly burning out on. As an undergrad, I wanted to major in Physics, but the college I...
  31. E

    Bash shell scripting and data science

    Hello, I've noticed that a number of employers ask for knowledge in Linux and shell scripting for data science positions. How does bash shell scripting help in the filed of data science, or in general why to learn it? Thanks
  32. C

    I Calculating signal to noise ratio for deimos data

    I'm currently working with the 1d DEEP2 DEIMOS fits files (see http://deep.ps.uci.edu/deep3/specprimer.html) and am trying to define some exclusion criteria in Python 2.7 for the data based on the noise. What's the best way to quantify the noise in order to do this? Thank you!
  33. E

    Job Skills Are internships necessary for transitioning into data science?

    Hello, I want to switch careers from EE to data science. I am taking courses in the field, reading books, and will do some online projects. Do I need to take internships on top of that? If yes, how willing are the companies to take interns? Would they respond if I emailed them without...
  34. Steven Ellet

    Rapid data retrieval from Excel (or similar)

    I have a spreadsheet with quite a bit of information. This spreadsheet is designed to encode words and create a encrypted message. Unfortunately, the process of coding and decoding is long and tedious. I am looking for a way to make this process quick (depending upon message length) and easy...
  35. K

    Other Is learning C necessary for becoming a data scientist?

    Greetings people from PF! I want to start my journey as a Data Scientist and currently I'm doing baby steps. I heard from people that every Data Scientist should learn Python, but problem is that I'm currently learning Java from the book Head First Java where they intuitively explain object...
  36. Dr Eve Wildman

    A AC transport data query (using a PPMS system)

    Hello and thanks for looking at this question. I have a semi-conducting sample which has been run on a PPMS system - measuring it's resistivity as a function of temperature. I switched to AC transport mode in order to measure the resistivity again while applying frequencies between 1Hz -...
  37. S

    Split Hopkinson pressure bar data

    Currently coding a wave separation script for the split hopkinson pressure bar. Is there a place I can get raw data for the split hopkinson pressure bar to test my code?
  38. E

    Understand Binning Data in Python

    Hello, I was reading an example on binning data, where a continuous variable is transformed into a categorical variable. The dataframe name is df, and the continuous variable's column's name is 'horsepower'. We would like to transform the continuous variable feature into a categorical feature...
  39. F

    Java Java Saving Data In File Internal

    Hello I am trying to read data from one activity, save it to a file... and when open another activity I will be able to read that file(by opening it). I searched for tutorials, all they are showing on the same activity. I figured out how the code works.. but I'm not able to understand what will...
  40. Dusty912

    Applying hypothesis test data collected (Statistics)

    Homework Statement So I am doing a project for statistics and wanted to apply a hypothesis test to see if there is a correlation between the number of years spent at my college and the number of services used. The services include library, recreational services, clubs, etc.. i sent out a survey...
  41. I

    C/C++ What should I be most familiar w/for C++ Data Structs & Algorithms

    Hey all, This coming Spring semester (starts in 6 weeks), I will be taking C++ Data Structures and Algorithms at my University. I started programming at University, so my experience is very limited (I've taken 3 programming courses, 1 in Python, 2 in C++). Topics we've covered in C++ were...
  42. ohwilleke

    I Is high redshift data a problem for ΛCDM?

    A couple of papers in the last couple of years identify problems with the ΛCDM "standard model of cosmology" based upon high redshift astronomy observations. Have there been adequate responses to these concerns? Charles L. Steinhardt, et al., "The Impossibly Early Galaxy Problem" (June 3...
  43. Hi Im Paul

    Graphing Data in Excel: Error Bar Confusion

    I'm currently writing a paper over an experiment I did for class and I have a very stupid question over I need clarification on before I continue. I am using excel to graph out data, and I have determined that the error in my Length is .001 m. However, I am graphing the inverse of length to...
  44. It's me

    I How can I access event data for LHCb calorimeters?

    I want to do a project using machine learning on the calorimeter event data of the LHCb. How can I access this data? Is it very difficult to navigate your way through the source code on your own?
  45. E

    Pipeline bend calculation from strain gauge data

    Hi all, I'm doing a stress analysis and I’m considering 3 buried pipelines carrying oil. Vibrating wire strain gauges have been installed in different sections for a total of 468 strain gauges. These have been arranged on the surface of the pipelines 120°apart: This pipelines have...
  46. chrononaut 114

    "From your data, is the bandgap of ZnSe direct or indirect?"

    (urgent) Hi, This question was apart of an assignment sheet that I was given in 'Experimental Physics III' after having completed and obtained data for the practical called 'The Bandgap Energy of Semiconductor ZnSe'. Cheers Below is some screenshots of the (Matlab-processed) data we obtained...
  47. zehkari

    Calculating Drag Coefficient from Wind Tunnel Data

    Homework Statement Using a wind tunnel to measure force and velocity at different drive %, we obtained some data for drag. We used one dimpled sphere and one smooth sphere. There are a couple of questions I am stuck on. "The force you have measured is known as the drag force and can be...
  48. J

    Is there a byte that never occurs in a float?

    A float consists of four bytes. I am using a data stream of floats through radio. This data occasionally is not a float though, and sometimes I'd want to stop the logging of floats of the stream, by inserting a "stop char", which would be checked by the program. Previously I was using simple LF...
  49. kostoglotov

    Which data set should I trust more?

    Homework Statement Control systems subject. Have a DC motor setup on a difference op-amp, then with gain, and the axel moves a wiper on a pot used for feedback. I understand the concepts reasonably well enough to complete the assignment. I'm using MATLAB scripts to approximate parameters...
  50. A

    A Download full data from National Nuclear Data Center

    dear physicists is it possible to download full data of half lives (or another parameters) from Chart of Nuclides in http://www.nndc.bnl.gov/chart/ I can see data One by one for extract my Required data and it is too time consuming So, if there's any way I can download all the information, it...
Back
Top