What is Data: Definition and 998 Discussions

Data are units of information, often numeric, that are collected through observation. In a more technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects, while a datum (singular of data) is a single value of a single variable.Although the terms "data" and "information" are often used interchangeably, these terms have distinct meanings. In some popular publications, data are sometimes said to be transformed into information when they are viewed in context or in post-analysis. However, in academic treatments of the subject data are simply units of information. Data are used in scientific research, businesses management (e.g., sales data, revenue, profits, stock price), finance, governance (e.g., crime rates, unemployment rates, literacy rates), and in virtually every other form of human organizational activity (e.g., censuses of the number of homeless people by non-profit organizations).
Data are measured, collected and reported, and analyzed, and from data visualizations such as graphs, tables or images are produced. Data as a general concept refers to the fact that some existing information or knowledge is represented or coded in some form suitable for better usage or processing. Raw data ("unprocessed data") is a collection of numbers or characters before it has been "cleaned" and corrected by researchers. Raw data needs to be corrected to remove outliers or obvious instrument or data entry errors (e.g., a thermometer reading from an outdoor Arctic location recording a tropical temperature). Data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next stage. Field data is raw data that is collected in an uncontrolled "in situ" environment. Experimental data is data that is generated within the context of a scientific investigation by observation and recording.
Data has been described as the new oil of the digital economy.

View More On Wikipedia.org
  1. jim mcnamara

    COVID What are the challenges in accurately reporting and interpreting COVID-19 data?

    � Source Deaths Data timestamp https://www]cdc.gov/coronavirus/2019-ncov/cases-updates/cases-in-us.html 175,651 Aug 23 2020 12:15PM EDT https://www]worldometers.info/coronavirus/#countries 180,724 August 24, 2020, 16:47 GMT Johns Hopkins U ARCGIS* 176,901 8/24/2020, 10:27:56 AM...
  2. T

    ADM formulation Initial Value Problem data per spacepoint

    I'm having a bit of trouble getting a clear picture of what is going on here, so if anyone can shed any light, it will be greatly appreciated. 1. I can see how the metric coefficients provide the six numbers per spacepoint, but it can't always be possible to transform the metric into a diagonal...
  3. W

    Python Data Structures: Guessing a Number

    Hi all, Trying to right a program in Python asking user to guess a number ( integer in finite range) until they make a correct guess, though I want to warn user when guess is too high --asking that they choose a lower number ,and same for when the guess is low, asking them to choose a higher...
  4. il postino

    Chemistry Calculate the boiling temperature of methanol from thermodynamic data

    Calculate the boiling temperature of methanol at 60 atm knowing that Tc = 513K, Pc = 78 atm and the acentricity is 0.555. I would like you to help me start the exercise. I thought about using the Pitzer Correlation to be able to calculate the fugacity coefficient, but I don't have the...
  5. M

    I Fitting rovibrational molecular data

    Hello! I have some data for rovibrational transitions between a ##X^2\Sigma^+## and ##A^2\Pi_{1/2}## and I need to extract the molecular parameters (e.g. B, D, ##\gamma## etc) for the 2 levels. I tried pgopher for a while, using Hund case B and A for the 2 states, respectively. However it...
  6. Roger Dodger

    B Choosing the number of trials when the data stream is unlimited

    I am collecting data from a Geiger-Muller radiation detector, which generates clicks that correspond to particles entering the detector. These clicks come in purely at random, so the number of clicks in a given time interval are governed by the Poisson distribution. My job is to find the average...
  7. brainpushups

    Data Plotting Software for HS Students

    I'm designing a course for 9th grade students that focuses on experimental methods in science. One topic that will come at the beginning of the course will be how to display experimental data graphically, including estimates of experimental error. I'm looking for some advice about what...
  8. FRANCVON

    Other Unsure between computational physics and data science

    What would be better to choose as a career COMPUTATIONAL PHYSICS or DATA SCIENCE Is there any pro and con?
  9. W

    Elementary Python Questions: Data Frames, k-nary functions

    Hi All, A couple of questions, please: 1) Say df is a dataframe in Python Pandas, and I select a specific column from df: Y=df[column].values. What kind of data structure is Y? 2) I want to find the sum of two numbers: Def Sum(a=0,b=0): return a+b If I want to find a sum over sum data...
  10. L

    Chemistry Find the formula of a hydrocarbon using combustion data

    number of moles of CO2 =0.089moles (using 2/22.4) number of moles of water = 0.067 (using 1.205/18) I know that all the carbon from the hydrocarbon is in the CO2 and all the hydrogen from the hydrocarbon is in the water and water creates x2 hydrogens so number of moles of C : H = 0.089 ...
  11. G

    Big Data and a Saturation Point

    I ask this with more of a software background than an engineering background, but here I go anyway. Big data is arguably the cultural motif or monograph of the information age. Trends involve immersing ourselves in media of various sorts and processing them at exceptional rates. Of course...
  12. marialovesphysics

    Data Management - Probability of Cards

    Here is my work so far: 52-13=39 There are 39 decks of cards left since the spades were removed. a) Then there 13 hearts therefore, (13/39 ) * (13/39 ) that would be two hearts but I am not sure what to do next. But I am sure that it would be 39 cards and 13 hearts on top (maybe) cus it is...
  13. M

    I Reduced chi square for few data points

    Hello! I need to make a straight line fit to 8 points, with errors on them. The data is like this ##x = [1,2,3,4,5,6,7,8]##, ##y=[377.488 691.191 , 1030.319, 1428.801, 1753.884, 2113.065 , 2398.642, 2797.664]##, ##y_{err}=[97.145, 131.452, 160.492, 188.997, 209.397, 229.840, 244.879...
  14. binbagsss

    MATLAB Matlab help please (generating a plot from this data)

    Hi I have saved data in pdf format, and I wish to generate a plot from this data. The data is iterations of optimising some function. I can't just copy and paste it into the live window obviously and then try to generate a figure, so how can I generate a figure, just as I would have done...
  15. W

    Back Up Phone Data: Solutions & Tips

    Hi, Hoping to back up my phone data. I used to use this free app SMS, which connected to gmail and uploaded the data there . Only now google/gmail has changed or tightened access rules and I get an error message when my phone tries to relay data to my gmail account. I have gone through...
  16. SymNeric

    Analysing a ##C_M## graph (pitching moment data)

    Hi guys, I hope everyone is safe and well. I'm currently nearing the end of my third year dissertation, and I'm looking at analysing pitching moment coefficient (CM) data over a full range of angles of attack for airfoils with different serrations on the trailing edge. What are things to look...
  17. Admiralibr123

    I Looking for Experimental Data on Isotopes (Nuclear Physics and Engineering)

    So, a website in which I just enter an element or an Isotope and it just lists all the relevant experimental data like mass, mass-defect binding energy etc. Also resources for the absorption data, resources to explore Monte-Carlo simulations, and other calculation tools would be awesome. Just a...
  18. P

    B How to get the typical value or typical data from the dataset

    I make a theoretical calculation and then compare the calculation result and the median of the corresonding measured dateset. The difference between them is very slight, so I state that the theoretical model is right and good. However one expert has suspended whether the median is typical...
  19. Another

    I What statistics are used to test data like this?

    I have 100 data. if I want to use data from 10 to 100 or from 20 to 100, which statistic should I use to test whether I can use data from 1 to 100 or 20 to 100 without significance?
  20. C

    I Find Experimental Results for Physics Project

    I'm a physics student in undergrad. For a project in our class in which we propose an experiment (which we will not actually perform and we can use resources we don't have access too for the "experiment") and base it on existing research for that topic. I am searching on ads and arxiv, so far I...
  21. M

    Bash script for moving data from one directory to another

    Hi PF! I'm a new Linux user (please be patient :) ). I would like to read data stored as a .dat file from a different directory so I can reference it; do you know how? Specifically, I want to replace a line from different C files (controlDict and U), located here ./system/controlDict and here...
  22. K

    I Subtracting background from data

    Hello! I have some counts measured at fixed points, both for background only and signal+background. Say that for a given x I have 16 counts with background only and 100 for background+signal. So the background is ##16\pm 4## and for the signal+background ##100\pm 10## (assuming poisson...
  23. E

    C/C++ C++ Program Suddenly Ends after Reading Huge Data

    I have written a C++ code in Visual Studio 2019 that requires an input tab-delimited text file and outputs a text file that is also tab-delimited. The data within the text file are stored in a vector and then it will perform calculations, whose results will then be written in a text file as...
  24. K

    I Linear fit on the difference of data points

    Hello! I have some data points obtained from a measurement and one of them is defined as the reference point. I need to compute the difference between that reference point and all the others (including itself) and plot the difference as a function of another variable (which doesn't have an error...
  25. B

    Thermodynamics: calculate thermodynamic derivative from data?

    I don't understand how to use output from an NPT molecular dynamics simulation to compute a thermodynamic derivative. I need to compute this (where "d" is a partial derivative, "T" is a subscript that means, "at constant temperature," and "E" is internal energy): -(dE/dV)T I have a simulation...
  26. Timboo

    B Tau neutrino flux in IceCube data

    What does this new find signify, i think we may be shortly due for something bad about to happen https://www.universetoday.com/144900/neutrinos-have-been-detected-with-such-high-energy-that-the-standard-model-cant-explain-them/https://arxiv.org/abs/2001.01737...
  27. DaveC426913

    Recovering data from a chip (MicroSD)

    My boy gave me his MicroSD (Kingston 16Gb Class 4) phone chip to see if I can recover the data. His phone apparently told him it was corrupt. He tried it in a new phone and apparently it "sort of" worked. Until an update came along. I'm not sure if this is two distinct issues (the corruption...
  28. arcTomato

    Engineering Fourier transform when the data is lacking datapoints

    I would like to know the equation of Fourier transform when the data has lack. like this sine wave.
  29. Cerenkov

    B Other lines of evidence for Dark Energy? (Besides supernova data)

    Hello. My current understanding (please correct, if wrong) is that the expansion of the universe is observed to be accelerating, rather than coasting or slowing down. The tentative cause of this acceleration has been given the placeholder name of 'Dark Energy'. One line of evidence for this...
  30. O

    I Spatial interpolation before or after data processing

    Let a set of values at several discrete points in 2D or 3D space be given. These values will be processed by an algorithm. At the end, processed values need not be known at the original locations but at grid points. Therefore, spatial interpolation needs to be applied. Is there a general...
  31. K

    Best software to fit molecular spectroscopy data

    Hello! I have some data from a molecular spectroscopy experiment, containing vibrational and rotational spectra, and I want to fit the peaks with Voigt profiles (one for each peak) in order to obtain the centers of the peaks. Do you know any software suitable for this kind of fit? I usually use...
  32. jedishrfu

    Genetic Data Tools Reveal How Pop Music Evolved

    An interesting article from the Physics Archive Blog:
  33. F

    Can I use a new SSD Hard Disk to store data without formatting?

    Can I use a brand new removed SSD HardDisk to store data without doing any thing at the beginning time(e.g without formating or other things)?I will use SSD to store ebooks(PDF,Djvu,epub,Audio files). Can I open books when I attach the removed SSD to laptop?
  34. H

    Rotate IMU data to obtain correct measurement data

    Hi I have collected data from a IMU on a boat. Currently I am using the angular velocity measurement vector ##\omega^b_{imu} = \begin{pmatrix} p\\q\\r\end{pmatrix} ## for use in kalman filter, where superscript ##b## is BODY frame. The BODY frame is given be x-axis pointing forward, y-axis...
  35. K

    I How to Bin Data for Spectrum Fitting with Poisson Errors?

    Hi! I have some measurements of the rate of a physical process versus energy. For each energy I have a number of counts and a measurement time associated to it. However, the step (in energy) at which the measurements are made is very small and also the measurement time is small, hence just...
  36. B

    I SXS Gravitational Wave Data: Initial Conditions Explained

    Hello! I need to do some analysis for a project with the SXS gravitational wave data: https://data.black-holes.org/waveforms/catalog.html but I am a bit confused about the initial conditions of their simulations. I read the paper they published about the data (it can be found at that website)...
  37. PeterDonis

    I Does the statistical weight of data depend on the generating process?

    The specific example I'm going to give is from a discussion I am having elsewhere, but the question itself, as given in the thread title and summary, is a general one. We have two couples, each of which has seven children that, in order, are six boys and one girl (i.e., the girl is the youngest...
  38. parazit

    I Comparing theoretical calculations with experimental data

    Dear users, The situation I have encountered is a simple statistical comparison of the experimental data, which accepted as correct, with the results obtained via six theoretical models. In the experimental data, there exist y values corresponding to x values and also the measurement errors of...
  39. W

    Python 2.7 Pandas BSoup4 Scrape: Outputs Column Names but not the Data

    Hi, trying to scrape a page : https://www.hofstede-insights.com/wp-json/v1/country I get the list of columns I want, but not the data assigned to the columns in the page that is being scraped. from bs4 import BeautifulSoup import pandas as pd import requests url =...
  40. S

    Get data from Simpack Post channels

    Hello. I would like to automate the process of getting result data from Simpack Post. It is easy to extract data from Simpack Post diagrams to txt-files and import to excel. But is there a way to extract data by choosing Simpack result chanels or generally of chanels? For example, when I define...
  41. benorin

    B Does the binomial distribution play a role determining p from data?

    In a game heroes have a maximum dodge rate, from experimental data we have 13 dodges out of 24 attacks (so 11 hits). A fellow on my discord server had immediately solved for the dodge rate as being 13/24. I started to explain it is not so simple as dividing (24-11)/24=13/24 is not the dodge...
  42. M

    I How to handle the infinity when making a least square fit for the first point?

    Hello! I have 5 data points with errors associated to them ##y_i \pm dy_i## and the corresponding ##x_i## values (which don't have uncertainties associated to them). I need to calculate the difference between the first of these points, ##y_1## and the rest, and fit a straight line to it...
  43. P

    Weather data from every planet

    I was curious about how much we could advance planetary science with the amount we are spending (and planning to spend) on the SLS. Specifically, I want us to increase the number of climates we study from basically Earth to every planet in the solar system. It looks like polar orbiting...
  44. M

    I Right way to fit some data

    Hello! I have the to fit a curve to the attached data (I plotted it both with and without error bars), where the error bars are Poisson errors i.e. ##\sqrt{N}##, where ##N## is the number of counts in the given bin. I want to fit 3 Gaussians + background and extract the values (and errors...
  45. K

    I How accurate are the peak values from different binning sizes?

    Hello! I am working on a spectroscopy project in which we adjust the wavelength of a laser and get some counts on the detector from some laser-atom interactions. The data that we have is in the form: ##(\lambda##, ##dt##, ##dN)##, where ##dt## is a time interval, ##\lambda## is the laser...
  46. S

    Is a "USB3 A Female" to "Micro B Male" OTG adapter w/ 5gbps data possible?

    Is it possible for a USB A 3.0 Female to Micro B Male OTG adapter to transfer data from the USB 3.0 source to the Micro B source at 5gbps speeds, and can the Micro B receive and process all of that data from it's Micro B port? Here's a link to the device, and a picture from it that claims this...
  47. K

    I Weighting data based on the errors

    Hello! I have some data (counts) with a Poisson error associated to it and I want to make a fit to the data. I am trying to weight the data inversely proportional to the errors, such that the data points with high errors are less important for the fit. However, using the the error on its own...
  48. anorlunda

    What can we learn from a friend's solar home data in Vermont?

    I visited a friend who has a very nice solar installation. He also has the software to do data collection and presentation. I thought it would be nice to share some of his data here. Perhaps we can link to this post as a reference in future solar discussions. First, some background. The...
  49. K

    I Take errors into account for a data fit

    Hello! I have some data in which the dependent variable ##y## has, for each data point, an error bar associated with it ##\delta y##. The errors are almost identical for each datapoint, so doing a weighted fit in terms of the errors would not change the results significantly. How can I take the...
Back
Top