What is Data: Definition and 998 Discussions

Data are units of information, often numeric, that are collected through observation. In a more technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects, while a datum (singular of data) is a single value of a single variable.Although the terms "data" and "information" are often used interchangeably, these terms have distinct meanings. In some popular publications, data are sometimes said to be transformed into information when they are viewed in context or in post-analysis. However, in academic treatments of the subject data are simply units of information. Data are used in scientific research, businesses management (e.g., sales data, revenue, profits, stock price), finance, governance (e.g., crime rates, unemployment rates, literacy rates), and in virtually every other form of human organizational activity (e.g., censuses of the number of homeless people by non-profit organizations).
Data are measured, collected and reported, and analyzed, and from data visualizations such as graphs, tables or images are produced. Data as a general concept refers to the fact that some existing information or knowledge is represented or coded in some form suitable for better usage or processing. Raw data ("unprocessed data") is a collection of numbers or characters before it has been "cleaned" and corrected by researchers. Raw data needs to be corrected to remove outliers or obvious instrument or data entry errors (e.g., a thermometer reading from an outdoor Arctic location recording a tropical temperature). Data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next stage. Field data is raw data that is collected in an uncontrolled "in situ" environment. Experimental data is data that is generated within the context of a scientific investigation by observation and recording.
Data has been described as the new oil of the digital economy.

View More On Wikipedia.org
  1. S

    I How do I normalise my data to a maximum of 100?

    I have this set of data (in the attached image) and I'm trying to normalize the counts at each angle to a maximum of 100. I'm not sure how to do it though. Any tips would be much appreciated. I have the mean and std dev for each count at each angle. I don't really know what to do from here...
  2. J

    B Do particles "care" about consciousness or simply the data

    I'm curious if there have been any variations to the double slit that specifically looked at whether the particles behavior was determined by the computer keeping the data or if a consious person was there to see it. For example what I had in mind was having an isolated room where the computer...
  3. G

    Interpolating data of a bandpass filter with Q=10.4

    Hello everyone. I'm trying to interpolate the data taken (frequency in Hz vs A in dB) from a bandpass filter with Q = 10.4. The problem is that I'm not entirely sure about the transfer function that I should use to interpolate it. I'm trying to extrapolate the peak frequency, Q factor and...
  4. S

    Basic kinematic problem, find velocity using certain data

    Hi. Sorry, couldn't make a more specific tittle, I'm not used to solve physics problems. A friend of mine handed me the following problem: Homework Statement You have a graph that denotes the aceleration of an object as a function of the position of the object. This object has a mass of 10 Kg...
  5. Jameson

    MHB What is the Kaggle Data Science Bowl 2017 competition about?

    There is a neat site called Kaggle that is home to lots of data science info and the place of very featured competitions with large cash prizes. The goal of this year's competition was to create a model to detect lung cancer. It just wrapped up last week and the results are being verified right...
  6. Chronos

    I Using big data to identify astronomocal data bias

    I've been following, albeit loosely, the use of big data to refine astronomical data. It has been frequently noted that astronomy is an excellent test ground for big data approaches. I'm led to wonder what kind of results have been achieved to date and how effective are these methods for...
  7. B

    I Data from a delayed choice quantum eraser experiment

    Hello! Does anyone here have raw data from a delayed choice quantum eraser experiment that you wouldn't mind sharing? Ideally, data from all the detectors, not just coincidence counts that the analysis usually focuses on? Or, are you aware of any data repository where data from quantum physics...
  8. K

    Acceleration Data Analysis Guidance for MSc

    Hi I'm currently on the data analysis section of my MSc and I'm after some guidance if possible please. Basically I've managed to figure out many variables from force plate data including displacement/net force/velocity/power etc however I'm trying to do the same with data collected from a...
  9. gelfand

    How Do Mean, Standard Deviation, and IQR Reflect Differences in Data Sets?

    Homework Statement Compare and contrast the given data Homework Equations None needed for this The Attempt at a Solution I'm never too sure what kind of thing I'd be expected to do for something like this. Here's how I would go about it, but would appreciate any pointers / things to...
  10. V

    A How can I resample data with errors linearly in log space?

    I need to resample a set of data and its errors linearly in log space, with the same number of points. I was just going to interpolate between points to get the data - but how do I calculate the errors?
  11. D

    Graphing data from the Compton effect

    Homework Statement Not sure whether I should be posting this here or in the quantum physics thread but I felt this more of a 'homework' question. So basically I have done an experiment in which I measure the energy of light that has been scattered through a steel rod from a radioactive...
  12. P

    Maximise profit knowing manufacturing data

    Homework Statement Hi, this is my data and problem. [PLAIN]http:// Homework EquationsThe Attempt at a Solution So I am thinking that I can use system of equation to get the number of tables and closets for given resources. 0,2x+0,1y=40 0,1x+0,3y=60 1,2x+1,5y=371,4 But this does not include...
  13. Nikhil N

    How to choose an amplifier to couple the noise into data line

    Hii... I have to couple the noise into a communication cable for analyzing the performance of the communication. I am using a function generator which will generate a white noise of 1-2V(rms) and I have a current transformer to couple this to the communication line. The current transformer has...
  14. M

    I Is the F-test Calculation Different for Longitudinal Data in Linear Regression?

    Can i Use a standard F-test on longitudinal data for a linear multiple regression? Mons
  15. FallenApple

    A Should I Ignore Data Driven Models or Use Bayesian Methods for Model Selection?

    So it was mentioned that one should ignore data driven models because this could inflate the type 1 error as a result of overfitting. So the model should be decided even before looking at the data set. But at the same time, other sources say we should look at the scatterplot matrix to determine...
  16. W

    I Data needed: Relative permittivity of air

    Where can I found values of the relative permittivity of air at different temperatures, frequencies, pressure, humidity, etc. or its dependence? I'm particularly interested in data around 1.4 GHz, 25ºC, 1 atm. 50% hum. Thanks in advance.
  17. Arman777

    MATLAB Plotting Data with Matlab - Learn How to Code & Graph

    I have some datas from our physics experimetn and we have to plot them on matlab.Idk how to do them,I don't know how to write codes.Also I have to write of the axises and table names
  18. N

    Why “sudo cat /dev/ttyACM0” run only 1 time? (GPS)

    hi group, I am working on the GPS data collection, I Need help. If anyone who has been working on the u-blox GNSS Evaluation Kit Time EVK-M8T before and have seen it, please help. I do not know why sudo cat /dev/ttyACM0 only 1 time. I have the U-blox EVK-M8T which can read the NMEA messages...
  19. R

    I Experimental Data - Error in slope

    I have conducted a tensile test on five specimens. I intend to do a linear regression for every set of data and get a value for the slope (modulus of elasticity) and its error by finding the standard deviation (using LINEST function on excel) of the slope. I will now end up with 5 slope values...
  20. Hercuflea

    Python Best way to get data from a website that is not obviously tabulated

    Hello, I'm trying to download and analyze the data from this link. I've used Python BS4 to read tabulated data before from a website, however this webpage is more complicated than any I have seen before. It's not set up as a table (at least that I can tell using inspect element). Is there a...
  21. N

    I Bell's inequality experimental data

    Everything I've seen about Bell's inequality has had the setup of 120 degree angles between the axis of measurements. The experiment then proves that the basic hidden variable theory can't be true. But the actual measurement has always been told to me as a 0.5 correlation. 50% of the time the...
  22. N

    Why the data does not print out with eclipse in unbuntu?

    Dear group, I am using Eclipse in the Ubuntu operation system and trying to print out this data from the begin to the string "END OF HEADER" but I do not understand why it did not give me the one as I expected. I also used this file "albe0320_1.17n" to test and it worked, but the file that I...
  23. xjcov

    Analysis of collisions from data

    Good afternoon! I would like to preface by saying, yes, this is for a project. I am only posting here to see if my method of solving is correct before I finish the project incorrectly. Homework Statement I chose two balls, mass A: .553 kg and mass B: .410 kg I recorded their collision and...
  24. fluidistic

    Exploring Data in the CPU Cache

    Hello, I wonder what kind of data usually go into the CPU cache. Generally the size of the cache is in kB or MB in contrast with RAM which is in GB. I understand that the cpu accesses the RAM much slower than the cache so it's better for it to find the data in the cache instead of in the RAM...
  25. E

    Job Skills How can I transition from academia to become a data scientist in the industry?

    Hello all, I'm trying to come up with a detailed plan on how to make a transition from academia to become a data scientist in the industry. My academic background is: B.Sc in Computer Engineering, and M.Sc and PhD (and ~ 2 years postdoc experience) in Electrical Engineering/Wireless...
  26. B

    Do Different Peak Heights in XRD Analysis Indicate a Different Substance?

    The major diffraction peaks of my sample have essentially the same 2θ values as the reference (graphically), but have different heights. Can it still count as conclusive evidence that my sample matches the reference? Or does it suggest that my sample is a different substance? Also, as a side...
  27. omar-us

    TEG&Li+ batteries to power data collection sensors

    Hi all, This is Omar undergraduate electrical engineering student. I am doing my senior project on powering data collection sensors( 3- resistance temperature detectors and 1- pressure transducers) of autoclave by using thermoelectric generator!. the sensors are connected in // to a 24V common...
  28. Immersion

    Physics Better MSc. option to be Data scientist if I'm Physicist?

    Hi everybody, I have a bachelor degree in physics and I want to be Data Scientist, I have good programming skills and background in mathematical statistics, so, between options that I have found of MSc. are applied mathematics, applied statistics, statistics, applied physics, computer science...
  29. GhostLoveScore

    I Why are the bins N/2 - N not a mirror image of bins 0 - N/2 in FFT analysis?

    I am trying to do Fast Fourier Transform on some data recorded from RTL SDR. I managed to write a program that does that, but the problem is this. This is final result as it should look And this is my result It may be hard to understand this, I'll try to explain. My graph is done using 5000...
  30. dkotschessaa

    B How Is Data Specifically Defined in Algebraic Topology?

    Sometimes while poking around for stuff in topology or algebra I find the word "data" used in this context, i.e. in Hatcher: (https://www.math.cornell.edu/~hatcher/AT/AT.pdf) Does it just mean "information" in a general sense or is there some precise algebraic meaning? It's impossible to...
  31. N

    C/C++ How to get data from text file in C++?

    Hi Group, I am trying to get data from text file, I hope someone can suggest me how to do? I also have this code which can read data, but i do not know how to scan every single line to get any information I want. Please help, Thank you very much. Here is my code: #include <iostream>...
  32. EnumaElish

    Physicists set to revolutionize big data, AI

    When I opened up the article https://www.wired.com/2017/01/move-coders-physicists-will-soon-rule-silicon-valley/ I expected to see quantum computing as the next field physicists are to revolutionize. I was surprised to see it was data management and machine learning. I am happy for physicists...
  33. jdawg

    Statistics: Pattern present in dice data?

    Homework Statement I generated data for a dice experiment. For the first case, two dies were rolled and the minimum number and the sum were recorded. For the other cases with three, four, and five dies, the minimum and sum were also recorded. I attached a picture of the tables with my data and...
  34. A

    Job Skills Masters in Statistics vs in data science? Is DS just buzz?

    which do you think is the smarter choice, in terms of employ-ability: ms in statistics or Ms in data science? do you think "data science" is just a buzz words that will die out? is a data scientist someone who can't program as well as the computer engineer, and can't build models as well as a...
  35. Carl Loomis-Anderson

    Fortran Fortran77 data compiler problem

    So I am doing a chemical simulation of titans atmosphere and I have potentially 1000 data files to sort through to retrieve concentration values written in Double format. The issue is that each chemical has its own line with well over 80 columns (1993 currently, though it is subject to...
  36. Killtech

    I Looking for experimental data

    i want to get a better understanding of quantum mechanics (specifically for the multiple particle case) and came to a point where looking deeper into the theory won't yield the answers i seek. so to my discomfort i fear i'd need to look at experimental results to extend my understanding. so my...
  37. U

    Correlating experimental VLE data

    Hello, the system is alcohol and water + pH-buffer that is supposed to alter the volatility of the mixture and I need to correlate the experimental data (x1,y1,T measured at constant P). Question: Can I use binary models like margules, van Laar and NRTL to correlate the data and see if the...
  38. PsychonautQQ

    Math 2 year Data science masters program? Teaching Math abroad?

    Hello PF! This place is so great at helping me with my math homework, maybe ya'll can give me advice on life as well! As the title indicates, I'm looking for any information/opinions what-so-ever on topics of teaching math abroad at a maximum of a high school level and how beneficial would it...
  39. BillTre

    538's Awards for Best and Worst Data Stories of 2016

    Link awards: Statistical Fortitude Best Use of Data to Speak Truth to Power "Word of the Year" of the Year Trudeau Prize for Governance The Barest Minimum of Progress Achieved Boldest Sacking of Experienced Humans in Favor of Untested Algorithm The "Are We Still Doing This for Willful...
  40. Arez

    Fortran Reading data from a table file

    Hello I am very new to programming and fortran. I have a text file formatted the following way: Name Peter John Sally Joseph Luke Vader etc... age XXXX XXXX XXXX XXXX XXXX XXXX XXXX height XXXX XXXX XXXX XXXX XXXX XXXX XXXX weigh...
  41. R

    I A spaceship traveling close to the speed of light sending some data....

    A spaceship traveling at speed of light close to speed of light (wrt inertial reference frame) sending some data every second on their clock to people who are stationary (wrt inertial reference frame). At what time these people would receive this data on their own clock? Let's say for a second...
  42. C

    B Can I find experimental data for the decay of Iodine-131?

    I'm currently undergoing an assignment in maths were it is is necessary to apply mathematical concepts to aspects in reality. Hence, I have chosen to model the radioactive decay of iodine 131, however I am required to plot the data of the radioactive decay of iodine 131, in order to find the...
  43. garrettwittag

    I Is Schrodingers cat real or just a way to put a lack of data

    you put a cat in a box with radiation And a poison vile, and the radiation has a 50% chance of killing the cat. Before you open the box and actually measure the results the cat is both dead and alive simultaneously. This is how subatomic particles work, as discovered from the double slit...
  44. C

    How to obtain MCNP tally data that is *not* normalized by nps

    Dear all, This is my first post in this forum. I would like to know how to obtain the result data of an MCNPX or MCNP6 tally for each simulated history, before the data of different histories is averaged and normalized by the total number of simulated source particles (nps). I'm calculating...
  45. Nikhil N

    How to test CST design studio for analyzing data errors

    Hi.. I want to know how can we specify the data(electrical) passing through cable that we are designing in CST design studio to test the effect of EMI on the data passing?
  46. DoobleD

    B Build a "full" wave function without data in simple problems

    Is it possible to build the full wave function for a simple problem in QM, such as an infinite well, without any experimental data ? I'm learning about QM, and I saw how to compute energy states (the wave function for each allowed energy level) in some usual QM basic problems. But then, I was...
  47. newjerseyrunner

    Where Is the Majority of Climate Data Stored?

    I keep reading news articles about how researches are trying to dump as much climate data as possible off of government computers before the new presidential term. How much data is actually stored on government servers and no where else? Petabytes? Exabytes? Is it the majority of raw data...
  48. J

    A Where to find experimental data?

    Hello folks, When you study a pendulum or the path followed by a ball falling under gravity the first thing you have in mind is the experimental result, because you can actually see it. It is easy to measure or find the position and velocity of those systems. If we talk about a particle with...
  49. Jules Winnfield

    I How do I compare a model to logarithmic data?

    I have a model which is quadratic (e.g. ##y = k x^2##). I'm comparing it against a large set of data (galaxy cluster masses) which spans several Log10 decades (e.g. ##10^{11}## to ##10^{15}## solar masses). What is the right way to say how good the data fits the model? Obviously the errors in...
  50. cynnetje

    A Do outliers exist in categorical data and how can they be detected?

    Hello! I am working on a pre-analysis plan and have to specify what I am going to do with outliers. I have two categorical variables (5 levels and 2 levels) and I will be performing a chi-square test for independence. I thought of using a boxplot to detect outliers, but now I am not sure if...
Back
Top