What is Data: Definition and 998 Discussions

Data are units of information, often numeric, that are collected through observation. In a more technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects, while a datum (singular of data) is a single value of a single variable.Although the terms "data" and "information" are often used interchangeably, these terms have distinct meanings. In some popular publications, data are sometimes said to be transformed into information when they are viewed in context or in post-analysis. However, in academic treatments of the subject data are simply units of information. Data are used in scientific research, businesses management (e.g., sales data, revenue, profits, stock price), finance, governance (e.g., crime rates, unemployment rates, literacy rates), and in virtually every other form of human organizational activity (e.g., censuses of the number of homeless people by non-profit organizations).
Data are measured, collected and reported, and analyzed, and from data visualizations such as graphs, tables or images are produced. Data as a general concept refers to the fact that some existing information or knowledge is represented or coded in some form suitable for better usage or processing. Raw data ("unprocessed data") is a collection of numbers or characters before it has been "cleaned" and corrected by researchers. Raw data needs to be corrected to remove outliers or obvious instrument or data entry errors (e.g., a thermometer reading from an outdoor Arctic location recording a tropical temperature). Data processing commonly occurs by stages, and the "processed data" from one stage may be considered the "raw data" of the next stage. Field data is raw data that is collected in an uncontrolled "in situ" environment. Experimental data is data that is generated within the context of a scientific investigation by observation and recording.
Data has been described as the new oil of the digital economy.

View More On Wikipedia.org
  1. Clever Penguin

    B Cosmological data margins of error

    I am currently creating a database of all the known cosmological objects, and need to know the percentage error in the following data: Diameter of the sun: 1.3914x108 metres Mass of sun: 1.989x1030 kilograms I have the orbital period of the sun to be: 2.75x107± 9.1% years
  2. Planobilly

    Questions about vacuum tube data sheets

    Hi, Here is a typical data sheet for a commonly used 6l6GC tube. http://www.drtube.com/datasheets/6l6gc-jj2003.pdf Under typical characteristics (in this data sheet) a value of Ra and Ra-a are given. I assume Ra-a means Za-a and it could also be written as Zout. Is this a correct assumption...
  3. C

    Finding Spectral Slope in dB/Octave for Hydrophone Data

    Homework Statement I have some registration of sound gathered by hydrophone. Next I have created a power spectral density (dB re 1 Pa^2/Hz) vs frequency plot (semilog in matlab). And now I want to find spectral slope in dB/octave (one octave is log2(f2/f1). I suppose that I should calculate the...
  4. jdawg

    Help with Plotting Data & Function in Matlab

    Homework Statement I'm a little lost on how to plot this data and function. I included the homework question and my attempt at plotting in the attached picture. I'm pretty sure what I have is completely wrong and I honestly don't have much of an understanding of matlab, so the more you dumb it...
  5. sounouhid

    MATLAB Fit Experimental Data with MATLAB: Best Practices

    Hello everyone help please I need to fit experimental data with a theoretical model using the MATLAB software what is the best way to do it? thx for advance
  6. ChrisVer

    I Fit a Poisson on Gaussian distributed data

    Hi, I have a simple/fast question... Can you reliably use a Poisson function to fit on data that seem to be Gaussian distributed (although that is due to the large number of the mean)?
  7. Einstein's Cat

    B Data on time for fading of galaxy?

    Galaxies that are further than c/H metres from us, have recessional velocity exceeding c and thus they begin to fade away. Is there any data for how long it takes galaxies to fade away from the perspective of an observer on Earth where when t is zero, the recessional velocity of the galaxy is...
  8. A

    What sparked my interest in gravity, space, and technology: A personal journey

    Hi i have an interest in gravity, space, and general developments. Background in in Telecoms and IT to management level. I Operate out of own business providing information solutions of various kinds from data collection, processing to visualisation. I always had an interest in electronics as a...
  9. Bill McKeeman

    A Exploring Galactic Rotation Data in 3D: A Search for Peculiar Motion

    Does anyone know of galactic rotation data (any galaxy) of the following form: For simplicity assume an [x y z] coordinate system with the origin in the center of the galaxy and [x y] representing the plane of rotation. At any point [x y z] there is a corresponding velocity vector [vx vy vz]...
  10. 1

    I need a good data logging DMM -- 1.5kV Range

    Hello everybody, I am currently working on a research project and need to record the IV curve of a plasma discharge. Our current setup is pretty cool. We just have a web camera setup and using python+OpenCV we make meter reading inferences based off true/false statements based around what...
  11. L

    Extinction Coefficient from Time series data

    I have some time series data of the absorbance of Br2 formation using UV Vis spectroscopy and I need to figure out the extinction coefficient/ absorptivity. The overall reaction is BrO3-+5Br- +6H+-->3Br2+3H2O which is expcted to go to completion I know that the equation relating absorbance to...
  12. G

    C: Printing unsigned char data type

    Homework Statement Given a matrix n x m with unsigned char data type entries (entries are of size 1 byte, so data type of an entry should be unsigned or signed char, not int or char *). Entries are read in hexadecimal format (0x00,0x11,0xFF,...). Matrix should be allocated dynamically. Print...
  13. M

    MHB Algorithm creates representative set of data

    Hi all, I have algorithm to analyze and make it easier to implement in programming language (Python). We have table with data and we want to select only representative part. It looks like: ID_PRODUCT | CARDINALITY | SET VARIANCE WITH THIS ELEMENT AND ABOVE 10 ---------------- 110...
  14. M

    A Algorithm creates representative set of data

    Hi all, I have algorithm to analyze and make it easier to implement in programming language (Python). We have table with data and we want to select only representative part. It looks like: ID_PRODUCT | CARDINALITY | SET VARIANCE WITH THIS ELEMENT AND ABOVE 10 ---------------- 110...
  15. W

    A Using quantum-secured communication for data transfering

    Hello! I am wondering if it is theoretically possible to allow a means of data transfer (or internet, etc.) by the idea of quantum entanglement. Correct me if I make any errors in understanding. But, by what I understand, in essence you could for instance run a computation on a quantum computer...
  16. D

    I Regression: which parameters to use and how to plot the data

    Hello! I am yet very weak in statistics, but I am learning some basic finance, and this requires to create regression. Please, take a look at attached files - one excel that contains the results of regression and one screen shot of the window of StatPlus that I have to fill in. Before using my...
  17. S

    Need help with accelerometer data processing

    Hi, am working on a vehicle tracking device, i am using LIS3Dh accelerometer to get the acceleration data. i am using TM4c1231e6pz controller. I need to implement a harsh breaking alert, I am having difficulty in finding out which direction the vehicle is moving since the axis are not aligned...
  18. R

    I Extracting characteristics from time series data

    hi I have a random set of time series data that is calculated after applying an algorithm to a main random time serie data, and really need to extract all the possible characteristics from the set. The goal is to measure those characteristics and perform some statistical graphs based on those...
  19. M

    Mathematica How to get data points from plot?

    Hi PF! I used NDSolve to find the solution to a differential equation. I then plotted the solution in mathematica. However, I would like to be able to plot this in LaTex, specifically in TikZ. Can anyone help me here? Thanks so much!
  20. V

    Modifying h(t) to Match New Tidal Data

    Homework Statement The function h(t) = 5 sin (30(t+3)) is used to model the height of tides. On a different day, the maximum height is the minimum height is -8 and high tide occurs at 5:30am. Modify function so it matches new data. Homework EquationsThe Attempt at a Solution Answer: h(t) = 8...
  21. Josh Terrill

    B Linear regression with two data sets?

    I want to try to predict the USA summer highs using a linear regression. I know I can probably take data from the last 10 summers and plug that in, and use that to predict, but I'd like to use two data sources. 1 data source from the historical highs from past summers in the USA, and the 2nd...
  22. V

    What are the Max and Min Values for Tide Height?

    Homework Statement The height, h, in metres, of the tide in a given location on a given day at t hours after midnight can be modeled using the sinusoidal function h(t)= 5 sin 30 (t-5) + 7 (A) find max and min value for depth of water. (B) what time is high tide? What time is low tide? (C) what...
  23. Katejk

    Can Technology Enhance Psychological Monitoring in Therapy Sessions?

    Hi everyone, I am an undergrad student of psychology and I was curious is anyone from you would be interested to cooperate with me to discuss possibility about device which would help psychologists monitor data about patients. From what I know we can measure breath, pulse and even temperature...
  24. Chezz42

    In my experiment on transformers, the data showed

    So I did an experiment a this week on the relationship between two tightly wrapped coils with same length and number of coils in a transformer. This was in a compete circuit using AC currents. I measured the Voltage, keeping the currents the same. One experiment had an iron core between the two...
  25. Greg Bernhardt

    B CERN releases 300TB of LHC data to public

    Ok, who has a spare super computer? http://cms.web.cern.ch/news/cms-releases-new-batch-research-data-lhc What is there motivation for this and realistically what can come from it?
  26. mfb

    I Data Collection Begins: Monitoring the Diphoton Excess

    Data collection can begin! This night the luminosity ("collision rate") was negligible (0.05% of the design value), but it should go up quickly as more and more bunches are filled in for the runs. By August we might know if the diphoton excess is something real or just an extremely weird...
  27. F

    I Question about fitting periodic data

    My experience with data fitting is poor so I am in real need for help. The potential in the following is periodic over [0,Pi] I need to find a fitting function that I can use to perform further mathematics. Fourier series does not work, but a 40-degree polynomial give the following fit...
  28. Z

    A Question about a particular paper on categorical data

    I am not sure this is the right forum for this -- I have a question about a particular paper: http://www-users.cs.umn.edu/~sboriah/PDFs/ChandolaCBK2009.pdf The authors describe 4 heuristics that can be derived from categorical data -- this is in order to map categorical data to numerical...
  29. M

    Can a Node Have Multiple Parents in Trees?

    i know that a node cannot have more than one parent given that these parents have common ancestor (because this is undirected cycle and a tree must have no cycles). but can a node have more than one parent given that these parents don't have common ancestor (which will produce an unrooted tree i...
  30. kubaanglin

    B Why Does My Vacuum Leak Rate Change at 25 Microns of Mercury?

    Hello Physics Forums, After about one year of research and construction, I have nearly finished building a functioning inertial electrostatic confinement fusion reactor. Just to be clear, I do not wish to discuss the dangerous activities that are involved with my project as I know such topics...
  31. K

    I Data Plotting Help: Calculating Error Bars for Gradients and Average Gradient

    I'm doing an experiment at work where I am observing an "event" over time. This event can be anything, but let's assume its a bucket of water being filled to the top, then it gets replaced with another bucket and I watch the whole "event" again. So x-axis will be time, y-axis will be the volume...
  32. S

    Fit blackbody spectrum to data in python

    Hi! I have to fit a blackbody spectrum to some data points. The y-axis is in mJy and the x-axis is in log_10(freq). My code looks like this: from __future__ import division import matplotlib.pyplot as plt import numpy as np from scipy.optimize import curve_fit h = 6.63*10**(-34) c =...
  33. S

    A Where to find galaxy mass function observational data?

    Dear all, I am looking for observational data for the number count of galaxy mass function: \begin{equation} dn/dM\end{equation} in terms of redshift and also mass, to compare with theories. I know that I can use HIPASS data (most probably), but as I am new to the field, I have to idea: 1-...
  34. saybrook1

    Help finding a polynomial function given a set of data

    Homework Statement Hello guys, I have a set of data containing x and y coordinates(width and length) as well as a 'z' coordinate that represents power density at each point of x and y given. I was hoping that someone might be able to help me figure out a way that I can find a function for z in...
  35. phosgene

    Comparing SD of data with RMSE of regression line

    Homework Statement I'm being asked to compare the standard deviation of a data set with the root mean square error of the regression line used to model the data, in order to determine the reliability of the regression line. Homework Equations Mean squared error = variance + bias squared The...
  36. T

    I Calculating noise in a data sample - what region to use

    I have a data set of number of counts vs position where counts were detected. I want to find the noise in the sample. Am I right to think that by 'noise' the requestor wants to know the standard error (SE = stdev/sqrt(N)) where N is the sum of x-axis points. Also if the above it true then is the...
  37. Huyen401

    Fortran Fortran: reading the data from file

    Hello, I have data in file and I want to read the data into variable in fortran to save memory caculation. I want to know: When I open file inputdata, whether fortran have any notices about the way it read data into variables? (like mathematica: each open file, we just read in the order anyway)
  38. E

    Graphing data in a lab- astrophysics

    Homework Statement Hello! So, I recently did an experiment where I altered the eccentricity and distance from the sun of a planet orbiting around the Sun in a simulation and measured how long it took the planet to complete a single orbit. With this data, I compared my experimental data with the...
  39. F

    Non-Ideal Battery Voltage with few data points

    Homework Statement The problem includes a graph. All I have is a current to external resistance graph, with 20 A coming at 10R of external resistance. I am to find the EMF and internal resistance of the battery. Here is the problem and my two attempts. http://1drv.ms/1LKbu5H Homework...
  40. R

    B Cosmic Ray Muons: Finding Experimental Data for Special Relativity

    Hi everyone, I'm current working on a project about special relativity, and i was thinking writing about the cosmic ray muons. But where do i, as a high school student get raw data of cosmic ray muons? I have searched quite a bit, but it doesn't seems like data like that is public and easy...
  41. W

    Standard Data Types for Web Addresses, Phone Numbers?

    Hi All, Just curious: what kind of data types does one usually use for web addresses, for phone numbers? EDIT: I am using MSSQL 2014 . Thanks.
  42. E

    I How to Extract Data from an Integral

    Consider this form: ##A = \int B\left(x\right) C\left(x\right) dx## I have the values for ##A## and ##C\left(x\right)## (a value of ##C## per value of ##x##), is there a way that I can extract ##B\left(x\right)## numerically or analytically? Thank you in advance.
  43. R

    Help interpreting processed data (and their transforms)

    Hi, so I have the spatial distributions of detected hits in figure 1. When plotting fig 1 as a regular scatter plot I thought I could discern some sort of pattern. So I got the idea of taking its Fourier transform and to see the result of the analysis. I am not very well acquainted with the...
  44. M

    Data structure and algorithms time computation?

    Assume the following set of instructions: 1. i = 0 2. if i < n, goto line 6 3. if A [ i ] = = x, goto line 7 4. i++ 5. goto line 2 6. return false 7. return true Assume that line i take Ci time, where Ci is a constant. The worst case total time of running this block of code can be calculated...
  45. websterling

    LIGO GW150914 Data Release & Tutorial

    https://losc.ligo.org/events/GW150914/']The[/PLAIN] LIGO Open Science Center has released data from the gravitational wave detection along with a tutorial going through some typical signal processing tasks on strain time-series data associated with it. GW150914 Data Release From the tutorial...
  46. N

    Convert data from weird to regular data

    Dear Group, I have a 30_sec_data.txt with weird characters in it and data.txt with nice numeric in it, Any one have any ideal to help me convert data from weird to regular data using Matlab. Thank you, Best regard,
  47. F

    SONET protocol for data transmission over fiber optics

    hello forum, I have read about SONET which seems to be a physical layer protocol to transport data over fiber optics. SONET is a TDM (time division multiplexing method). TDM means that that time divided into slots and shared between different users. For example, given three users A, B and C...
  48. W

    Big Data and RDBS (Relational DB). Do They Fit?

    Hi All, I am having trouble seeing how Relational Databases (RDBS) can be used in the world of big data. The inflow of data seems to be way too fast for the database to reflect what is going on at a given moment. I understand this issue is supposed to be addressed by data warehouses. Is...
  49. G

    Discover Visible Wavelengths for Neon Spectrum | Resources & Links Included

    I need a list of neon visible wavelengths. I wonder if any of you know of any good resources. I've tried searching but sometimes I get too many numbers. I also need to link each number with the spectrum colour chart, but so far I can't find anything like that. cos I don't want just want an...
  50. R

    Data from pulsars - light curves?

    Hi everybody, I hope some of you have worked with pulsars before or other x-ray data from NASA Heasarc. I need some data showing a very precise light curve of the crab pulsar and some other pulsar. It should be something like this: http://cdn.eso.org/images/screen/eso9948i.jpg Where the time...
Back
Top