Data Generation with Requirement

  • Thread starter Thread starter awaludin
  • Start date Start date
  • Tags Tags
    Data Generation
Click For Summary
SUMMARY

The discussion focuses on generating data for a linear equation with three independent variables (x0, x1, x2) and one dependent variable (y) while achieving specific R² values (0.8 for x0, 0.9 for x1, and 0.85 for x2). The user proposes creating a linear dataset with added disturbances to manipulate the R² values. However, they seek guidance on how to effectively control and achieve the desired R² values in their generated data.

PREREQUISITES
  • Understanding of linear regression and R² values
  • Familiarity with data generation techniques
  • Knowledge of statistical disturbances and their effects on data
  • Experience with programming tools for data manipulation, such as Python or R
NEXT STEPS
  • Research methods for controlling R² values in generated datasets
  • Explore Python libraries such as NumPy and Pandas for data generation
  • Learn about statistical disturbances and their impact on regression analysis
  • Investigate the use of simulation techniques for data generation
USEFUL FOR

Researchers, data scientists, and statisticians involved in data generation and analysis, particularly those focusing on linear regression and R² optimization.

awaludin
Messages
2
Reaction score
0
Dear All
I need to generate data for my research. Let I have a linear equation with 3 independent variables and 1 dependent variable
a0x0 + a1x1 + a3x3 = y
I want each of my variable(x0, x1, x2) has a defined R2 (r square) with y. Let x0-y = 0.8, x1-y = 0.9, x2-y = 0.85. Any reference how to do this? Thank you.
 
Physics news on Phys.org
What are your thoughts on the matter? You need to show that you've attempted the problem first!
 
My first thought was to create a data of a line and add some disturbance on it. Different disturbance, different variable. Let say that the line data + random disturbance = x0, the line data + sine disturbance = x1. I can get a different r square, but I still don't know how to control the value of r square.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K
Replies
3
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K