Data Generation with Requirement

  • Thread starter Thread starter awaludin
  • Start date Start date
  • Tags Tags
    Data Generation
AI Thread Summary
To generate data for a linear equation with specified R² values for independent variables, the user seeks guidance on achieving defined correlations between the variables and the dependent variable. The proposed method involves creating a linear dataset and adding various disturbances to each independent variable to manipulate their relationships with the dependent variable. However, the challenge remains in controlling the exact R² values for each variable. Suggestions for achieving this include using statistical techniques or simulations to fine-tune the disturbances. The discussion emphasizes the need for a systematic approach to ensure the desired correlation coefficients are met.
awaludin
Messages
2
Reaction score
0
Dear All
I need to generate data for my research. Let I have a linear equation with 3 independent variables and 1 dependent variable
a0x0 + a1x1 + a3x3 = y
I want each of my variable(x0, x1, x2) has a defined R2 (r square) with y. Let x0-y = 0.8, x1-y = 0.9, x2-y = 0.85. Any reference how to do this? Thank you.
 
Physics news on Phys.org
What are your thoughts on the matter? You need to show that you've attempted the problem first!
 
My first thought was to create a data of a line and add some disturbance on it. Different disturbance, different variable. Let say that the line data + random disturbance = x0, the line data + sine disturbance = x1. I can get a different r square, but I still don't know how to control the value of r square.
 
I tried to combine those 2 formulas but it didn't work. I tried using another case where there are 2 red balls and 2 blue balls only so when combining the formula I got ##\frac{(4-1)!}{2!2!}=\frac{3}{2}## which does not make sense. Is there any formula to calculate cyclic permutation of identical objects or I have to do it by listing all the possibilities? Thanks
Since ##px^9+q## is the factor, then ##x^9=\frac{-q}{p}## will be one of the roots. Let ##f(x)=27x^{18}+bx^9+70##, then: $$27\left(\frac{-q}{p}\right)^2+b\left(\frac{-q}{p}\right)+70=0$$ $$b=27 \frac{q}{p}+70 \frac{p}{q}$$ $$b=\frac{27q^2+70p^2}{pq}$$ From this expression, it looks like there is no greatest value of ##b## because increasing the value of ##p## and ##q## will also increase the value of ##b##. How to find the greatest value of ##b##? Thanks
Back
Top