Help with generalized linear model

Click For Summary
SUMMARY

The discussion focuses on modeling the relationship between discrete variable x(t) and continuous variable y(t) using a generalized linear model (GLM). The user seeks to account for the dependence of x(t) on changes in y(t), particularly when y(t) experiences significant fluctuations. Suggestions include normalizing x(t) to the range [0,1] and considering logistic or probit models, although the appropriateness of these models is questioned. The importance of understanding the underlying physical system to inform model selection is emphasized.

PREREQUISITES
  • Understanding of generalized linear models (GLM)
  • Familiarity with logistic and probit regression techniques
  • Knowledge of data normalization methods
  • Basic statistics for analyzing distributions and stochastic processes
NEXT STEPS
  • Research the application of logistic regression for modeling binary outcomes
  • Explore probit models and their suitability for discrete data
  • Learn about data normalization techniques for discrete variables
  • Investigate the role of underlying physical systems in model selection
USEFUL FOR

Data scientists, statisticians, and researchers interested in modeling relationships between discrete and continuous variables using generalized linear models.

Cexy
Messages
70
Reaction score
1
I have a data set {X(t) = (x(t), y(t))}_t=1,...,N and I'm interested in modelling the changes from t to t+1, using some metric d(X(t),X(t+1))

The issue is that x(t) has some dependence on y(t), and I'd like to account for this: if there is a large change in y(t) we expect there to be a corresponding change in x(t), and I'd like the metric to account for this expected change.

The problem is that x(t) takes only discrete values (in fact it is almost always 1 or 2, with a small probability of being any other integer greater than 2) whereas y(t) is a positive real number with an unknown distribution (although of course I can approximate the distribution with a histogram - I have a lot of data available).

What would be an appropriate model for the dependence of x(t) on y(t)?

I've thought about normalizing x(t) to be in the range [0,1] and using a logistic or probit model, but I really have no idea how appropriate this is.

Any ideas?
 
Physics news on Phys.org
Cexy said:
...What would be an appropriate model for the dependence of x(t) on y(t)?...

It's difficult to model a general stochastic process by data exploration alone. If anything is known about the underlying physical system, so that appropriate assumptions can be made, it will help in selecting a range of candidate models (possibly including but not limited to GLM).
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 22 ·
Replies
22
Views
4K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
Replies
3
Views
3K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K