Need advice applying a logistic regression to model school data

Click For Summary
SUMMARY

This discussion focuses on applying logistic regression to analyze MEAP data from 2005-2009, specifically to assess trends in student performance meeting or exceeding standards in grades 3-5. The user graphed the data and proposed a logistic growth model, f(t) = 1 / (1 + b*e^(-a*t)), to capture the non-linear progression of student scores. They seek to determine whether the schools are on the concave-up or concave-down portion of the curve and consider incorporating a t^2 term in their regression model to better understand the data's behavior. The user expresses a need for guidance on implementing a non-linear regression approach.

PREREQUISITES
  • Understanding of logistic regression and its applications
  • Familiarity with MEAP data and educational performance metrics
  • Knowledge of non-linear modeling techniques
  • Basic proficiency in statistical analysis and graphing
NEXT STEPS
  • Research how to implement non-linear regression models in Python using libraries like SciPy or StatsModels
  • Explore the implications of adding polynomial terms in regression analysis
  • Learn about hypothesis testing in the context of logistic regression
  • Investigate the application of Student's t-distribution for analyzing educational data
USEFUL FOR

Data analysts, educators, and researchers interested in modeling educational performance trends using statistical methods, particularly those focusing on logistic regression and non-linear analysis.

lcary
Messages
6
Reaction score
0
To determine whether schools are increasing the amount of students meeting or
exceeding standards, I obtained a MEAP database reporting the amount of students that
scored in the met or exceed standards range during years 2005-2009, for grades 3-5. I then
graphed the data for using numbers 1 through 5 representing the years since the data
started in the x-axis and and the number of students that met or exceeded standards in the y axis.

I am trying to look at progress over time not just as a linear increase, but as something that might speed up then slow down. In particular, any measurement like % at-or-above-basic-proficiency is limited to between 0% and 100% seems to apply here. A simple model for how it might grow in time would be

f(t) = 1 / (1 + b*e^(-a*t) )

-I tried graphing this for simple values like b=1, a=1 on the range t=-2 to t=+2)
Notice it starts out concave-up on the left, and then turns to be concave-down for larger values of t.

I am trying to ask are schools on the concave-up portion of the curve, or the concave-down portion?

I think I could get a partial answer by including a t^2 term in a regression model, then look at the sign of its coefficient--though there are some dangers, like if the coefficient is positive we might be on the downslope of the parabola, with scores decreasing in time! By the way, I am using t=5 meaning the year 2005.
Overall, I am trying to use logistic regression but I am lost on how to try to apply a non linear program of some sort to find a good model.



any advice?
 
Physics news on Phys.org
If you drop the linearity requirement, then something must replace it. You could e.g. develop a hypothesis, i.e. a curve which you suspect to be a good model, then test it with some standard test, or simply analyse the mean distances of your data. Student's t-distribution might be a good point to start with.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K