Find Year for Pop of 60.4M with Finite Diff Interp

  • Thread starter Thread starter Shakattack12
  • Start date Start date
  • Tags Tags
    Finite
Click For Summary

Homework Help Overview

The problem involves estimating the year when a country's population reaches 60.4 million, based on historical population data from 1930 to 1970. The original poster has attempted to use Newton Divided Differences Interpolation and provided a polynomial equation for the population.

Discussion Character

  • Exploratory, Assumption checking, Problem interpretation

Approaches and Questions Raised

  • Participants discuss the use of a difference table to extend the population data and question the reliability of extrapolating beyond the given data range. Some suggest that an exponential model may fit the data better than a polynomial.

Discussion Status

The discussion is ongoing, with various participants exploring different methods for estimating the population and questioning the validity of extrapolation. There is recognition of the limitations of the polynomial model and the potential for better fitting with other types of equations.

Contextual Notes

Participants note that extrapolating data outside the known range can lead to unreliable estimates and that confidence intervals for such extrapolations depend on statistical assumptions that may not hold true in this case.

Shakattack12
Messages
24
Reaction score
2

Homework Statement


The problem involves a population in a country:
year 1930 1940 1950 1960 1970
Pop 1.0 1.2 1.6 2.8 5.4
(millions)

Part A involved finding the population in 1920 using Newton Divided Differences Interpolation (SOLVED)
Part B requires finding the year when the population is 60.4 million. I went through the method below but apparently there is any easier more accurate way (not using a calculator). Please help.

Homework Equations


The equation for the population is:
y = 0.0001x^3 - 0.581x^2 + 1125.22x - 726412.4

The Attempt at a Solution


For part B I used Newtons formula and a divided differences table

Substitute in values using interpolation formula

x = g(y0) + (y-y0)g(y0y1) + (y-y0)(y-y1)g(y0y1y2)

Let y = 60.4
x = g(y0) + (60.4-y0)g(y0y1) + (60.4-y0)(60.4-y1)g(y0y1y2)

let y0 = 50.5125, g(y0) = 2015 …x = 2015 + (60.4-50.5125)x0.447928331 + (60.4-50.5125)(60.4-84)x-0.001538931

x = 2019.787993
This is an approximation. The actual year is 2020.
 
Physics news on Phys.org
Why not just write out the difference table, and extend the table to longer times?
 
Shakattack12 said:

Homework Statement


The problem involves a population in a country:
year 1930 1940 1950 1960 1970
Pop 1.0 1.2 1.6 2.8 5.4
(millions)

Part A involved finding the population in 1920 using Newton Divided Differences Interpolation (SOLVED)
Part B requires finding the year when the population is 60.4 million. I went through the method below but apparently there is any easier more accurate way (not using a calculator). Please help.

Homework Equations


The equation for the population is:
y = 0.0001x^3 - 0.581x^2 + 1125.22x - 726412.4

The Attempt at a Solution


For part B I used Newtons formula and a divided differences table

Substitute in values using interpolation formula

x = g(y0) + (y-y0)g(y0y1) + (y-y0)(y-y1)g(y0y1y2)

Let y = 60.4
x = g(y0) + (60.4-y0)g(y0y1) + (60.4-y0)(60.4-y1)g(y0y1y2)

let y0 = 50.5125, g(y0) = 2015 …x = 2015 + (60.4-50.5125)x0.447928331 + (60.4-50.5125)(60.4-84)x-0.001538931

x = 2019.787993
This is an approximation. The actual year is 2020.

No, the actual year is unknown and un-knowable, although 2020 might be a useful approximation. You are extrapolating data way outside its range, so cannot be 100% sure about anything! In regression theory, there are formulas for getting "confidence intervals" for extrapolation outside the input range, but they are dependent on some statistical assumptions about the nature of the data and the model errors.

In this case an exponential formula of the form ##P = a + b e^{c t}## fits much better than any polynomial of the form ##y = a + bt + ct^2 ## or ##y = a + bt + ct^2 + dt^3##, and also makes more sense theoretically. However, finding the optimal parameters ##a,b,c## of the exponential fit requires solving a nonlinear "least-squares" problem, so needs modern software tools. (Also: I am not sure how to get confidence intervals, etc., for nonlinear fit models, because the usual regression equations do not apply.)
 
A difference table, as suggested by Chet, does show that the given data can be represented exactly by a cubic polynomial.

As Ray points out, as a practical matter it is dangerous to extrapolate.
 
SammyS said:
A difference table, as suggested by Chet, does show that the given data can be represented exactly by a cubic polynomial.

As Ray points out, as a practical matter it is dangerous to extrapolate.

OK: maybe I should not have said "fits much better than the cubic", since one can find a cubic going exactly through all the points. However, when you extend the cubic to values below 1930 the plot starts to look ridiculous and goes negative below about 1917.
 
Ray Vickson said:
OK: maybe I should not have said "fits much better than the cubic", since one can find a cubic going exactly through all the points. However, when you extend the cubic to values below 1930 the plot starts to look ridiculous and goes negative below about 1917.
Quite so.
The first requirement of a mathematical model for a physical process is that it makes at least vague theoretical sense. Fitting to the data is secondary.
 
Here are the lines of the simple difference table I was referring to in post #2.

##1.0\ \ \ 1.2\ \ \ 1.6\ \ \ 2.8\ \ \ 5.4\ \ \ 10.0\ \ \ 17.2\ \ \ 27.6\ \ \ 41.8\ \ \ 60.4##
##\ \ \ \ 0.2\ \ \ 0.4\ \ \ 1.2\ \ \ 2.6\ \ \ 4.6\ \ \ \ 7.2\ \ \ \ 10.4\ \ \ 14.2\ \ \ 18.6##
##\ \ \ \ \ \ \ \ 0.2\ \ \ 0.8\ \ \ 1.4\ \ \ 2.0\ \ \ 2.6\ \ \ \ 3.2\ \ \ \ \ 3.8\ \ \ \ \ 4.4##
##\ \ \ \ \ \ \ \ \ \ \ 0.6\ \ \ \ 0.6\ \ \ 0.6\ \ \ 0.6\ \ \ \ 0.6\ \ \ \ 0.6\ \ \ \ \ 0.6##

It took me less than 5 min to do this table.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
5K