think of a curve. a finite piece of that curve is called deltay. if you think of the curve as made up of an infinite number of infinitely short pieces, each one of which is straight, but infinitely short, like a polygon with an infinite number of super short sides, then one of those tiny sides is called dy.now if y is linear, then not only dy, the infinitely short piece of curve, but also deltay, the finite piece of curve, are both straight. that's why, since x is a linear function of x, that deltax is hard to distinguish from dx.
now since the language above is logically nonsense, to make sense of it we try to associate in some meaningful way, a straight line to each point on a curve, and of course we use the tangent line at that point, since it supposedly has the same direction as an infinitely short piece of our curve.
so at each point of a curve, dy should somehow represent the tangent line to the curve y(x) at that point.
so here we go with the correct, but complicated meaning of dy.
if y(x) is a function of x, then deltay is a function of two variables, whose value at x,h is y(x+h)- y(x).
dy is also a function of two variables, whose value at x,h is y'(x).h.
then at the point x, deltay and dy are both functions of h, and if y is a differentiable function of x, then dy(x,h) is a good approximation to deltay(x,h) when h is small.
since x'(x) = 1, this makes deltax and dx the same function of x and h, namely both equal h for every x.
but there is a more abstract version of dy too. namely, even if we do not have coordinates x chosen in our domain space, e.g if the x-axis has no unit chosen on it, then a function y still has a graph which is a curve in the x,y plane, and this curve can have a tangent line at each point. then dy is the function of x whose value at each point x is the linear function whose graph is the tangent line to the graph of y.so for each function f, df is a family of linear functions, one for each point of the domain of f, and at each such point p, the linear function df(p) is the function whose graph is the tangent to the graph of f at the point (p,f(p)).notice that this makes sense even if there are no units on the y axis, but deltay does not.see how much nicer it was to think dy was just an infinitesimally short, hence straight, piece of the graph of y?