I think it would have helped if they stated that the 2nd term is a total time derivative 'of a function of coordinates and time' ...

df(x,t) / dt = df/dx * dx/dt + df/dt (partial d's now)

Since f does not depend on the velocities, df/dx and df/dt don't, and the overall dependence of df/dt on v=dx/dt is linear.

