That's a very tangentially related use of the term. Just use the definitions in the book you are reading.
Or if we are allowed to change definitions mid-sentence, both covariant and contravariant vectors transform covariantly:P
OK, to be more serious, let's imagine we have a 2D surface covered by coordinates (x,y). Imagine that each point has a different temperature f(x,y). A vehicle, carrying a clock which reads time t moves across the surface making a curve (x(t),y(t)). The variation in temperature versus time that the vehicle experiences is df/dt which is just one dimensional calculus. At a point p, df/dt=df/dx.dx/dt+df/dy.dy.dt, a scalar which we can rewrite as a a row vector (df/dx,df/dy) multiplied by a column vector (dx/dt,dy/dt), with all derivatives evaluated at p. The column vector or contravariant vector is something like the velocity, and the row vector or covariant vector is something like the gradient of the temperature.
We don't conceive of the velocity at that point as belonging to only one curve, since many curves can have the same velocity at that point. We also conceive of the velocity of any particular curve being the same under a change of coordinates from (x,y) to (U(x,y),V(x,y)). The same temperature variation is now described by f(U,V), and the same path is now described by (U(t),V(t)). So the new column vector representing the same velocity will be (dU/dt,dV/dt)=(dU/dx.dx/dt+dU/dy.dy/dt,dV/dx.dx.dt+dV/dy.dy/dt), which is how the coordinate representation of a contravariant vector transforms. Similarly, the new row vector representing the same gradient will be (df/dU,df/dV)=(df/dx.dx/dU+df/dy.dy/dU,df/dx.dx/dV+df/dy.dy/dV), which is how the coordinate representation of a covariant vector transforms. df/dt=df/dU.dU/dt+df/dV.dV/dt remains unchanged.
Something to keep in mind for later, when the metric is introduced: at this stage, we can multiply row and column vectors, but we haven't defined what it means to "multiply" column vectors, which is a job the metric can do.