The original theorems 7.5 and 7.7, are almost trivial, as the proof in Apostol shows. I.e. existence, or the fact that e^tA solves u' = Au, is just a matter of knowing how to differentiate the exponential function. the uniqueness is the same argument used in beginning calculus, to show that the only functions with derivative zero are constants. I.e. assuming u' = au, look at h = u/e^at. By the quotient (or product) rule we get h' = [u'e^at - uae^at]/ e^2at = [aue^at-uae^at]/e^2at = 0. Hence h = constant c, so h = c.e^at. and c is determined by the initial condition.
But the question was why they are both stated separately, for which I stick with pedagogy, i.e. learning is facilitated by repetition, in particular it is useful to see how to specialize general statements. The OP is thus to be commended for seeing that the statements are in theory needlessly repetitious.
If you are interested in more insight as to the proofs in general, the usual argument for the existence, is by a sequence of approximations, which also can be used to see uniqueness. I.e. given a (possibly time dependent) vector field f(t,x), (where x is an arbitrary element of some vector space E) and hence differential equation u'(t) = f(t,u(t)) with initial condition u(0) = b, where u is a function from the reals to the vector space E, then u solves the differential equation if and only if it solves the integral equation u(t) = b + integral from 0 to t, of f(s,u(s))ds.
Hence u is a solution if and only if u is a "fixed point" of the operator H taking any function u to the function H(u) = b + integral from 0 to t, of f(s,u(s))ds.
Moreover, with appropriate condition on the domain, and on f, this operator H is a "contraction" (of the space of functions u from some t-interval to E), hence does have a unique fixed point. I.e. the contraction H gradually squeezes the whole space down to that one point, which itself is left fixed, and this fixed point is the solution to both the integral and differential equation. Thus one can begin from any point u0 at all in the function space, and repeated application of the operator H will give a sequence u0, H(u0) = u1, H(u1) = u2,...converging to the fixed point.
In the simple case of f(t,u(t)) = A.u(t), and u(0) = B, we get the operator H taking a function u to the function H(u) = [B + integral from 0 to t, of A.u(s)ds].
Beginning from the constant function u0 = B, and applying the operator H repeatedly, we get the sequence of approximations:
u0 = B,
u1 = H(u0) = H(B) = B + ABt,
u2 = H(u1) = H(B+ABt) = B + ABt + A^2B.t^2/2,
u3 = H(u2) = H(B+ABt+A^2B.t^2/2) = B + ABt + A^2B.t^2/2 + A^3B.t^3/3!,
.......
This is the famous exponential sequence, converging to (e^At).B = (e^tA).B.
Here we can take B in any Banach space E, i.e. E any vector space of any dimension, even infinite, which has a notion of "length" or norm" for which convergence of Cauchy sequences occurs, and A any continuous (i.e. "bounded") linear transformation on E.
Technical note: An interesting subtlety arises here that I got wrong in an earlier comment, (since deleted). Namely in order to guarantee convergence of the sequence {H(uj)} we must let H act on a space of merely continuous functions u from a t-interval to a closed bounded ball in E. I.e. a uniform limit of smooth functions may not be smooth in general. But in this case, even if u is merely assumed continuous, but also a fixed point of H, we have u = H(u). Then since H(u) is an integral of a continuous integrand, H(u) must be smooth by the fundamental theorem of calculus, hence u = H(u) forces the fix point u to also be smooth.
[I have not defined a contraction, but it is an operator H which in particular is continuous, so It follows that if u is the limit of {H(uj)}, then H(u) is also the limit of {H(uj)} hence u = H(u). The definition of contraction moreover forces the sequence {H(uj)} to be Cauchy, hence convergent under our assumptions.
Defn: H is a contraction of the metric space S, (with distance function d), iff H:S-->S and there is a constant c, with 0 < c < 1, such that for every pair of points x,y in S, we have d(H(x),H(y)) ≤ c.d(x,y). e.g this holds if the distance between any pair of points is cut in half by applying H.]