@Dale
As I suggested earlier, this thread would be better suited for the Electrical Engineering forum section. The reason is simple: this circuit is just a parallel RC filter, AC-coupled to a step voltage generator through C₁.
It's a textbook circuit that every EE student solves in their first year. The capacitive divider sets the initial condition at t=0⁺ (V₀/2 for equal capacitors), and then R discharges C₂ with time constant τ = RC₂.
In an EE forum, the answer would simply be "obviously V₀/2."
That said, I appreciate the rigor of your analysis, and I agree with you on the pure mathematics: in the framework of classical real analysis, the derivative of the Heaviside function does not exist at t=0, so the ODE has no classical solution. That is formally correct.
However, circuit theory operates in a different mathematical framework, that of distributions (Schwartz, 1950) where:
u(t) is a well-defined object with Laplace transform 1/s
δ(t) = du/dt is a well-defined object with Laplace transform 1
ODEs with distributional forcing have unique solutions
Your equation from post n.5:
##0=\frac{v_0}{R}+C_2 \frac{dv_0}{dt} + C_1 \frac{d(v_0-v_I)}{dt}##
is perfectly solvable once we recognize that ##\frac{dv_I}{dt} = V_0\,\delta(t)##.
This is standard practice in every circuits textbook (Desoer & Kuh, Nilsson & Riedel, Hayt, Oppenheim & Willsky...).
Taking the Laplace transform (with zero initial conditions):
##0 = \frac{V_0(s)}{R} + (C_1+C_2)\,s\,V_0(s) - C_1\,V_0##
Solving for ##V_0(s)## with ##C_1 = C_2 = C##:
##V_0(s) = \frac{C\,V_0}{\frac{1}{R} + 2Cs} = \frac{V_0}{2} \cdot \frac{1}{s + \frac{1}{2RC}}##
Inverse transform:
##v_0(t) = \frac{V_0}{2}\,e^{-t/(2RC)}\,u(t)##
So ##V_{C1}(0^+) = V_{C2}(0^+) = V_0/2##
The initial value theorem confirms: ##\lim_{s\to\infty} s\,V_0(s) = V_0/2##
I understand that from a standpoint of mathematical rigor, the tools I'm using here (distributions, Laplace transform of the Heaviside function, δ(t) as a derivative) may feel uncomfortable in a Physics forum. But these are exactly the tools that would be used, without any debate, in an Electrical Engineering forum, because they are the standard language of circuit analysis.
I think we are both correct, just operating in different frameworks. Thank you for the insightful discussion, it highlights how the same problem can look very different depending on the mathematical tools one chooses to apply.