They are not the same. There's three common tests for statistical significance which give Z scores, t scores or chi square scores for three distributions: the Gaussian (normal), the T and the chi square. The latter two are more precise for small samples and are sensitive to the number of comparisons or degrees of freedom. (means and SDs are summary statistics). Both the T and the (central)chi square distributions happen to converge to the Gaussian for larger sample sizes or number of comparisons. Generally samples of about 30 or more are adequate for Z score testing when the distribution of values within samples are assumed to be well behaved (few or no outliers).
The p values of each of the three are calculated with respect to their own distributions, but will converge to the Z score calculation with sufficiently large samples. Without knowing the shape of your distributions, I thought the t-statistic was probably a bit better.