I'm not sure where you are going with this.
To see what I mean, let $F(A)$ be the free semigroup on the set $\{A\}$ and $F(B)$ be the free semigroup on the set $\{B\}$.
Clearly both semigroups are sub-semigroups of $F(A,B)$. The problem is that $F(A) \cup F(B)$ is not, and adding the empty word to this union doesn't change this, we still have a problem with the product $AB$, for example. This problem exists even though we can embed both semi-groups in the semi-ring $\Bbb N[F(A,B)]$, where your proposed closure and distributive properties both hold for $G = \Bbb N[F(A)]$ and $H = \Bbb N[F(B)]$.
In general, algebraic operations perform poorly with respect to unions (because of closure issues). The union generally isn't "big enough" to be the join in the lattice of substructures.
If you are already stipulating that $G \cup H$ is already closed and associative under $\cdot$, there is nothing to prove, we can always create a monoid by "adjoining a 0", and defining:
$x \cdot 0 = 0 \cdot x = x$ for all $x \in G \cup H$.
IF the semi-groups $G,H$ are part of a larger structure that already HAS a multiplicative identity, $e$, then we can run into problems:
Consider $\Bbb N \times \Bbb N$ with the operations:
$(a,b) + (a',b') = (a+a',b+b')$
$(a,b) \cdot (a',b') = (aa',bb')$
Now we can form a monoid from the union of $G = \{(k,k):k \in \Bbb N\}$ and $H = \{(k,0): k \in \Bbb N\}$, taking the unit to be that of $G$.
The problem is, $H$ is not a sub-monoid of $G \cup H$, that is, the inclusion mapping is no longer a monoid homomorphism, because the identity of $H$ is $(1,0)$ which does not map to the identity $(1,1)$ of $G \cup H$.
If, in the example above, we instead took $G = \{(0,k): k \in \Bbb N\}$, then we again get that $G \cup H$ is a semigroup, and we also have that $G \cup H \cup (1,1)$ is a monoid, but now neither of $G,H$ is a submonoid of $G \cup H \cup (1,1)$.
That is, even though the semi-groups structures of $G,H$ are compatible with the semi-group structure of $G \cup H$, the monoid structures are not. Depending on your point of view, this is either a minor inconvenience, or a fatal flaw.
I suspect less trivial examples could be formed with $\text{End}_F(F^n)$ under matrix multiplication, but alas I don't have the time at the moment to look for some.