neurocomp2003 said:
but how do you use the terms false and x,X to define it?
Did you see the earlier mentions of theories, structures, and models? I think you'd be interested in this, so we can run through what these things are and how they are related. I'm still piecing some of these things together myself, but I'll try not to stray too far into those areas.
Every formal theory is connected to a language. (By some definitions, a formal theory itself is a language, but we'll stick with the more inclusive usage.) We can speak just as generally about L-theories as we can about formal theories by letting L be an arbitrary formal language. An L-theory is a set of formulas of L. We build L from a set of symbols. The symbols are undefined primitives, whose role loseyourname already described. We form strings out of our symbols by simply stringing some number of symbols together in some order. (Note that, to keep L arbitrary, we'll let strings be of any length, finite or infinite, but strings are usually restricted to finite lengths.) Formulas are special strings that have some form, some pattern or orderliness, that we want to take advantage of. They are the well-formed strings, strings that meet some well-formedness conditions.
What makes an L-theory special as a set of formulas is that it is closed under a special relation. This relation can be any of several related ones, depending on the details of the treatment, and they go by several names: implication, entailment, consequence, deducibility, syntactic entailment, semantic entailment, formal consequence, logical consequence, and so on. What these relations have in common is the idea of one formula following from other formulas. One formula might follow from the others because you can prove it from them or because its truth-value is related in a special way to their truth-values or for some other similar reason.
So an L-theory is a set of formulas of L such that if a formula
f follows from any set of formulas already in the theory, then
f gets put into the theory as well. And it all starts with just a set of symbols:
Set of symbols --> set of strings --> set of formulas --> set of formulas closed under entailment relation.
Is this a comfortable foundation for you? Meaning and truth haven't entered the picture yet. L-theories technically involve only meaningless symbols being manipulated mechanically according to formal rules. It's all just a bunch of operations and relations on a set of symbols, which we are describing and studying from up above in our metalanguage (English or whathaveyou) and our metatheory (first-order logic and set theory or whathaveyou).
I'm not sure how to describe the situation of, say, using set theory as both your metatheory and object theory. You could perhaps recast it as languages talking about themselves. Natural languages are rich enough to talk about themselves, to incorporate their own metalanguage, and you could perhaps look at formal languages as being a kind of refinement within the natural language. Maybe your object language is in fact a model of your metalanguage description of it. It seems a lot of what you're doing is just narrowing down the possible interpretations, making your language more precise, and I guess you might be building some new things as well, but derp, this is an area where I didn't want to stray. So anywho, back to safer ground...
A formal language L that you want to use for set theory, or, rather, an L-theory of sets, will contain some special symbols, variable symbols, which is what
x and
X are functioning as in the example formula that you're asking about.
A structure is the thing that let's you interpret the formulas of an L-theory and assign truth-values to them. It let's you give the formulas meaning. For example, suppose you have an L-theory of equivalence relations, where L is a first-order language with one nonlogical binary predicate symbol, denoted by
P. One axiomatization of this L-theory could be
(A1) \forall x \ [Pxx]
(A2) \forall x, y \ [Pxy \ \rightarrow \ Pyx]
(A3) \forall x, y, z \ [(Pxy \ \wedge \ Pyz) \ \rightarrow \ Pxz]
(An axiomatization of an L-theory consists of your rules of inference and logical theorems, which you'll recall are usually left implied in the background, together with a set of formulas from which all other formulas in that L-theory follow. Also, I'm hoping that you recognize those axioms as saying that
P is reflexive, symmetric, and transitive.) A set
A with the identity relation
R = {(x, x) : x in A} defined on it would be a structure that would let you interpret your L-theory of equivalence relations. It let's you interpret your theory because it has a binary relation,
R, to match up with your binary predicate symbol,
P.
The situation with structures is similar to the one with theories in that we connect structures with a language in order to use them. An L-structure is a structure that can be used to interpret all of the symbols of a language L.
The interpretation and truth-value assignment are done with functions, but you can use different definitions depending on your purposes, and the form will depend on the form of the language that you're interpreting. The most general form of an L-structure that I can think of is an ordered pair (
A,
I), where
A is your underlying set, or domain, which contains the individuals of your structure, and
I is the set of functions that use
A to interpret the symbols of your L-theory and assign truth-values to your formulas. The variations on this (
A,
I) pair would split
I up into different functions or sets of functions. For example, you might separate out the truth-assigning functions (commonly called an L-valuation) or, if L has constant symbols, you could specify that some function maps your constant symbols to individuals in your domain. For simplicity, we'll keep everything together under the umbrella of an L-structure.
If an L-structure assigns a truth-value of
true, or whatever value we have chosen to correspond to truth, to a formula, we say that the structure models that formula or is a model of that formula. Similarly, if a structure interprets every formula in a set of formulas to be true, we say that it models that set of formulas. Recall that a theory is a set of formulas. So, for example, a model of an L-theory of sets is an L-structure that interprets every formula in that L-theory as being true. If we turn our earlier example structure of a set with the identity relation into a suitable L-structure, it is a model of our L-theory of equivalence relations because the identity relation does indeed satisfy the equivalence relation axioms, and due to the properties of and relations among the entailment relations of first-order logic and the axiomatization of our L-theory, any structure that is a model of our axioms is also a model of our entire theory; if it makes the axioms of our theory true, it must make all of the other formulas of our theory true as well.
So if I finally try to answer your question by saying that the empty set is any set that satisfies the Axiom of the Empty Set, I'm saying two main things:
(i) I have an L-theory that contains a formula that I've called 'the Axiom of the Empty Set'. I just wrote down a formula in my formal language L. The axiom has no meaning. It is simply, say, the following string of symbols:
(AES) \exists X \forall x \ [\neg(x \in X)],
(ii) I have an L-structure that is a model of that L-theory, and the domain of my structure contains an individual that, when assigned to the
X symbol in (AES), makes (AES) work out to be true.
For a simplified example, which isn't a model of every axiom of set theory, we could let our structure's domain
A = {a, b, c} and assign to our binary membership symbol, \in, the binary relation
R = {(a, b), (b, c), (c, c)}. We interpret the string x \in y to mean that the ordered pair (x, y) is in
R. An empty set is then any member of
A that doesn't show up as the second argument in any pair in
R. The empty set of our example structure is
a.
Does that answer make sense?
By the bye, (AES) is just another way of saying what matt already said -- we wanted to state it that way to fit in with our setup. Also, I didn't want to distract you by mentioning this eariler, but there is also an empty string, whose length is 0. It is a nice thing to have. For example, it is the identity element for the binary string concatenation operation.