I A confusion about axioms and models

Click For Summary
The discussion revolves around the distinction between axioms and additional claims used to specify a unique model within first-order logic. Participants explore whether these additional claims should be classified as axioms or if they possess different properties that prevent such classification. They highlight the inconsistency in the use of the term "axiom" across various mathematical contexts, particularly in group theory, where defining properties may be termed axioms or simply conditions. The conversation also touches on the implications of the Löwenheim-Skolem theorem, questioning how unique models can be defined within first-order systems. Ultimately, the discourse reveals a broader confusion regarding the terminology and classification of axioms in mathematics.
  • #31
micromass said:
I don`t see why you would say that the real numbers form a model of ZFC?? If that`s what you`re asking, then yes this was definitely wrong.
Additional question: Since real numbers are not a model of ZFC, this means that real numbers do not satisfy some of the axioms of ZFC. Can you say what these un-satisfied axioms are? Obviously, one of them is the existence of the empty set (because empty set is not a real number), but what are the others?
 
Physics news on Phys.org
  • #32
Demystifier said:
Additional question: Since real numbers are not a model of ZFC, this means that real numbers do not satisfy some of the axioms of ZFC. Can you say what these un-satisfied axioms are? Obviously, one of them is the existence of the empty set (because empty set is not a real number), but what are the others?

The entire language doesn`t match. ZFC is a model of first-order logic with two relations: ##in## and ##=##.
The real numbers can be modeled as a logic with as relations ##=##, ##<## and with as operations ##+## and ##\cdot##.
So the two are incomparable.
 
  • Like
Likes fresh_42 and Demystifier
  • #33
@micromass Thank you for the enlightening discussion.
 
  • Like
Likes Greg Bernhardt and micromass
  • #34
fresh_42 said:
@micromass Thank you for the enlightening discussion.
From me too, though the conclusion is rather underwhelming (not that micromass is to blame for that).
 
  • Like
Likes micromass
  • #35
micromass said:
We have the axioms of group theory. these are a first order theory, and there are many models of this. Fine. Now we choose one model at random to work in. Now consider the statement `This group is commutative`. This is an unprovable statement. We choose to work in a specific model which might or might not be commutative. We don`t know. We can however find a submodel of our specific model which is commutative (the trivial group). This shows that commutativity cannot be disproven for groups. This also shows that commutativity is relative, submodels won`t respect commutativity or lack of it. This is an awkward way to look at group theory, since it is so easy to describe models of it.
But from a standard point of view, it seems rather absolute that group SO(2) is commutative and that SO(3) is not. So what exactly is the difference between the standard and this awkward point of view?
 
  • #36
Demystifier said:
But from a standard point of view, it seems rather absolute that group SO(2) is commutative and that SO(3) is not. So what exactly is the difference between the standard and this awkward point of view?
The part of the comparison to group theory was the one I had difficulties with, too.
Then rereading it, I interpreted it this way: the starting point was group theory alone, not group theory within ZFC.
Otherwise I had the same question.
 
  • Like
Likes micromass
  • #37
Demystifier said:
But from a standard point of view, it seems rather absolute that group SO(2) is commutative and that SO(3) is not. So what exactly is the difference between the standard and this awkward point of view?

That`s the difference between set theory and group theory. In group theory, you can exhibit a very specific model. Then you can prove everything you want about that model. In set theory however, we can never describe a model at all. We can only go by the axioms.
Try to do group theory like this: you know you`re working in some group, meaning that the usual axioms are satisfied. But outside of this, you have no clue about the specifics of the group. You cannot construct groups other than the one you have now. That is the point of view with which we do set theory.
 
  • Like
Likes Demystifier
  • #38
fresh_42 said:
The part of the comparison to group theory was the one I had difficulties with, too.
Then rereading it, I interpreted it this way: the starting point was group theory alone, not group theory within ZFC.
Otherwise I had the same question.

I understand it`s difficult. But the analogy between set theory and group theory is a really really good one. It helped me significantly.
Your interpretation is correct by the way. You start only from one single group, for which you know the axioms are valid, and nothing else.
 
  • #39
micromass said:
In set theory however, we can never describe a model at all.
Then why don't they tell us that in the textbooks? :mad:

But here is a more interesting question. What if I remove the axiom which claims existence of an infinite set? Would it be possible to explicitly describe a model for such a reduced ZFC?
Or even more interestingly, is it possible to remove some other axiom (while retaining infinite-set axiom), such that again a model can be described explicitly?
 
  • #40
Demystifier said:
Then why don't they tell us that in the textbooks? :mad:

But here is a more interesting question. What if I remove the axiom which claims existence of an infinite set? Would it be possible to explicitly describe a model for such a reduced ZFC?

Depends on what you mean with describe, but I`d say yes. You can explicitely describe all the sets in such a universe.

Or even more interestingly, is it possible to remove some other axiom (while retaining infinite-set axiom), such that again a model can be described explicitly?

That seems much less likely.
 
  • Like
Likes Demystifier
  • #41
Wiki states that Goedel's completeness theorem says: each syntactically consistent theory (set of closed formulas without contradiction) has a model. Shouldn't this apply to any reduced ZFC system as long as consistency can be proved?
 
  • #42
fresh_42 said:
Wiki states that Goedel's completeness theorem says: each syntactically consistent theory (set of closed formulas without contradiction) has a model. Shouldn't this apply to any reduced ZFC system as long as consistency can be proved?

But consistency can`t be proved. Otherwise, yes.
 
  • #44
I thought it answers your question, at least to some extend as it supplies a sufficient condition.

Demystifier said:
Or even more interestingly, is it possible to remove some other axiom (while retaining infinite-set axiom), such that again a model can be described explicitly?

(To be honest: I haven't thought about whether every reduction of ZFC cannot be proven consistent.)
 
  • #46
Demystifier said:
@micromass, if higher-order theory is necessary for unique definition of mathematical models, then why most textbooks on logic say almost nothing about higher-order logic and claim that first-order logic is sufficient for most of mathematics?

First order logic has the Goedel "Completeness" Theorem, which second order logic does not. The notion of "Completeness" in the "Completeness Theorem" is not the same as that in the "Incompleteness Theorem". Mathematicians have much worse terminology than physicists :P

https://en.wikipedia.org/wiki/Gödel's_completeness_theorem
 
  • #47
atyy said:
Mathematicians have much worse terminology than physicists :P
:oldsmile:

Now that I have seen how mathematicians use the term "axiom", I have (with great sadness) to somewhat agree.
 
  • #48
  • #49
Demystifier said:
Suppose that I have a set of axioms in first-order logic. And suppose that I have several inequivalent models for this set of axioms. And suppose that I want to choose one specific model. To choose it, I need to make some additional claims which specify my model uniquely.

My question is the following: What kind of claims these additional claims are? Are they some additional axioms? Or are they claims which are not classified as axioms? If they are not classified as axioms, what property do they have/lack so that they cannot be classified as axioms?

If the question looks too abstract, let me consider an example. Suppose that I start from axioms of group theory. There are many different groups satisfying these axioms. So I choose some specific group, say SO(3), defined by some claims which define that group. Are these claims also axioms? If not, then what property do they have/lack so that they cannot be called axioms?

Could it be that my confusion stems from the fact that the word "axiom" in logic has more narrow meaning than the word "axiom" in the rest of mathematics?

I spent a fair amount of time pretending to do mathematical logic, and I think I can answer this question.

The difference between a theory and a model is that a theory is a collection of mathematical statements, while a model is a mathematical object. Of course, to reason about a mathematical object, you need a characterization of that object, and that requires statements, too. But usually, what in mathematical logic is considered a "model" is some specification of a mathematical structure in a way that is believed to uniquely characterize that structure (or sometimes, up to isomorphism).

Coming up with a model is often along the lines of a construction--a concrete recipe for building that object. Of course, the recipe often involves steps that cannot be carried out by a human being, but would require some kind of godlike being that can do infinitely many things in a finite amount of time.

Maybe it's best to see an example:

To axiomatize the natural numbers, you might give the following (Peano axioms): (I'll use x' to mean the next number after x)

1. x+0 = x
2. x+y' = (x+y)'
3. x*0 = 0
4. x*y' = (x*y) + x
5. 0 \neq x'
6. x' = y' \Rightarrow x=y

There's actually an induction schema, too, but I'll skip that for now.

I think it's obvious that the interpretation of the domain as the natural numbers 0, 1, 2, ..., and the interpretation of x' as x+1, and the interpretation of "+" as ordinary addition, and "*" as ordinary multiplication works. But the question is, do those axioms uniquely specify a mathematical structure? The answer is no. To see this, you can see that the axioms don't rule out "hyperfinite" objects that are infinitely far removed from 0. More concretely, let's add a new constant symbol, \infty, and add the (infinitely many) axioms:

\infty \neq 0
\infty \neq 0&#039;
\infty \neq 0&#039;&#039;
\infty \neq 0&#039;&#039;&#039;
etc.

(Having an infinite number of axioms might seem like cheating, but when they follow a simple pattern, as they usually do with axiom schema's, an infinite number of axioms is no harder to use in theorem-proving from than a finite number)

It's clear that this new theory is not about the (finite) natural numbers, since it has an object, \infty, that is not anyone of the natural numbers. But the original set of axioms didn't rule out the possibility of such an object, so the original axioms didn't uniquely characterize the natural numbers, either.

It turns out that you can't possibly uniquely characterize the natural numbers using only first-order language. That is, using only statements involving +, *, 0, x', etc., you can't axiomatize the truths about the natural numbers.

But we can (allowing godlike operations) construct a model of the natural numbers. Roughly speaking:

Let A be any set, and let f: A \rightarrow A be any unary function on A that is one-to-one, but not onto. Let Z be any element of A that is not in the image of f. If A&#039; is a subset of A, call it "closed" if Z is in A&#039; and whenever x is in A&#039;, so is f(x). Finally, we define N = the set of all x that are in every closed subset of A.

Then N is a model of the natural numbers. It was definitely not obtained by adding more axioms to the Peano axioms. The language used to "construct" N doesn't even mention the original axioms.

A model is definitely not a collection of axioms. It's a mathematical object.
 
  • Like
Likes fresh_42
  • #50
stevendaryl said:
The difference between a theory and a model is that a theory is a collection of mathematical statements, while a model is a mathematical object.
IMHO, we can also claim : A theory is a set of mathematical formula provable from a set of chosen axioms and the logic used (here first-order logic). Thus It is also a mathematical object. A mathematical syntactic object. A model is use to give a semantic (e.g. an interpretation ) to the theory. The model theory is the semantic of the formal discourse.

For example from the group axioms (syntax) https://proofwiki.org/wiki/Definition:Group_Axioms we can built models (semantic) that we call group which is a "concrete realization" of the axioms. There are many different groups satisfying the group axioms, and they come in all shapes and sizes.

The Gödel's[/PLAIN] completeness theorem give a relationship between theory and model.

Patrick
https://en.wikipedia.org/wiki/First-order_logic
 
Last edited by a moderator:
  • #51
stevendaryl said:
A model is definitely not a collection of axioms. It's a mathematical object.
A model is a semantic object, a theory is syntactic. A model for a theory maps the theory into a world where each of the sentences of the theory are
satisfied according to Tarski's satisfiability criterion.
 
  • #52
I am not sure if the OP's specific question is resolved, but I think I can answer it.

The OP is asking given a theory T, with a collection of nonisomorphic models, how to specify a subset of the models. I am unsure how much model theory does the OP know, and whether they have in mind are additional assumptions I missed. What I know below is from undergraduate model theory, so there may be more to it.

The first thing to check is whether T is complete. A complete theory is one which derives every sentence or it's negation. If T is not a complete theory, then there is some sentence σ of first order logic that is independent of T, which is to say that both T∪{σ} and T∪{~σ} are consistent, and by completeness have models. Then simply adding σ to you set of axioms will remove all models of T∪{~σ}, and you have specified a subset of the models of T.

An example of this is when T is the group axioms and σ is the sentence asserting commutative. This parses down from all groups to abelian groups

It is however, possible for T to be complete, yet have nonisomorphic models. In fact, it is a theorem of model theory that if T has an infnite model, it has models of arbitrarily large infinite cardinalities. An easy way to specify a subset of models would be to specify a cardinality, since models of different sizes can never be isomorphic. This specification will not be first order in general unless you specify a finite cardinality. In the previous example this may mean requiring your groups to have 3 elements, which would give you the cyclic group of order 3, or countable etc.

An example of a complete theory with nonisomorphic models is the theory of dense linear orders without endpoints. It has only one model in countable infinity, i.e. ℚ, and it is a theorem that if a theory has only one isomorphism class in some uncountable cardnality, then it must be complete. Several nonisomorphic dense linear orders without endpoints in cardinality continuum include ℝ and (a copy of ℝ followed by a copy of ℚ)

As an aside, In model theory, theories with only one model in cardinality λ are called λ-categorical. Categoricity is an important concept in model theory, as with completeness. There are other important concepts the OP may be interested in, including "stability" which roughly means that there are not too many elements that could exist but don't. An example of an unstability is in dense linear orders, because in the model ℚ there are only countable many elements, but you can describe uncountably many real numbers by dedekind cuts using only the language of order and parameters from ℚ.

Going back to the OP's question again, in general whatever mathematical property you want to add to a theory T to restrict the models, it will likely be first order, but in the language of sets rather than your original language. For example the property of being noetherian in a ring is not first order in the language of algebra, but you can formula it in terms of sets.

EDIT: Thinking of the OP's example of SO(3), you can never find first order axioms such that the only model is SO(3), because SO(3) is an infinite model, and you can build models of arbitrarily large cardinalities that satisfy the same first order sentences (and your axioms) as SO(3) by ultrapowers, or the compactness theorem etc.

stevendaryl said:
I spent a fair amount of time pretending to do mathematical logic, and I think I can answer this question.
A model is definitely not a collection of axioms. It's a mathematical object.

BTW this is true, but the completeness theorem constructs models by taking equivalence classes of strings of symbols. When I think about the completeness theorem, models in some sense literally become quotients of infinite lists of sentences, and this gives me weird philosophical thoughts about the nature of language, reality, etc.
 
Last edited:
  • Like
Likes atyy
  • #53
stevendaryl said:
an infinite number of axioms is no harder to use in theorem-proving from than a finite number
Perhaps it's not harder for a human, but what if we want a machine to prove the theorems?
 
  • #54
WWGD said:
A model is a semantic object, a theory is syntactic.
Both semantic and syntax are expressed as a collection of claims. Can a machine distinguish between semantics and syntax?
 
  • #55
Demystifier said:
Perhaps it's not harder for a human, but what if we want a machine to prove the theorems?

Well, theorem-proving is a tremendously complicated thing that potentially would require artificial intelligence to do a really good job. But proof-checking is pretty straight-forward: A proof is a sequence of claims, and each claim must either be an axiom, or must follow from previous claims by the rules of inference. Such a sequence is a proof of the last claim. So as long as there is an algorithm to determine whether a claim is an axiom, then you can do proof checking.

If you don't care about efficiency, and have trillions of years to wait, then a proof checker can be turned into a theorem prover, by just generating all possible sequences of claims and checking each sequence to see if it's a proof, and if so, then you've got a proof of the last claim.
 
  • #56
@stevendaryl I was asking about theorems (to be proved by a machine) which follow from an infinite number of axioms. And I am not interested in efficiency, only if it is possible in principle.

I think that machine can make such theorems only if they are reformulated as a finite number of axioms, which may require second-order logic.
 
  • #57
Demystifier said:
@stevendaryl I was asking about theorems (to be proved by a machine) which follow from an infinite number of axioms. And I am not interested in efficiency, only if it is possible in principle.

I think that machine can make such theorems only if they are reformulated as a finite number of axioms, which may require second-order logic.

As I said, the only requirement is that an axiom be recognizable as an axiom. That is, there has to be an algorithm that, given an arbitrary statement in the language, will eventually halt in an accepting state if that statement is an axiom. (It's actually not required that it be able to reject a claim as definitely not an axiom).

I suppose in some sense, it might be true that every theory with an infinite number of axioms can be recast as a theory with a finite number of axioms, if you allow higher-order constructs, but there is no need to do that.
 
  • #58
stevendaryl said:
As I said, the only requirement is that an axiom be recognizable as an axiom. That is, there has to be an algorithm that, given an arbitrary statement in the language, will eventually halt in an accepting state if that statement is an axiom. (It's actually not required that it be able to reject a claim as definitely not an axiom).

I suppose in some sense, it might be true that every theory with an infinite number of axioms can be recast as a theory with a finite number of axioms, if you allow higher-order constructs, but there is no need to do that.

I should point out that proofs are finite objects, so even if there are infinitely many axioms, any given proof only uses finitely many axioms.
 
  • #59
Demystifier said:
Both semantic and syntax are expressed as a collection of claims. Can a machine distinguish between semantics and syntax?

No, it's all syntax. Perhaps this is a way to explain it that might make sense.

I believe that there is a sense in which model theory is equivalent to a relative interpretation. Relative interpretation is maybe easier to explain than model theory.

So you start with some theory that you think you understand--say arithmetic, or set theory. This theory has certain function symbols, relation symbols, and constants. It has certain axioms. Actually, it's simpler to just have relation symbols.

Now, you have a second theory in a different language. It has different relation symbols and different axioms.

So to interpret the second theory in the first is pretty straightforward:
  • For every n-ary relation symbol R in the new theory, you come up with a formula with free variables \Phi(x_1, x_2, ..., x_n) in the old theory.
  • You come up with a unary formula U(x) in the old theory (the interpretation being that x is an element of the "universe" of the new theory)
  • Given those two choices, you can map every statement of the new theory to a statement of the old theory. Just replace the relation symbols by the appropriate formulas, and replace quantifies by relativized quantifiers: \forall x \psi(x) \Rightarrow \forall x (U(x) \rightarrow {\tilde{\psi}(x))} and \exists x \psi(x) \Rightarrow \exists x (U(x) \&amp; \tilde{\psi}(x))
Then you prove the translation of each axiom of the new theory, using the axioms of the old theory. If you can do that, then you've interpreted the new theory in the old theory.

Typically, the base theory that you start with is either set theory, or something less powerful like arithmetic or second-order arithmetic.
 
  • #60
Demystifier said:
Both semantic and syntax are expressed as a collection of claims. Can a machine distinguish between semantics and syntax?
I don't know if this helps, but semantic statements can be assigned truth values, syntactic ones cannot.
 

Similar threads

  • · Replies 34 ·
2
Replies
34
Views
3K
  • · Replies 11 ·
Replies
11
Views
371
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
7
Views
1K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 19 ·
Replies
19
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K