Let's see how we can define ordinals
in their own terms, before moving on to explaining how positional ordinals work.
Ordinal numbers are meant to represent the concept of order. So let's think of an ordered set. What do we need to define it as ordered? Well, we need to have a binary relation of
precedence. That is, a relation that given two elements of the set will tell us which one goes "before" or "after" which (or in cardinal terms, which one is "less than" or "greater than" which). Next, we have to be careful to distinguish between two kinds of order:
total order and
partial order. An order is total if
all the elements in the set are comparable under the above binary relation. An example of a set with a total order is the set of real numbers, and an example of a set with a partial order but without a total order (or at least not one that is compatible with its arithmetic) is the set of complex numbers.
Total order is also known as
linear order (because the set can be arranged in a one-dimensional, linear fashion under the order relation), and in mathematical terms a set with a linear order can be referred to as a
chain. This is the kind of order that interests us for the purpose of defining ordinal numbers.
So we have a binary relation of precedence between ordinal numbers, and thus we can say that some ordinal goes
before or
after some other. But ordinals tell us more than just this. Two ordinals can be
immediately adjacent to one another, meaning that they have a "point" in common (see below) and there is no other ordinal "between" them (in mathematical terms, given an ordinal A that precedes an ordinal B, ordinal A is said to be immediately adjacent to ordinal B iff there exists no ordinal C such that C is after A and before B
). So, in addition to the binary relation of precedence, we can define a pair of functions for
immediate precedence: a function "next" that tells us which ordinal goes
immediately after another, and a function "previous" that tells us which one goes
immediately before. So we can say that "ordinal C goes before ordinal E and after ordinal A", but also that "the next ordinal to ordinal C is ordinal D and the previous one to it is ordinal B". Thus, we can conceptualize the chain of ordinals as a chain of immediately adjacent segments on a line. Each of those segments has a certain size, and is defined by two points: an initial point and an end point. Its initial point being the one that the given segment has in common with its previous segment (previous ordinal), and its end point the one in common with its next segment (next ordinal). So we can say that ordinals have a "size" (I'll elaborate on this in a later post) and two "sides" (the side "facing" its previous ordinal and the one "facing" its next).
What's more, an ordinal also tells us its
exact relative position in the whole chain of ordinals. And for this we need to define a point of reference: the
starting point of the ordinals. Which leads us to the concept of
first. Using the above conceptualization in terms of segments, we can define
first as "the ordinal whose initial point (or 'previous' side) is at the starting point". Which, for the set of unsigned ordinals, can be rephrased as "the ordinal with no previous ordinal".
Once we have the concept of first and the concept of immediate precedence, we can already define all other ordinals: the
second is "the ordinal whose immediate previous is the first", the
third "the one whose immediate previous is the second", etc. When we introduce the concept of sign and the chain of positive ordinals is mirrored with the starting point as the centre of symmetry, we can then define
negative first as "the one immediately previous to
positive first",
negative second as "the one immediately previous to negative first", etc. That is,
negative first is "the ordinal whose
end point (instead of
initial point) is at the starting point", and the change of sign can be thought of as kind of "flipping" or "reversing" the "sides" of the ordinal (the one "facing" its next becomes the one "facing" its previous and viceversa). Note that, as a different kind of "mirror image" of the concept of
first, we can also define the concept of
last, if instead of (or in addition to) a starting point, our chain has a defined
final point. The
last is thus "the ordinal whose end point is at the final point" (or "the [unsigned] ordinal with no next ordinal"), and similarly we can define all other ordinals in terms of the last instead of in terms of the first:
second to last,
third to last, etc. (and here we can also introduce the concept of sign and define
negative last as "the ordinal whose
initial point is at the final point", and then define the subsequent
negative second to last, etc.).
As a side comment, note the etymology of the English word
first, which is (quite obviously) not
in any way related to the word for cardinal
one. Instead, it (and its Latinate cognates
primary and related words such as
prime) etymologically means "foremost" (PIE root *
per- + a superlative sense through derivation).
First is in fact a cognate of
fore; not surprising since the first is the one
at the fore or
before all others (in the unsigned or positive series). The (Latinate, through French) word
second is also not in any way related to the word for cardinal
two, and instead etymologically means "the one about to follow [the first]" (Latin
secundus, the future participle of deponent verb
sequor "to follow", from PIE root *
sekʷ-). As for the word
last, like
first it is etymologically a superlative (the superlative of comparative
latter, and a cognate of
late, since the
last is "the one coming
latest").
However, even though ordinals and cardinals are distinct concepts and, as we have just seen, ordinals can be defined entirely in their own terms completely independent of the cardinals, there seems to be an apparently innate tendency to associate ordinal number first with cardinal number one, ordinal second with cardinal two, and in fact all other ordinals in English are named after their associated cardinal in this way (third, fourth, etc.). This mental association probably stems from the fact that in order to have a first you need to have
at least one, in order to have a second you need to have
at least two, etc. Note that there is no element to be named ordinally if you have
zero elements, so languages have historically never felt any need to come up with a word for a supposed ordinal
zeroth, even centuries after a specific word for
zero entered the language.
Nevertheless, it is precisely because of this close mental (and linguistic and representational) association between cardinals and ordinals that, once people realized of the nature and importance of number zero for the cardinals, and in fact turned this discovery into a kind of badge of honour of how evolved and mature modern mathematics are in comparison with the "primitive" mathematics of the ancients that lacked the use and concept of 0 as a number, that in recent times certain people have tried once and again to artificially graft this purely cardinal concept (the concept of no quantity) onto the ordinals, absurdly trying to create the (unnecessary and impossible) concept of an ordinal
zeroth. Curiously, people have not been so keen to, viceversa, try to graft the purely ordinal concept of
last onto the cardinals. :rolleyes:
As should be (hopefully) clear by now —once that ordinals have been defined in their own terms, without the need to use, nor room for, an ordinal concept analogous to the cardinal concept of zero—, and as I previously tried to explain with illustrative examples (such as the one of a person walking successive steps from an initial point of rest), the word
zeroth only makes sense in an
indicial sense. That is, meaning not a true
ordinal "zeroth" (an absurdity), but merely acting as a tag, an index that is placed on some element to mean that the tagged element is "in
some way related to
cardinal number zero". For example, because a label with a printed "0" has been physically placed on it, such as on a door or floor (e.g., to indicate
how many floors there are from this floor to the ground—i.e., to indicate a
quantity, not really an
order, although an order can be derived in such cases from the fact that the cardinal numbers are themselves ordered). Or because cardinal number 0 is used as a sequential tag to identify the element in an indexed series because of some practical reason, such as in the case of the indices commonly used to identify array elements in modern computer science. A practice originating from the fact that it eases pointer arithmetic when accessing the memory location of the element, since this way to access element
i you just have to calculate the pointer
m + i, where
m is the pointer to the memory address of the first element of the array. Another reason is the fact that indices starting at zero may use the full range of unsigned integers representable on computers, whereas indices starting at one would "waste" the available index zero. Note, however, that the initial practice in programming languages was to use an "intuitive" ordinal-inspired approach of one-based indexing, and it was only later because of said practical reasons that the usage was majoritarily changed to the current dominant one of zero-based indexing; and then it was not long before too-self-assured computer geeks—which do not seem to realize of the difference between ordinals and indicials—started to believe that "zeroth"
as an ordinal was a sensible concept, since they
think they are using it when they refer indicially to the
first element of an array as its "0-th" because of its index label. :rolleyes: