Logic and Mathematics Logic and MathematicsStephen G. Simpson Department of MathematicsApril 30, 1999 Pennsylvania State UniversityThis article is an overview of logic and the philosophy ofmathematics. It is intended for the general reader. It has appearedin the volume The Examined Life: Readings from Western Philosophy from Plato to Kant, edited by Stanley Rosen, publishedin 2000 by Random House.ContentsLogicAristotelean logicThe predicate calculusFoundations of mathematicsThe geometry of EuclidFormal theories for mathematicsPhilosophy of mathematicsPlato and AristotleThe 20th centuryThe futureBibliographyLogicLogic is the science of formal principles of reasoning orcorrect inference. Historically, logic originated with the ancientGreek philosopher Aristotle. Logic was further developed andsystematized by the Stoics and by the medieval scholasticphilosophers. In the late 19th and 20th centuries, logic sawexplosive growth, which has continued up to the present.One may ask whether logic is part of philosophy or independent of it.According to Bochenski [2, §10B], this issue isnowhere explicitly raised in the writings of Aristotle. However,Aristotle did go to great pains to formulate the basic concepts oflogic (terms, premises, syllogisms, etc.) in a neutral way,independent of any particular philosophical orientation. ThusAristotle seems to have viewed logic not as part of philosophy butrather as a tool or instrument1 to be used by philosophers and scientistsalike. This attitude about logic is in agreement with the modernview, according to which the predicate calculus (see 1.2below) is a general method or framework not only for philosophicalreasoning but also for reasoning about any subject matter whatsoever.Logic is the science of correct reasoning. What then is reasoning?According to Aristotle [13, Topics, 100a25], reasoning isany argument in which certain assumptions or premises are laiddown and then something other than these necessarily follows. Thuslogic is the science of necessary inference. However, when logic isapplied to specific subject matter, it is important to note that notall logical inference constitutes a scientifically validdemonstration. This is because a piece of formally correct reasoningis not scientifically valid unless it is based on a true and primarystarting point. Furthermore, any decisions about what is true andprimary do not pertain to logic but rather to the specific subjectmatter under consideration. In this way we limit the scope of logic,maintaining a sharp distinction between logic and the other sciences.All reasoning, both scientific and non-scientific, must take placewithin the logical framework, but it is only a framework, nothingmore. This is what is meant by saying that logic is a formalscience.For example, consider the following inference:Some real estate will increase in value.Anything that will increase in value is a good investment.Therefore, some real estate is a good investment.This inference is logically correct, because the conclusion ``somereal estate is a good investment'' necessarily follows once we acceptthe premises ``some real estate will increase in value'' and``anything that will increase in value is a good investment''. Yetthis same inference may not be a demonstration of its conclusion,because one or both of the premises may be faulty. Thus logic canhelp us to clarify our reasoning, but it can only go so far. The realissue in this particular inference is ultimately one of finance andeconomics, not logic.We shall now briefly indicate the basics of Aristotelean logic.Aristotelean logicAristotle's collection of logical treatises is known as the Organon.Of these treatises, the Prior Analytics contains the most systematicdiscussion of formal logic. In addition to the Organon, theMetaphysics2also contains relevant material. See Aristotle [13] andRoss [19].Subjects and predicatesAristotelean logic begins with the familiar grammatical distinctionbetween subject and predicate. A subject is typically anindividual entity, for instance a man3 or a house or acity. It may also be a class of entities, for instance all men. Apredicate is a property or attribute or mode of existence whicha given subject may or may not possess. For example, an individualman (the subject) may or may not be skillful (the predicate), and allmen (the subject) may or may not be brothers (the predicate).The fundamental principles of predication are:Identity. Everything is what it is and acts accordingly. In symbols: is . For example, an acorn will grow into an oak tree and nothing else.Non-contradiction. It is impossible for a thing both to be and not to be. A given predicate cannot both belong and not belong to a given subject in a given respect at a given time. Contradictions do not exist. Symbolically: and non- cannot both be the case. For example, an honest man cannot also be a thief.Either-or. Everything must either be or not be. A given predicate either belongs or does not belong to a given subject in a given respect at a given time. Symbolically: Either or non- . For example, a society must be either free or not free.These principles have exercised a powerful influence on subsequentthinkers. For example, the 20th century intellectual Ayn Rand titledthe three main divisions of her best-selling philosophical novelAtlas Shrugged4[18] after the three principles above, in tribute toAristotle.SyllogismsAccording to Aristotelean logic, the basic unit of reasoning is thesyllogism. For example, the real estate inference which waspresented above is a syllogism. It is of the formSome is .All is .Therefore, some is .Here denotes real estate, denotes increase in value, and denotes a good investment. Just as in the case of this example, everysyllogism consists of two premises and one conclusion. Each of thepremises and the conclusion is of one of four types:universal affirmative:All is . universal negative:No is . particular affirmative:Some is . particular negative:Some is not . The letters , , are known as terms. Every syllogismcontains three terms. The two premises always share a common termwhich does not appear in the conclusion. This is known as themiddle term. In our real estate example, the middle term is , i.e., that which increases in value.In order to classify the various types of syllogisms, one must takeaccount of certain symmetries. In particular, ``no is '' and``no is '' are equivalent, as are ``some is '' and``some is ''. Furthermore, the order of the two premises in asyllogism does not matter. Allowing for these symmetries, we canenumerate a total of possible syllogistic forms. Of these , only represent correct inferences. For example, the formall is , all is , therefore all is represents a correct inference, whileall is , all is , therefore some is does not. The classification of syllogisms leads to a rather complex theory.Medieval thinkers perfected it and developed ingenious mnemonics toaid in distinguishing the correct forms from the incorrect ones. Thisculminated in the famous pons asinorum (``bridge of asses''),an intricate diagram which illustrates all of the syllogistic forms bymeans of a contrast between the good and the pleasurable. SeeBochenski [2, §24H, §32F].The predicate calculusIn 1879 the German philosopher Gottlob Frege published a remarkabletreatise, the Begriffsschrift (``concept script'') [22]. Thisbrilliant monograph is the origin of modern logical theory. However,Frege's account was defective in several respects, and notationallyawkward to boot. Instead of Frege's system, we shall present astreamlined system known as first-order logic or the predicate calculus.The predicate calculus dates from the 1910's and 1920's. It is basicfor all subsequent logical research. It is a very general system oflogic which accurately expresses a huge variety of assertions andmodes of reasoning. We shall see that it is much more flexible thanthe Aristotelean syllogistic.Predicates and individualsIn the predicate calculus, the subject/predicate distinction is drawnsomewhat differently from the way it is drawn in Aristotelean logic.The main point here is that, in the predicate calculus, a subject isalways an individual entity, never a class of entities. For example,an individual man can be treated as a subject, but the class of allmen must be treated as a predicate. Since a subject in the predicatecalculus is always an individual entity, it is usual to speak ofindividuals rather than subjects. We shall follow thiscustomary practice.The predicate calculus makes heavy use of symbolic notation.Lower-case letters , , , ..., , , , ... areused to denote individuals. Upper-case letters , , , , , ... are used to denote predicates. Simple assertions may beformed by juxtaposing a predicate with an individual.For example, if is the predicate ``to be a man'' and is theindividual ``Socrates'', then denotes the assertion ``Socrates isa man''. The symbol is called an argument of . Thepredicate may be applied to any individual, and that individual isthen an argument of . If is the individual ``New York'', then asserts, falsely, that New York is a man. In general, if isany individual whatsoever, then is the assertion that is aman. This assertion may or may not be true, depending on what is.The expression is called an atomic formula of thepredicate calculus.Some predicates require more than one argument. For example, if is the predicate ``bigger than'', then denotes the assertion`` is bigger than ''. Thus requires two arguments, and is an atomic formula. If we try to use with only oneargument, we obtain something like , i.e., `` is bigger than''.This is not an atomic formula or any other kind of assertion. It isonly a meaningless combination of symbols. In analogy with Englishgrammar, we could say that is like a grammatically correctsentence, while is merely a sentence fragment. Such fragmentsplay no role in the predicate calculus.Let us now go into more detail about the role of individuals in thepredicate calculus. We have already said that lower-case lettersdenote individuals. We now divide the lower-case letters into twogroups: , , , ... near the beginning of the alphabet, and , , , ... near the end of the alphabet. We insist on animportant grammatical or logical distinction between these two groups.Letters of the first group are known as individual constants orsimply constants. As in the above examples, we think of themas denoting specific individuals, such as Socrates or New York.Letters of the second group are known as individual variablesor simply variables. For example, is a variable. We thinkof as denoting not a specific individual but rather an arbitraryor unspecified individual.5Formulas and logical operatorsWe have already mentioned two kinds of symbols: lower-case letters forindividuals (constants and variables), and upper-case letters forpredicates. In addition to these, the predicate calculus employsseven special symbols known as logical operators6: The names and meanings of the logical operators are given bysymbolnameusagemeaning conjunction ``both ... and ...'' disjunction ``either ... or ... (or both)'' negation ``it is not the case that ...'' implication7 ``if ... then ...'' bi-implication8 ``... if and only if ...'' universal quantifier ``for all , ...'' existential quantifier ``there exists such that ...'' Here is anyvariable.A formula is a meaningful expression built up from atomicformulas by repeated application of the logical operators. In theabove table, an ellipsis mark ... stands for a formula within alarger formula.For example, suppose we have a predicate meaning ``is a man'',another predicate meaning ``is a truck'', and another predicate meaning ``drives''. Here and are predicates which requireonly one argument apiece. The predicate requires two arguments:the driver, and the vehicle being driven. Thus , , and are atomic formulas meaning `` is a man'', `` is a truck'', and`` drives '', respectively. A typical formula built from theseatomic formulas is which we can translate as ``for all , if is a man then thereexists such that is a truck and drives ''. In otherwords,Every man drives at least one truck.Similarly, the formula translates toEvery truck is driven by at least one man.In writing formulas, we often use parentheses as punctuation marks toindicate grouping and thereby remove ambiguity. If parentheses werenot used, one could construe the formula in twologically inequivalent ways: as (`` is not atruck, and drives ''), or as (`` is nota truck that drives''). The parentheses allow us to choose themeaning that we intend.The predicate calculus is very rich in expressive power. For example,the four Aristotelean premise types discussed in 1.1.2 caneasily be rendered as formulas of the predicate calculus. Letting and be predicates which require one argument apiece, we haveuniversal affirmativeall is  universal negativeno is  particular affirmativesome is  particular negativesome is not  In the second line of this table, the universal negative ``no is '' could have been rendered equivalently as , or as .The above table may tend to gloss over a subtle but philosophicallysignificant difference between Aristotelean logic and the predicatecalculus. Namely, where Aristotelean logic views as a subject and as a predicate, the predicate calculus views both and aspredicates. This is typical of the different perspectives involved.Aristotelean logic emphasizes the universal essences of subjects orentities, while the predicate calculus elevates predicates to aposition of supreme importance.Logical validity and logical consequenceA formula of the predicate calculus is said to be logically valid if it is necessarily always true, regardless of the specificpredicates and individuals involved. For example, the threefundamental principles of Aristotelean logic (see 1.1.1above) correspond to formulas as follows:Identity: .Non-contradiction: .Either-or: .These formulas are logically valid, because they are ``necessarily''or ``automatically'' or ``formally'' true, no matter what predicatemay be denoted by the symbol .The predicate calculus concept of logical validity subsumes theAristotelean syllogism. Each syllogism corresponds to a logicallyvalid implication where and are formulas expressing the two premisesand expresses the conclusion. For example, the syllogismsome is , all is , therefore some is has a predicate calculus rendition and this formula is logically valid.More generally, a formula is said to be a logical consequence of a set of formulas , ..., just incase is logically valid. Here , ..., are premises and is a conclusion. This is similar to the Aristoteleansyllogism, but it is of wider applicability, because the premises andthe conclusion can be more complex. As an example, the 19th centurylogician Augustus DeMorgan noted9 that the inferenceall horses are animals,therefore, the head of a horse is the head of an animalis beyond the reach of Aristotelean logic. Yet this same inferencemay be paraphrased as ``if all horses are animals, then for all ,if is the head of some horse then is the head of someanimal'', and this corresponds to a logically valid formula of the predicate calculus. Here , , denote ``is a horse'',``is an animal'', ``is the head of'', respectively. Thus DeMorgan'sconclusion is indeed a logical consequence of his premise.The completeness theoremFormulas of the predicate calculus can be exceedingly complicated.How then can we distinguish the formulas that are logically valid fromthe formulas that are not logically valid? It turns out that there isan algorithm10 for recognizing logicallyvalid formulas. We shall now sketch this algorithm.In order to recognize that a formula is logically valid, itsuffices to construct what is known as a proof tree for ,or equivalently a refutation tree for . This is atree which carries at the root. Each node of the treecarries a formula. The growth of the tree is guided by the meaning ofthe logical operators appearing in . New nodes are added to thetree depending on what nodes have already appeared. For example, if anode carrying has appeared, we create twonew nodes carrying and respectively. Thethought behind these new nodes is that the only way for to be the case is if at least one of or is the case. Similarly, if a nodecarrying has already appeared, we create a newnode carrying , where is the result ofsubstituting a new constant for the variable throughout theformula . The idea here is that the only way for the universalstatement to be false is if is false for someparticular . Since is a new constant, is a formulawhich may be considered as the most general false instance of .Corresponding to each of the seven logical operators, there areprescribed procedures for adding new nodes to the tree. We applythese procedures repeatedly until they cannot be applied any more. Ifexplicit contradictions11 are discoveredalong each and every branch of the tree, then we have a refutationtree for . Thus is seen to be logicallyimpossible. In other words, is logically valid.The adequacy of proof trees for recognizing logically valid formulasis a major insight of 20th century logic. It is a variant of thefamous completeness theorem, first proved in 1930 by the greatlogician Kurt Gödel [5,22].On the other hand, the class of logically valid formulas is known tobe extremely complicated. Indeed, this class is undecidable:there is no algorithm12 which accepts as input an arbitrary formula and outputs ``yes'' if is logically valid and ``no'' if is not logically valid. In this sense, the concept of logicalvalidity is too general and too intractable to be analyzed thoroughly.There will never be a predicate calculus analog of the pons asinorum.Formal theoriesThe predicate calculus is a very general and flexible framework forreasoning. By choosing appropriate predicates, one can reason aboutany subject whatsoever. These considerations lead to the notion of aformal theory.In order to specify a formal theory, one first chooses a smallcollection of predicates which are regarded as basic for a given fieldof study. These predicates are the primitives of the theory.They delimit the scope of the theory. Other predicates must bedefined in terms of the primitives. Using them, one writes downcertain formulas which are regarded as basic or self-evident withinthe given field of study. These formulas are the axioms of thetheory. It is crucial to make all of our underlying assumptionsexplicit as axioms. Once this has been done, a theorem is anyformula which is a logical consequence of the axioms. A formal theory is this structure of primitives, axioms, and theorems.As a frivolous example, we could envision a theory of cars, trucks,and drivers. We would begin with some primitives such as (``is acar''), (``is a truck''), (``drives''), (``is a man''),etc. We could then write down certain obvious or self-evident axiomssuch as (``no man is a car''), (``every driver is a man''), etc.Then, within the constraints imposed by the axioms, we couldinvestigate the logical consequence relationships among variousnon-obvious assertions, such as (``nobody drives both a car and a truck''). Additional predicates (``is a vehicle'') and (``is a driver'') can be defined in termsof the primitives. The defining axioms for and would be and , respectively. In this fashion, we could attempt to codifyall available knowledge about vehicles and drivers.More seriously, one could try to write down formal theoriescorresponding to various scientific disciplines, such as mechanics orstatistics or law. In this way one could hope to analyze the logicalstructure of the respective disciplines.The process of codifying a scientific discipline by means ofprimitives and axioms in the predicate calculus is known asformalization. The key issue here is the choice of primitivesand axioms. They cannot be chosen arbitrarily. The scientist whochooses them must exercise a certain aesthetic touch. They must besmall in number; they must be basic and self-evident; and they mustaccount for the largest possible number of other concepts and facts.To date, this kind of formal theory-building has been convincinglycarried out in only a few cases. A survey is in Tarski [21].The most notable successes have been in mathematics.Foundations of mathematicsMathematics is the science of quantity. Traditionally therewere two branches of mathematics, arithmetic and geometry, dealingwith two kinds of quantities: numbers and shapes. Modern mathematicsis richer and deals with a wider variety of objects, but arithmeticand geometry are still of central importance.Foundations of mathematics is the study of the most basicconcepts and logical structure of mathematics, with an eye to theunity of human knowledge. Among the most basic mathematical conceptsare: number, shape, set, function, algorithm, mathematical axiom,mathematical definition, mathematical proof.The reader may reasonably ask why mathematics appears at all in thisvolume. Isn't mathematics too narrow a subject? Isn't the philosophyof mathematics of rather specialized interest, all the more so incomparison to the broad humanistic issues of philosophy proper, issuessuch as the good, the true, and the beautiful?There are three reasons for discussing mathematics in a volume ongeneral philosophy:Mathematics has always played a special role in scientific thought. The abstract nature of mathematical objects presents philosophical challenges that are unusual and unique.Foundations of mathematics is a subject that has always exhibited an unusually high level of technical sophistication. For this reason, many thinkers have conjectured that foundations of mathematics can serve as a model or pattern for foundations of other sciences.The philosophy of mathematics has served as a highly articulated test-bed where mathematicians and philosophers alike can explore how various general philosophical doctrines play out13 in a specific scientific context.The purpose of this section is to indicate the role of logic in thefoundations of mathematics. We begin with a few remarks on thegeometry of Euclid. We then describe some modern formal theories formathematics.The geometry of EuclidAbove the gateway to Plato's academy appeared a famous inscription:Let no one who is ignorant of geometry enter here.In this way Plato indicated his high opinion of geometry. Accordingto Heath [9, page 284], Plato regarded geometry as``the first essential in the training of philosophers'', because ofits abstract character. See also Plato [17, Republic, 527B].In the Posterior Analytics [13], Aristotle laid down thebasics of the scientific method.14 The essence of themethod is to organize a field of knowledge logically by means ofprimitive concepts, axioms, postulates, definitions, and theorems.The majority of Aristotle's examples of this method are drawn fromarithmetic and geometry[1,7,9].The methodological ideas of Aristotle decisively influenced thestructure and organization of Euclid's monumental treatise ongeometry, the Elements [8]. Euclid begins with 21definitions, five postulates, and five common notions. After that,the rest of the Elements are an elaborate deductive structureconsisting of hundreds of propositions. Each proposition is justifiedby its own demonstration. The demonstrations are in the form ofchains of syllogisms. In each syllogism, the premises are identifiedas coming from among the definitions, postulates, common notions, andpreviously demonstrated propositions. For example, in Book I of theElements, the demonstration of Proposition 16 (``in any triangle, ifone of the sides be produced, the exterior angle is greater thaneither of the interior and opposite angles'') is a chain of syllogismswith Postulate 2, Common Notion 5, and Propositions 3, 4 and 15 (``iftwo straight lines cut one another, they make the vertical anglesequal to one another'') occurring as premises. It is true that thesyllogisms of Euclid do not always conform strictly to Aristoteleantemplates. However, the standards of rigor are very high, andAristotle's influence is readily apparent.The logic of Aristotle and the geometry of Euclid are universallyrecognized as towering scientific achievements of ancient Greece.Formal theories for mathematicsA formal theory for geometryWith the advent of calculus in the 17th and 18th centuries,mathematics developed very rapidly and with little attention tological foundations. Euclid's geometry was still regarded as a modelof logical rigor, a shining example of what a well-organizedscientific discipline ideally ought to look like. But the prolificEnlightenment mathematicians such as Leonhard Euler showed almost nointerest in trying to place calculus on a similarly firm foundation.Only in the last half of the 19th century did scientists begin to dealwith this foundational problem in earnest. The resulting crisis hadfar-reaching consequences. Even Euclid's geometry itself came undercritical scrutiny. Geometers such as Moritz Pasch discovered whatthey regarded as gaps or inaccuracies in the Elements. Greatmathematicians such as David Hilbert entered the fray.An outcome of all this foundational activity was a thorough reworkingof geometry, this time as a collection of formal theories within thepredicate calculus. Decisive insights were obtained by Alfred Tarski.We shall sketch Tarski's formal theory for Euclidean15 planegeometry.16As his primitive predicates, Tarski takes (``point''), (``between''), (``distance''), (``identity''). The atomicformulas , , , and mean `` is a point'',`` lies between and '', ``the distance from to isequal to the distance from to '', and `` is identical to '', respectively. Geometrical objects other than points, such asline segments, angles, triangles, circles, etc., are handled by meansof the primitives. For example, the circle with center and radius consists of all points such that holds.In geometry, two points and are considered identical if thedistance between them is zero. Tarski expresses this by means of anaxiom .Another axiom expresses the fact that, given any four points, if the second isbetween the first and the third, and if the third is between the firstand the fourth, then the third is between the second and the fourth.A noteworthy axiom is which says: any three points equidistant from two distinctpoints must be collinear. This axiom is typical oftwo-dimensional (i.e., plane) geometry and does not apply togeometries of dimension greater than two.Altogether Tarski presents twelve axioms, plus an additionalcollection of axioms expressing the idea that a line is continuous.The full statement of Tarski's axioms for Euclidean plane geometry isgiven at [10, pages 19-20]. Let be the formal theorybased on Tarski's axioms.Remarkably, Tarski has demonstrated that is complete.This means that, for any purely geometrical17 statement , either or is a theorem of . Thus wesee that the axioms of suffice to answer all yes/no questions ofEuclidean plane geometry. Combining this with the completenesstheorem of Gödel, we find that is decidable: there isan algorithm18 which accepts as input an arbitrarystatement of plane Euclidean geometry, and outputs ``true'' if thestatement is true, and ``false'' if it is false. This is a triumph ofmodern foundational research.A formal theory for arithmeticBy arithmetic we mean elementary school arithmetic, i.e., thestudy of the positive whole numbers , , , ... along withthe familiar operations of addition ( ) and multiplication( ). This part of mathematics is obviously fundamental, yet itturns out to be surprisingly complicated. Below we write down some ofthe axioms which go into a formal theory of arithmetic.19Our primitive predicates for arithmetic are (``number''), (``addition''), (``multiplication''), (``identity''). Theatomic formulas , , , mean `` is a number'',`` '', `` '', `` '', respectively. Our axiomswill use the predicates , , , to assert that for anygiven numbers and , the numbers and alwaysexist and are unique. We shall also have axioms expressing some wellknown arithmetical laws:substitution laws:if and is a number then is a number, etc.commutative laws: and .associative laws: and .distributive law: .comparison law: if and only if, for some , or .unit law: . Our formal axioms for arithmetic are as follows.substitution laws:   existence and uniqueness of :  existence and uniqueness of :  commutative laws: associative laws: distributive law: comparison law: unit law: Let be the formal theory specified by the above primitives andaxioms.It is known that suffices to derive many familiar arithmeticalfacts. For example, may be expressed, awkwardly20 to be sure, as or 21and this formula is indeed a theorem of , i.e., a logicalconsequence of the axioms of . Another theorem of is expressing a familiar cancellation law: if either or , then .On the other hand, the axioms of are by no means exhaustive.They can be supplemented with other axioms expressing the so-calledmathematical induction or least number principle: ifthere exists a number having some well-defined property, then amongall numbers having the property there is a smallest one. Theresulting formal theory is remarkably powerful, in the sense that itstheorems include virtually all known arithmetical facts. But it isnot so powerful as one might wish. Indeed, any formal theory whichincludes is necessarily either inconsistent22 or incomplete. Thus there is no hope of writing downenough axioms or developing an algorithm to decide all arithmeticalfacts. This is a variant of the famous 1931 incompleteness theorem of Gödel [5,22]. There are severalmethods of coping with the incompleteness phenomenon, and thisconstitutes a currently active area of research in foundations ofmathematics.The contrast between the completeness of formal geometry and theincompleteness of formal arithmetic is striking. Both sides of thisdichotomy are of evident philosophical interest.A formal theory of setsOne of the aims of modern logical research is to devise a singleformal theory which will unify all of mathematics. Such a theory willnecessarily be subject to the Gödel incompleteness phenomenon,because it will incorporate not only but also .One approach to a unified mathematics is to straightforwardly embedarithmetic into geometry, by identifying whole numbers with evenlyspaced points on a line. This idea was familiar to the ancientGreeks. Another approach is to explain geometry in terms ofarithmetic and algebra, by means of coordinate systems, like latitudeand longitude on a map. This idea goes back to the 17th centurymathematician and philosopher René Descartes and the 19th centurymathematician Karl Weierstrass. Both approaches give rise toessentially the same formal theory, known as second-order arithmetic.23 This theory includes both and and is adequate for the bulk of modern mathematics. Thus the decisionabout whether to make geometry more fundamental than arithmetic orvice versa seems to be mostly a matter of taste.A very different approach to a unified mathematics is via set theory. This is a peculiarly 20th century approach. It is basedon one very simple-looking concept: sets. Remarkably, this oneconcept leads directly to a vast structure which encompasses all ofmodern mathematics.A set is a collection of objects called the elements ofthe set. We sometimes use informal notations such as to indicate that is a set consisting of elements , , .... The number of elements in a set can be arbitrarilylarge or even infinite. A basic principle of set theory is that a setis determined by its elements. Thus two sets are identical if andonly if they have the same elements. This principle is known asextensionality. For example, the set is consideredto be the same set as because the elements are the same,even though written in a different order.Much of the complexity of set theory arises from the fact that setsmay be elements of other sets. For instance, the set is anelement of the set and this is distinct from the set .For a formal theory of sets, we use three primitives: (``set''), (``identity''), (``element''). The atomic formulas , , mean `` is a set'', `` is identical to '', `` is an element of '', respectively. One of the ground rules of settheory is that only sets may have elements. This is expressed as anaxiom . In addition there is anaxiom of extensionality and an axiom expressing theexistence of the empty set, i.e. a set having noelements. A list of all the axioms of set theory may be found intextbooks [11,15]. Let be the formal theory ofsets based on these axioms.The set theory approach to arithmetic is in terms of the non-negativewhole numbers , , , , .... These numbers are identifiedwith specific sets. Namely, we identify with the empty set , with , with , with , etc. In general,we identify the number with the set of smaller numbers . Among the axioms of is an axiom of infinity asserting the existence of the infinite set . One can use the set to show that includes a theory equivalent to . After that, one canfollow the ideas of Descartes and Weierstrass to see that alsoincludes a theory equivalent to . It turns out that the rest ofmodern mathematics can also be emulated within . This includesan elaborate theory of infinite sets which are much larger than .The set-theoretical approach to arithmetic and geometry is admittedlysomewhat artificial. However, the idea of basing all of mathematicson one simple concept, sets, has exerted a powerfulattraction.24 The implications of thisidea are not yet fully understood and are a topic of current research.Philosophy of mathematicsIn this section we indicate some issues and trends in the philosophyof mathematics.Plato and AristotleThe objects that are studied in mathematics tend to be somewhatabstract and remote from everyday perceptual experience. Therefore,the existence and nature of mathematical objects present specialphilosophical challenges. For example, is a geometrical squaredifferent from a square floor tile? If so, then where is thegeometrical square? Is it on the floor, in our minds, or somewhereelse? And what about sets? Is a set of cards something otherthan the cards themselves?The ancient Greek philosophers took such questions very seriously.Indeed, many of their general philosophical discussions were carriedon with extensive reference to geometry and arithmetic. Plato seemedto insist that mathematical objects, like the Platonic forms oressences, must be perfectly abstract and have a separate, non-materialkind of existence. Aristotle[1,7,13,19] dissectedand refuted this view in books and of the Metaphysics.According to Aristotle, the geometrical square is a significant aspectof the square floor tile, but it can only be understood by discardingother irrelevant aspects such as the exact measurements, the tilingmaterial, etc. Clearly these questions provide much food forphilosophical analysis and debate.The 20th centuryIn the 20th century, the advent of the predicate calculus and thedigital computer profoundly affected our view of mathematics. Thediscovery that all of mathematics can be codified in formal theoriescreated a huge stir. One expression of this excitement was the riseof an extreme philosophical doctrine known asformalism.25According to formalism, mathematics is only a formal game, concernedsolely with algorithmic manipulation of symbols. Under this view, thesymbols of the predicate calculus do not denote predicates or anythingelse. They are merely marks on paper, or bits and bytes in the memoryof a computer. Therefore, mathematics cannot claim to be any sort ofknowledge of mathematical objects. Indeed, mathematical objects donot exist at all, and the profound questions debated by Plato andAristotle become moot. Mathematics is nothing but a kind of blindcalculation.The formalist doctrine fits well with certain modern trends incomputer science, e.g., artificial intelligence. However, formalismhas proved inadequate as an integrated philosophy of mathematics,because it fails to account for human mathematical understanding, notto mention the spectacular applications of mathematics in fields suchas physics and engineering.By way of reaction against formalism, several alternative doctrineshave been advocated. One of these is constructivism, the ideathat mathematical knowledge can be obtained by means of a series ofpurely mental constructions. Under this view, mathematical objectsexist solely in the mind of the mathematician, so mathematicalknowledge is absolutely certain. However, the status of mathematicsvis a vis the external world becomes doubtful. An extreme version ofconstructivism is so solipsistic that it does not even allow for thepossibility of mathematical communication from one mind to another.An additional disturbing feature of constructivism is that it entailsrejection of the basic laws of logic. To see how this comes about,consider some specific mathematical problem or question26 of a yes/no nature, for which theanswer is currently unknown. (Mathematics abounds with suchquestions, and the Gödel incompleteness phenomenon suggests thatsuch questions will always exist.) Express the ``yes'' answer as aformula and the ``no'' answer as the negated formula . Since the answer is unknown, neither nor is in the mind of the mathematician. Therefore, accordingto the constructivists, the disjunction is not alegitimate mathematical assumption. Thus Aristotle's either-orprinciple (see 1.1.1 and 1.2.3 above) must beabandoned.27Constructivism has the merit of allowing human beings to possessmathematical knowledge. However, the constructivist rejection of theexternal world and of Aristotelean logic are highly unpalatable tomost mathematicians and mathematically oriented scientists. For thisreason, constructivism remains a fringe movement on the 20th centurymathematical landscape.Another 20th century philosophical doctrine has arisen fromset-theoretical foundations. The reliance on infinite sets suggestsmany perplexing questions. What do such sets correspond to inreality? Where are they, and how can the human mind grasp them? Inorder to boldly answer these questions, and as a reaction againstformalism, many researchers in axiomatic set theory have subscribed towhat is known as set-theoretical Platonism. According to thisvariant of the Platonic doctrine, infinite sets exist in anon-material, purely mathematical realm. By extending our intuitiveunderstanding of this realm, we will be able to cope with chaosissuing from the Gödel incompleteness phenomenon. The mostprominent and frequently cited authority for this kind of Platonism isGödel himself [5].There is a good fit between set-theoretical Platonism and certainaspects of 20th century mathematical practice. However, as aphilosophical doctrine, set-theoretical Platonism leaves much to bedesired. Many of Aristotle's objections to the Platonic forms arestill cogent. There are serious questions about how a theory ofinfinite sets can be applicable to a finite world.We have mentioned three competing 20th century doctrines: formalism,constructivism, set-theoretical Platonism. None of these doctrinesare philosophically satisfactory, and they do not provide muchguidance for mathematically oriented scientists and other users ofmathematics. As a result, late 20th century mathematicians havedeveloped a split view, a kind of Kantian schizophrenia, which isusually described as ``Platonism on weekdays, formalism on weekends''.In other words, they accept the existence of infinite sets as aworking hypothesis in their mathematical research, but when it comesto philosophical speculation, they retreat to a formalist stance.Thus they have given up hope of an integrated view which accounts forboth mathematical knowledge and the applicability of mathematics tophysical reality. In this respect, the philosophy of mathematics isin a sorry state.The futureFrom the Renaissance through the 20th century, Aristotle's ideas aboutthe nature of mathematical objects have been neglected and ignored.Now the time seems ripe for a renovation of the philosophy ofmathematics, based on Aristotelean and neo-Aristotelean [16]ideas and bolstered by the techniques of modern logic, including thepredicate calculus.The great mathematician David Hilbert anticipated such a renovation inhis 1925 essay, On the Infinite [22]. Hilbert was aware that,according to modern physics, the physical universe is finite. Yetinfinite sets were playing an increasingly large role in themathematics of the day. Hilbert therefore recognized that the mostvulnerable chink in the armor of mathematics was the infinite. Inorder to defend what he called ``the honor of human understanding'',Hilbert proposed to develop a new foundation of mathematics, in whichformal theories of infinite sets, such as , would be rigorouslyjustified by reference to the finite. This is Hilbert's program offinitistic reductionism.28Although Hilbert did not cite Aristotle, we can imagine that Hilbertwould have profited from an examination of Aristotle's distinctionbetween actual and potential infinity. An actual infinity issomething like an infinite set regarded as a completed totality. Apotential infinity is more like a finite but indefinitely long,unending series of events. According to Aristotle, actual infinitiescannot exist, but potential infinities exist in nature and aremanifested to us in various ways, for instance the indefinite cycle ofthe seasons, or the indefinite divisibility of a piece of gold.In any case, it turned out that Hilbert had stated his program in toosweeping a fashion. The wholesale finitistic reduction which Hilbertdesired cannot be carried out. This follows from Gödel'sincompleteness theorem [5,22]. The remarkableresults obtained by Gödel in 1931 caused the philosophical ideasof Hilbert's 1925 essay to fall into disrepute. Hilbert's grandfoundational program appeared to be dead, broken beyond hope ofrepair.The last 20 years have seen a revival of Hilbert's program. Recentfoundational research [20] has revealed that, although is not finitistically reducible, there are other formal theories whichare finitistically reducible, in the precise sense envisionedby Hilbert. Moreover, these other formal theories turn out to beadequate for a very large portion of mathematics. They do notencompass actual infinities such as , but they do include themain results of arithmetic and geometry and allied disciplines.This new research has not yet had an impact on the philosophy ofmathematics or on mathematical practice. Philosophers andmathematicians are free to choose which directions to pursue and whichtechniques to emphasize. Only time will reveal the future evolutionof the philosophy of mathematics.Bibliography1Hippocrates G. Apostle.Aristotle's Philosophy of Mathematics.University of Chicago Press, 1952.X + 228 pages.2I. M. Bochenski.A History of Formal Logic.Chelsea Publishing Company, New York, 2nd edition, 1970.Translated and edited by Ivo Thomas, XXII + 567 pages.3Haskell B. Curry.Outlines of a Formalist Philosophy of Mathematics.Studies in Logic and the Foundations of Mathematics. North-Holland, 1951.VII + 75 pages.4Melvin Fitting.First-Order Logic and Automated Theorem Proving.Graduate Texts in Computer Science. Springer-Verlag New York Inc., 2nd edition, 1996.XVI + 326 pages.5Kurt Gödel.Collected Works.Oxford University Press, 1986-1995.Volume I, XVIII + 474 pages, 1986, Volume II, XVI + 407 pages, 1990, Volume III, XX + 532 pages, 1995.6Petr Hájek and Pavel Pudlák.Metamathematics of First-Order Arithmetic.Perspectives in Mathematical Logic. Springer-Verlag, 1993.XIV + 460 pages.7Thomas Heath.Mathematics in Aristotle.Clarendon Press, Oxford, 1949.XIV + 291 pages.8Thomas Heath.The Thirteen Books of Euclid's Elements.Dover Publications, New York, 2nd revised edition, 1956.Three volumes.9Thomas Heath.A History of Greek Mathematics.Dover Publications, New York, 1981.Volume I, XV + 446 pages, Volume II, XI + 586 pages.10Leon Henkin, Patrick Suppes, and Alfred Tarski, editors.The Axiomatic Method, With Special Reference to Geometry and Physics.Studies in Logic and the Foundations of Mathematics. North-Holland, 1959.XI + 488 pages.11Thomas Jech.Set Theory.Academic Press, 1978.XI + 621 pages.12Morris Kline.Why Johnny Can't Add.St. Martin's Press, 1973.173 pages.13Richard McKeon, editor.The Basic Works of Aristotle.Random House, 1941.XXXIX + 1487 pages.14Elliott Mendelson.Boolean Algebra and Switching Circuits.Schaum's Outline Series. 1970.213 pages.15Elliott Mendelson.Introduction to Mathematical Logic.Wadsworth, 3rd edition, 1987.IX + 341 pages.16Leonard Peikoff.Objectivism: The Philosophy of Ayn Rand.Dutton, New York, 1991.XV + 493 pages.17Plato.Dialogues.Oxford, Clarendon Press, 4th edition, 1964.Translated by Benjamin Jowett.18Ayn Rand.Atlas Shrugged.Random House, New York, 1957.V + 1168 pages.19David Ross.Aristotle.Barnes and Noble, 5th revised edition, 1964.XIV + 300 pages.20Stephen G. Simpson.Subsystems of Second Order Arithmetic.Perspectives in Mathematical Logic. Springer-Verlag, 1999.XIV + 445 pages.21Alfred Tarski.Introduction to Logic and to the Methodology of Deductive Sciences.Oxford University Press, 4th edition, 1994.XXII + 229 pages.22J. van Heijenoort, editor.From Frege to Gödel: A Source Book in Mathematical Logic, 1879-1931.Harvard University Press, 1971.Second Printing. XII + 660 pages.About this document ... Logic and MathematicsThis document was generated using theLaTeX2HTML translator Version 2002-2-1 (1.71)Copyright © 1993, 1994, 1995, 1996,Nikos Drakos, Computer Based Learning Unit, University of Leeds.Copyright © 1997, 1998, 1999,Ross Moore, Mathematics Department, Macquarie University, Sydney.The command line arguments were: latex2html -split 0 philmathThe translation was initiated by Stephen G Simpson on 2007-09-28Footnotes... instrument1The Greek word for instrument is organon. The collection of Aristotle's logical writings is known as the Organon....Metaphysics2The Metaphysics is Aristotle's treatise on the science of existence, i.e., being as such. It includes a detailed analysis of the various ways in which a thing can be said to be.... man3We use man in the traditional sense, equivalent to ``human being''. There is no intention to exclude persons of the female gender.... Shrugged4A survey conducted for the Book-of-the-Month Club and the Library of Congress in 1991 found that Atlas Shrugged is the most influential book in the United States of America, second only to the Bible. See http://www.lcweb.loc.gov/loc/cfbook/bklists.html.... individual.5The idea of using letters such as and as variables is of great value. Historically, the creators of the predicate calculus borrowed this idea from the mathematical discipline known as algebra. Recall that algebra is a kind of generalized arithmetic. In algebra there are constants, i.e., specific quantities such as , the square root of , etc., but there are also variables such as , , etc. The key idea of algebra is that a variable represents an unspecified or unknown quantity. It always stands for some quantity, but it may stand for any quantity. The use of variables makes algebra much more powerful than arithmetic. Variables help us to express and solve equations such as involving one or more unknown quantities. Variables can also be used to express arithmetical laws such as .... operators6The first five logical operators ( , , , , ) are equivalent to so-called ``Boolean logic gates'' of electrical engineering. Formulas built from them may be viewed as representations of the binary switching circuits that control the operation of modern digital computers. See Mendelson [14,15].... implication7This is the so-called ``material implication'': is equivalent to .... bi-implication8This is called bi-implication because is equivalent to .... noted9See however Bochenski [2, §16E].... algorithm10The details of this algorithm are explained in modern logic textbooks. Variants of it have been programmed to run on digital computers. They form the basis of a system of computer logic. See Fitting [4].... contradictions11An explicit contradiction is a pair of formulas of the form , .... algorithm12The algorithms in question may be implemented as Turing machine programs. This undecidability result is known as Church's theorem. See Mendelson [15].... out13For example, philosophical intrinsicism may play out as mathematical Platonism. Philosophical subjectivism may play out as mathematical constructivism. Nominalism may play out as formalism.... method.14Our modern notion of a formal theory (see 1.2.5 above) is a variant of Aristotle's concept of scientific method.... Euclidean15Here ``Euclidean geometry'' refers to the familiar geometry in which the angles of a triangle sum to degrees, as distinct from the ``non-Euclidean'' (i.e., hyperbolic) geometry developed by Bolyai and Lobachevsky in the 19th century....geometry.16Tarski also showed how to handle non-Euclidean plane geometry, as well as Euclidean and non-Euclidean geometries of higher dimension, in a similar fashion.... geometrical17This means that all occurrences of variables within the formula are within subformulas of the form or . Thus we are restricting attention to the realm of geometry and excluding everything else.... algorithm18Such algorithms have been implemented as computer programs. They are useful in robotics and other artificial intelligence applications.... arithmetic.19Two recent studies of formal arithmetic are Hájek/Pudlák [6] and Simpson [20].... awkwardly20This kind of awkwardness can be alleviated by means of various devices. In particular, standard mathematical notation such as can be incorporated into the predicate calculus.... 21In this formula, the variables , , , play the role of , , , respectively.... inconsistent22A formal theory is said to be inconsistent if its axioms logically imply an explicit contradiction. Such theories are of no scientific value.... arithmetic.23A recent study of second-order arithmetic is Simpson [20]....attraction.24The idea of set-theoretical foundations gave rise to the ``new math'' pedagogy of the 1960's. For a lively discussion, see Kline [12]....formalism.25See for example Curry [3].... question26For example, we could consider the following difficult question of Goldbach. Can every even number greater than be expressed as the sum of two prime numbers?...abandoned.27See the essays of Brouwer and Kolmogorov in [22].... reductionism.28Hilbert is often inaccurately described as a formalist. The details of Hilbert's program will not be presented here, but see [20,22]. Roughly speaking, a formal theory is said to be finitistically reducible if it can be embedded into some very restricted formal theory such as , which is physically meaningful and makes absolutely no reference to actual infinity. Stephen G Simpson2007-09-28 |
|