Mathematical Wit and Mathematical Cognition

Andrew Aberdein

Mathematical wit and mathematical cognition

Topics in Cognitive Science

The published works of scientists often conceal the cognitive processes that led to their results. Scholars of mathematical practice must therefore seek out less obvious sources. This paper analyzes a widely circulated mathematical joke, comprising a list of spurious proof types. An account is proposed in terms of argumentation schemes: stereotypical patterns of reasoning, which may be accompanied by critical questions itemizing possible lines of defeat. It is argued that humour is associated with risky forms of inference which are essential to creative mathematics. The components of the joke are explicated by argumentation schemes devised for application to topic neutral reasoning. These in turn are classified under seven headings: retroduction; citation; intuition; meta-argument; closure; generalization; and definition. Finally, the wider significance of this account for the cognitive science of mathematics is discussed.

MATHEMATICAL WIT AND MATHEMATICAL COGNITION Abstract. The published works of scientists often conceal the cognitive processes that led to their results. Scholars of mathematical practice must therefore seek out less obvious sources. This paper analyzes a widely circulated mathematical joke, comprising a list of spurious proof types. An account is proposed in terms of argumentation schemes: stereotypical patterns of reasoning, which may be accompanied by critical questions itemizing possible lines of defeat. It is argued that humour is associated with risky forms of inference which are essential to creative mathematics. The components of the joke are explicated by argumentation schemes devised for application to topic neutral reasoning. These in turn are classified under seven headings: retroduction; citation; intuition; meta-argument; closure; generalization; and definition. Finally, the wider significance of this account for the cognitive science of mathematics is discussed. 1. Introduction 1.1. The Joke in Question. The role of humour in mathematical cognition has seldom been addressed. This paper will focus on a single mathematical joke,1 albeit one with multiple parts. It is a list of spurious proof methods which has circulated on mathematical bulletin boards, real and virtual, since the 1980s. It deserves attention for several reasons. Firstly, it is well known among mathematicians. Secondly, the joke is concerned with a central aspect of mathematical practice: proof, and especially borderline cases of proof. As Paul Renteln and Alan Dundes observe, the list hints at the anxieties felt by many mathematicians regarding the degree to which mathematical truth is dependent upon the trustworthiness of previous results. This anxiety is exacerbated by the fact that some mathematicians have a less rigorous proof style than other mathematicians (Renteln and Dundes, 2005, p. 28). Key words. argumentation schemes; citation; generalization; humour; intuition; mathematical practice; mathematical reasoning; retroduction. 1 It lacks some of the structural features common to many jokes, but it does meet an informal definition: ‘a relatively short text which, for a given cultural group, is recognizable as having, as its primary purpose, the production of an amused reaction in its reader/hearer, and which is typically repeatable in a wide range of contexts’ (Ritchie, 2004, p. 15). However, nothing that follows rests on whether the text in question is a joke or some other sort of wit. 1 Thirdly, its dissemination is sufficiently documented to observe that the list has remained fairly stable over time. Although some items incorporate older material, 24 of them were compiled as (Angluin, 1983). A rearranged version, with three additions and no attribution, may be found in (Cherkaev and Cherkaev, nd).2 This website, which contains many other mathematical jokes, is acknowledged as a source by Renteln and Dundes (2005, p. 24), who reproduce the list in the Cherkaevs’ rearrangement, omitting their most recent addition. Although the list is obviously not exhaustive of problematic forms of proof, the survival of most of the items through these changes and the comparative unoriginality of the new items suggest that it is broadly comprehensive. Lastly, as Homer Simpson would say, ‘It’s funny because it’s true.’ Many of the items are only lightly exaggerated, and many similar examples of actual mathematical practice could be given, although for reasons of space only a few are documented here. 1.2. Humour Theory. Some surveys distinguish as many as eight conflicting categories of humour theory; it seems correct to say that each category succeeds in accounting for an important feature, but typically at the expense of other important features (Hurley et al., 2011, p. 37). Amongst the most influential are superiority, release, and incongruity theories, each of which has distinguished champions. On the superiority account, associated with Hobbes, laughter proceeds from the recognition of the joke teller’s superiority over the butt of the joke. On the release account, associated with Freud, laughter is a release of tension resulting from the overcoming of an inhibition. On the incongruity account, associated with Kant, laughter results from the resolution of an apparent absurdity. Humour has been the subject of empirical inquiry in all the major domains of psychological research (Martin, 2007, p. 29). In social psychology, it has been shown how humour can be used to socialize members of a group into the acceptance of norms important to the group without the hostility that may result from more direct criticism (Martin, 2007, p. 120). As Gary Fine and Michaela De Soucey conclude from their studies of mushroom collectors and meteorologists, ‘Joking themes permit the smoothing of conversation, sharing traditions in the name of cohesion, separating the group from others, and securing the conformity of members’ (2005, p. 17). All of these activities are just as important to the acculturation of student 2 The Cherkaevs assert, with more candour than legal acumen, that ‘To our mind, a joke goes to “public domain” immediately after being created or modified and there should be no authorship in it.’ A search on archive.org reveals the list was on their site by 2000, and was added to in 2003 and 2004. 2 mathematicians, a central activity of the communities in which Angluin’s list has circulated. So it seems reasonable to conjecture that the items on the list reflect norms that mathematicians consider important. Humour is more intimately related to cognition: jokes may be seen as byproducts of defeasible cognitive heuristics. For example, Marvin Minsky has argued that ‘Common sense logic is too unreliable for practical use. It cannot be repaired, so we must learn to avoid its most common malfunctions. Humor plays a special role in learning and communicating about such matters. . . . Its forms include graciously disarming ways to instruct others about inappropriate behavior and faulty reasoning’ (1983, pp. 172 f.). A more sophisticated articulation of this insight characterizes humour as a reward mechanism for ‘maintaining epistemic dataintegrity’ (Hurley et al., 2011, p. 127). On this account, humour has arisen through natural selection and cultural evolution as compensation for the slapdash nature of much cognition. Our minds prefer fast and unreliable heuristics over reliable but resource-intensive alternatives. Left unchecked, this process would leave us with increasingly contradictory belief sets, leading to erratic and risky behaviour. Hence, there is selective pressure for a mechanism which will incentivize us to reconcile these incongruities. We experience the incentive as mirth. 1.3. Fallacy Theory. Some of the items on Angluin’s list resemble fallacies, so it might be supposed that the analysis of the list could be subsumed under a general account of (mathematical) fallacy. However, fallacy theory is a much less developed field than humour theory. After centuries of stagnation, there has been significant progress in the study of informal fallacies (beginning with Hamblin, 1970), but this work has only begun to be applied to mathematical fallacies (see, for example, Krabbe, 2008; Aberdein, 2010). Most treatments of mathematical fallacies exhibit three significant shortcomings. Firstly, they emphasize what one may, following Bacon, call ‘gross fallacies’—cases where it is obvious that something is wrong and the problem is (only) to determine what—rather than ‘subtle fallacies’ which superficially appear sound (for this distinction, see Aberdein, 2010, p. 273). Secondly, surveys of mathematical fallacy are seldom more than lists, perhaps subdivided by the field or difficulty of the mathematics involved (for example, Maxwell, 1959; Barbeau, 2000). One exception does offer a ‘classification of exercises in refuting false mathematical arguments’ (Bradis et al., 1959, pp. 6 ff.). Some of the entries, such as ‘Extension to exceptional cases’ correspond to aspects of reasoning discussed in this paper, but many of them are by definition always 3 erroneous. This leads to the third shortcoming of such surveys, that the emphasis is placed exclusively on irredeemably bad arguments, without exploring their interdependence with good (but not fully rigorous) reasoning. 1.4. Argumentation Schemes. The account proposed in this paper utilizes argumentation schemes, stereotypical patterns of reasoning. They have an ancient origin, but have attracted renewed interest in recent years as a device for the analysis and evaluation not just of fallacies but of non-deductive argumentation in general. Deductive inference rules may be thought of as special cases of argumentation schemes, but the novelty of the approach rests in the application of defeasible schemes to informal reasoning. The widest survey to date distinguishes more than 90 schemes (Walton et al., 2008, pp. 308 ff.). Many of these are specialized instances of a general scheme, Defeasible Modus Ponens: As a rule, if P , then Q. P. It is not the case that there is an exception to the rule that if P , then Q. Therefore, Q. (Walton et al., 2008, p. 366) The defeasible nature of the reasoning is captured by an additional device, critical questions, which itemize possible lines of response. For the use of a scheme to be evaluated positively, its proponent must make adequate replies to such of these questions as may be raised by a respondent. We may distinguish different degrees of success, that is degrees of confidence with which the conclusion should be accepted, by requiring more or less extensive answers. Although the argumentation scheme methodology was developed for non-mathematical contexts, it is applicable to mathematical reasoning in two ways: firstly, by using topic neutral schemes; and secondly, by developing specifically mathematical schemes. This paper pursues the first approach.3 The classification of argumentation schemes is an open problem, complicated, perhaps intractably, by the multiple dimensions of similarity between schemes. Walton proposes a threefold division between reasoning, source-based arguments, and the application of rules (Walton et al., 2008, pp. 348 f.). A more fine-grained version of this classification is employed below to highlight problematic aspects of mathematical argumentation. 3For more on the first approach, see (Aberdein, 2010; Dove, 2009); for the second approach, see (Pease and Aberdein, 2011). 4 2. Retroduction Much mathematical reasoning is sound deductive inference, secure from the anxieties that mathematical jokes reflect. But not all of it. One of the most ancient techniques of mathematical problem-solving is retroduction, or working backwards. In one of its aspects this was known to ancient mathematicians as analysis, from ‘ἀνάπαλιν λύσιν’ or ‘solution backwards’ (Heath, 1908, p. xlvii). (The term has, of course, since acquired other senses.) The steps which the analysis traces should be deductively sound, and once the process is complete the order of the steps may be reversed to yield a conventional deductive proof. This is familiar in the strategy of reducing an open problem to one that has already been solved. Of course, the target must be chosen carefully, as one of Angluin’s spurious proofs indicates: Proof by reduction to the wrong problem: ‘To see that infinite-dimensional coloured cycle stripping is decidable, we reduce it to the halting problem.’4 (Angluin, 1983, p. 17) Multi-step argumentation is characteristic of, but not unique to mathematical reasoning. It may be understood in terms of the following scheme: 2.1. Argument from Gradualism. Premise: Proposition A is true (acceptable to the respondent). Premise: There is an intervening sequence of propositions, B1 , B2 , . . . Bn−1 , Bn , C such that the following conditionals are true: If A then B1 ; if B1 then B2 ; . . . ; if Bn−1 then Bn ; if Bn then C. Premise: The conditional ‘If A then C’ is not, by itself, acceptable to the respondent, nor are shorter sequences from A to C (than the one specified in the second premise) acceptable to the respondent. Conclusion: Therefore, the proposition C is true (acceptable to the respondent). (Walton et al., 2008, p. 339) When a problem is to be reduced to a previously proved result, the task of proving C becomes that of finding a suitable theorem A, and intervening sequence of propositions, B1 , . . . , Bn . Walton does not state critical questions for this scheme. By comparison with similar schemes, we may work out what questions are required: 4The unsolvability of the halting problem is a famous result in computability theory. The quotation marks in this and all other spurious proofs are original. They may be understood as ascribing direct speech to the proponent. 5 (1) What evidence is there that proposition A is true (acceptable to the respondent)? (2) What is the evidence showing each of the conditionals to be true? (3) Are there independent reasons for rejecting that the proposition C is true (acceptable to the respondent). In Angluin’s example, A is undecidable, so the first critical question cannot be answered affirmatively. Not all backward reasoning in mathematics is deductive: Proof by importance: A large body of useful consequences all follow from the proposition in question. (Angluin, 1983, p. 17) This piece of reasoning is interestingly ambiguous. On the one hand, it might be perceived as consequentialist practical reasoning: 2.2. Argument from Positive Consequences. Premise: If A is brought about, then good consequences will plausibly occur. Conclusion: Therefore, A should be brought about. Critical Questions: (1) How strong is the likelihood that the cited consequences will (may, must) occur? (2) What evidence supports the claim that the cited consequences will (may, must) occur, and is it sufficient to support the strength of the claim adequately? (3) Are there other opposite consequences (bad as opposed to good, for example) that should be taken into account? (Walton et al., 2008, pp. 332 f.) Such reasoning is used to defend contested axioms or principles, such as the axiom of choice or the law of excluded middle, because as axioms they defy proof. ‘Proof by importance’ works by suggesting that an unproven conjecture must be true, because of its extensive coherence with other results. This is spurious as proof, but is the sort of hunch that, for example, was offered in support of the Taniyama-Shimura hypothesis prior to its confirmation by Andrew Wiles.5 5The hypothesis relates modular forms to elliptical curves, two seemingly remote categories of mathematical object. Its proof by Wiles was the decisive step in the proof of Fermat’s Last Theorem (see Singh, 1997, for a popular account). 6 Reasoning backwards from the consequences of a conjecture may also be understood as abductive reasoning. George Pólya discusses the importance of abduction for Euler in establishing his novel proof that P∞ n=1 1/n2 = π 2 /6 through representation of sin x as an infinite product: ‘Euler does not reexamine the grounds for his conjecture . . . he examines only its consequences. He regards the verification of any such consequence as an argument in favor of his conjecture . . . In scientific research as in ordinary life, we believe, or ought to believe, a conjecture more or less according as its observable consequences agree with the facts. In short, Euler seems to think the same way as reasonable people, scientists or non-scientists, usually think’ (Pólya, 1954, vol. 1, p. 22). Of course, despite the importance of this informal procedure for Euler, he never mistook it for proof. Indeed he kept worrying away at the problem until he found an independent proof in more conventional terms. Euler’s inference, and some cases of Angluin’s ‘proof by importance’, might thus be characterized by the positive form of the following abductive argumentation scheme: 2.3. Argument from Evidence to a Hypothesis. Premise: If A (a hypothesis) is true, then B (a proposition reporting an event) will be observed to be true. Premise: B has been observed to be true [false], in a given instance. Conclusion: Therefore, A is true [false]. Critical Questions: (1) Is it the case that if A is true, then B is true? (2) Has B been observed to be true (false)? (3) Could there be some reason why B is true, other than its being because of A being true? (Walton et al., 2008, pp. 331 f.) The falsification variant is modus tollens, and thus uncontroversial. From a strict falsificationist perspective, the best that can be said of any hypothesis is that it has not (yet) been falsified. The defeasible acceptance of a hypothesis on the grounds of non-falsification is another application of the positive variant of this scheme. For empirical science, this may come close enough to confirmation for practical cases. In mathematics the effect is less convincing: 7 Proof by accumulated evidence: Long and diligent search has not revealed a counterexample. (Angluin, 1983, p. 17) Of course this is no sort of ‘proof’. If the scope of the search is explicitly stated, a more limited result could be confirmed (see §6). But counterexamples revealed only by extraordinarily long and diligent searches are common enough to undercut any hope that such reasoning may hold in general. Because of such counterexamples, which have few empirical analogues, the third critical question can only be fully answered in mathematical cases by proving the hypothesis. 3. Citation Source-based arguments are a rich source of mathematical anxiety and humour. Yet comparatively little attention has been paid to the role of citation in mathematical practice (an important exception is Geist et al., 2010). The most numerous and diverse cases on Angluin’s list are subvarieties of deviant citation. Firstly, citations can reference things which do not exist, or which do not say what they are supposed to: Proof by wishful citation: The author cites the negation, converse, or generalization of a theorem from the literature to support his claim. Proof by ghost reference: Nothing even remotely resembling the cited theorem appears in the reference given. (Angluin, 1983, pp. 16 f.) More subtly, the citation may not adequately reflect the source’s level of confidence. Angluin’s juxtaposition of the next two examples illustrates an anxiety about the credibility arbitrage behind such references: Proof by eminent authority: ‘I saw Karp in the elevator and he said it was probably NP-complete.’ Proof by personal communication: ‘Eight-dimensional coloured cycle stripping is NP-complete [Karp, personal communication].’ (Angluin, 1983, p. 16) Other citations send the diligent on a wild goose chase which is practically (or even logically) impossible to complete: Proof by reference to inaccessible literature: The author cites a simple corollary of a theorem to be found in a privately circulated memoir of the Slovenian Philological Society, 1883. 8 Proof by mutual reference: In reference A, Theorem 5 is said to follow from Theorem 3 in reference B, which is shown to follow from Corollary 6.2 in reference C, which is an easy consequence of Theorem 5 in reference A. (Angluin, 1983, p. 17) Most transparently, the citation may refer to future activity which may never happen: Proof by forward reference: Reference is usually to a forthcoming paper by the author, which is often not as forthcoming as at first. (Angluin, 1983, p. 17) Proof by deferral: ‘We’ll prove this later in the course.’ (Renteln and Dundes, 2005, p. 28) All of these different cases may be understood as ineffective applications of the same argumentation scheme, that for Appeal to Expert Opinion. 3.1. Appeal to Expert Opinion. Major Premise: Source E is an expert in subject domain S containing proposition A. Minor Premise: E asserts that proposition A is true (false). Conclusion: A is true (false). Critical Questions: (1) Expertise Question: How credible is E as an expert source? (2) Field Question: Is E an expert in the field that A is in? (3) Opinion Question: What did E assert that implies A? (4) Trustworthiness Question: Is E personally reliable as a source? (5) Consistency Question: Is A consistent with what other experts assert? (6) Backup Evidence Question: Is E’s assertion based on evidence? (Walton et al., 2008, p. 310) Citation is indispensable in serious mathematical practice, but must be conducted with care. Walton’s list of critical questions, although designed primarily for a forensic context, provides a good starting point for an analysis of what comprises such care. Good citation requires a positive answer to all six questions. All of the above examples block a satisfactory answer to one or more of these questions. For most of them, the Opinion Question could not be fully answered. (Mutual reference is an exception, but it fails the Backup Evidence Question, as do several of the others.) 9 All of these cases exhibit defective argumentation by the prover, whose use of the scheme is exposed as unwarranted by his inability to answer the critical questions. But the prover must also have a respondent, the audience for his proof. Pseudo-proofs can involve the argumentational shortcomings of the respondent too. For example, the respondent may be too ready to defer to the proponent’s authority: Proof by vehement assertion: It is useful to have some kind of authority relation to the audience. (Angluin, 1983, p. 17) Proof by adverb: ‘As is quite clear, the elementary aforementioned statement is obviously valid.’ (Cherkaev and Cherkaev, nd) The latter case exemplifies the former; in both the prover leads his audience into employing Appeal to Expert Opinion, with him as the expert. In effect, he is appealing to his own authority. If the respondent lets him get away with it, she is neglecting her duty to challenge the assertions of the prover by asking him the appropriate critical questions. Something similar is happening in the next case: Proof by vigorous handwaving: Works well in a classroom or seminar setting. (Angluin, 1983, p. 16) In each of these cases the audience is browbeaten into acquiescence, something they share with another familiar scenario: Proof by intimidation: ‘Trivial’ (Angluin, 1983, p. 16) In cases where the result is not trivial, and perhaps not correct, this may be understood as inducing the respondent to employ a different scheme, for Argument from Danger. 3.2. Argument from Danger. Premise: If you (the respondent) bring about A, then B will occur. Premise: B is a danger to you. Conclusion: Therefore (on balance) you should not bring about A. (Walton et al., 2008, p. 334) The argument induced by the dogmatic declaration ‘Trivial!’ may be unpacked as an instance of this scheme: ‘If you ask for clarification, you will appear to be a fool. Appearing a fool is a danger to you. Therefore (on balance) you should not ask.’ (Those in no danger of appearing a fool can ask devastatingly simple questions 10 which might, from a humbler source, provoke mere condescension.) However, where the result seems trivial to the audience too, there can be no intimidation in drawing attention to its triviality; rather, this may be understood as based on a different source: (shared) intuition. 4. Intuition Like citation, intuition is ubiquitous, but problematic. As Reuben Hersh observes, in ‘mathematical practice, the intuitive is everywhere. . . . [It] carries a heavy load of mystery and ambiguity. Sometimes it’s a dangerous illegitimate substitute for rigorous proof. Sometimes it’s a flash of insight that tells the happy few what others learn with great effort’ (Hersh, 1997, p. 61). This ambiguity manifests in jokes: Proof by appeal to intuition: Cloud-shaped drawings frequently help here. (Angluin, 1983, p. 17) Proof by seduction: ‘Convince yourself that this is true!’ (Renteln and Dundes, 2005, p. 28) The second example may be found, almost verbatim, even in so august an author as G. H. Hardy: ‘The reader will convince himself of the truth of the following assertion’ (Hardy, 1967, p. 4). Intuition can be private, but if it is intended to convince others it must be shared; that is, it must be obvious. Hardy (crediting J. E. Littlewood) offers a partial resolution of the underlying ambiguity of intuitive obviousness: ‘One may mean “it is difficult to doubt the truth of the theorem,” . . . But sometimes we mean . . . “a moment’s reflection should not only convince the reader of the truth of what is stated, but should also suggest to him the general lines of a rigorous proof”’ (Hardy, 1967, p. 130).6 Thus intuitive obviousness is a knowledge claim resting on an equivocal basis. Both of Hardy’s senses may be characterized by the same scheme: 4.1. Argument from Position to Know. Major Premise: Source a is in position to know about things in a certain subject domain S containing proposition A. Minor Premise: a asserts that A is true (false). Conclusion: A is true (false). Critical Questions: 6Hersh also analyses the ambiguity of intuition, identifying six different senses (Hersh, 1997, pp. 61 f.). Although Hersh ranges wider, Hardy is more incisive: he subdivides the third of Hersh’s senses. 11 (1) Is a in position to know whether A is true (false)? (2) Is a an honest (trustworthy, reliable) source? (3) Did a assert that A is true (false)? (Walton et al., 2008, p. 309) In the second sense, the prover and (so he assumes) the respondent are in a position not only to know but to prove the ‘obvious’ result, and the critical questions may be easily answered in the affirmative. In the first sense, an adequate basis for the prover’s knowledge claim is more fugitive. It may be, as Hersh suggests, ‘reliable mathematical belief without the slightest dream of being formalized’ (Hersh, 1997, p. 61). Since he views this faculty as reliable, Hersh would still answer the critical questions affirmatively. Sceptics about such reliability would demur from one or both of the first two questions. 5. Meta-Argument A prominent feature of mathematics is the reification of arguments as objects of mathematical inquiry. However, arguments which bear upon the status of other arguments comprise a variety of source-based argumentation which occurs in non-mathematical contexts. Angluin’s list contains two cases which exploit this practice in different ways. Proof by metaproof: A method is given to construct the desired proof. The correctness of the method is proved by any of these techniques. (Angluin, 1983, p. 17) Here the source from which the argument at issue arises is itself an argument, the presumed virtues of which would indeed transfer to the object argument, but of course, only if the method is secure. Proof by funding: How could three different government agencies be wrong? (Angluin, 1983, p. 16) The existence of funding, like the eminence of the author’s institution, and other ‘esteem indicators’, might be taken as indirect evidence for the soundness of his results. While this evidence is pretty weak, similar evidence can be persuasive. One empirical study presented samples of undergraduates and research mathematicians with a heuristic argument for the presence of one million consecutive sevens in the decimal expansion of π. Correctly attributing its authorship to Field’s Medallist Tim Gowers significantly increased the positive appraisal of the argument in both groups (Inglis and Mejı́a-Ramos, 2009). As one research subject commented, ‘We are told the argument is made by a reputable mathematician, so we implicitly 12 assume that he would tell us if he knew of any evidence or convincing arguments to the contrary’ (Inglis and Mejı́a-Ramos, 2009, p. 42). This demonstrates the perceived relevance of the author’s ethos to the acceptance of his arguments, an application of the following scheme: 5.1. Ethotic Argument. Premise: If x is a person of good (bad) moral character, then what x says should be accepted as more plausible (rejected as less plausible). Premise: a is a person of good (bad) moral character. Conclusion: Therefore, what a says should be accepted as more plausible (rejected as less plausible). Critical Questions: (1) Is a a person of good (bad) moral character? (2) Is character relevant in the dialogue? (3) Is the weight of presumption claimed strongly enough warranted by the evidence given? (Walton et al., 2008, p. 336) Walton’s version is not quite suitable for our purposes: the ethos of a must encompass epistemic instead of (or as well as) moral virtue (as Walton explores elsewhere: cf. Walton et al., 2008, p. 359). It is not Tim Gowers’s blameless personal life which leads us to trust his arguments, but rather his demonstrably high standards as a working mathematician. Notoriously, academics can have deplorable ethical standards, while still producing work of genuine value. This may be because, unlike politicians, say, whose weak morals can prove more problematic, they back up their claims with independently verifiable argumentation. Does this obviate the need for consideration of their ethos, even restricted to its epistemic aspects? Only if the argument is fully worked out: if we are invited to accept aspects of the reasoning on trust, it is not unreasonable to ask whether the reasoner is trustworthy. Ethotic Argument is closely related to Appeal to Expert Opinion. Indeed, the research subject’s comments quoted above could also be construed as positive answers to some of the critical questions for the latter scheme. As the commenter is inferring the absence of counter-arguments from the absence of evidence for them, he might also be taken as employing a different form of source-based argument, an Argument from Ignorance. 13 6. Closure Arguments from Ignorance are typically classified as fallacious, but they can be convincing in epistemically closed domains, that is domains in which all true propositions are known.7 Mathematics, however, is not epistemically closed, so legitimate mathematical applications of the following scheme are scarce. 6.1. Argument from Ignorance. Major Premise: If A were true, then A would be known to be true. Minor Premise: It is not the case that A is known to be true. Conclusion: Therefore A is not true. Critical Questions: (1) How far along has the search for evidence progressed? (2) Which side has the burden of proof in the dialogue as a whole? In other words, what is the ultimate probandum [thing to be proved] and who is supposed to prove it? (3) How strong does the proof need to be in order for this party to be successful in fulfilling the burden? (Walton et al., 2008, p. 327) This scheme is an instance of modus tollens, so its plausibility just turns on the acceptability of its premises. The major premise states the epistemic closure principle, which can be analyzed as a conjunction of two conditionals: (i) If A were true, then A would be known to be true by someone; (ii) If A were known to be true by someone, then A would be known to be true (by me). In much everyday social knowledge this is unproblematic: (i) is trivially true and (ii) is a straightforward claim about the speaker’s position within an epistemic community. In mathematical cases, (i) is characteristically false: there are few unknown A of which it can be said that if A were true, then A would be known to be true by someone. A notable exception is the use of an exhaustive search, whether by hand or with computer assistance. If a computer is known to have calculated the logical closure of a set of axioms, or determined which of a wide range of cases have a particular property, then an unsuccessful search through the resultant data set is conclusive: the truth of the closure principle 7‘Epistemic closure’ is also used in another sense, to specify that knowledge of a proposition implies knowledge of everything that proposition entails (or is known to entail). 14 permits the deduction of the conclusion. Even when the closure principle is not known with certainty, it may often be practically assumed. As one science writer remarks of Mathematica, ‘You can enter a function that you want to integrate, and out comes the correct expression. And if it does not, you may be reasonably sure that a solution to your question simply does not exist’ (Szpiro, 2003, p. 198). However, (ii) is often plausible in mathematics. Experts may confidently assert of many propositions in their field, that if a proof (or counterexample) were known, it would be known by them. Even non-experts can be confident about some propositions. A counter-example to Goldbach’s conjecture would be headline news. So I may safely infer from my ignorance that no such number has been found, but not of course that it does not exist. Hence the modus tollens stops at (ii), and the safe conclusion is not that A is not true, but that nobody knows whether A is true. Angluin offers an interesting variant of this argument: Proof by cosmology: The negation of the proposition is unimaginable or meaningless. Popular for proofs of the existence of God. (Angluin, 1983, p. 17) This ‘proof’ employs a weaker closure principle: ‘If ¬A were true, then ¬A would be imaginable.’ The absence of an objective measure of imaginability makes this principle implausible. Arguments from Ignorance can also be induced in the respondent: Proof by cumbersome notation: Best done with access to at least four alphabets and special symbols. Proof by exhaustion: An issue or two of a journal devoted to your proof is useful. Proof by obfuscation: A long plotless sequence of true and/or meaningless syntactically related statements. (Angluin, 1983, p. 16) These three cases encourage the respondent to pass the buck, rather than tackle a purported proof of conspicuous repugnance. The Argument from Ignorance would run ‘If there was something wrong with this proof, someone else would have noticed. Nobody seems to have noticed anything. So, it must be OK.’ 15 7. Generalization Much mathematical practice concerns the correct application of established rules. A notable example is the move from particular to general results. However, generalization lends itself to abuse when applied carelessly: Proof by example: The author gives only the case n = 2 and suggests that it contains most of the ideas of the general proof. Proof by omission: ‘The reader may easily supply the details.’ ‘The other 253 cases are analogous.’ ‘. . . ’ (Angluin, 1983, p. 16) These patterns of reasoning occur in both historical and contemporary mathematics. John Wallis, for instance, routinely argued from specific examples to the conclusion that a procedure may be applied indefinitely. He ‘called [this] “modus inductionis”: later it was termed “incomplete induction.”’ (Pedersen, 1980, p. 38). Standards of rigour have changed profoundly since Wallis’s day. But instances of proofs by example or omission still arise in ostensibly rigorous contexts, although often not without reproach. Take the dispute over Wu-Yi Hsiang’s alleged proof of Kepler’s conjecture, that the maximum density of a packing of congruent √ spheres in three dimensions is π/ 18. In his review, Gábor Fejes Tóth complains that ‘we are given arguments such as “the most critical case is. . . ” followed by a statement that “the same method will imply the general case.” The problem with arguments of this kind is not only that they require the reader to redo some pages of calculations, but, notoriously, that they occur at places where we expect difficulties and most frequently it is impossible to see how the same method works in the general case’ (Fejes Tóth, 1995). Both cases exemplify the following scheme. 7.1. Argument from Example. Premise: In this particular case, the individual a has property F and also property G. Conclusion: Therefore, generally, if x has property F , then it also has property G. Critical Questions: (1) Is the proposition claimed in the premise in fact true? 16 (2) Does the example cited support the generalization it is supposed to be an instance of? (3) Is the example typical of the kinds of cases the generalization covers? (4) How strong is the generalization? (5) Do special circumstances of the example impair its generalizability? (Walton et al., 2008, p. 314) This scheme underpins incomplete or enumerative induction. This is, of course, invalid, but still has a place in informal mathematical reasoning, for example in hypothesis formation: a is F and also G; I can’t think of anything which is F and not G; Are all F s G? The concern with enumerative induction is that it may be mistaken for proof. Some empirical research is worrisome: one study concluded that over 50% of trainee elementary mathematics teachers considered arguments based on specific instances to be mathematical proofs (Martin and Harel, 1989; but see Weber, 2010, for more reassuring results). But enumerative induction is not the only application of this scheme. While Wallis might be described as employing induction, Hsiang is attempting something else, but is still employing Argument from Example. His generalizing moves are contentious because his examples are said to provide insufficient support, and to be poorly chosen, that is to fail the second and third critical questions, not because generalization is in principle non-rigorous. Where the examples are genuinely typical, and the method transparent, application of this scheme is consistent with mathematical rigour. A special case of generalization arises with diagrammatic proof. Angluin’s list contains one example that articulates the conservative attitude: Proof by picture: A more convincing form of proof by example. Combines well with proof by omission. (Angluin, 1983, p. 17) Defenders of the legitimacy of diagrammatic proof acknowledge the centrality of the ‘Generalization Problem’: ‘After we have proved a theorem using a diagram, how can we legitimately generalize the configuration of this diagram to a wide (usually infinite) class of configurations, and to what class exactly?’ (Kulpa, 2009, p. 81). However, they point out that similar issues arise in non-diagrammatic inference, and may be resolved in a similar fashion in the diagrammatic case, by the painstaking assurance that the particular case really does generalize, that is by answering the critical questions of the scheme for Argument from Example. 17 8. Definition A less contentious application of a rule is the employment of a definition. Well-chosen definitions can be pivotal to mathematical progress. At a superficial level, this choice can appear arbitrary: mathematicians have more flexibility in the concepts they take as primitive than scientists in any other domain. On the other hand, appeals to the naturalness or otherwise of definitions are frequent, and a change of definitions can revolutionize the understanding of a hitherto intractable problem. Of course, this strategy can be abused. Proof by semantic shift: Some of the standard but inconvenient definitions are changed for the statement of the result. (Angluin, 1983, p. 17) Instead of defining the same problem with different terms, a different (typically much easier) problem is described by the same terms. Such sharp practice is a straightforward misapplication of the scheme for Argument from Definition. 8.1. Argument from Definition to Verbal Classification. Individual Premise: a fits definition D. Classification Premise: For all x, if x has property D, then x can be classified as having property G. Conclusion: a has property G. Critical Questions: (1) What evidence is there that D is an adequate definition, in light of other possible alternative definitions that might exclude a’s having G? (2) Is the verbal classification in the classification premise based merely on a stipulative or biased definition that is subject to doubt? (Walton et al., 2008, p. 319) If the non-standard definition D is poorly motivated, the example will fail to answer the first question; if the non-standard definition is smuggled past a classification premise only known to hold for the standard definition, the example would (also) fail to answer the second question. 18 9. Conclusion & Future Work We have seen how a piece of mathematical folk humour draws attention to aspects of informal mathematical practice where the argumentation carries the greatest risk. We have surveyed the borderline between admissible and inadmissible instances of such risky argumentation by means of a diverse selection of argumentation schemes originally developed for non-mathematical applications (summarized in Fig. 1). This is further corroboration that ‘mathematical reasoning is already in accord with principles and techniques from informal logic—even if this is unnoticed by the practitioners’ (Dove, 2009, p. 150). • Reasoning – Retroduction ∗ Argument from Gradualism ∗ Argument from Positive Consequences ∗ Argument from Evidence to a Hypothesis • Source Based – Citation ∗ Appeal to Expert Opinion ∗ Argument from Danger – Intuition ∗ Argument from Position to Know – Meta-Argument ∗ Ethotic Argument – Closure ∗ Argument from Ignorance • Rule Based – Generalization ∗ Argument from Example – Definition ∗ Argument from Definition to Verbal Classification Figure 1. Summary of reasoning types and schemes discussed The application of schemes to mathematical argumentation is still in its infancy, but the comparative maturity of the methodology in other domains offers inviting prospects for future research. In recent years, argumentation schemes have attracted much attention from researchers in artificial intelligence. Schemes have been implemented within systems of defeasible argumentation, and as strategies in dialogue games (Walton et al., 2008, pp. 370; 383). Progress has also been made with automated argument mining tools using schemes to find arguments, primarily in legal texts (Walton, 2011, p. 57). This approach should also lend itself to informal mathematical argumentation. 19 A more pessimistic consequence for the cognitive science of mathematics follows if we take seriously the suggestion that humour plays an integral role in informal reasoning. This paper has argued that informal mathematical reasoning is continuous with informal reasoning in other domains, and thereby that humour plays the same role in both. This suggests that inventive mathematics requires a good sense of humour.8 Some cognitive scientists argue that humour is ‘AI-complete’: that it cannot be simulated by an artificial intelligence less sophisticated than that required to simulate general human cognition (Hurley et al., 2011, p. 298). If both claims are correct, then creative mathematics, at least the way humans do it, is also AI-complete. Of course, this would not rule out the possibility of an AI-incomplete creative artificial mathematician that reasoned by methods fundamentally different from those used by humans, but if anything that may pose an even harder design challenge. Acknowledgements I would like to thank Matthew Inglis and Simon Colton for their detailed comments on an earlier draft of this paper. Even earlier versions were delivered at conferences in Leicester, UK and Treasure Island, FL. I am grateful to the organizers and to the audiences at both conferences. References Aberdein, A. (2010). Observations on sick mathematics. In Van Kerkhove, B., Van Bendegem, J. P., and De Vuyst, J., edd., Philosophical Perspectives on Mathematical Practice, pp. 269–300. College Publications, London. Angluin, D. (1983). How to prove it. ACM SIGACT News, 15:16–17. Barbeau, E. J. (2000). Mathematical Fallacies, Flaws, and Flimflam. Mathematical Association of America, Washington, DC. Bradis, V. M., Minkovskii, V. L., and Kharcheva, A. K. (1999 [1959]). Lapses in Mathematical Reasoning. Dover, Mineola, NY. Cherkaev, A. and Cherkaev, E. (n.d.). Mathematical humor. http://www.math.utah.edu/~cherk/ mathjokes.html. 8More precisely, a sense of humour of a characteristic type, something some mathematicians claim to have observed (Paulos, 1991, p. 112). 20 Dove, I. J. (2009). Towards a theory of mathematical argument. Foundations of Science, 14:137–152. Fejes Tóth, G. (1995). Review of Hsiang, Wu-Yi, ‘On the sphere packing problem and the proof of Kepler’s conjecture.’ Internat. J. Math. 4 (1993), no. 5, 739–831. Mathematical Reviews, 95g(52032). Fine, G. A. and De Soucey, M. (2005). Joking cultures: Humor themes as social regulation in group life. Humor, 18:1–22. Geist, C., Löwe, B., and Van Kerkhove, B. (2010). Peer review and knowledge by testimony in mathematics. In Löwe, B. and Müller, T., edd., PhiMSAMP, pp. 155–178. College Publications, London. Hamblin, C. L. (1970). Fallacies. Methuen, London. Hardy, G. H. (1967). A Course of Pure Mathematics. Cambridge University Press, Cambridge, tenth edition. Heath, T. L. (2006 [1908]). The Thirteen Books of Euclid’s Elements. Barnes & Noble, New York, NY. Hersh, R. (1997). What is Mathematics, Really? Jonathan Cape, London. Hurley, M. M., Dennett, D. C., and Adams, R. B. (2011). Inside Jokes. MIT Press, Cambridge, MA. Inglis, M. and Mejı́a-Ramos, J. P. (2009). The effect of authority on the persuasiveness of mathematical arguments. Cognition and Instruction, 27:25–50. Krabbe, E. C. W. (2008). Strategic maneuvering in mathematical proofs. Argumentation, 22:453–468. Kulpa, Z. (2009). Main problems of diagrammatic reasoning. Part I: The generalization problem. Foundations of Science, 14:75–96. Martin, R. A. (2007). The Psychology of Humor. Elsevier Academic Press, Burlington, MA. Martin, W. G. and Harel, G. (1989). Proof frames of preservice elementary teachers. Journal for Research in Mathematics Education, 20:41–51. Maxwell, E. A. (1959). Fallacies in Mathematics. Cambridge University Press, Cambridge. Minsky, M. (1983). Jokes and their relation to the cognitive unconscious. In Groner, R., Groner, M., and Bischof, W. F., edd., Methods of Heuristics, pp. 171–193. Lawrence Erlbaum Associates, Hillsdale, NJ. Paulos, J. A. (1991). Beyond Numeracy. Knopf, New York, NY. Pease, A. and Aberdein, A. (2011). Five theories of reasoning: Interconnections and applications to mathematics. Logic and Logical Philosophy, 20:7–57. 21 Pedersen, K. M. (2000 [1980]). Techniques of the calculus, 1630–1660. In Grattan-Guinness, I., ed., From the Calculus to Set Theory, 1630–1910, pp. 10–48. Princeton University Press, Princeton, NJ. Pólya, G. (1954). Mathematics and Plausible Reasoning. Two Volumes. Princeton University Press, Princeton, N.J. Renteln, P. and Dundes, A. (2005). Foolproof: A sampling of mathematical folk humor. Notices of the AMS, 52:24–34. Ritchie, G. (2004). The Linguistic Analysis of Jokes. Routledge, London. Singh, S. (1997). Fermat’s Last Theorem. Fourth Estate, London. Szpiro, G. G. (2003). Kepler’s Conjecture. Wiley, Hoboken, NJ. Walton, D. N. (2011). Argument mining by applying argumentation schemes. Studies in Logic, 4:38–64. Walton, D., Reed, C., and Macagno, F. (2008). Argumentation Schemes. Cambridge University Press, Cambridge. Weber, K. (2010). Mathematics majors’ perceptions of conviction, validity, and proof. Mathematical Thinking and Learning, 12:306–336. 22

Log In

Mathematical wit and mathematical cognition

Related papers

Related papers

Related topics