Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Calculations and Canonical Elements

Calculations and Canonical Elements Ian Stewart and David Tall University of Warwick Part 1 1. Introduction Equivalence relations are the basis of modern approaches to many topics in school mathematics, from the first ideas of cardinal number (through matching activities and correspondence between sets) through definitions of negative numbers (using ordered pairs of natural numbers), to equivalence of fractions, modular arithmetic, vectors, and many more advanced topics. We contend that these approaches to the subject have been based on an inadequate theoretical framework, causing an unnecessary schism between traditional mathematics and “modern” approaches. The missing link is the concept of a canonical element. Reintroducing this idea gives a much more coherent relationship between the structural elegance of equivalence relations in modern mathematics and the traditional aspect of computation. We tackle this in Part 1 of this paper, which follows. This in turn gives a clearer insight, as we shall see in Part 2, into certain technical and educational problems. 2. Equivalence Relations We shall assume that the reader is familiar with the notion of an equivalence relation ~ on a set S. Being a relation means that for each ordered pair of elements a, b ∈ S, we either have a ~ b (a, b are related), or a ~/ b (a, b are not related). An equivalence relation satisfies the further properties: (E1) a ~ a for every a ∈ S (E2) if a ~ b, then b ~ a (E3) if a ~ b, b ~ c, then a ~ c. This partitions the set S into equivalence classes, where for any x ∈ S we denote the equivalence class containing x by Ex = {y ∈ S | x ~ y}. We find that Ex = Ey if and only if x ~ y , and if x ~/ y then Ex∩Ey is empty, so Ex, Ey are disjoint. An alternative notation for Ex which we shall often use is [x], so [x] = {y ∈ S | x ~ y}. Further discussion of equivalence relations may be found in [9] or [11, p. 74 ff.]. Here we content ourselves with some typical examples met in teaching, which we shall explore further as this article develops. Example 1: Cardinal Number Two sets are equivalent if there is a bijection (or one-to-one correspondence) between them. Some modern syllabuses in primary schools, inspired by Piaget (for instance [1], [5], [7]), begin by laying emphasis on matching activities as a preliminary basis for cardinal number. In other quarters (for example [6, p. 192]), this approach is vigorously rejected. We shall consider this conflict later on. Example 2: Egyptian Number Symbols (An interesting historical example which could be used with advantage in primary schools.) The ancient Egyptians used stroke | to denote a unit, a hoop ∩ to denote 10 and a scroll to denote 100, with other symbols for higher powers of I0. Two collections of such symbols are equivalent if they represent the same number, for instance 12 strokes | | | | | | | | | | | | are equivalent to one hoop and two strokes ∩ | |. Example 3: Integers (Positive and Negative) using Ordered Pairs of Natural Numbers Let N denote the set of natural numbers 0, 1, 2, 3, . . . If wish to “construct” the integers Z (including negative integers) then one method is to consider ordered pairs (m, n) which m, n ∈ N and to define an equivalence relation. (m1, n1) ~ (m2, n2) if and only if m1 + n2= m2 + n1 Thus, for example, (1, 0) ~ (2, 1) ~ (3, 2) ~ … ~ (n+1, n) ~ … , (2, 0) ~ (3, 1) ~ (4, 2) ~ … ~ (n + 2, n) ~ …. This cunning device yields a “definition” of the integers (positive and negative) as the equivalence classes, namely + 1 = {(1, 0), (2, 1), . . . (n+1, n), …}, + 2 = {(2, 0), (3, 1), . . . (n+2, n), …}, … also 0 = {(0, 0), (1, 1), …, (n, n), . . .} – 1 = {(0, 1), (1, 2), …, (n, n+1), . . .}} – 2= {(0, 2), (1, 3), …, (n, n+2), . . .} In essence the integer +k or –k denotes the amount by which the first element in an ordered pair exceeds the second, as can be seen by looking at the above instances. This method (which we do not advocate for teaching purposes) has been called (rather affectedly) the “Doncaster method” in [1], though it is based on a well-known mathematical construct and has been used before by other mathematical educationists in other forms (such as [3] or the Nuffield Primary Mathematics Project). Example 4: Fractions Two fractions p/q, m/n (where p q, m, n are non-zero natural numbers) are equivalent (and we write simply p/q = m/n), if pn = mq. (This can also be treated using ordered pairs in a manner analogous to example 3 by writing (p, q) ~ (m, n) if pn = mq, but we won’t dwell on that here. For details, see [11, Chapter 10], or [1].) Example 5: Modular (or “Clock”) Arithmetic We shall work modulo 3 for simplicity (the same phenomenon, of course, works with 3 replaced by any other positive integer). Two integers are “equivalent modulo 3” if their difference is divisible by 3. For instance, 1 is equivalent to 4, 2 is equivalent to 368, –1 is equivalent to 2. The equivalent classes are: E0 = {…, –6, –3, 0, 3, 6, 9, …} = E3 = E6 = … , E1 = {…, –5, –2, 1, 4, 7, 10, …} = E4 = E7 = … , E2 = {…, –4, –1, 2, 5, 8, 11, …} = E5 = E8 = … . of course we also have E0=E–3 = E–6 =… , and so on. These examples will be sufficient for our purposes, but many more occur in mathematics, see, for instance [9]. We shall consider other examples in the second article. 3. Canonical Elements Wherever there is an equivalence relation, we can select canonical elements. Quite simply, we do this by choosing a single element from each class. Then that element is called the canonical element (or representative) for that class. As an example we can take the set of children in a classroom and use the equivalence relation “is the same sex as”, which divides the children into two equivalence classes: the boys and the girls. (For this example we shall assume the children are mixed to prevent the discussion being trivial.) A canonical choice of elements is then one boy (a particular one, say, Joe), and one girl (say, Ann). It doesn’t matter which boy or girl we select as long as we have in mind precisely one of each. We don’t have to pick the best, or the biggest, or “the most typical”, as long as we have just one from each equivalence class. There is no coloration in the meaning of a “canonical element” beyond that (arbitrary) choice. In the mathematical examples we considered in the last section, however, there is in each case a “natural” choice of canonical element which is usually made by mathematicians. In fact these canonical elements were the stuff of mathematics before modern set theory was a twinkle in Cantor’s eye. We shall see that it is the manipulation of these canonical elements that constitutes the traditional art of calculation in mathematics. Example 1: Cardinal Number In school we are concerned only with finite sets. Counting a finite set of elements is performed by pointing to each element in turn and saying “one”, “two”, “three”, … until we have pointed at each elements precisely once. The last number that we recite is the number of elements in the set. In practice this means that a set with (say) four elements is put into one-one correspondence with the set (one, two, three, four). All sets equivalent to the given set can also be put into one-one correspondence with (one, two, three, four), so the latter is a natural choice of canonical element. Of course, we don’t have to wrap it up in set theoretic rigmarole. We shall return to this point in Section 6. Example 2: Egyptian Number Symbols The canonical choice (for numbers less than 1,000) is the representation which has less than 10 of each symbol. This corresponds to our decimal notation in the obvious way, for instance in the equivalence class containing | | | | | | | | | | | | (12 strokes), the canonical element is one hoop and two strokes, ∩ | |. Example 3: Integers using ordered pairs of natural numbers. If m ≥ n, a sensible choice of canonical element in E(m,n) (the equivalence class containing (m, n)) is (m–n, 0). For m < n we choose (0, n–m). In this way every ordered pair is equivalent to a canonical element of the form (k, 0) for k ≥ 0, or (0, k) for k<0. Of course this makes (k, 0) the canonical choice in the class +k and (0, k) the choice for –k. As we shall see later, there isn’t much in a name. Example 4: Fractions The natural choice of canonical element for equivalent fractions is “the one in lowest terms”. Example 5: Modular Arithmetic Working modulo 3, the natural choice of canonical elements in the classes E0, E1, E2 are 0, 1, 2 respectively. 4. Calculations All the examples we have mentioned above were specifically selected because we do calculations with them. In all cases we have an operation of addition and also multiplication. For the moment, let us concentrate on the former. The primitive addition of cardinal numbers (say three plus five) may be achieved by choosing a set of three objects (in one-one correspondence with lone, two, three)) then a disjoint set of five objects, putting the two sets together and counting them: “one, two, …, eight” to find “three plus five is eight”. There is a subtlety here in the choice of disjoint sets which is not found in adding two Egyptian numbers, where one simply takes the two collections of symbols all together. We can symbolise this using a (we hope) forgivable mixture of ancient and modem notation by writing 26+15 as: ∩∩| | | | | | + ∩| | | | | = ∩∩| | | | | | ∩| | | | | This combination of symbols is not in canonical form, but we can complete the calculation by replacing the answer by the canonical element in its class (effected by replacing ten strokes by an equivalent hoop), thus: ∩∩| | | | | | ∩| | | | | = ∩∩∩| | | | | | | | | | | = ∩ ∩ ∩ ∩ |. This, in essence, is the procedure that first school children use when they perform addition on an abacus (see, for example, [2, book 1, p. 9]). Here “twenty-six plus fifteen” is represented as: tens units oo oooooo o ooooo which becomes, on exchanging 10 units for one ten: tens units oo o oo which is 41. In example 3, addition of ordered pairs of natural numbers may be defined by adding the components: (m, n)+(p, q)=(m+p, n+q). This also allows us to add equivalence classes, for instance, to add +2 and –3, we select any elements we wish from the appropriate equivalence classes, say (2, 0) and (0, 3) to get (2, 0) + (0, 3) = (2, 3). Now (2, 3) is equivalent to (0, 1) and both are in the equivalence class –1, hence + 2 + –3 = –1. In example 4, when adding fractions m/n and plq, we put them over a “common denominator”, say nq, to get: m/n + p/q = (mq + np)/nq. We then proceed to reduce the answer to lowest terms. In practice we may develop more refined techniques; we do not need to use the common denominator nq because the lowest common multiple of n and q will do, thus: 1/2 + 1/6 = 3/6 + 1/6 = 4/6 = 2/3 (in lowest terms). Finally, example 5, modular arithmetic, involves similar principles. For instance, to calculate 2 + 2 (modulo 3) we first compute 2 + 2 = 4, then find the canonical element equivalent to 4, namely 1, so 2 + 2 = 1 (modulo 3) Some mathematicians prefer to write this as 2 + 2 ~ 1 (modulo 3), yet others prefer to use equivalence class notation: E2 + E2 = E2+2 = E4, and, of course, E4 = E1 so we get E2 + E2 = E1. As in this last case, all the examples can be interpreted using either equivalence classes or canonical elements. The modern approach favours equivalence classes, the traditional computational one favours canonical elements. In each case we have a set S on which an operation of addition is defined. (It could be any other binary operation, such as multiplication, but for simplicity of explanation we shall use addition.) Also S has on it an equivalence relation. The case of addition of integers + 2 + –3 = –1 is typical, in that we can extend the addition of elements (in this case addition of ordered pairs (m, n)) to the equivalence classes themselves. The method is this: To add equivalence classes Ex, Ey form the equivalence class Ex+y containing x+y, and this is the answer. Thus + 2 + –3 = E(2,0) + E(0,3) = E(2,3) = –1 This is the modern formulation using only equivalence classes. A traditional formulation, using only elements, follows the pattern of adding fractions: 1/2 + 1/6 = 3/6 + 1/6 = 4/6 = 2/3. The link between these two types of formulation is the notion of a canonical element, as we see in Figure 1. Figure 1 The upward vertical arrows correspond to finding the equivalence class to which an element belongs (usually easy), the horizontal arrow is the operation of addition on classes, the downward arrow is picking out the canonical element in a given class (usually harder and more algorithmically contrived). The modern formulation usually involves only the top of this diagram (see Fig. 2). Figure 2 It is just the definition Ex + Ey = Ex+y . The traditional formulation usually avoids all reference to equivalence classes and works on the element level, as in Figure 3. Figure 3 In this diagram we start with elements x, y, then operate to get x + y, then find the equivalent canonical element k. The difference between Figures 2 and 3 represents the schism between modern and traditional approaches. The unification in a single framework is given by Figure 1. It is in analysing the total structure that we shall obtain a blend of the old and new. But before we do this we must digress for a while to consider an important technicality. 5. A Technical Problem In carrying over a binary operation from elements to equivalence classes there is a technical problem which proves difficult for beginners to grasp yet which forms an essential ingredient in the whole picture. In defining the sum of Ex and Ey as Ex+y, we have skated over the problem of the name of the equivalence class Ex+y. For instance if x′ is another element equivalent to x, then Ex′ and Ex are one and the same. The symbols Ex′ and Ex, are just different names for the same equivalence class. Suppose that we calculate Ex′ + Ey instead of Ex + Ey, then we would get the answer: Ex′ + Ey = Ex′+y. The central problem is: can we be sure that the equivalence class Ex′+y is the same as Ex+y? Worse still, suppose we compound the problem by considering another y´ such that y ~ y´ (in which case Ey = E y′). Would we find that Ex′ + Ey′ = Ex′+y′. is the same equivalence class as Ex+y? If we can’t, then the whole process of defining the sum of Ex and Ey breaks down because we may get different answers Ex+y and Ex′+y′ depending on which elements we select from the equivalence classes to perform the computations. For the sum to carry over from elements to equivalence classes we require the condition if x ~ x´ and y ~ y´, then we must also have x+y ~ x´+y´ As an example, consider addition modulo 3. Here, by computation, we have: E1 + E2 = E3, but E1 = E4, E2 = E8, and E4 + E8 = E12. Fortunately, we have E12 = E3, (because 12 – 3 is divisible by 3), so we encounter no problem in this case. In fact we encounter no problem in any of the examples given, because in all cases, x ~ x´, y ~ y´, implies x+y ~ x´+y´ as the reader may verify. But what of a general operation ° on a set S with equivalence relation ~ ? We might wish to carry over the operation o to equivalence classes by defining Ex ° Ey = Ex°y. To do this requires the general property x ~ x´, y ~ y´ implies x°y ~ x´°y´. If this does not hold, the whole thing breaks down. As an example (taken from [11, p. 77]), consider the operation on integers x °y = xy. Denoting the equivalence class of n modulo 3 by [n], we might attempt to define the taking of powers for numbers modulo 3 by: [x]°[y]= [x°y] or, in other words, [x][y] = [x y]. This does not work. For instance, if x = 2, y = 2, we get [2][2] = [2 2] = [4] = [1]. But [2] = [5] and [2][5] = [2 5] = [32] = [2]. The classes [1] and [2] are different. This is because x ~ x´, y ~ y´ does not imply x y = ( x ! )y ! in general. As a counter example we have 2~2, 2~5, but 2 2 ~/ 2 5 . This warns us that, in general, just blindly pressing on without checking that x ~ x´, y ~ y´ implies x°y ~ x´°y´. (*) can sometimes lead to nonsense in handling computations with equivalence relations. The astute reader may notice that we do not get any such problems in defining operations on the corresponding canonical elements, at least as far as the definition itself is concerned. We can always define an operation on canonical elements alone by starting with canonical elements x, y, then forming the composite x°y; this of course need not be a canonical element, but it is equivalent to a unique canonical element k. We therefore define a new operation on canonical elements associated with ° by defining x, y to be the unique canonical element k which is equivalent to x°y. For instance, if x°y = xy, we can define a new operation on the canonical elements 0, 1, 2 (modulo 3) in this fashion. To compute 22 (where 2 is now thought of as a canonical element), we first compute 22 (as an integer) to get 4, then take the canonical element equivalent to 4 modulo 3, namely 1. In this way we are able to compute all powers xy where x, y run through the values 0, 1, 2 (considered as canonical elements modulo 3). Although we have been able to make such a definition, we haven’t really gained anything, because the simple rules of powers break down. For example we can compute 21 = 2, 22 = 1 (as canonical elements), but (22)2 = 12 = 1 22×2 = 21 = 2 (22)2 ≠ 22×2. Hence in general we may have (xm)n ≠ xmn when computing with canonical elements modulo 3. It is all a matter of swings and roundabouts: the difficulty simply pops up somewhere else. Returning to the case where the fundamental property (*) does hold, we find genuine differences between the equivalence class approach and that using only canonical elements. If the operation ° on the elements has basic algebraic properties, for instance, associativity or commutativity, then these can be seen to carry over easily to the induced operation on equivalence classes. For instance, if we suppose that x°y = y°x then we find Ex ° Ey =Ex°y (by definition) =Ey°x (since x°y = y°x) = Ey ° Ex, (by definition again). Thus a commutative operation on elements induces a commutative operation on equivalence classes; the analogous statement for associativity follows just as easily. However, at the canonical element level there is more difficulty with the associativity proof. We compute (x°y)°z, first by finding x°y equivalent to a canonical element k, then computing k°z and finding the canonical element equivalent to this. On the other hand, in finding x°(y°z), we first find the canonical element c which is equivalent to y°z, then, when we compute the canonical element equivalent to x°c, it is by no means clear we end up with the same element as before. In fact to show this is so we have to appeal to the full force of condition (*). We know x°y ~ k, y°z ~ c hence, using (*), we get k°z ~ (x°y)°z = x°(y°z) ~ x°c. Because ~ is an equivalence relation, we deduce k°z ~ x°c, and so the two elements k°z, x°c are equivalent to the same canonical element. Thus the proof of associativity for the product of canonical elements has to use (*) for elements such as x°y, y°z, which may not be canonical, and we are forced to see the canonical elements within a broader framework. This demonstrates that traditional computations with canonical elements are better viewed within the context of equivalence relations. On the other hand, mathematicians must be able to compute, so they need in the end to be able to handle the manipulation of canonical elements. Hence the best solution is to gain a global view of the whole picture, placing canonical elements within the framework of the theory of equivalence classes. This we shall do in Part 2 of this article, with special reference to mathematical concepts taught in school. References 1. 2. 3. 4. 5. 6. 7. Auckland, K. et al. (1977). Primary Mathematics, Globe Education. Cordin, P. W. (1970). Number in Mathematics, Macmillan. Dienes, Z. P. (1960). Building Up Mathematics, Hutchinson. Donaldson, M. (1978). Children’s Minds, Fontana. Fletcher, H. (1970). Mathematics for Schools, Addison Wesley. Freudenthal, H. (1973). Mathematics as an Educational Task, Reidel, Holland. Nuffield Foundation (1970). Mathematics: the first 3 years, W. and R. Chambers, John Murray. 8. Piaget, J. (1952). The Child’s Conception of Number, Routledge and Kegan Paul. 9. Quadling, D. (1969). The Same but Different, George Bell. 10. Schwarzenberger, R. L. E. and Tall, D. O. (1978). “Conflicts in the Learning of Real Numbers & Limits”, Mathematics Teaching, No. 82. 11. Stewart, I. N. and Tall, D. O. (1977). Foundations of Mathematics, OUP. 12. Tall, D. O. (1977). “A Long Term Learning Schema for Calculus & Analysis”, Mathematical Education for Teaching, Vol. 2, No. 2. 13. Tall, D. O. Historical, Logical and individual development of mathematical ideas and the relevance in teaching. (In preparation.)