Equality (mathematics)

Basic notion of sameness in mathematics From Wikipedia, the free encyclopedia

Equality (mathematics)

In mathematics, equality is a relationship between two quantities or expressions, stating that they have the same value, or represent the same mathematical object.[1][2] Equality between A and B is written A = B, and pronounced "A equals B". In this equality, A and B are distinguished by calling them left-hand side (LHS), and right-hand side (RHS).[3] Two objects that are not equal are said to be distinct.[4]

Thumb
The equals sign, used to represent equality symbolically in an equation

Equality is often considered a kind of primitive notion, meaning, it is not formally defined, but rather informally said to be "a relation each thing bears to itself and nothing else". This characterization is notably circular ("nothing else"). This makes equality a somewhat slippery idea to pin down. Basic properties about equality like reflexivity, symmetry, and transitivity have been understood intuitively since at least the ancient Greeks, but were not symbolically stated as general properties of relations until the late 19th century by Giuseppe Peano. Other properties like substitution and function application weren't formally stated until the development of symbolic logic.

There are generally two ways that equality is formalized in mathematics: through logic or through set theory. In logic, equality is a primitive predicate (a statement that may have free variables) with the reflexive property (called the law of identity), and the substitution property. From those, one can derive the rest of the properties usually needed for equality. After the foundational crisis in mathematics at the turn of the 20th century, set theory (specifically Zermelo–Fraenkel set theory) became the most common foundation of mathematics. In set theory, any two sets are defined to be equal if they have all the same members. This is called the axiom of extensionality.

Etymology

Summarize
Perspective
Thumb
The first use of an equals sign, equivalent to in modern notation. From The Whetstone of Witte (1557) by Robert Recorde.
Thumb
Recorde's introduction of =. "And to avoid the tedious repetition of these words: 'is equal to' I will set as I do often in work use, a pair of parallels, or twin lines of one [the same] length, thus: ==, because no 2 things can be more equal."[5]

In English, the word equal is derived from the Latin aequālis ('like', 'comparable', 'similar'), which itself stems from aequus ('level', 'just').[6] The word entered Middle English around the 14th century, borrowed from Old French equalité (modern égalité).[7] More generally, the interlingual synonyms of equal have been used more broadly throughout history (see § Geometry).

Before the 16th century, there was no common symbol for equality, and equality was usually expressed with a word, such as aequales, aequantur, esgale, faciunt, ghelijck, or gleich, and sometimes by the abbreviated form aeq, or simply æ and œ.[8] Diophantus's use of ἴσ, short for ἴσος (ísos 'equals'), in Arithmetica (c.250 AD) is considered one of the first uses of an equals sign.[9]

The sign =, now universally accepted in mathematics for equality, was first recorded by Welsh mathematician Robert Recorde in The Whetstone of Witte (1557). The original form of the symbol was much wider than the present form. In his book, Recorde explains his symbol as "Gemowe lines", from the Latin gemellus ('twin'), using two parallel lines to represent equality because he believed that "no two things could be more equal."[5][8]

Recorde's symbol was not immediately popular. After its introduction, it wasn't used again in print until 1618 (61 years later), in an anonymous Appendix in Edward Wright's English translation of Descriptio, by John Napier. It wasn't until 1631 that it received more than general recognition in England, being adopted as the symbol for equality in a few influential works. Later used by several influential mathematicians, most notably, both Isaac Newton and Gottfried Leibniz, and due to the prevalence of calculus at the time, it quickly spread throughout the rest of Europe.[8]

Basic properties

Summarize
Perspective
Reflexivity
For every a, one has a = a.[10][11]
Symmetry
For every a and b, if a = b, then b = a.[10][11]
Transitivity
For every a, b, and c, if a = b and b = c, then a = c.[10][11]
Substitution
Informally, this just means that if a = b, then a can replace b in any mathematical expression or formula without changing its meaning.[10][12][13] (For a formal explanation, see § Axioms) For example:
  • Given real numbers a and b, if a = b, then implies .
Operation application
For every a and b, with some operation , if a = b, then [14][13][a] For example:
  • Given integers a and b, if a = b, then (Here, a unary operation.)
  • Given natural numbers a, b, c, and d, if and then (Here, , a binary operation.)
  • Given real functions and over some variable a, if for all a, then for all a. (Here, An operation over functions (i.e. an operator), called the derivative).[b]

The first three properties are generally attributed to Giuseppe Peano for being the first to explicitly state these as fundamental properties of equality in his Arithmetices principia (1889).[15][16] However, the basic notions have always existed; for example, in Euclid's Elements (c.300 BC), he includes 'common notions': "Things that are equal to the same thing are also equal to one another" (transitivity), "Things that coincide with one another are equal to one another" (reflexivity), along with some operation-application properties for addition and subtraction.[17] The operation-application property was also stated in Peano's Arithmetices principia,[15] however, it had been common practice in algebra since at least Diophantus (c.250 AD).[18] The substitution property is generally attributed to Gottfried Leibniz (c.1686), and often called Leibniz's Law.[12][19]

Equations

Summarize
Perspective
Thumb
Balance scales, used in algebra education to help students visualize how equations can be transformed to determine unknown values.

An equation is a symbolic equality of two mathematical expressions connected with an equals sign (=).[20] Algebra is the branch of mathematics concerned with equation solving: the problem of finding values of some variable, called unknown, for which the specified equality is true. Each value of the unknown for which the equation holds is called a solution of the given equation; also stated as satisfying the equation. For example, the equation has the values and as its only solutions. The terminology is used similarly for equations with several unknowns.[21] The set of solutions to an equation or system of equations is called its solution set.[22]

In mathematics education, students are taught to rely on concrete models and visualizations of equations, including geometric analogies, manipulatives including sticks or cups, and "function machines" representing equations as flow diagrams. One method uses balance scales as a pictorial approach to help students grasp basic problems of algebra. The mass of some objects on the scale is unknown and represents variables. Solving an equation corresponds to adding and removing objects on both sides in such a way that the sides stay in balance until the only object remaining on one side is the object of unknown mass.[23]

Often, equations are considered to be a statement, or relation, which can be true or false. For example, is true, and is false. Equations with unknowns are cosidered conditionally true; for example, is true when or , and false otherwise.[24] There are several different terminologies for this. In mathematical logic, an equation is a binary predicate (i.e. a logical statement, that can have free variables) which satisfies certain properties.[25] In computer science, an equation is defined as a boolean-valued expression, or relational operator, which returns 1 and 0 for true and false respectively.[26]

Identities

An identity is an equality that is true for all values of its variables in a given domain.[27][28] An "equation" may sometimes mean an identity, but more often than not, it specifies a subset of the variable space to be the subset where the equation is true. An example is , which is true for each real number . There is no standard notation that distinguishes an equation from an identity, or other use of the equality relation: one has to guess an appropriate interpretation from the semantics of expressions and the context.[29] Sometimes, but not always, an identity is written with a triple bar: [30] This notation was introduced by Bernhard Riemann in his 1857 Elliptische Funktionen lectures (published in 1899).[31][32][33]

Alternatively, identities may be viewed as an equality of functions, where instead of writing , one may simply write .[34][35] This is called the extensionality of functions.[36][37] In this sense, the operation-application property refers to operators, operations on a function space (functions mapping between functions) like composition[38] or the derivative, commonly used in operational calculus.[39] An identity can contain an functions as "unknowns", which can be solved for similarly to a regular equation, called a functional equation.[40] A functional equation involving derivatives is called a differential equation.[41]

Definitions

Equations are often used to introduce new terms or symbols for constants, assert equalities, and introduce shorthand for complex expressions, which is called "equal by definition", and often denoted with ().[42] It is similar to the concept of assignment of a variable in computer science. For example, defines Euler's number,[43] and is the defining property of the imaginary number .[44]

In mathematical logic, this is called an extension by definition (by equality) which is a conservative extension to a formal system.[45] This is done by taking the equation defining the new constant symbol as a new axiom of the theory. The first recorded symbolic use of "Equal by definition" appeared in Logica Matematica (1894) by Cesare Burali-Forti, an Italian mathematician. Burali-Forti, in his book, used the notation ().[46][47]

In logic

Summarize
Perspective

History

Thumb
Roman copy in marble of a Greek bronze bust of Aristotle by Lysippos, c. 330 BC, with modern alabaster mantle

Equality is often considered a primitive notion, informally said to be "a relation each thing bears to itself and to no other thing".[48] This tradition can be traced back to at least 350 BC by Aristotle: in his Categories, he defines the notion of quantity in terms of a more primitive equality (distinct from identity or similarity), stating:[49]

The most distinctive mark of quantity is that equality and inequality are predicated of it. Each of the aforesaid quantities is said to be equal or unequal. For instance, one solid is said to be equal or unequal to another; number, too, and time can have these terms applied to them, indeed can all those kinds of quantity that have been mentioned.

That which is not a quantity can by no means, it would seem, be termed equal or unequal to anything else. One particular disposition or one particular quality, such as whiteness, is by no means compared with another in terms of equality and inequality but rather in terms of similarity. Thus it is the distinctive mark of quantity that it can be called equal and unequal. ―(translated by E. M. Edghill)

Aristotle had separate categories for quantities (number, length, volume) and qualities (temperature, density, pressure), now called intensive and extensive properties. The Scholastics, particularly Richard Swineshead and other Oxford Calculators in the 14th century, began seriously thinking about kinematics and quantitative treatment of qualities. For example, two flames have the same heat-intensity if they produce the same effect on water (e.g, warming vs boiling). Since two intensities could be shown to be equal, and equality was considered the defining feature of quantities, it meant those intensities were quantifiable.[50][51]

Around the 19th century, with the growth of modern logic, it became necessary to have a more concrete description of equality. With the rise of predicate logic due to the work of Gottlob Frege, logic shifted from being focused on classes of objects to being property-based. This was followed by a movement for describing mathematics in logical foundations, called logicism. This trend lead to the axiomatization of equality through the law of identity and the substitution property especially in mathematical logic[12][25] and analytic philosophy.[52]

The precursor to the substitution property of equality was first formulated by Gottfried Leibniz in his Discourse on Metaphysics (1686), stating, roughly, that "No two distinct things can have all properties in common." This has since broken into two principles, the substitution property (if , then any property of is a property of ), and its converse, the identity of indiscernibles (if and have all properties in common, then ).[53] Its introduction to logic, and first symbolic formulation is due to Bertrand Russell and Alfred Whitehead in their Principia Mathematica (1910), who claim it follows from their axiom of reducibility, but credit Leibniz for the idea.[54]

Axioms

Thumb
Gottfried Leibniz, a major contributor to 17th-century mathematics and philosophy of mathematics, and whom the Substitution property of equality is named after.

Law of identity: Stating that each thing is identical with itself, without restriction. That is, for every , . It is the first of the traditional three laws of thought.[55] Stated symbolically as:

Substitution property: Sometimes referred to as Leibniz's law,[56] generally states that if two things are equal, then any property of one must be a property of the other. It can be stated formally as: for every a and b, and any formula (with a free variable x), if , then implies . Stated symbolically as:

Function application is also sometimes included in the axioms of equality,[14] but isn't necessary as it can be deduced from the other two axioms, and similarly for symmetry and transitivity. (See § Derivations of basic properties) In first-order logic, these are axiom schemas (usually, see below), each of which specify an infinite set of axioms.[57] If a theory has a predicate that satisfies the Law of Identity and Substitution property, it is common to say that it "has equality", or is "a theory with equality".[58]

The use of "equality" here somewhat of a misnomer in that any system with equality can be modeled by a theory without standard identity, and with indiscernibles.[59][57] Those two axioms are strong enough, however, to be isomorphic to a model with idenitity; that is, if a system has a predicate staisfying those axioms without standard equality, there is a model of that system with standard equality.[57] This can be done by defining a new domain whose objects are the equivalence classes of the original "equality".[60] If the relation is interpreted as equality, then those properties are enough, since if has all the same properties as , and has the property of being equal to , then has the property of being equal to .[54][61]

As axioms, one can deduce from the first using universal instantiation, and the from second, given and , by using modus ponens twice. Alternatively, each of these may be included in logic as rules of inference.[57] The first called "equality introduction", and the second "equality elimination"[62] (also called paramodulation), used by some theoretical computer scientists like John Alan Robinson in their work on resolution and automated theorem proving.[63]

Derivations of basic properties

  • Reflexivity of Equality: Given any expression , by the Law of Identity, .[64]
  • Symmetry of Equality: Given , take the formula . So we have . Since by assumption, and by Reflexivity, we have that .[64]
  • Transitivity of Equality: Given and , take the formula . So we have . Since by symmetry, and by assumption, we have that .[64]
  • Function application: Given some function and expressions a and b, such that a = b, then take the formula .[64] So we have:

    Since by assumption, and by reflexivity, we have that .

In set theory

Summarize
Perspective
Thumb
Thumb
Two sets of polygons in Euler diagrams. These sets are equal since both have the same elements, even though the arrangement differs.

Set theory is the branch of mathematics that studies sets, which can be informally described as "collections of objects".[65] Although objects of any kind can be collected into a set, set theory  as a branch of mathematics  is mostly concerned with those that are relevant to mathematics as a whole. Sets are uniquely characterized by their elements; this means that two sets that have precisely the same elements are equal (they are the same set).[66] In a formalized set theory, this is usually defined by an axiom called the Axiom of extensionality.[67]

For example, using set builder notation,

Which states that "The set of all integers greater than 0 but not more than 3 is equal to the set containing only 1, 2, and 3", despite the differences in notation.

José Ferreirós credits Richard Dedekind for being the first to explicitly state the principle, although he does not assert it as a definition:[68]

It very frequently happens that different things a, b, c... considered for any reason under a common point of view, are collected together in the mind, and one then says that they form a system S; one calls the things a, b, c... the elements of the system S, they are contained in S; conversely, S consists of these elements. Such a system S (or a collection, a manifold, a totality), as an object of our thought, is likewise a thing; it is completely determined when, for every thing, it is determined whether it is an element of S or not.

Richard Dedekind, 1888 (translated by José Ferreirós)

Background

Thumb
Ernst Zermelo, a contributor to modern Set theory, was the first to explicitly formalize set equality in his Zermelo set theory (now obsolete), by his Axiom der Bestimmtheit.[69]

Around the turn of the 20th century, mathematics faced several paradoxes and counter-intuitive results. For example, Russell's paradox showed a contradiction of naive set theory, it was shown that the parallel postulate cannot be proved, the existence of mathematical objects that cannot be computed or explicitly described, and the existence of theorems of arithmetic that cannot be proved with Peano arithmetic. The result was a foundational crisis of mathematics.[70]

The resolution of this crisis involved the rise of a new mathematical discipline called mathematical logic, which studies formal logic within mathematics. Subsequent discoveries in the 20th century then stabilized the foundations of mathematics into a coherent framework valid for all mathematics. This framework is based on a systematic use of axiomatic method and on set theory, specifically Zermelo–Fraenkel set theory, developed by Ernst Zermelo and Abraham Fraenkel. This set theory (and set theory in general) is now considered the most common foundation of mathematics.[71]

Set equality based on first-order logic with equality

In first-order logic with equality (See § Axioms), the axiom of extensionality states that two sets that contain the same elements are the same set.[72]

  • Logic axiom:
  • Logic axiom:
  • Set theory axiom:

The first two are given by the substitution property of equality from first-order logic; the last is a new axiom of the theory. Incorporating half of the work into the first-order logic may be regarded as a mere matter of convenience, as noted by Azriel Lévy.

"The reason why we take up first-order predicate calculus with equality is a matter of convenience; by this, we save the labor of defining equality and proving all its properties; this burden is now assumed by the logic."[73]

Set equality based on first-order logic without equality

In first-order logic without equality, two sets are defined to be equal if they contain the same elements. Then the axiom of extensionality states that two equal sets are contained in the same sets.[74]

  • Set theory definition:
  • Set theory axiom:

Or, equivalently, one may choose to define equality in a way that mimics, the substitution property explicitly, as the conjunction of all atomic formuals:[75]

  • Set theory definition:
  • Set theory axiom:

In either case, the Axiom of Extensionality based on first-order logic without equality states:

Proof of basic properties

  • Reflexivity: Given a set , assume , it follows trivially that , and the same follows in reverse, therefore , thus .[76]
  • Symmetry: Given sets , such that , then , which implies , thus .[76]
  • Transitivity: Given sets , such that (1) and (2) , assume , then by (1), which implies by (2), and similarly for the reverse, therefore , thus .[76]
  • Substitution: (See: Substitution (logic) § Proof of substitution in ZFC.)
  • Function application: Given and , then . Since and , then . This is the defining property of an ordered pair.[77] Since by the Axiom of Extensionality, they must belong to the same sets, so, since we have or Thus

Similar relations

Summarize
Perspective

Approximate equality

Thumb
The sequence given by the perimeters of regular n-sided polygons that circumscribe the unit circle approximates

Numerical analysis is the study of constructive methods and algorithms to find numerical approximations (as opposed to symbolic manipulations) of solutions to problems in mathematical analysis. Especially those which cannot be solved analytically.[78]

Calculations are likely to involve rounding errors and other approximation errors. Log tables, slide rules, and calculators produce approximate answers to all but the simplest calculations. The results of computer calculations are normally an approximation, expressed in a limited number of significant digits, although they can be programmed to produce more precise results.[79]

If viewed as a binary relation, (denoted by the symbol ) between real numbers or other things, if precisely defined, is not an equivalence relation since it's not transitive, even if modeled as a fuzzy relation.[80]

In computer science, equality is given by some relational operator. Real numbers are often approximated by floating-point numbers (A sequence of some fixed number of digits of a given base, scaled by an integer exponent of that base),[81] thus it is common to store an expression that denotes the real number as to not lose precision. However, the equality of two real numbers given by an expression is known to be undecidable (specifically, real numbers defined by expressions involving the integers, the basic arithmetic operations, the logarithm and the exponential function). In other words, there cannot exist any algorithm for deciding such an equality (see Richardson's theorem).[82]

Equivalence relation

Thumb
Graph of an example equivalence with 7 classes

An equivalence relation is a mathematical relation that generalizes the idea of similarity or sameness. It is defined on a set as a binary relation that satisfies the three properties: reflexivity, symmetry, and transitivity. Reflexivity means that every element in is equivalent to itself ( for all ). Symmetry requires that if one element is equivalent to another, the reverse also holds (). Transitivity ensures that if one element is equivalent to a second, and the second to a third, then the first is equivalent to the third ( and ).[83] These properties are enough to partition a set into disjoint equivalence classes. Conversely, every partition defines an equivalence class.[84]

The equivalence relation of equality is a special case, as, if restricted to a given set , it is the strictest possible equivalence relation on ; specifically, equality partitions a set into equivalence classes consisting of all singleton sets.[84] Other equivalence relations, since they're less restrictive, generalize equality by identifying elements based on shared properties or transformations, such as congruence in modular arithmetic or similarity in geometry.[85][86]

Congruence relation

In abstract algebra, a congruence relation extends the idea of an equivalence relation to include the operation-application property. That is, given a set , and a set of operations on , then a congruence relation has the property that for all operations (here, written as unary to avoid cumbersome notation, but may be of any arity). A congruence relation on an algebraic structure such as a group, ring, or module is an equivalence relation that respects the operations defined on that structure.[87]

Isomorphism

In mathematics, especially in abstract algebra and category theory, it is common to deal with objects that already have some internal structure. An isomorphism describes a kind of structure-preserving correspondence between two objects, establishing them as essentially identical in their structure or properties.[88][89]

More formally, an isomorphism is a bijective mapping (or morphism) between two sets or structures and such that and its inverse preserve the operations, relations, or functions defined on those structures.[88] This means that any operation or relation valid in corresponds precisely to the operation or relation in under the mapping. For example, in group theory, a group isomorphism satisfies for all elements , where denotes the group operation.[90]

When two objects or systems are isomorphic, they are considered indistinguishable in terms of their internal structure, even though their elements or representations may differ. For instance, all cyclic groups of order are isomorphic to the integers, , with addition.[91] Similarly, in linear algebra, two vector spaces are isomorphic if they have the same dimension, as there exists a linear bijection between their elements.[92]

The concept of isomorphism extends to numerous branches of mathematics, including graph theory (graph isomorphism), topology (homeomorphism), and algebra (group and ring isomorpisms), among others. Isomorphisms facilitate the classification of mathematical entities and enable the transfer of results and techniques between similar systems. Bridging the gap between isomorphism and equality was one motivation for the development of category theory, as well as for homotopy type theory and univalent foundations.[93][94][95]

Geometry

Thumb
The two triangles on the left are congruent. The third is similar to them. The last triangle is neither congruent nor similar to any of the others.

In geometry, formally, two figures are equal if they contain exactly the same points. However, historically, geometric-equality has always been taken to be much broader. Euclid and Archimedes used "equal" (ἴσος isos) often referring to figures with the same area or those that could be cut and rearranged to form one another. For example, Euclid stated the Pythagorean theorem as “the square on the hypotenuse is equal to the squares on the sides, taken together”; and Archimedes said that “a circle is equal to the rectangle whose sides are the radius and half the circumference.”[96]

This notion persisted until Adrien-Marie Legendre, who introduced the term "equivalent" to describe figures of equal area and restricted "equal" to what we now call "congruent"—the same shape and size, or if one has the same shape and size as the mirror image of the other.[97][98] Euclid's terminology continued in the work of David Hilbert in his Grundlagen der Geometrie, who further refined Euclid's ideas by introducing the notions of polygons being "divisibly equal" (zerlegungsgleich) if they can be cut into finitely many triangles which are congruent, and "equal in content" (inhaltsgleichheit) if one can add finitely many divisibly equal polygons to each such that the resulting polygons are divisibly equal.[99]

After the rise of set theory, around the 1960s, there was a push for a reform in mathematics education called New Math, following Andrey Kolmogorov, who, in an effort to restructure Russian geometry courses, proposed presenting geometry through the lens of transformations and set theory. Since a figure was seen as a set of points, it could only be equal to itself, as a result of Kolmogorov, the term "congruent" became standard in schools for figures that were previously called "equal", which popularized the term.[100]

While Euclid addressed proportionality and figures of the same shape, it was not until the 17th century that the concept of similarity was formalized in the modern sense. Similar figures are those that have the same shape but can differ in size; they can be transformed into one another by scaling and congruence.[101] Later a concept of equality of directed line segments, equipollence, was advanced by Giusto Bellavitis in 1835.[102]

See also

Notes

  1. can have any arity, but is written as unary to avoid cumbersome notation.
  2. Assuming and are differentiable.

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.