Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Computerizing Mathematical Text with MathLang

Published: 01 April 2008 Publication History

Abstract

Mathematical texts can be computerized in many ways that capture differing amounts of the mathematical meaning. At one end, there is document imaging, which captures the arrangement of black marks on paper, while at the other end there are proof assistants (e.g., Mizar, Isabelle, Coq, etc.), which capture the full mathematical meaning and have proofs expressed in a formal foundation of mathematics. In between, there are computer typesetting systems (e.g., and Presentation MathML) and semantically oriented systems (e.g., Content MathML, OpenMath, OMDoc, etc.). The MathLang project was initiated in 2000 by Fairouz Kamareddine and Joe Wells with the aim of developing an approach for computerizing mathematical texts and knowledge which is flexible enough to connect the different approaches to computerization, which allows various degrees of formalization, and which is compatible with different logical frameworks (e.g., set theory, category theory, type theory, etc.) and proof systems. The approach is embodied in a computer representation, which we call MathLang, and associated software tools, which are being developed by ongoing work. Three Ph.D. students (Manuel Maarek (2002/2007), Krzysztof Retel (since 2004), and Robert Lamar (since 2006)) and over a dozen master's degree and undergraduate students have worked on MathLang. The project's progress and design choices are driven by the needs for computerizing real representative mathematical texts chosen from various branches of mathematics. Currently, MathLang supports entry of mathematical text either in an XML format or using the editor. Methods are provided for adding, checking, and displaying various information aspects. One aspect is a kind of weak type system that assigns categories (term, statement, noun (class), adjective (class modifier), etc.) to parts of the text, deals with binding names to meanings, and checks that a kind of grammatical sense is maintained. Another aspect allows weaving together mathematical meaning and visual presentation and can associate natural language text with its mathematical meaning. Another aspect allows identifying chunks of text, marking their roles (theorem, definition, explanation, example, section, etc.), and indicating relationships between the chunks (A uses B, A contradicts B, A follows from B, etc.). Software tool support can use this aspect to check and explain the overall logical structure of a text. Further aspects are being designed to allow adding additional formality to a text such as proof structure and details of how a human-readable proof is encoded into a fully formalized version (so far this has only been done for Mizar and started for Isabelle). A number of mathematical texts have been computerized, helping with the development of these aspects, and indicating what additional work is needed for the future. This paper surveys the past and future work of the MathLang project.

References

[1]
J. Abbott, A. van Leeuwen, and A. Strotmann. Objectives of openmath. Technical Report 12, RIACA (Research Institute for Applications of Computer Algebra), 1996. The TR archives of RIACA are incomplete. Earlier versions of this paper can be found at the “old OpenMath Home Pages” archived at the Uni. Köln
[2]
Barendregt, Henk, Towards an interactive mathematical proof mode. In: Kamareddine, Fairouz (Ed.), Kluwer Applied Logic Series, volume 28. Kluwer Academic Publishers. pp. 25-36.
[3]
Cantor, Georg, Beiträge zur Begründung der transfiniten Mengenlehre (part 1). Mathematische Annalen. v46. 481-512.
[4]
Cantor, Georg, Beiträge zur Begründung der transfiniten Mengenlehre. Mathematische Annalen. v49. 207-246.
[5]
Cauchy, Augustin-Louis, Cours d'Analyse de l'École Royale Polytechnique. 1821. Debure, Paris.
[6]
W3C (World Wide Web Consortium). Mathematical markup language (MathML) version 2.0. W3C Recommendation, October 2003
[7]
W3C (World Wide Web Consortium). RDF Primer. W3C Recommendation, February 2004
[8]
W3C (World Wide Web Consortium). XQuery 1.0 and XPath 2.0 data model (XDM). W3C Recommendation, 2007
[9]
Coquand, Thierry and Huet, Gérard P., The calculus of constructions. Inform. & Comput. v76 i2/3. 95-120.
[10]
N.G. de Bruijn. The mathematical vernacular, a language for mathematics with typed sets. In Workshop on Programming Logic, 1987. Reprinted in {32, F.3}
[11]
Dedekind, Richard, Stetigkeit und irrationale Zahlen. 1872. Vieweg & Sohn, Braunschweig.
[12]
Can be found on pp. 1--82 in {38}
[13]
Frege, Gottlob, Grundgesetze der Arithmetik, volume 1. 1893. Hermann Pohle, Jena.
[14]
Frege, Gottlob, Grundgesetze der Arithmetik, volume 2. 1903. Hermann Pohle, Jena.
[15]
Gierz, G., Hofmann, K.H., Keimel, K., Lawson, J.D., Mislove, M.W. and Scott, D.S., A Compendium of Continuous Lattices. 1980. Springer.
[16]
Heath, Thomas L., The 13 Books of Euclid's Elements. 1956. Dover.
[17]
Kamareddine, Fairouz, Laan, Twan and Nederpelt, Rob, A Modern Perspective on Type Theory from Its Origins Until Today. May 2004. Kluwer Applied Logic Series, May 2004.Kluwer Academic Publishers.
[18]
Fairouz Kamareddine, Robert Lamar, Manuel Maarek, and J. B. Wells. Restoring natural language as a computerised mathematics input method. In MKM'07 {31}, pages 280--295
[19]
Kamareddine, Fairouz, Maarek, Manuel, Retel, Krzysztof and Wells, J.B., Digitised mathematics: Computerisation vs. formalisation. Review of the National Center for Digitization. v10. 1-8.
[20]
Kamareddine, Fairouz, Maarek, Manuel, Retel, Krzysztof and Wells, J.B., Gradual computerisation/formalisation of mathematical texts into Mizar. Studies in Logic, Grammar and Rhetoric. v10 i23. 95-120.
[21]
Fairouz Kamareddine, Manuel Maarek, Krzysztof Retel, and J.B. Wells. Narrative structure of mathematical texts. In MKM'07 {31}, pages 296--311
[22]
Kamareddine, Fairouz, Maarek, Manuel and Wells, J.B., Flexible encoding of mathematics on the computer. In: Lecture Notes in Computer Science, volume 3119. Springer. pp. 160-174.
[23]
Kamareddine, Fairouz, Maarek, Manuel and Wells, J.B., Mathlang: Experience-driven development of a new mathematical language. ENTCS. v93. 138-160.
[24]
Kamareddine, Fairouz, Maarek, Manuel and Wells, J.B., Toward an object-oriented structure for mathematical text. In: Lecture Notes in Artificial Intelligence, volume 3863. Springer. pp. 217-233.
[25]
Kamareddine, Fairouz and Nederpelt, Rob, A refinement of de Bruijn's formal language of mathematics. J. Logic Lang. Inform. v13 i3. 287-340.
[26]
Kanahori, Toshihiro, Sexton, Alan, Sorge, Volker and Suzuki, Masakazu, Capturing abstract matrices from paper. In: Lecture Notes in Computer Science, volume 4108. Springer Berlin/Heidelberg. pp. 124-138.
[27]
Kohlhase, Michael, . In: Lecture Notes in Artificial Intelligence, volume 4180. Springer.
[28]
Landau, Edmund, Grundlagen der Analysis. 1930. Chelsea.
[29]
Translation of {28} by F. Steinhardt
[30]
Manuel Maarek. Mathematical Documents Faithfully Computerised: the Grammatical and Text & Symbol Aspects of the MathLang Framework. PhD thesis, Heriot-Watt University, Edinburgh, Scotland, June 2007
[31]
In: Lecture Notes in Artificial Intelligence, volume 4573. Springer Berlin/Heidelberg.
[32]
Nederpelt, Rob, Geuvers, J.H. and de Vrijer, Roel C., . In: Studies in Logic and the Foundations of Mathematics, volume 133. North-Holland, Amsterdam.
[33]
An English translation can be found on pp. 83--97 in {38}
[34]
P. Rudnicki. An overview of the Mizar project. In Proceedings of the 1992 Workshop on Types for Proofs and Programs, 1992
[35]
Alan Sexton and Volker Sorge. The ellipsis in mathematical documents. Talk overhead images presented at the IMA (Institute for Mathematics and its Applications, University of Minnesota) “Hot Topic” Workshop The Evolution of Mathematical Communication in the Age of Digital Libraries held on 2006-12-08/---09
[36]
Lambert S. van Benthem Jutting. Checking Landau's “Grundlagen” in the AUTOMATH System. PhD thesis, Eindhoven, 1977. Partially reprinted in {32, B.5,D.2,D.3,D.5,E.2}
[37]
van der Hoeven, Joris, GNU TeXmacs. SIGSAM Bulletin. v38 i1. 24-25.
[38]
van Heijenoort, J., From Frege to Gödel: A Source Book in Mathematical Logic, 1879--1931. 1967. Harvard University Press.
[39]
Whitehead, Alfred North and Russel, Bertrand, Principia Mathematica. 1910. Cambridge University Press.
[40]
Wiedijkm, F., Formal proof sketches. In: LNCS, volume 3085. Springer-Verlag. pp. 378-393.
[41]
An English translation can be found on pp. 199--215 in {38}

Cited By

View all
  • (2014)Methods for analyzing semantic data of electronic collections in mathematicsAutomatic Documentation and Mathematical Linguistics10.3103/S000510551402006X48:2(81-85)Online publication date: 1-Mar-2014
  • (2013)Ontologies and languages for representing mathematical knowledge on the Semantic WebSemantic Web10.5555/2590215.25902174:2(119-158)Online publication date: 1-Apr-2013
  • (2013)Bringing Math to LODProceedings of the 12th International Semantic Web Conference - Part I10.1007/978-3-642-41335-3_24(379-394)Online publication date: 21-Oct-2013
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

Publisher

Elsevier Science Publishers B. V.

Netherlands

Publication History

Published: 01 April 2008

Author Tags

  1. logical foundations of mathematics
  2. mathematical knowledge management
  3. mathematical typesetting
  4. mathematical vernacular
  5. proof assistants
  6. proof checkers
  7. theorem provers

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2014)Methods for analyzing semantic data of electronic collections in mathematicsAutomatic Documentation and Mathematical Linguistics10.3103/S000510551402006X48:2(81-85)Online publication date: 1-Mar-2014
  • (2013)Ontologies and languages for representing mathematical knowledge on the Semantic WebSemantic Web10.5555/2590215.25902174:2(119-158)Online publication date: 1-Apr-2013
  • (2013)Bringing Math to LODProceedings of the 12th International Semantic Web Conference - Part I10.1007/978-3-642-41335-3_24(379-394)Online publication date: 21-Oct-2013
  • (2012)A User-friendly Interface for a Lightweight Verification SystemElectronic Notes in Theoretical Computer Science (ENTCS)10.1016/j.entcs.2012.06.004285:C(29-41)Online publication date: 19-Sep-2012
  • (2011)Translating between language and logicProceedings of the 23rd international conference on Automated deduction10.5555/2032266.2032269(5-25)Online publication date: 31-Jul-2011
  • (2011)Logical structure analysis of scientific publications in mathematicsProceedings of the International Conference on Web Intelligence, Mining and Semantics10.1145/1988688.1988713(1-9)Online publication date: 25-May-2011
  • (2010)Dimensions of formalityProceedings of the 10th ASIC and 9th MKM international conference, and 17th Calculemus conference on Intelligent computer mathematics10.5555/1894483.1894517(355-369)Online publication date: 5-Jul-2010
  • (2010)MathAbsProceedings of the 8th International Conference on Frontiers of Information Technology10.1145/1943628.1943665(1-7)Online publication date: 21-Dec-2010
  • (2009)MathLang Translation to Isabelle SyntaxProceedings of the 16th Symposium, 8th International Conference. Held as Part of CICM '09 on Intelligent Computer Mathematics10.1007/978-3-642-02614-0_30(373-388)Online publication date: 3-Jul-2009
  • (2009)Spreadsheet Interaction with FramesProceedings of the 16th Symposium, 8th International Conference. Held as Part of CICM '09 on Intelligent Computer Mathematics10.1007/978-3-642-02614-0_28(341-356)Online publication date: 3-Jul-2009

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media