Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

The Ancient Secrets of Information Management

This paper presents some ancient secrets (and modern pitfalls) of information management. Users of primitive relational database systems recognized hundreds of years ago that logic can be used to organize information, and that the structure of relational information should be dictated by its owners, not database technicians. These facts are clearly indicated by the early practice of double-entry bookkeeping. This ancient knowledge has significant positive implication for modern organizations that struggle to effectively implement relational databases. Historical and cultural factors have undermined recognition and observance of important principles....Read more
Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 45 THE ANCIENT SECRETS OF INFORMATION MANAGEMENT Ronald Fuller, The Institute for Logic and the Public Interest, rgfuller@logicandbusiness.org Matthew North, Utah Valley University, mnorth@uvu.edu Peter Cardon, University of Southern California, cardon@marshall.usc.edu ABSTRACT The lost art of classical logic is the key to overcoming the most difficult and costly challenges of modern organizations. This becomes easy to understand when we recognize the central role of logic in both modern and legacy systems. We theorize that the study and implementation of foundational principles of logic will improve more than just data storage and retrieval—it will fundamentally alter for the better the management of unprecedented amounts of information found in modern organizations. In this paper, we layout the theoretical underpinnings for adding classical logic to information systems design and development, without dictating any specific implementation methodology. We show that people have used classical logic to organize information for more than 700 years, and that the logical structure of information should be determined by its owner, not by a toolmaker or a technical specialist. Based on our findings from historical uses of classical logic to guide the development of non-computerized business systems, we argue that a renewed emphasis on logic education and logic literacy will bring significant benefits to modern organizations that struggle to manage massive amounts of information effectively. Keywords: Logic, Information Management, Data Governance, Data Quality, Integration, Next Generation Systems INTRODUCTION People who manage information have known for decades that classical logic can be used to query databases, but the use of logic to organize information within those databases is not well understood in the field of information science. Fricke (2012) states, “The monumental and authoritative Encyclopedia of Library and Information Sciences, Third Edition, 2009, does not have an entry for logic in its 6,856 pages.” We will show that medieval business practitioners used classical logic to organize information, and that they understood that the structure of information should be determined by its owner, not by a toolmaker or a technical specialist. These facts are supported by the system of interconnected books used in manual accounting systems such as double-entry bookkeeping since the 13th century. Primitive business institutions did not have the benefit of computers, but they enjoyed a particular advantage over their modern counterparts: a logic literate merchant class. Based on our findings, we argue that a renewed emphasis on logic education and logic literacy will bring significant benefits to modern organizations that struggle to manage information effectively. The greatest challenges to information management from a business perspective include integration, data quality, and adapting to changing requirements with reasonable cost and effort. Difficulties in these areas are typically attributed to misalignment between business units and IT departments. From a technical perspective, the same challenges are often attributed to deficiencies of the SQL language, its implementation by vendors, and limitations of the relational model. This paper presents an alternative view that addresses both perspectives: The greatest challenges of effective information management are not a result of departmental misalignment or technical issues; rather, they are a result of the decline of classical logic education during the 20 th century, and mistaken assumptions about who should decide how information is organized. Logic literacy should not be seen as equivalent to general or natural intelligence. Classical logic is based on a set of laws and principles that can be taught, studied and learned. In some ways learning classical logic is like learning to https://doi.org/10.48009/4_iis_2019_45-55
Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 46 read and write – both involve a set of rules, a set of symbols (or concepts that can be represented by symbols) and learning to recognize and construct patterns based on the rules and symbols (or concepts). For much of modern history classical logic was taught to young children, and was considered to be a necessary foundation for all further learning (Joseph and McGlinn). Logic was eliminated as a required subject in schools during the progressive education reforms beginning around the turn of the 20 th century. This was a political decision (Prawat). Logic literacy declined during the first half of the 20th century and computers were developed during the second half. Ironically, computers are based on classical logic. Computers do not decrease the value of logic literacy, they increase it significantly, just as the Guttenberg press increased the value of standard literacy. There is a movement afoot today to restore classical logic as an important part of basic education (Genesereth and Chaudhri). We wholeheartedly endorse this movement and believe that the business education community will be among its greatest beneficiaries. Logic and Information Systems When the relational model was introduced in the 1970’s it was falsely seen as a new computer-based method of organizing information, rather than a new computer-based way to automate old logic-based methods that had been used with success since Medieval times (Fuller, 2018). Consequently, the responsibility to decide how information is organized shifted to computer experts and architects and away from domain experts. Regarding the way information is organized, our focus here is on the initial organized state and the final states that can be derived from them. Intermediate states are important too but are not discussed here. The promise of the relational model from the beginning was that information could be stored in a way that allows it to be reorganized for various purposes while guaranteeing that outputs are consistent with the inputs. This ideal is rarely achieved in current practice, and it cannot be accomplished through technology alone. It is possible only when information is deliberately organized in a way that logically entails the range of desired outputs. Both logic literacy and domain expertise are necessary to organize information in this way (Fricke, 2012). Structure is Significant Codd wrote that the relational model “provides a means of describing data with its natural structure only” (Codd, 1970). The word ‘only’, Codd explained, means “without superimposing any additional structure for machine representation purposes.” He did not explain the meaning of “natural structure,” but the implication is a structure consistent with classical first-order logic, which is the theoretical foundation of the relational model. Perhaps he chose the word natural because classical logic is the basis of natural deduction and has some characteristics of natural language which allow information owners to precisely express their intended meaning. Structure is significant because the way information is organized determines how it can be used. Structure encodes meaning and intent. The initial state of relational information determines the full range of all possible subsequent states that can be derived from it. Terms and Concepts We will use the following terms and concepts to illustrate the applicability of classical logic to information management: Information resource: An information resource is information organized for some purpose. It can be any kind of information organized for any purpose, from a memorized phone number to the entire internet. Relational information: Relational information is information organized in a manner consistent with classical first-order logic, such that it can be manipulated using the standard operators and the classical rules of inference. The structure of relational information can be described in terms of a logical signature or vocabulary (defined below). Relational information can be organized in many ways to serve many purposes. Note: when logicians speak of relational logic or a relational signature, it means that no functions are used (and therefore much of the expressive power of classical logic is not used). But the relational model almost always implies the use of functions. In terms of logic, a relation can take one or many inputs and produce one or many outputs, and a function produces only a single output for any given input. All functions are relations, but not vice versa. Codd’s choice of the term relational model to describe a theory of information management based on the full expressive capacity of classical logic is an unfortunate accident of history and it can lead to confusion when discussing it in terms of logic. The term relational information can likewise be misunderstood, but it is familiar and we use it here to emphasize its connection with the relational model.
https://doi.org/10.48009/4_iis_2019_45-55 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 THE ANCIENT SECRETS OF INFORMATION MANAGEMENT Ronald Fuller, The Institute for Logic and the Public Interest, rgfuller@logicandbusiness.org Matthew North, Utah Valley University, mnorth@uvu.edu Peter Cardon, University of Southern California, cardon@marshall.usc.edu ABSTRACT The lost art of classical logic is the key to overcoming the most difficult and costly challenges of modern organizations. This becomes easy to understand when we recognize the central role of logic in both modern and legacy systems. We theorize that the study and implementation of foundational principles of logic will improve more than just data storage and retrieval—it will fundamentally alter for the better the management of unprecedented amounts of information found in modern organizations. In this paper, we layout the theoretical underpinnings for adding classical logic to information systems design and development, without dictating any specific implementation methodology. We show that people have used classical logic to organize information for more than 700 years, and that the logical structure of information should be determined by its owner, not by a toolmaker or a technical specialist. Based on our findings from historical uses of classical logic to guide the development of non-computerized business systems, we argue that a renewed emphasis on logic education and logic literacy will bring significant benefits to modern organizations that struggle to manage massive amounts of information effectively. Keywords: Logic, Information Management, Data Governance, Data Quality, Integration, Next Generation Systems INTRODUCTION People who manage information have known for decades that classical logic can be used to query databases, but the use of logic to organize information within those databases is not well understood in the field of information science. Fricke (2012) states, “The monumental and authoritative Encyclopedia of Library and Information Sciences, Third Edition, 2009, does not have an entry for logic in its 6,856 pages.” We will show that medieval business practitioners used classical logic to organize information, and that they understood that the structure of information should be determined by its owner, not by a toolmaker or a technical specialist. These facts are supported by the system of interconnected books used in manual accounting systems such as double-entry bookkeeping since the 13th century. Primitive business institutions did not have the benefit of computers, but they enjoyed a particular advantage over their modern counterparts: a logic literate merchant class. Based on our findings, we argue that a renewed emphasis on logic education and logic literacy will bring significant benefits to modern organizations that struggle to manage information effectively. The greatest challenges to information management from a business perspective include integration, data quality, and adapting to changing requirements with reasonable cost and effort. Difficulties in these areas are typically attributed to misalignment between business units and IT departments. From a technical perspective, the same challenges are often attributed to deficiencies of the SQL language, its implementation by vendors, and limitations of the relational model. This paper presents an alternative view that addresses both perspectives: The greatest challenges of effective information management are not a result of departmental misalignment or technical issues; rather, they are a result of the decline of classical logic education during the 20th century, and mistaken assumptions about who should decide how information is organized. Logic literacy should not be seen as equivalent to general or natural intelligence. Classical logic is based on a set of laws and principles that can be taught, studied and learned. In some ways learning classical logic is like learning to 45 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 read and write – both involve a set of rules, a set of symbols (or concepts that can be represented by symbols) and learning to recognize and construct patterns based on the rules and symbols (or concepts). For much of modern history classical logic was taught to young children, and was considered to be a necessary foundation for all further learning (Joseph and McGlinn). Logic was eliminated as a required subject in schools during the progressive education reforms beginning around the turn of the 20th century. This was a political decision (Prawat). Logic literacy declined during the first half of the 20th century and computers were developed during the second half. Ironically, computers are based on classical logic. Computers do not decrease the value of logic literacy, they increase it significantly, just as the Guttenberg press increased the value of standard literacy. There is a movement afoot today to restore classical logic as an important part of basic education (Genesereth and Chaudhri). We wholeheartedly endorse this movement and believe that the business education community will be among its greatest beneficiaries. Logic and Information Systems When the relational model was introduced in the 1970’s it was falsely seen as a new computer-based method of organizing information, rather than a new computer-based way to automate old logic-based methods that had been used with success since Medieval times (Fuller, 2018). Consequently, the responsibility to decide how information is organized shifted to computer experts and architects and away from domain experts. Regarding the way information is organized, our focus here is on the initial organized state and the final states that can be derived from them. Intermediate states are important too but are not discussed here. The promise of the relational model from the beginning was that information could be stored in a way that allows it to be reorganized for various purposes while guaranteeing that outputs are consistent with the inputs. This ideal is rarely achieved in current practice, and it cannot be accomplished through technology alone. It is possible only when information is deliberately organized in a way that logically entails the range of desired outputs. Both logic literacy and domain expertise are necessary to organize information in this way (Fricke, 2012). Structure is Significant Codd wrote that the relational model “provides a means of describing data with its natural structure only” (Codd, 1970). The word ‘only’, Codd explained, means “without superimposing any additional structure for machine representation purposes.” He did not explain the meaning of “natural structure,” but the implication is a structure consistent with classical first-order logic, which is the theoretical foundation of the relational model. Perhaps he chose the word natural because classical logic is the basis of natural deduction and has some characteristics of natural language which allow information owners to precisely express their intended meaning. Structure is significant because the way information is organized determines how it can be used. Structure encodes meaning and intent. The initial state of relational information determines the full range of all possible subsequent states that can be derived from it. Terms and Concepts We will use the following terms and concepts to illustrate the applicability of classical logic to information management: • • Information resource: An information resource is information organized for some purpose. It can be any kind of information organized for any purpose, from a memorized phone number to the entire internet. Relational information: Relational information is information organized in a manner consistent with classical first-order logic, such that it can be manipulated using the standard operators and the classical rules of inference. The structure of relational information can be described in terms of a logical signature or vocabulary (defined below). Relational information can be organized in many ways to serve many purposes. Note: when logicians speak of relational logic or a relational signature, it means that no functions are used (and therefore much of the expressive power of classical logic is not used). But the relational model almost always implies the use of functions. In terms of logic, a relation can take one or many inputs and produce one or many outputs, and a function produces only a single output for any given input. All functions are relations, but not vice versa. Codd’s choice of the term relational model to describe a theory of information management based on the full expressive capacity of classical logic is an unfortunate accident of history and it can lead to confusion when discussing it in terms of logic. The term relational information can likewise be misunderstood, but it is familiar and we use it here to emphasize its connection with the relational model. 46 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 • Account: An account is a kind of information resource. It is a descriptive record of the resources and transactions of an enterprise which is organized for the purpose of management and analysis. Many accounts are relational. Many others are intended to be relational but are not because they do not conform to the rules and principles of the relational model. The key to understanding an information resource is to recognize that the way information is organized determines how it can be used. Deciding how information should be organized requires a degree of subject matter expertise specific to the kind of information resource at hand. The concept of sophotaxis will add clarity to further discussion (Fuller & Cardon, 2016). Sophotaxis Sophotaxis (Greek: wisdom + order) is essentially the process of organizing information in a way that allows it to be used according to the intent of its owner. The degree of subject matter expertise required to achieve this condition can be described in terms of sophotaxy. The following usage examples illustrate these terms in contexts of business, information science, and logic: • • • In business, the process of sorting documents in a file cabinet does not require any understanding about the files or their purpose, only an awareness of the sorting key; this is a non-sophotaxical task. Deciding whether the cabinet should be organized by last name or customer ID requires some knowledge of how the files will be used, but not any understanding of their content; this is a semi-sophotaxical decision. Determining what kind of information will be recorded on the forms, how they will be arranged, and the number of carbon copy sheets to be attached to each form requires a detailed understanding of the business process involved, the purpose of the forms, and how they will be used. Such business forms are highly sophotaxical information resources. In library and information science, classification schemes are defined by subject matter experts. After a scheme has been defined, a librarian can use it to catalog an information item with bibliographic references. After an item is cataloged it can be placed on a shelf by a volunteer or a machine according to the catalog identifier. These are highly, semi, and non-sophotaxical processes, respectively. In logic, sophotaxis combines the notion of entailment with that of intent. If a logical signature can be used to express an intended set of statements and their antecedents, then it is sophotaxically complete; if it cannot, then it is sophotaxically deranged or incomplete (Fuller and Cardon). The sopho in sophotaxis implies a certain meaning for wisdom: Unlike knowledge, wisdom cannot be readily transferred from one person to another. Wisdom combines knowledge with perspective, experience, and exposure to consequences – all of which are needed when determining a logical vocabulary for an enterprise organization. Therefore, logical vocabularies should be created not by engineers or architects, but by subject matter experts who possess wisdom that can only come from significant firsthand experience in the organizations that own, produce, and consume the information. Logical Vocabulary In terms of logic, a logical vocabulary describes the properties of a given system and determines the range of statements that can be made about the objects in discussion, and the range of conclusions that can be inferred from those statements. A logical vocabulary consists of four parts (Genesereth & Kao, 2013): • a nonempty set of object constants • a possibly empty set of function constants • a nonempty set of relation constants • an assignment of arities for each of the function constants and relation constants The Fundamental Four In terms of information management, the same notion of a logical vocabulary is established by answering the following questions: • What tables will be created? 47 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 • • • What columns will each table have? What rules will apply to the values in each column? What relationships will exist between and within the tables and columns? These are the fundamental four: tables, columns, rules, and relationships. Objects are expressed as tables. Their attributes are expressed as columns. Relationships to other objects are expressed as dependencies. Types and integrity constraints are expressed as rules and relationships, as are identity management and classification schemes. In common practice, these critical owner-management decisions are delegated to technical specialists such as architects, database administrators, or programmers. This is a mistake which assures bad outcomes from the beginning. It comes from the belief that the fundamental four represent some kind of abstraction or model of the owner’s requirement. Not so – the fundamental four represent the minimal and essential elements of any requirement expressed for the purpose of defining a relational information resource. If the owner does not provide them, then no requirement has been expressed – at least not for a relational information resource. The requirements that organizations typically do express when they want an information resource are instead those for automated systems, which are not the same as information resources. They are, rather, tools used to access and maintain information resources, as well as automate various processes and workflows. Information resources and automated systems are closely related, but the distinction is important. Quantification Quantification means expressing information in its smallest meaningful components so it can be analyzed and transformed, using the classical rules of inference, into new expressions which can reveal new information while preserving the original values and meaning. Quantification is at the same time both highly subjective and perfectly objective in different respects. It is subjective in the sense that what constitutes the smallest meaningful component of something can vary between people, circumstances, and requirements. It is objective in the sense that it creates precise expressions that can be consumed by logical operations or computer programs. Figure 1 illustrates the subjective nature of quantification: Figure 1. Subjective Quantification The table on the left expresses telephone numbers as single values. For some organizations this is a perfectly reasonable way to represent phone numbers. When the only purpose of the numbers is to make phone calls, the numbers here are already expressed in their smallest meaningful form and there is no need to break them down into smaller parts. For other organizations, however, the individual components of each number might be meaningful. Users may want to count or group numbers by area code, for example. The table on the right represents the smallest meaningful components in this case. Breaking them down even further, perhaps into 11 columns with only a single digit each, not only would serve no purpose, but would add unnecessary complexity and conceal valuable information. Knowing how to express information in its smallest meaningful form is highly dependent on specialized subject matter expertise and a thorough understanding of how the information will be used. Quantification in this sense is the essence of sophotaxis. Data Literacy vs Information Literacy Euclid defined data as that which is given, and which can be determined from what is given. If we were to receive a business requirement that specifies “A implies B”, we can derive from this the logically equivalent term “If not B, then not A.” The ability to interpret and manipulate data in this way is what comes to mind when we consider the term data literacy. Information literacy adds to this the recognition that the structure and content of such statements each express different kinds of information (see Quantification). Logic tells us that the 2 statements above are equivalent, but logic cannot tell us if A really does imply B in any given situation. Domain expertise and classical 48 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 logic are both necessary. Research and experience show that most people can gain a practical level of logic literacy with 2 to 3 weeks of study (Newcombe et al., 2015; Attridge et al., 2016). An Account, Not a Model Hugh Darwen has said “A database is an account of some enterprise, not a model of it” (2018). This distinction is critical to understanding the difference between an information resource and an automated system. An account is a kind of information resource, while an automated system is a tool which follows a model of various processes and workflows. When information is organized to reflect the owner’s intent, use, and valuation, it reflects logical creation of information resources rather than dogmatic adherence to a specific systems design standard. If an automated tool will be used to access and maintain an account, the account should be defined first. When the owner of an account only expresses a requirement for a tool, and allows the tool designers to define the account, the account becomes a subcomponent of the system. Problems with data quality, integration, and adaptation to change are inevitable because those objectives are highly reliant on a sophotaxically correct relational vocabulary. When accounts are sophotaxically correct they are easy to adapt to new requirements and easy to modify when requirements change. Subsequent logical derivations (in the form of queries, ETL jobs, reports, etc.) can be made with no reliance whatsoever on domain expertise. In contrast, when an account is sophotaxically deranged, domain expertise must be reintroduced at every point of change to ensure (or attempt to ensure) that information content and meaning are preserved. Working with sophotaxically deranged information is very much like playing a game of charades while blindfolded. No More ‘Modeling’ With an understanding of classical logic and the principle of sophotaxis, the notion of data modeling has no further value. All the various modeling techniques developed over the previous decades such as object role modeling, entity relation modeling, etc., are entirely subsumed and surpassed. Sophotaxis is not a model or methodology, it is a set of principles that can be applied to many aspects of information management. In terms of logic, a model is an interpretation. In other words, a model is an assignment of values to variables. In terms of IS, a model in this sense corresponds to the values of a populated database, in contrast to the structure of an empty database (which corresponds to a logical signature). Logical vocabularies can be expressed in many ways, including with text (such as SQL), symbolic (such as standard logic notation), and graphical (such as conceptual graphs), all of which are fully substitutable without any loss of information. A signature expressed using one notation can be translated to any other with perfect precision and accuracy. The ability to express information in many different ways using different systems of notation while preserving its meaning is a powerful aspect of classical logic. The Relational Approach Among the choices organizations have when deciding on an approach to organize information, the relational approach is the most highly sophotaxical of all. That means direct, firsthand subject matter expertise is required more urgently when organizing relational information than any other kind. The reason is simple: Relational information represents the logical vocabulary for an enterprise; and classical first-order logic is the most expressive form of knowledge representation that can be computationally evaluated by any person or machine. It provides a means of encoding wisdom and intent that is more reliable and precise than any other mode of communication. Properly derived outputs are guaranteed to be logically consistent with the inputs (Sowa, 2000). Experts Only Subject matter experts are native speakers of their own wisdom and intent, with their attendant perspective, experience, and sensitivity to consequences. Expertise in a given subject can take many years of firsthand experience to acquire. Such expertise comprises the vast majority of the total understanding necessary to define and create a sophotaxically correct relational vocabulary – in other words, to decide the fundamental four. In contrast, the rules of logic are only a small portion of the total understanding needed. They can be learned in a few days and mastered in a few weeks. 49 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 The common practice of delegating responsibility to define the fundamental four to technical specialists – relying on interviews with experts, or requirements expressed for automated systems – leads to predictably bad results for the following reasons: • • • Subject matter expertise changes from organization to organization, job to job, and project to project. It can take years to acquire and it comprises the vast majority of the total understanding needed to define a vocabulary The rules of logic are static. They do not change from organization to organization, job to job, or project to project. They are easily acquired and represent a tiny fraction of the understanding needed to define a vocabulary The rules of logic must be applied according to the needs of the owner and subject matter experts For these reasons it is not a good idea for technical specialists to decide how information is organized, unless it is information relating directly to their own field of expertise Programmers have often imagined that classical logic, with its great expressive power, is too complex to be of much use to programmers, to say nothing of business professionals. But this perceived complexity actually comes from the difficulty of expressing, in great detail, complex information relating to areas they are unfamiliar with. This difficulty carries the illusion of technical complexity, when it is really nothing more than unfamiliarity with a given field of expertise such as the complex rules and processes of a large business organization. Chris Date (2011) explains this well in the following passage. We use the term classical logic in parentheses below to replace the term predicate calculus in the original text. Predicate calculus is a common notation for classical logic, which can be substituted for many other notations, including SQL, as noted above. “Codd’s 1970 paper had proposed, among other things, that (classical logic) might reasonably be used as a basis for a data language (or data sublanguage, as Codd called it), and it gave examples of what expressions in such a language might look like. Unfortunately, it did so using the formal notation of (classical logic), notation that many people in the computing world at the time were quite unfamiliar with. So the idea arose that (classical logic) was just too difficult for ordinary mortals… It is true that the syntax of (classical logic) might look a little strange and complicated; but the concepts expressed by that notation… are actually quite simple, and can be explained quite simply too. In other words, it is not difficult to wrap some nice syntactic sugar around those concepts and make them very palatable indeed. Do you happen to be familiar with Query-By-Example (QBE)? QBE is exactly (classical logic)! In my opinion, it’s a very user friendly syntactic sugar coating of the semantic ideas embodied in (classical logic).” (Date, 2011) The principles of logic that make QBE an intuitive way to query information are the same principles that will allow logic-aware managers and logic-literate business professionals to solve their most difficult challenges managing information. Plausible Desirability A data architect with a small army of business analysts can spend hundreds of hours playing 20,000 questions with business experts to determine what they consider to be a plausibly desirable range of outcomes, and still not come up with a complete list. Scope creep is unavoidable with this approach. But when business experts define their own vocabularies the problem is significantly reduced or even completely eliminated. What is plausibly desirable to an owner or expert may change over time, but it will always change far more slowly than the perceived requirements when IT professionals must play the guessing game. Even when a vocabulary is defined by a 3rd party, a logic-literate expert can readily determine if it will meet their needs. For this reason, a junior manager or analyst can take on the legwork of creating a vocabulary on behalf of an information owner, and the owner can then validate it when complete. This is the way financial budgets are managed: they are prepared by subordinates, and then approved by an accountable manager. Budgets are generally not accepted 50 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 without careful review and firsthand approval from the responsible leader. It should be likewise with information resources. But this can only become possible with logic-literate business professionals and logic-aware management. When integration is required, resources must be defined in collaboration with professionals within each of the related processes and subject matter areas. Deciding the fundamental four requires no knowledge whatsoever of any technical machine or software (except for advice from specialists on cost benefit issues, which are discussed in “Cost-Benefit Concerns” below). Sophotaxical Derangement Technical expertise can actively and severely impair one’s ability to decide the fundamental four, aside from and exacerbating the fact of not having firsthand domain expertise, regardless of how well one may understand database software, which is irrelevant here. The following example shows how eminently qualified technical experts at a world leading company, who are also themselves in this case the domain experts, are blinded by their technical perspective from obvious and severe problems with the fundamental four. They are the database architects of one of the world’s largest technical content publishing systems. The system is built around two types of content items: articles and tables of contents (TOC’s), each of which can participate in many-to-many relationships with the other. However, in this system all articles and TOC definitions are stored as XML-typed data in a single base table. The rationale for this decision is that “it’s all XML data” therefore, it is all of the same type and therefore the same domain (no, it’s not). As a result, the leaders of the organization are frequently unable, despite tremendous expense and effort, to answer simple questions about their content, such as “Which authors have written articles about a given topic?” Even the seemingly simple task of building a path of breadcrumb links for topical navigation requires an enormously complex stored procedure. The joke goes (even within this team) that the easiest way to find something within in this system is to Google it. As a result, they may end up just creating their own web-crawled key-value indexes like a search engine and forfeit any hope of achieving the richness, power, and flexibility that is only possible with truly relational information. The problem here is that programming requires a mode of thinking that is quite different from the sophotaxical processes necessary to organize information. They can be difficult to clearly separate when working with both simultaneously – even in rare cases when the engineers are also the subject matter experts. This dilemma is further exacerbated because programmers and database engineers are just as unlikely to have learned classical logic as business professionals and other experts. GIGO is Overrated ‘Garbage in, garbage out (GIGO)” is a common explanation for poor data quality. But a far more frequent problem exists when information within a system is known to be complete and correct but cannot be used in a desired way, or formed into desired outputs, because it is not organized in a way that allows it. In other words, when it is sophotaxically deranged. This is also a primary root cause of integration failure. Information systems cannot be integrated unless the combined logical signatures are sophotaxically correct. That is possible only when the individual component systems are themselves sophotaxically correct and compatible with one another. Integration can only be achieved through the coordinated efforts of logic-aware management. No amount of effort by architects, engineers or consultants can succeed without this. Moving data from an old platform to a new one is not the problem here. The real challenge is reorganizing information that is sophotaxically deranged, which has nothing to do with technology, or the age of any system or programming language. It has only to do with the way the information has been organized. The U.S. Department of Defense spent 10 years and more than a billion dollars trying to integrate its payroll systems, and the project ended in complete failure with no change in capability or any useful outcome (Reuters). Sophotaxical derangement may also provide cover for fraud and abuse. When a system is unable to detect or verify errors in legitimate transactions, it seems unlikely that it can be used to detect fraud. Without logic-aware management it is nearly impossible to integrate information systems, even for highly funded government agencies and world leading technology companies. Logic-literate business professionals and logicaware management are necessary to create vocabularies to describe the accounts of any large enterprise. 51 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 Sophotaxical derangement can contribute to other factors relating to poor data quality as well. For example, the problems identified by Francis and Prevosto in Data and Disaster: The Role of Data in the Financial Crisis were cases of failure to obtain or analyze data. Sophotaxical derangement can be a significant factor in such failures. A 2010 Thomson Reuters and Lepus report claims that fragmented data platforms contributed to the 2008 financial crisis (FinExtra Research). Likewise, disjointed systems prevented the intelligence community from connecting the dots prior to the 911 terror attacks. Fragmented systems that cannot be integrated, but should be, are common evidence of sophotaxical derangement and present enormous opportunities for logic education in business. Cost-benefit Concerns In the process of defining a logical vocabulary, engineers and architects can provide valuable input on cost-benefit decisions involving issues like performance, storage, and scale. For example, an owner might define the fundamental four in a way that accomplishes the critical business process of identity management using alphanumeric codes rather than sequential integers. An experienced architect might advise that implementing this requirement could cause problems with performance because of the way some database software works. The owner then has a decision to make. Options might include: 1) accept the performance consequences, which could be very significant; 2) change their approach to identity management and use sequential integers instead of the codes; or 3) add extra columns with integers (surrogate keys) to make the software work faster, and accept the additional complexity and long-term management overhead of maintaining two (or more) separate identity management systems. Whatever the decision, it should be informed by specialists, but decided by information owners in view of the overall leadership objectives and priorities. The need for cost-benefit analysis and related tradeoffs when defining the fundamental four is not new. Don Chamberlin, co-inventor of the SQL language, explained that the original IBM System R team designed SQL to give customers the option to enforce certain rules only when they choose to. One reason he cited was that enforcing the rules takes additional processing and storage, which in the mid 1970’s was quite expensive. He explained, “When the original SQL designers decided to allow users the options of handling nulls and duplicates, they viewed these features as minor conveniences, not as major departures from orthodoxy, taken at the risk of excommunication” (Chamberlain, 1996). But that decision did not force anyone to do or ignore anything. Rather, it gave users the freedom to make their own cost-benefit decisions. Unfortunately, to this day those kinds of decisions do not often fall to the people who bear the cost of poor choices and are properly positioned to make good ones. So, not only are highly sophotaxical decisions being made by non-subject matter experts, but – adding insult to injury – they are often implemented in a defective manner that makes the problems even worse. Sophotaxical derangement is a pervasive aspect of production information systems everywhere, and presents a vast frontier of research opportunity for logic oriented IS scholars. The first such efforts should focus on characterizing and cataloging the various modes of failure. Triumph of ‘the Systems of Men’ Thomas Haigh (2001) explains why information management is treated as a technical discipline rather than a management one: “The systems men were the members of the Systems and Procedures Association (SPA) during the 1950’s and 60’s. They offered corporate executives an implicit bargain: ‘You put us in charge and we’ll deliver to you more power over your firms than you’ve ever dreamed of”. The papers presented at their International Systems meetings…exhibit a fixation on questions of status and power. But executives for the most part were not convinced that technical skills should translate into management authority. The systems men claimed to possess a body of objective knowledge and techniques qualifying them to make superior decisions within a particular technical domain. But their task of legitimation was uniquely difficult because their claimed domain was management itself. Only during the 1980s did the term MIS become so tainted by failure, reflecting the persistent reality of computer work’s low status in the eyes of management that academics and management writers flocked to alternatives rather than to redefinition.” By the 1970’s the SPA was defunct; but the systems men had left a stubborn cultural legacy that persists still today: the idea that managing information is a job for technicians. Haigh continues: 52 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 “For better or worse, to speak of something as an information system continues to imply that it should be engineered by an information specialist and built using information technology. It seems unlikely that the idea of information can ever truly be separated from these roots: it is just too historically and culturally charged.” To overcome this illusion, we must look to the past. Sophotaxis in the Middle Ages Double entry bookkeeping (DEB) has been used in commerce since around 1296. It has two defining characteristics: 1. 2. A system of interconnected books which are cross-referenced and correlated. We’ll refer to this as relational bookkeeping for reasons explained below A process of dual entry where every transaction is recorded twice, once as a debit and once as a credit, for the purposes of checking math. This aspect is not significant for our analysis here Hoskin, Ma, and Macve (2015) explain that these two characteristics were used separately at different times and places through history. Accounting historians disagree on which aspect is more significant. However, the use of both together is always indicated by the term double entry bookkeeping. This unfortunate fact inevitably clouds the debate. We propose the term relational bookkeeping to specify the system of interconnected books to eliminate this ambiguity. Relational bookkeeping (i.e., the system of interconnected books), even as practiced in the Middle Ages, is a practical application of classical first-order logic, and every object and process can be described in terms of modern relational databases. Below are some DEB terms with their corresponding modern counterparts: • • • • • • • • • Journals and inventories are relations (transaction tables) The repertory or finding key is a relation (reference table) Book entries are n-tuples (rows) Columns are domains (attributes) Ledger accounts are filtered views of journals, and therefore they are also relations Ledger accounts refer to journal entries by way of page reference numbers, which are essentially foreign key constraints for each account Balance sheets and other financial statements are filtered or aggregated views of the ledger, capital and cash accounts The rules of DEB provide consistency and referential integrity A Chart of accounts is a system catalog, or metadata dictionary Every aspect of the relational model is satisfied. With this it becomes clear that medieval merchants in northern Italy invented fully functional, manually operated relational database management systems, almost 700 years before the theoretical foundations of those systems were to be formally described and understood. Their leather-bound books and manual techniques served the same purposes and accomplished the same kinds of outcomes as modern databases and query languages – with perhaps even more reliable results due to higher rates of logic literacy and observance of sophotaxical principles. The same paper-and-ink techniques could have been used to manage other kinds of information as well, but manual accounting systems are painstaking and time consuming, so it is easy to understand why their use were limited to managing the kind of information of greatest concern to early merchants. Implications These facts stand as evidence that computers and technology have not fundamentally changed the underlying nature of information management, as has been commonly assumed since the 1970’s. They reestablish the previously longunderstood fact that managing information is a management discipline, not a technology or even a hybrid discipline. They further establish that the discipline of information management is based on fundamental principles. And not just any ordinary fundamental principles, but the principles of classical logic, which many mathematicians, philosophers, 53 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 and scholars consider to be the most fundamental of all principles. Kurt Gödel, the best friend of Albert Einstein (and whom their colleagues considered his intellectual equal) described logic as “a science prior to all others, which contains the ideas and principles underlying all sciences.” Compare his use of words to the following statements by economist Werner Sombart in 1919: “Double-entry bookkeeping came from the same spirit which produced the systems of Galileo and Newton, and the subject matter of modern physics and chemistry. . . “Double-entry bookkeeping is based on ... the basic principle of quantification which has delivered up to us all the wonders of nature, and which appeared here for the first time in human history in all its clarity.” (Most, 1970) Sombart and others were roundly criticized for supposedly overstating the nature and significance of double-entry bookkeeping; but now, with recognition of its congruence with classical first-order logic, those criticisms should be reconsidered. CONCLUSION The ancient secrets of information management consist of two important lessons: Ancient Secret #1 Classical logic can be used to organize information. The modern field of library and information science, for which the organization of information is a primary concern, has only recently begun to investigate this useful fact. Fricke argues, “It should be possible to give an adequate account of classification using only First Order Logic.” And a review of Fricke supports our claim that the task of using logic to organize information should fall to subject matter experts rather than information specialists (Rasmussen , 2013). Practitioners of commerce have relied on the expressiveness and computational power of classical logic to organize information for many centuries. But logic is not enough. A vocabulary is also needed, which must come from logicliterate business professionals and managers, as expounded in secret #2. Ancient Secret #2 Decisions about organizing information should be made by logic-literate subject matter experts. The medieval merchants who practiced relational bookkeeping certainly did not look to tool makers to define their ledgers and accounts. But that is what modern organizations routinely do. It makes no difference that the old tools were made from parchment, feathers, and dye, and the new ones from computers, software, and networks. The old tools served precisely the same purpose as the new with respect to the definition and management of information resources. The new tools serve an additional purpose of automating processes and workflows, but that is no reason to believe that the engineers who create the tools should also be the ones to define the accounts, and there are important reasons to understand why they should not. REFERENCES Attridge, N., Aberdein, A. & Inglis, M. (2016). Does Studying Logic Improve Logical Reasoning? In: Proceedings of the 40th Conference of the International Group for the Psychology of Mathematics Education, Szeged, Hungary, 3-7. Chamberlin, D. (1996). Using the new DB2. San Francisco: Morgan Kaufmann. Codd, E. F. (1970). A relational model of data for large shared data banks. Communications of the ACM, 13(6), 377–387. Darwen, H. (2006, January). The Askew Wall. Retrieved from www.dcs.warwick.ac.uk/~hugh/TTM/TTMTheAskewWall-printable.pdf 54 Issues in Information Systems Volume 20, Issue 4, pp. 45-55, 2019 Date, C. J. (2011). SQL and relational theory. Sebastopol: O'Reilly Media. FinExtra Research. “After Financial Crisis, Firms Look to Improve Data Quality - Thomson Reuters Report.” Finextra Research, 2 Mar. 2010 Fricke, M. H. (2012). Logic and the organization of information. New York: Springer. Fuller, Ronald (2018). “First-Order Logic In13th-Century Accounting Systems.” The Bulletin of Symbolic Logic, vol. 24, 2018, p. 209, doi:10.1017/bsl.2018.12. Fuller, R., & Cardon, P. (2016). Sophotaxis. The Bulletin of Symbolic Logic, 23(1), 128. Geijsbeek, J. (1914). Ancient doubleentry bookkeeping. Geijsbeek, Denver. Includes translation of Pacioli, L. 1494. Summa de arithmetica, geometrica. Proportioni et proportionalita. Paganini, Venice. Genesereth, M., & Kao, E. (2013). Introduction to Logic (2nd ed.). Williston, VT: Morgan & Claypool. Genesereth, M. & Chaudhri, V. Logic in Secondary Education. http://intrologic.stanford.edu/miscellaneous/intro.html. Accessed 28 January 2019. Haigh, T. (2001). Inventing information systems: The systems men and the computer, 1950–1968. Business History Review, 75, 1561. Hoskin, K., Ma, D., & Macve, R. (2015). A genealogy of myths about the rationality of accounting in the West and in the East (London School of Economics working paper). Cited with permission. Joseph, Miriam, and Marguerite McGlinn. The Trivium: The Liberal Arts of Logic, Grammar, and Rhetoric : The Understanding the Nature and Function of Language. Paul Dry Books, 2002. Most, K.S. (1970). The role of accounting in the economic development of the modern state. Ph.D. dissertation, University of Florida. Includes translation from Sombart, W. 1919. Der Moderne Kapitalismus. 3d ed. Duncker and Hurablot, Munich and Leipzig. Newcombe, C., Rath, T. Zhang, F., Bogdan, M., Brooker, M. & Deardeuff, M. (2015). How Amazon Web Services Uses Formal Methods. Communications of the ACM, 58(4), 66–73. Prawat, R. S. (2003). The Nominalism Versus Realism Debate: Toward a Philosophical Rather Than a Political Resolution. Educational Theory, 53(3), 275–311. Rasmussen, D. (2013). Review of Logic and the Organization of Information by Martin Frické. Canadian Journal of Information and Library Science, 37(1). Reuters. (2013). Special Report: How the Pentagon’s Payroll Quagmire Traps America’s... Reuters, 9 July 2013. www.reuters.com, https://www.reuters.com/article/us-usa-pentagon-payerrors-special-reportidUSBRE96818I20130709. Sowa, J. (2019). Notes on the History of Logic. http://www.jfsowa.com/peirce/hist_log.htm. Accessed 7 March 2019. 55