A Novel Categorical Approach to Semantics of Relational First-Order Logic

Schreiner, Wolfgang; Steingartner, William; Novitzká, Valerie

doi:10.3390/sym12101584

Open AccessArticle

A Novel Categorical Approach to Semantics of Relational First-Order Logic

by

Wolfgang Schreiner

¹

,

William Steingartner

^2,*

and

Valerie Novitzká

²

¹

Research Institute for Symbolic Computation (RISC), Johannes Kepler University, Altenbergerstraße 69, A-4040 Linz, Austria

²

Faculty of Electrical Engineering and Informatics, Technical University of Košice, Letná 9, 042 00 Košice, Slovakia

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(10), 1584; https://doi.org/10.3390/sym12101584

Submission received: 30 August 2020 / Revised: 16 September 2020 / Accepted: 18 September 2020 / Published: 23 September 2020

(This article belongs to the Section Computer)

Download

Browse Figures

Versions Notes

Abstract

:

We present a categorical formalization of a variant of first-order logic. Unlike other texts on this topic, the goal of this paper is to give a very transparent and self-contained account without requiring more background than basic logic and set theory. Our focus is to show how the semantics of first-order formulas can be derived from their usual deduction rules. For understanding the core ideas, it is not necessary to investigate the internal term structure of atomic formulas, thus we abstract atomic formulas to (syntactically opaque) relations; in this sense, our variant of first-order logic is “relational”. While the derived semantics is based on categorical principles (even the duality that arises from a symmetry between two ways of looking at something where there is no reason to choose one over the other), it is nevertheless “constructive” in that it describes explicit computations of the truth values of formulas. We demonstrate this by modeling the categorical semantics in the RISCAL (RISC Algorithm Language) system which allows us to validate the core propositions by automatically checking them in finite models.

Keywords:

category; functor; RISCAL; relation; relational first-order logic; semantics

1. Introduction

Most introductions to first-order logic first define the syntax of formulas, then formalize their meaning in the form established in the 1930s by Tarski [1] (essentially what we call today in programming language theory a “denotational semantics” [2]), then introduce a deduction calculus, and finally show the soundness and completeness of this calculus concerning the semantics: if a formula can be derived in the calculus, it is true according to the semantics, and vice versa. These relationships between truth and derivability have to be established because there is no self-evident link between the semantics of a formula and the deduction rules associated with it. Historically, deduction came first; the soundness of a deduction calculus was established by showing that it could not lead to apparent inconsistencies, i.e. that both a formula and its negation could not be derived in a deduction system. It was Tarski who first gave meaning to formulas that was independent of deduction.

However, as it did since the 1940s to many other mathematical areas, category theory [3,4,5,6,7], the general theory of mathematical structures, can bring and provide an alternative light also to first-order logic. It does so by considering logical notions as special instances of “universal” constructions, where a value of interest is determined

first, by depicting the core property that the value shall satisfy; and
second, by giving a criterion how to choose a canonical value from all values that satisfy the property.

It was eventually recognized that by such universal constructions the semantics of the connectives of propositional logic could be determined directly from their associated introduction and elimination rules. However, it took until the late 1960s until Lawvere gained the fundamental insight that this idea could be also applied to the quantifiers of first-order logic [8], thus establishing a direct relationship between its semantics and its proof calculus.

However, this insight has not yet obtained a foothold in basic texts on logic and its basic education. The main reason may be that the corresponding material is found mostly in texts on category theory and its applications where it is dispersed among examples of the application of categorical notions without a clear central presentation. Furthermore, the general treatment of first-order logic with terms and variables requires a complex mathematical apparatus [9] which is much beyond the scope of basic introductions. Reasonably compact introductions can be found, e.g., in Section 2.1.10 of [10], in [11], in Section 1.6 of [12] (however, in the context of type theory rather than classical first-order logic), in Section 9.5 of [3] (the treatment of quantifiers only), and in Section 7.1.12 of [7] (again only the treatment of quantifiers).

The goal of this paper is to give a compact introduction to a categorical version of first-order logic that is fully self-contained, only introduces the categorical notions relevant for the stated purpose, and presents them from the point of view of the intended application. For this purpose, it elaborates a simple but completely formalized syntactic and semantic framework of first-order logic that represents the background of the discussion, without gaps and inconsistencies. As a deliberate decision, this framework does not address the syntax and semantics of terms but abstracts atomic formulas to opaque relations; this allows for focusing the discussion on the essentials. However, to describe a reasonably close relative of first-order logic, this framework is (in contrast to other presentations) not based on relations of fixed arity, i.e., with a fixed number of variables; instead, we consider relations of infinite arity, i.e., with infinitely many variables. However, only finitely many variables may influence the truth value of the relation, which represents the effect that a classical atomic formula can only reference a finite number of variables. The overall result is a slick and elegant presentation. Because the duality has many manifestations in logic and it is agreed by all hands that a duality is like a “giant symmetry”—a symmetry between theories, we focus on this concept in our approach. For the implementation, we use the RISCAL—the RISC Algorithm Language [13], which is a specification language with an associated software system for describing mathematical algorithms, formally specifying their behavior based on mathematical theories, and validating the correctness of algorithms, specifications, and theories by the execution/evaluation of their formal semantics. The term “algorithm language” indicates that RISCAL is intended to model, rather than low-level code, algorithms (as can be found in textbooks on discrete mathematics) in a high-level language, and specifying the behavior of these algorithms by formal contacts. RISCAL has been developed to validate the correctness of mathematical theories, specifications, and programs, by checking instances of these artifacts on finite domains; applications of RISCAL are for instance in discrete mathematics, number theory, and computer algebra. Software based on formal logic plays an ever-increasing role in areas where a mathematically precise understanding of a subject domain and sound rules for reasoning about the properties of this domain are essential. A prime example is the formal modeling, specification, and verification of computer programs and computing systems, but there are many other applications in areas such as knowledge-based systems, computer mathematics, or the semantic web [14]. Furthermore, the intent of all our projects (namely LogTechEdu, SemTech [15], and others listed in Funding section) is to further advance education in computer science and related topics. In academical courses for computer science and mathematics, by utilizing the power of modern software based on formal logic and semantics, students shall engage with the material they encounter by actively producing the problem solutions rather than just passively taking them from the lecturer.

The remainder of this paper is structured as follows: in Section 2, we define a term-free variant of first-order logic and give it a semantics in the usual style based on set-theoretic notions. In Section 3, we introduce those categorical notions that are necessary for understanding the following elaboration and discuss their relationships. The core of this paper is Section 4 where we elaborate the categorical formulation of the semantics of our variant of first-order logic. In Section 5, we demonstrate that these semantics are constructive by modeling it in the RISCAL system [13], which allows us to automatically check the core propositions in particular finite models. Section 6 concludes our presentation and gives an outlook on our future work.

2. A Relational First-Order Logic

In this section, we introduce a simplified variant of first-order logic that abstracts from the syntactic structure of atomic formulas and thus copes without the concept of terms, constants, function symbols, predicate symbols, and all of the associated semantic apparatus. Towards this goal, atomic formulas are replaced by relations over assignments (maps of variables to values) that are constrained to only depend on a finite number of variables; we will call such relations “predicates”. Consequently, the semantics of every non-atomic formula is also a relation (i.e., a predicate, as mentioned).

We begin with some standard notions. First, we specify variables and the values that variables hold.

Axiom 1

(Variables and Values). Let

Var

denote an arbitrary infinite and enumerable set; we call the elements of this set variables. Furthermore, let

Val

denote an arbitrary non-empty set; we call the elements of these set values.

Next, we define assignments.

Definition 1

(Assignments). We define

Ass : = Var \to Val

as the set of all mappings of variables to values (a function space); we call the elements of this set assignments. Thus, for every assignment

a \in Ass

and every variable

x \in Var

, we have

a (x) \in Val

.

We note that assignment is similar to a concept of state in theory of formal semantics of programming languages (see, e.g., [16]) where the state is a function from variables to values: to each variable, the state associates its current value.

Definition 2

(Updates). Let

a \in Ass

be an assignment,

x \in Var

a variable, and

v \in Val

a value. We define the update assignment

a [x \mapsto v] \in Ass

as follows:

\begin{matrix} a [x \mapsto v] (y) & : = \{\begin{matrix} v, & if x = y; \\ a [x], & otherwise . \end{matrix} \end{matrix}

Consequently,

a [x \mapsto v]

is identical to a except that it maps variable x to value v.

Based on this, we can formulate the following updating properties.

Proposition 1

(Update Properties). Let

a \in Ass

be an assignment,

x, y \in Var

variables, and

v, v_{1}, v_{2} \in Val

values. Then, we have the following properties:

\begin{matrix} a [x \mapsto a (x)] & = a, \\ a [x \mapsto v_{1}] [x \mapsto v_{2}] & = a [x \mapsto v_{2}], \\ x \neq y \Rightarrow a [x \mapsto v_{1}] [y \mapsto v_{2}] & = a [y \mapsto v_{2}] [x \mapsto v_{1}] . \end{matrix}

Proof.

Directly from the definitions. □

The properties of assignments listed above (and only these) will be of importance in the subsequent proofs.

Now, we turn to the fundamental semantic notions.

Definition 3

(Relations). We define

Rel : = P (Ass)

as the set of all sets of assignments; we call the elements of this set “relations”. Consequently, a relation is a set of assignments.

Definition 4

(Variable Independence). We state that relation

R \in Rel

is independent of variable

x \in Var

, written as

R ⫫ x

, if and only if the following holds:

\forall a \in Ass, v_{1}, v_{2} \in Val . a [x \mapsto v_{1}] \in R \Leftrightarrow a [x \mapsto v_{2}] \in R .

Consequently, if

R ⫫ x

, the value of x in any assignment a does not influence whether a is in R. We say that R depends on x if

R ⫫ x

does not hold.

We transfer the central syntactic property of atomic formulas (they can only refer to finitely many variables) to its semantic counterpart.

Definition 5

(Predicates). A relation

R \in Rel

is a predicate, if it only depends on finitely many variables. We denote by

Pred

the set of all predicates and by

{Pred}_{x} : = {P \in Pred | P ⫫ x}

the subset of all predicates that are independent of x.

Now, we are ready to introduce the central entities of our paper. First, we give a definition of the abstract syntax of formulas.

Definition 6

(Abstract Syntax of Formulas). We define

F o r

as that smallest set of abstract syntax trees in which every element

F \in For

is generated by an application of a rule of the following context-free grammar (where

P \in Pred

denotes an arbitrary predicate and

x \in Var

denotes an arbitrary variable):

\begin{matrix} F & : : = P | ⊤ | ⊥ \\ | \neg F | F_{1} \land F_{2} | F_{1} \lor F_{2} | F_{1} \to F_{2} | F_{1} \leftrightarrow F_{2} \\ | \forall x . F | \exists x . F \end{matrix}

We call the elements of this set formulas.

In this definition, the role of a classic atomic predicate

p (t_{1}, \dots, t_{n})

with argument terms

t_{1}, \dots, t_{k}

in which n variables

x_{1}, \dots, x_{n}

occur freely is abstracted to a predicate P that depends on variables

x_{1}, \dots, x_{n}

.

Now, we establish the relationship between the syntax and semantics of formulas.

Definition 7

(Semantics of Formulas). Let

F \in For

be a formula. We define the relation

〚 F 〛 \in Rel

, called the semantics of F, by induction on the structure of F:

\begin{matrix} 〚 P 〛 & : = {a \in Ass | a \in P} \\ 〚 ⊤ 〛 & : = {a \in Ass | true} \\ 〚 ⊥ 〛 & : = {a \in Ass | false} \\ 〚 \neg F 〛 & : = {a \in Ass | a \notin 〚 F 〛} \\ 〚 F_{1} \land F_{2} 〛 & : = {a \in Ass | a \in 〚 F_{1} 〛 and a \in 〚 F_{2} 〛} \\ 〚 F_{1} \lor F_{2} 〛 & : = {a \in Ass | a \in 〚 F_{1} 〛 or a \in 〚 F_{2} 〛} \\ 〚 F_{1} \to F_{2} 〛 & : = {a \in Asss | a \notin 〚 F_{1} 〛 or a \in 〚 F_{2} 〛} \\ 〚 F_{1} \leftrightarrow F_{2} 〛 & : = {a \in Ass | (a \in 〚 F_{1} 〛 and a \in 〚 F_{2} 〛) or (a \notin 〚 F_{1} 〛 and a \notin 〚 F_{2} 〛)} \\ 〚 \forall x . F 〛 & : = {a \in Ass | a [x \mapsto v] \in 〚 F 〛, for all v \in Val} \\ 〚 \exists x . F 〛 & : = {a \in Ass | a [x \mapsto v] \in 〚 F 〛, for some v \in Val} \end{matrix}

The above definition is well-defined in that every formula denotes a relation. To show that formulas indeed denote predicates, some more work is required.

Proposition 2

(Quantified Formulas and Variable Independence). For every variable

x \in Var

and formula

F \in For

, we have

〚 \forall x . F 〛 ⫫ x

and

〚 \exists x . F 〛 ⫫ x

, i.e., the semantics of quantified formulas do not depend on x.

Proof.

We prove this proposition by reductio ad absurdum.

First, assume that

〚 \forall x . F 〛

depends on x. Then, we have some assignment

a \in 〚 \forall x . F 〛

and some values

v_{1}, v_{2} \in Val

such that

a [x \mapsto v_{1}] \in 〚 \forall x . F 〛

and

a [x \mapsto v_{2}] \notin 〚 \forall x . F 〛

. From

a [x \mapsto v_{2}] \notin 〚 \forall x . F 〛

we have some

v \in Val

with

a [x \mapsto v_{2}] [x \mapsto v] \notin 〚 F 〛

and thus

a [x \mapsto v] \notin 〚 F 〛

. However,

a [x \mapsto v_{1}] \in 〚 \forall x . F 〛

implies

a [x \mapsto v_{1}] [x \mapsto v] \in 〚 F 〛

and thus

a [x \mapsto v] \in 〚 F 〛

, which represents a contradiction.

Now, assume that

〚 \exists x . F 〛

depends on x. Then, we have some assignment

a \in 〚 \exists x . F 〛

and some values

v_{1}, v_{2} \in Val

such that

a [x \mapsto v_{1}] \in 〚 \exists x . F 〛

and

a [x \mapsto v_{2}] \notin 〚 \exists x . F 〛

. From

a [x \mapsto v_{1}] \in 〚 \exists x . F 〛

, we have some

v \in Val

with

a [x \mapsto v_{1}] [x \mapsto v] \in 〚 F 〛

and thus

a [x \mapsto v] \in 〚 F 〛

. However,

a [x \mapsto v_{2}] \notin 〚 \exists x . F 〛

implies

a [x \mapsto v_{2}] [x \mapsto v] \notin 〚 F 〛

and thus

a [x \mapsto v] \notin 〚 F 〛

, which represents a contradiction. □

Proposition 3

(Formula Semantics and Predicates). For every formula

F \in For

, we have

〚 F 〛 \in Pred

, i.e., the semantics of F is a predicate.

Proof.

The proof proceeds by induction over the structure of F.

If $F = P$ , we have $〚 F 〛 = {a \in Ass | a \in P} = P \in Pred$ .
If $F \in {⊤, ⊥}$ , there are no $x, a, v_{1}, v_{2}$ such that $a [x \mapsto v_{1}] \in 〚 F 〛$ and $a [x \mapsto v_{2}] \notin 〚 F 〛$ because, for $F = ⊤$ , the second condition must be false and for $F = ⊥$ the first one; thus, F does not depend on any variable.
If $F = \neg F_{1}$ , by the induction hypothesis, we may assume that $〚 F_{1} 〛$ depends only on the variables in some finite variable set X. From the definition of $〚 F 〛$ , it is then easy to show that $〚 \neg F_{1} 〛$ also depends only on the variables in X.
If $F \in {F_{1} \land F_{2}, F_{1} \lor F_{2}, F_{1} \to F_{2}, F_{1} \leftrightarrow F_{2}}$ , we may assume by the induction hypothesis that $〚 F_{1} 〛$ depends only on the variables in some finite set $X_{1}$ while $P_{2}$ only depends on the variables in some finite set $X_{2}$ . From the definition of $〚 F 〛$ , it is then easy to show that $〚 F 〛$ depends only on the variables in the finite set $X_{1} \cup X_{2}$ .
If $F \in {\forall x . F_{1}, \exists x . F_{1}}$ , we may assume by the induction hypothesis that $〚 F_{1} 〛$ only depends on the variables in some finite variable set X. We are now going to show that $〚 F 〛$ only depends on the variables in the finite set $X \ {x}$ . Actually, we assume that this is not the case and show a contradiction. From this assumption and Proposition 2, we have a variable $y \neq x \land y \notin X$ on which F depends; thus, we have an assignment a and values $v_{1}, v_{2}$ such that $a [y \mapsto v_{1}] \in 〚 F 〛$ and $a [y \mapsto v_{2}] \notin 〚 F 〛$ .
If $F = \forall x . F_{1}$ , from $a [y \mapsto v_{2}] \notin 〚 F 〛$ , we have a $v \in Val$ with $a [y \mapsto v_{2}] [x \mapsto v] \notin 〚 F_{1} 〛$ and thus (since $y \neq x$ ) $a [x \mapsto v] [y \mapsto v_{2}] \notin 〚 F_{1} 〛$ . From $a [y \mapsto v_{1}] \in 〚 F 〛$ , we know $a [y \mapsto v_{1}] [x \mapsto v] \in 〚 F_{1} 〛$ and thus $a [x \mapsto v] [y \mapsto v_{1}] \in 〚 F_{1} 〛$ . Thus, $〚 F_{1} 〛$ depends on a variable $y \notin X$ which contradicts the induction assumption.
If $F = \exists x . F_{1}$ , from $a [y \mapsto v_{1}] \in 〚 F 〛$ , we have a $v \in Val$ with $a [y \mapsto v_{1}] [x \mapsto v] \in 〚 F_{1} 〛$ and thus (since $y \neq x$ ) $a [x \mapsto v] [y \mapsto v_{1}] \in 〚 F_{1} 〛$ . From $a [y \mapsto v_{2}] \notin 〚 F 〛$ we know $a [y \mapsto v_{2}] [x \mapsto v] \notin 〚 F_{1} 〛$ and thus $a [x \mapsto v] [y \mapsto v_{2}] \notin 〚 F_{1} 〛$ . Thus, $〚 F_{1} 〛$ depends on a variable $y \notin X$ which contradicts the induction assumption.

This completes our proof. □

In the following, we transfer the classical model-theoretic notions to our framework.

Definition 8

(Satisfaction). Let

a \in Ass

be an assignment and

F \in For

be a formula. We define

a ⊧ F

(read: a satisfies F) as follows:

a ⊧ F : \Leftrightarrow a \in 〚 F 〛 .

Definition 9

(Validity). Let

F \in For

be a formula. We define

⊧ F

(read: F is valid) as follows:

⊧ F : \Leftrightarrow \forall a \in Ass . a ⊧ F .

Definition 10

(Logical Consequence). Let

F, G \in For

be formulas. We define

F ⊧ G

(read: G is a logical consequence of F) as follows:

F ⊧ G : \Leftrightarrow \forall a \in Ass . a ⊧ F \Rightarrow a ⊧ G .

Definition 11

(Logical Equivalence). Let

F, G \in For

be formulas. We define

F \equiv G

(read: F and G are logically equivalent) as follows:

F \equiv G : \Leftrightarrow \forall a \in Ass . a ⊧ F \Leftrightarrow a ⊧ G .

Proposition 4

(Logical Consequence and Logical Equivalence). Let

F, G \in For

be formulas. Then, we have the following equivalences:

$(F ⊧ G) \Leftrightarrow (⊧ F \to G)$
$(F ⊧ G) \Leftrightarrow (〚 F 〛 \subseteq 〚 G 〛)$
$(F \equiv G) \Leftrightarrow (⊧ F \leftrightarrow G)$
$(F \equiv G) \Leftrightarrow (〚 F 〛 = 〚 G 〛)$

Proof.

Directly from the definitions. □

Thus, a logical consequence on the meta-level coincides with an implication on the formula level and with the subset relation on the semantic level. Furthermore, logical equivalence on the meta-level coincides with equivalence on the formula level and with the equality relation on the semantic level.

In the following, we establish a set-theoretic interpretation of the logical operations of our formula language.

Definition 12

(Complement). We define the complement

\bar{R} \in Rel

of relation

R \in Rel

as the relation

\bar{R} : = Ass \ R

. Consequently, an assignment is in

\bar{R}

if and only if it is not in R.

Proposition 5

(Propositional Semantics as Set Operations). Let

F, F_{1}, F_{2} \in For

be formulas. We then have the following equalities:

\begin{matrix} 〚 P 〛 & = P \\ 〚 ⊤ 〛 & = Ass \\ 〚 ⊥ 〛 & = \emptyset \\ 〚 \neg F 〛 & = \bar{〚 F 〛} \\ 〚 F_{1} \land F_{2} 〛 & = 〚 F_{1} 〛 \cap 〚 F_{2} 〛 \\ 〚 F_{1} \lor F_{2} 〛 & = 〚 F_{1} 〛 \cup 〚 F_{2} 〛 \\ 〚 F_{1} \to F_{2} 〛 & = \bar{〚 F_{1} 〛} \cup 〚 F_{2} 〛 \\ 〚 F_{1} \leftrightarrow F_{2} 〛 & = (〚 F_{1} 〛 \cap 〚 F_{2} 〛) \cup (\bar{〚 F_{1} 〛} \cap \bar{〚 F_{2} 〛}) \end{matrix}

Proof.

Directly from the definition of the semantics. □

While the above results are quite intuitive, a corresponding set-theoretic interpretation of quantified formulas is not. In the following, we only state the plain result without indication of how it can be intuitively understood; we will delegate this explanation to Section 4, where the categorical framework will provide us with adequate insight.

Proposition 6

(Quantifier Semantics as Set Operations). Let

F \in For

be a formula. We then have the following equalities:

\begin{matrix} 〚 \forall x . F 〛 & = ⋃ {P \in Pred | P ⫫ x \land P \subseteq 〚 F 〛}, \\ 〚 \exists x . F 〛 & = ⋂ {P \in Pred | P ⫫ x \land 〚 F 〛 \subseteq P} . \end{matrix}

In other words,

〚 \forall x . F 〛

is the weakest predicate P (“weakest” in the sense of the largest set) that is independent from x and that satisfies the property

P \subseteq 〚 F 〛

while

〚 \exists x . F 〛

is the strongest predicate P (“strongest” in the sense of the smallest set) that is independent of x and that satisfies the property

〚 F 〛 \subseteq P

.

Proof.

The proof is in two stages. First, we take an arbitrary assignment

a \in Ass

and show

a \in 〚 \forall x . F 〛 \Leftrightarrow (\exists P \in Pred . P ⫫ x \land P \subseteq 〚 F 〛 \land a \in P)

⇒: We assume

a \in 〚 \forall x . F 〛

and prove for

P : = 〚 \forall x . F 〛

\begin{matrix} P \in Pred \end{matrix}

(1)

\begin{matrix} P ⫫ x \end{matrix}

(2)

\begin{matrix} P \subseteq 〚 F 〛 \end{matrix}

(3)

\begin{matrix} a \in P \end{matrix}

(4)

From Proposition 3, we have (1). From Proposition 2, we have (2). From

a \in 〚 \forall x . F 〛

, we have (4). To show (3), we take arbitrary assignment

a_{0} \in P

and show

a_{0} \in 〚 F 〛

. From

a_{0} \in P

, we know

a_{0} [x \mapsto v] \in 〚 F 〛

for

v : = a_{0} (x)

. Since

a_{0} [x \mapsto a_{0} (x)] = a_{0}

, we thus know

a_{0} \in 〚 F 〛

.

⇐: We assume for some

P \in Pred

\begin{matrix} P ⫫ x \end{matrix}

(5)

\begin{matrix} P \subseteq 〚 F 〛 \end{matrix}

(6)

\begin{matrix} a \in P \end{matrix}

(7)

and prove

a \in 〚 \forall x . F 〛

. For this, we take arbitrary

v \in Val

and prove

a [x \mapsto v] \in 〚 F 〛

. From (6), it suffices to show

a [x \mapsto v] \in P

. From (5), we know

\begin{matrix} \forall v_{1}, v_{2} \in Val . a [x \mapsto v_{1}] \in P \Leftrightarrow a [x \mapsto v_{2}] \in P \end{matrix}

(8)

From (7) and

a = a [x \mapsto a (x)]

, we know

a [x \mapsto v_{0}] \in P

for

v_{0} : = a (x)

. Thus, with (8), we know

a [x \mapsto v] \in P

.

Now, we take arbitrary

a \in Ass

and show

a \in 〚 \exists x . F 〛 \Leftrightarrow (\forall P \in Pred . P ⫫ x \land 〚 F 〛 \subseteq P \Rightarrow a \in P)

⇒: We assume

a \in 〚 \exists x . F 〛

and take arbitrary but fixed

P \in Pred

for which we assume

\begin{matrix} P ⫫ x \end{matrix}

(9)

\begin{matrix} 〚 F 〛 \subseteq P \end{matrix}

(10)

Our goal is to show

a \in P

. From

a \in 〚 \exists x . F 〛

, we know

a [x \mapsto v] \in 〚 F 〛

for some

v \in Val

. From (10), we thus know

a [x \mapsto v] \in P

. From (9), we thus know

a [x \mapsto a (x)] \in P

. Since

a [x \mapsto a (x)] = a

, we thus know

a \in P

.

⇐: We assume

\begin{matrix} \forall P \in Pred . P ⫫ x \land 〚 F 〛 \subseteq P \Rightarrow a \in P \end{matrix}

(11)

and prove

a \in 〚 \exists x . F 〛

. From (11) instantiated with

P : = 〚 \exists x . F 〛

and Propositions 3 and 2, it suffices to prove

〚 F 〛 \subseteq 〚 \exists x . F 〛

. Take arbitrary assignment

a_{0} \in 〚 F 〛

. Since

a_{0} [x \mapsto a (x)] = a

, we thus have

a_{0} [x \mapsto v] \in 〚 F 〛

for

v : = a (x)

and thus

a_{0} \in 〚 \exists x . F 〛

. □

3. Category Theory

In this section, we discuss those aspects of category theory that are relevant for the subsequent categorical formulation of our relational first-order logic.

3.1. Basic Notions

We begin with the basic notions of category theory.

Definition 13

(Category). A category C is a triple

〈 O, A, \circ 〉

of the following components:

A class O of elements called C-objects or just objects.
A class A of elements called C-arrows or just arrows. Each arrow has a source object and a target object from O; we write $f : a \to b$ to indicate that f is an arrow with source a and target b. We write $C (a, b)$ to denote the class of all arrows of A with source a and target b (called the hom-class of all arrows from a to b). For every object x in O, A contains an arrow ${id}_{x} : x \to x$ called the identity arrow for x.
A composition—binary operation ∘ defined on arrows. For all arrows $f : a \to b$ and $g : b \to c$ , we have $(g \circ f) : a \to c$ . Furthermore, the composition satisfies the following axioms:
-
Associativity: $(h \circ g) \circ f = h \circ (g \circ f)$ , for all arrows $f : a \to b$ , $g : b \to c$ , $h : c \to d$ .
-
Identity: ${id}_{b} \circ f = f = f \circ {id}_{a}$ , for all arrows $f : a \to b$ .

Definition 14

(Isomorphism). Let C be a category and

a, b

be C-objects

a, b

. Then, we have

a ≃ b

(read: a and b are isomorphic) if there are C-arrows

f : a \to b

and

g : b \to a

, called isomorphisms, such that

g \circ f = {id}_{a}

and

f \circ g = {id}_{b}

.

Definition 15

(Subcategory). A category C is a subcategory of category

𝒟

if every C-object is also a

𝒟

-object, every C-arrow is also a

𝒟

-arrow, every identity arrow in C is also an identity arrow in

𝒟

, and

g \circ_{C} f = g \circ_{D} f

for all C-arrows

f : a \to b

and

g : b \to c

, where

\circ_{C}

denotes the composition in C and

\circ_{D}

denotes the composition in

𝒟

.

3.2. Object Constructions

We are now introducing constructions of categorical objects that will subsequently play an important role in the categorical formulation of relational first-order logic.

Definition 16

(Initial and Final Objects). Let C be a category. A C-object 0 is initial if for every C-object a there exists exactly one arrow

0_{a} : 0 \to a

. A C-object 1 is final if for every C-object a there exists exactly one arrow

1_{a} : a \to 1

.

The following diagram illustrates the arrows of an initial object 0 and a final object 1 with respect to an arbitrary object a:

This construction of initial/final objects is “universal” in the sense that it describes a class of entities (objects and accompanying arrows) that share a common property and picks from this class an entity whose characterizing property is the existence of exactly one arrow from/to every entity of this class. This defines the entity uniquely up to isomorphism. Further instances of such constructions will be given later.

Definition 17

(Product and Coproduct). Let C be a category. Then, the triple

〈 a \times b, π_{1}, π_{2} 〉

is a product of C-objects a and b if

a \times b

is a C-object, the product object, with arrows

π_{1} : a \times b \to a

and

π_{2} : a \times b \to b

, the projections, such that for every triple

〈 c, f, g 〉

with C-object c and arrows

f : c \to a

and

g : c \to b

there exists exactly one arrow

〈 f, g 〉 : c \to a \times b

such that the following diagram commutes:

Dually, the triple

(a + b, ι_{1}, ι_{2})

is a coproduct of C-objects a and b if

a + b

is a C-object, the coproduct object, with arrows

ι_{1} : a \to a + b

and

ι_{2} : b \to a + b

, the injections, such that, for every triple

〈 c, f, g 〉

with C-object c and arrows

f : a \to c

and

g : b \to c

, there exists exactly one arrow

[f, g] : a + b \to c

such that the following diagram commutes:

The product and the coproduct are thus defined by universal constructions analogous to those of the final and the initial element, respectively; thus, products and coproducts are also uniquely defined up to isomorphism.

Definition 18

(Product Arrow). Let C be a category with products

〈 a_{1} \times a_{2}, π_{1}, π_{2} 〉

and

〈 b_{1} \times b_{2}, π_{1}^{'}, π_{2}^{'} 〉

and arrows

f : a_{1} \to b_{1}

and

g : a_{2} \to b_{2}

, respectively. Then, the product arrow

f \times g : a_{1} \times a_{2} \to b_{1} \times b_{2}

is the arrow

〈 f \circ π_{1}, g \circ π_{2} 〉

.

Definition 19

(Exponential). Let C be a category in which, for all C-objects, there exists a product object. Then, the tuple

〈 b^{a}, {eval}_{a, b} 〉

is an exponential of C-objects a and b if

b^{a}

is a C-object, the exponential object, with arrow

{eval}_{a, b} : b^{a} \times a \to b

, the evaluation arrow, such that for every C-object c with arrow

f : c \times a \to b

there exists exactly one arrow

{curry}_{f} : c \to b^{a}

, the currying arrow, such that the following diagram commutes:

Since the exponential is also defined by a universal construction, it is uniquely defined up to isomorphism.

3.3. Functors and Adjunction

Moving on from individual categories, we will now discuss some concepts that address relationships between categories.

Definition 20

(Functor). Let C and

𝒟

be categories. A functor

F : C \to 𝒟

is a map that takes every C-object a to a

𝒟

-object

F (a)

and every C-arrow

f : a \to b

to a

𝒟

-arrow

F (f) : F (a) \to F (b)

such that

$F ({id}_{a}) = {id}_{F (a)}$ for every C-object a, and
$F (g \circ_{C} f) = F (g) \circ_{D} F (f)$ for all C-arrows $f : a \to b$ and $g : b \to c$ .

Definition 21

(Adjunction, Left, and Right Adjoint). Let C and

𝒟

be categories with functors

F : C \to 𝒟

and

G : 𝒟 \to C

. Then, we have

F ⊣ G

(read:

〈 F, G 〉

is an adjunction, F is a left adjoint of G, G is a right adjoint of F) if for every C-object a and

𝒟

-object b the arrow classes

𝒟 (F (a), b)

and

C (a, G (b))

are isomorphic, i.e., there exists a bijection between them. This is equivalent to saying that, for every C-object a and

𝒟

-object b, there exist two surjective mappings

s_{1} : 𝒟 (F (a), b) \to C (a, G (b))

and

s_{2} : C (a, G (b)) \to 𝒟 (F (a), b)

, i.e.,

for every $𝒟$ -arrow $g : F (a) \to b$ we have a C-arrow $f : a \to G (b)$ with $s_{2} (f) = g$ and
for every C-arrow $f : a \to G (b)$ , we have a $𝒟$ -arrow $g : F (a) \to b$ with $s_{1} (g) = f$ .

Note.

This equivalence is a consequence of the Cantor–Schröder–Bernstein theorem which states that there exists a bijective function between sets A and B if there exist injective functions

f : A \to B

and

g : B \to A

. This implies that such a bijective function also exists if there exist surjective functions

f^{'} : A \to B

and

g^{'} : B \to A

because, from these, we can define the injective functions

f (a) : = such b . g^{'} (b) = a

and

g (b) : = such a . f^{'} (a) = b

. While the theorem has been formulated for sets, it can also be generalized to classes.

The above formulation will become handy in proving that two functors represent an adjunction.

Proposition 7

(Equivalence of Adjunctions and Universals). Let C and

𝒟

be categories with functors

F : C \to 𝒟

and

G : 𝒟 \to C

. Then, the condition

F ⊣ G

is equivalent to each of the following two conditions:

1.: For every C-object a, there is a C-arrow $u : a \to G (F (a))$ , the “universal arrow”, such that, for every $𝒟$ -object b and C-arrow $f : a \to G (b)$ , there exists a $𝒟$ -arrow $g_{b, f} : F (a) \to b$ :
2.: For every $𝒟$ -object b, there is a C-arrow $v : F (G (b)) \to b$ , the “couniversal arrow”, such that, for every C-object a and $𝒟$ -arrow $g : F (a) \to b$ , there is a C-arrow $f_{a, g} : a \to G (b)$ :

Proof.

See the proof of Propositions 6 and 7 in [12]. □

3.4. Object Constructions by Adjunction

We conclude this section by demonstrating that the previously described object conjunctions can be also considered as applications of functors that are determined as left respectively right adjoints to certain basic functors.

Proposition 8

(Initial and Final Object by Adjunction). Let

1

be the “singleton” category with a single object ∗ (and consequently a single arrow

{id}_{*} : * \to *

); this category is uniquely defined up to isomorphism. Let C be a category with the constant functor

C : C \to 1

; in addition, this functor is uniquely defined up to isomorphism. Then, the following holds:

Let C-object 0 be initial and the “initial object functor” $I_{0} : 1 \to C$ be defined by $I_{0} (*) : = 0$ and $I_{0} ({id}_{*}) : = {id}_{0}$ . Then, we have $I_{0} ⊣ C$ , i.e., the initial object functor is a left adjoint of the constant functor.
Let C-object 1 be final and the “final object functor” $F_{1} : 1 \to C$ be defined by $F_{1} (*) : = 1$ and $F_{1} ({id}_{*}) : = {id}_{1}$ . Then, we have $C ⊣ F_{1}$ , i.e., the final object functor is a right adjoint of the constant functor.

Proof.

For showing the first statement, we take the initial object 0 with initial object functor

I_{0}

. We show

I_{0} ⊣ C

, i.e., that

C (I_{0} (*), a)

and

1 (*, C (a))

are isomorphic, for arbitrary C-object a. This follows from

1 (*, C (a)) = 1 (*, *)

,

C (I_{0} (*), a) = C (0, a)

, and the fact that there exists exactly one

1

-arrow

{id}_{*} : * \to *

and, since 0 is initial, exactly one C-arrow

f : 0 \to a

.

For showing the second statement, we take the final object 1 with final functor

F_{1}

. We prove

C ⊣ F_{1}

, i.e., that

1 (C (a), *)

and

C (a, F_{1} (*))

are isomorphic, for arbitrary C-object a. This follows from

1 (C (a), *) = 1 (*, *)

,

C (a, F_{1} (*)) = C (a, 1)

, and the fact that there exists exactly one

1

-arrow

{id}_{*} : * \to *

and, since 1 is final, exactly one C-arrow

f : a \to 1

. □

Proposition 9

(Product and Coproduct by Adjunction). Let C be a category. Let the “product category”

C \times C

be the category whose objects

(a, b)

are pairs of C-objects a and b, whose arrows

(f, g) : (a, c) \to (b, d)

are pairs of C-arrows

f : a \to b

and

g : c \to d

, where the identity arrows are pairs of identity arrows, and where composition is component-wise composition. Let the “diagonal functor”

Δ : C \to C \times C

be defined by

Δ (a) = (a, a)

for every C-object a and

Δ (f) = (f, f)

for every C-arrow

f : a \to b

. Then, the following holds:

Assume that every pair of C-objects a and b has a product $a \times b$ and let the “product functor” $P : C \times C \to C$ be defined by $P (a, b) : = a \times b$ . Then, we have $Δ ⊣ P$ , i.e., the product functor is a right adjoint of the diagonal functor.
Assume that every pair of C-objects a and b has a coproduct $a + b$ and let the “coproduct functor” $C : C \to C \times C$ be defined by $C (a, b) : = a + b$ . Then, we have $C ⊣ Δ$ , i.e., the coproduct functor is a left adjoint of the diagonal functor.

Proof.

For showing the first statement, we take arbitrary category C and functor P satisfying the stated assumption. We show

Δ ⊣ P

, i.e., that, for arbitrary C-objects

p, a, b

, the arrow classes

(C \times C) (Δ (p), (a, b))

and

C (p, P (a, b))

are isomorphic. Since

Δ (p) = (p, p)

and

P (a, b) = a \times b

, it suffices to find surjections

s_{1} : (C \times C) ((p, p), (a, b)) \to C (p, a \times b)

and

s_{2} : C (p, a \times b) \to (C \times C) ((p, p), (a, b))

. First, we define

s_{1} (f, g) : = 〈 f, g 〉

where

〈 f, g 〉 : p \to a \times b

is the unique C-arrow given to us by Definition 17 with property

f = π_{1} \circ 〈 f, g 〉

and

g = π_{2} \circ 〈 f, g 〉

. Now, we show that, for every C-arrow

h : p \to a \times b

, there exist some C-arrows

f : p \to a

and

g : p \to b

with

s_{1} (f, g) = h

. We take

f : = π_{1} \circ h

and

g : = π_{2} \circ h

. Due to the uniqueness of

〈 f, g 〉

, the equalities

f = π_{1} \circ h

and

g = π_{2} \circ h

imply

h = 〈 f, g 〉

and thus

s_{1} (f, g) = h

. Second, we define

s_{2} (h) : = (π_{1} \circ h, π_{2} \circ h)

. Now, we show that, for every

(C \times C)

-arrow

(f, g) : (p, p) \to (a, b)

, i.e., for all C-arrows

f : p \to a

and

g : p \to b

, there exists some C-arrow

h : p \to a \times b

with

s_{2} (h) = (f, g)

, i.e.,

π_{1} \circ h = f

and

π_{2} \circ h = g

. Definition 17 can be used to define h.

For showing the second statement, we take arbitrary category C and functor C satisfying the stated assumption. We prove

C ⊣ Δ

, i.e., that, for arbitrary C-objects

a, b, c

, the arrow classes

C (C (a, b), c)

and

(C \times C) ((a, b), Δ (c))

are isomorphic. Since

C (a, b) = a + b

and

Δ (c) = (c, c)

, it suffices to find surjections

s_{1} : C (a + b, c) \to (C \times C) ((a, b), (c, c))

and

s_{2} : (C \times C) ((a, b), (c, c)) \to C (a + b, c)

. First, we define

s_{1} (h) : = (h \circ ι_{1}, h \circ ι_{2})

. Now, we prove that for every

(C \times C)

-arrow

(f, g) : (a, b) \to (c, c)

, i.e., for all C-arrows

f : a \to c

and

g : b \to c

, there exists some C-arrow

h : a + b \to c

with

s_{1} (h) = (f, g)

, i.e.,

h \circ ι_{1} = f

and

h \circ ι_{2} = g

. Definition 17 can be used to define h. Second, we define

s_{2} (f, g) : = [f, g]

where

[f, g] : a + b \to c

is the unique C-arrow given to us by Definition 17 with property

f = [f, g] \circ ι_{1}

and

g = [f, g] \circ ι_{2}

. Now, we show that, for every C-arrow

h : a + b \to c

, there exist some C-arrows

f : a \to c

and

g : b \to c

with

s_{2} (f, g) = h

. We take

f : = h \circ ι_{1}

and

g : = h \circ ι_{2}

. Due to the uniqueness of

[f, g]

, the equalities

f = h \circ ι_{1}

and

g = h \circ ι_{2}

imply

h = [f, g]

and thus

s_{2} (f, g) = h

. □

Proposition 10

(Exponential by Adjunction). Let C be a category in which, for every pair of C-objects a and b, there exists a product object

b \times a

and an exponential object

b^{a}

. For every C-object a, let the “(unary) product functor”

P_{a} : C \to C

be defined by

P_{a} (b) : = b \times a

and the “(unary) exponential functor”

E_{a} : C \to C

be defined by

E_{a} (b) : = b^{a}

. Then, we have

P_{a} ⊣ E_{a}

, i.e., the exponential functor is a right adjoint of the product functor.

Proof.

We take arbitrary category C, C-object a, and functors

P_{a}

and

E_{a}

satisfying the assumption. We show

P_{a} ⊣ E_{a}

, i.e., that, for arbitrary C-objects

b, c

, the arrow classes

C (P_{a} (c), b)

and

C (c, E_{a} (b))

are isomorphic. Since

P_{a} (b) = b \times a

and

E_{a} (b) = b^{a}

, it suffices to find surjections

s_{1} : C (c \times a, b) \to C (c, b^{a})

and

s_{2} : C (c, b^{a}) \to C (c \times a, b)

. First, we define

s_{1} (f) : = {curry}_{f}

. Now, we show that, for every C-arrow

g : c \to b^{a}

, there exists some C-arrow

f : c \times a \to b

with

s_{1} (f) = g

, i.e.,

{curry}_{f} = g

. We define

f : = {eval}_{a, b} \circ (g \times {id}_{a})

and show

{curry}_{f} = g

. From the definition of f, we know that the C-arrow

g : c \to b^{a}

satisfies the equality

f = {eval}_{a, b} \circ (g \times {id}_{a})

. However, Definition 19 implies that the only such C-arrow is

{curry}_{f}

; thus,

{curry}_{f} = g

. Second, we define

s_{2} (g) : = {eval}_{a, b} \circ (g \times {id}_{a})

. Now, we show that, for every C-arrow

f : c \times a \to b

, there exists some C-arrow

g : c \to b^{a}

with

s_{2} (g) = f

, i.e.,

{eval}_{a, b} \circ (g \times {id}_{a}) = f

. We define

g : = {curry}_{f}

from which Definition 19 proves the goal. □

We are now ready to discuss the central aspects of categorical logic.

4. A Categorical Semantics

Based on the concepts introduced in the previous sections, this section elaborates a categorical semantics of our relational version of first-order logic. We advise the reader to consult Figure 1 to grasp the overall framework and the relationship between its various categories and functors.

4.1. Syntactic Category and Formula Functors

We start by introducing the “syntactic category”

S Y N = 〈 For, A, \circ 〉

as follows:

The objects of this category are the formulas in the set $For$ which was introduced in Definition 6.
The arrow class A consists of all pairs $〈 F_{1}, F_{2} 〉$ of formulas $F_{1}, F_{2}$ for which $F_{1} ⊧ F_{2}$ holds, i.e., for which $F_{2}$ is a logical consequence of $F_{1}$ , as described in Definition 8. The source object of such an arrow is $F_{1}$ , and its target object is $F_{2}$ . The existence of an arrow $f : F_{1} \to F_{2}$ thus indicates $F_{1} ⊧ F_{2}$ . The identity ${id}_{F} : F \to F$ indicates the fact $F ⊧ F$ .
The composition ∘ denotes relational composition: for all arrows $f : F_{1} \to F_{2}$ and $g : F_{2} \to F_{3}$ , and the existence of the arrow $(g \circ f) : F_{1} \to F_{3}$ indicates the transitivity of the relation ⊧.

For every variable x,

{S Y N}_{x}

is that subcategory of

S Y N

whose objects are formulas whose semantics are independent of x (see Definition 4).

For reasons explained below, we will exclude from the syntactic category negations and equivalences, i.e., formulas of form

(\neg F)

and

(F_{1} \leftrightarrow F_{2})

. We may do so by considering them as the following syntactic shortcuts:

\begin{matrix} (\neg F) & \equiv (F \to ⊥), \\ (F_{1} \leftrightarrow F_{2}) & \equiv (F_{1} \to F_{2}) \land (F_{2} \to F_{1}) . \end{matrix}

The validity of these shortcuts can be easily shown by proving the corresponding logical equivalences. Consequently, negations and equivalences need subsequently not be considered any more and their semantics need not be explicitly defined.

For the other kinds of formulas, we introduce the following (families of) “formula functors” where

1

is the “singleton” category with a single object ∗ (see Proposition 8):

\begin{matrix} true & : 1 \to S Y N \\ false & : 1 \to S Y N \\ and & : S Y N \times S Y N \to S Y N \\ or & : S Y N \times S Y N \to S Y N \\ {imp}_{F \in For} & : S Y N \to S Y N \\ {forall}_{x \in Var} & : S Y N \to {S Y N}_{x} \\ {exists}_{x \in Var} & : S Y N \to {S Y N}_{x} \end{matrix}

These functors map formulas to formulas, and logical consequences to logical consequences. The formula mappings are naturally defined as follows:

\begin{matrix} true (*) & : = ⊤ \\ false (*) & : = ⊥ \\ and (F_{1}, F_{2}) & : = F_{1} \land F_{2} \\ or (F_{1}, F_{2}) & : = F_{1} \lor F_{2} \\ {imp}_{F_{1}} (F_{2}) & : = F_{1} \to F_{2} \\ {forall}_{x} (F) & : = \forall x . F \\ {exists}_{x} (F) & : = \exists x . F \end{matrix}

As for the mapping of consequences, we notice that all functors are covariant in their

S Y N

arguments. It is exactly for this reason that negation and equivalence (which do not allow covariance in their arguments) are not modeled as formula functors and that implication (which is only covariant in its second argument) is not modeled by a binary functor but by a family of unary functors.

Thus, we have for all formulas

F, F_{1}, F_{2}, G, G_{1}, G_{2}

and every variable x the following (easy to prove) properties:

\begin{matrix} (F_{1} ⊧ G_{1}) \land (F_{2} ⊧ G_{2}) \Rightarrow and (F_{1}, F_{2}) ⊧ and (G_{1}, G_{2}) \\ (F_{1} ⊧ G_{1}) \land (F_{2} ⊧ G_{2}) \Rightarrow or (F_{1}, F_{2}) ⊧ or (G_{1}, G_{2}) \\ (F_{2} ⊧ G_{2}) \Rightarrow {imp}_{F_{1}} (F_{2}) ⊧ {imp}_{F_{1}} (G_{2}) \\ (F ⊧ G) \Rightarrow {forall}_{x} (F) ⊧ {forall}_{x} (G) \\ (F ⊧ G) \Rightarrow {exists}_{x} (F) ⊧ {exists}_{x} (G) \end{matrix}

Therefore, the object maps of these functors naturally induce the necessary logical consequences.

4.2. Semantic Category and Predicate Functors

Next, we introduce the “semantic category”

S E M = 〈 Pred, B, \circ 〉

as follows:

The objects of this category are the predicates in the set $Pred$ which was introduced in Definition 5 (thus $S E M$ -objects are relations, i.e., sets).
The arrow class B consists of all pairs $〈 P_{1}, P_{2} 〉$ of predicates $P_{1}, P_{2}$ for which $P_{1} \subseteq P_{2}$ holds, i.e., for which $P_{1}$ is a subset of $P_{2}$ . The source object of such an arrow is $P_{1}$ , its target object is $P_{2}$ . The existence of an arrow $f : P_{1} \to P_{2}$ thus indicates $P_{1} \subseteq P_{2}$ . The identity ${id}_{P} : P \to P$ indicates the fact $P \subseteq P$ .
The composition ∘ denotes relational composition: for all arrows $f : P_{1} \to P_{2}$ and $g : P_{2} \to P_{3}$ , the existence of the arrow $(g \circ f) : P_{1} \to P_{3}$ indicates the transitivity of the relation ⊆.

For every variable x,

{S E M}_{x}

is the subcategory of

S E M

whose objects are predicates that are independent of x (see Definition 4).

Corresponding to the various kinds of formula constructions, we will have the following “predicate functors” (respectively families of functors):

\begin{matrix} TRUE & : 1 \to S E M \\ FALSE & : 1 \to S E M \\ AND & : S E M \times S E M \to S E M \\ OR & : S E M \times S E M \to S E M \\ {IMP}_{P \in Pred} & : S E M \to S E M \\ {FORALL}_{x \in Var} & : S E M \to {S E M}_{x} \\ {EXISTS}_{x \in Var} & : S E M \to {S E M}_{x} \end{matrix}

These functors map predicates to predicates and subset relations to subset relations (their detailed definitions will be given later). As we will see, these functors are covariant in their

S E M

-arguments, i.e., we have for all predicates

P, P_{1}, P_{2}, Q, Q_{1}, Q_{2}

and every variable x the following properties:

\begin{matrix} (P_{1} \subseteq Q_{1}) \land (P_{2} \subseteq Q_{2}) \Rightarrow AND (P_{1}, P_{2}) \subseteq AND (Q_{1}, Q_{2}) \\ (P_{1} \subseteq Q_{1}) \land (P_{2} \subseteq Q_{2}) \Rightarrow OR (P_{1}, P_{2}) \subseteq OR (Q_{1}, Q_{2}) \\ (P_{2} \subseteq Q_{2}) \Rightarrow {IMP}_{P_{1}} (P_{2}) \subseteq {IMP}_{P_{1}} (Q_{2}) \\ (P \subseteq Q) \Rightarrow {FORALL}_{x} (P) \subseteq {FORALL}_{x} (Q) \\ (P \subseteq Q) \Rightarrow {EXISTS}_{x} (P) \subseteq {EXISTS}_{x} (Q) \end{matrix}

Therefore, the object maps of these functors (defined by the respective predicate operations) naturally induce appropriate arrow maps (the corresponding subset relations).

4.3. The Semantic Functor

Now, we introduce the “semantic functor”

〚 〛 : S Y N \to S E M

defined as follows:

For every $S Y N$ -object F, i.e., formula F, $〚 F 〛$ denotes the semantics of F as defined in Definition 7, which according to Proposition 3 is a predicate, i.e., indeed a $S E M$ -object.
For every $S Y N$ -arrow $f : F_{1} \to F_{2}$ , i.e., every pair of formulas $F_{1}$ and $F_{2}$ with $F_{1} ⊧ F_{2}$ , we have the $S E M$ -arrow $〚 f 〛 : 〚 F_{1} 〛 \to 〚 F_{2} 〛$ , i.e., the fact $〚 F_{1} 〛 \subseteq 〚 F_{2} 〛$ , which is a direct consequence of Definition 10 which introduces the ⊧ relation.

This semantic functor establishes the relationship between the previously introduced formula functors and predicate functors by the following identities on

S E M

-objects, i.e., predicate identities that will hold for all formulas

F, F_{1}, F_{2}

and every variable x:

\begin{matrix} 〚 true (*) 〛 = TRUE (*) \\ 〚 false (*) 〛 = FALSE (*) \\ 〚 and (F_{1}, F_{2}) 〛 = AND (〚 F_{1} 〛, 〚 F_{2} 〛) \\ 〚 or (F_{1}, F_{2}) 〛 = OR (〚 F_{1} 〛, 〚 F_{2} 〛) \\ 〚 {imp}_{F_{1}} (F_{2}) 〛 = {IMP}_{〚 F_{1} 〛} (〚 F_{2} 〛) \\ 〚 {forall}_{x} (F) 〛 = {FORALL}_{x} (〚 F 〛) \\ 〚 {exists}_{x} (F) 〛 = {EXISTS}_{x} (〚 F 〛) \end{matrix}

4.4. Categorical Semantics of First-Order Relational Logic

We are now going to elaborate in detail the semantic functors from which all of the above can be shown; this elaboration is inspired from and indeed directly derived from the well-known logical inference rules of first-order logic. The resulting definitions are based on the categorical notions introduced in Section 3, i.e., final and initial objects, products and coproducts, exponentials, and left and right adjoints, respectively. This gives us for every logical operation a “universal” definition of its semantics. Nevertheless, this semantics is also “constructive” in the sense that it is explicitly defined from well-known set-theoretic operations.

4.4.1. Logical Constants

The role of the logical constants in reasoning is exhibited by the following two “rules” which follow directly from Definition 8 (these rules are propositions that are valid for every formula F; they mimic the corresponding inference rules of first-order logic):

\begin{matrix} F ⊧ ⊤ ⊥ ⊧ F \end{matrix}

In other words, ⊤ is a logical consequence of every formula F, i.e., ⊤ is the “weakest” formula. Dually, every formula F is a logical consequence of ⊥, i.e., ⊥ is the “strongest” formula. This implies that

true (*) = ⊤

is the final object of category

S Y N

and

false (*) = ⊥

is its initial one (see Definition 16). Then, Proposition 8 implies

C_{S Y N} ⊣ true

and

false ⊣ C_{S Y N}

, i.e., functor

true

is the right adjoint of the constant functor

C_{S Y N} : S Y N \to 1

while functor

false

is its left one.

Correspondingly,

TRUE (*)

is the final object of category

S E M

(the “weakest” predicate, i.e., the predicate which is a superset of every predicate) and

FALSE (*)

is its initial object (the “strongest” predicate, i.e., the predicate which is a subset of every predicate). By Proposition 8, we then have

C_{S E M} ⊣ TRUE

and

FALSE ⊣ C_{S E M}

, i.e., functor

TRUE

is the right adjoint of the constant functor

C_{S E M} : S E M \to 1

while functor

FALSE

is its left one.

Therefore, corresponding to the above rules for formulas, we have the following rules for every predicate P:

\begin{matrix} P \subseteq TRUE (*) FALSE (*) \subseteq P \end{matrix}

Since final and initial objects are unique, these rules actually represent implicit but unique definitions of

TRUE (*)

and

FALSE (*)

which can be explicitly written as

\begin{matrix} TRUE (*) : = ⋃ {P | P \in Pred} FALSE (*) : = ⋂ {P | P \in Pred} \end{matrix}

i.e.,

TRUE (*)

is the union of all predicates and

FALSE (*)

is their intersection. Thus, we have derived alternative characterizations

〚 ⊤ 〛 = TRUE (*)

and

〚 ⊥ 〛 = FALSE (*)

that are both constructive and universal (Proposition 5 gives us

〚 ⊤ 〛 = Ass

and

〚 ⊥ 〛 = \emptyset

from which it is easy to verify these equalities).

4.4.2. Conjunction and Disjunction

The role of conjunction in reasoning is exhibited by the following rules for arbitrary formulas

F_{1}, F_{2}, F

(the first two ones mimic the logical inference rules of “elimination”, and the last one mimics the inference rule of “introduction”):

\begin{matrix} F_{1} \land F_{2} ⊧ F_{1} \\ F_{1} \land F_{2} ⊧ F_{2} \\ (F ⊧ F_{1}) \land (F ⊧ F_{2}) \Rightarrow (F ⊧ F_{1} \land F_{2}) \end{matrix}

Dually, we have the following rules for disjunction:

\begin{matrix} F_{1} ⊧ F_{1} \lor F_{2} \\ F_{2} ⊧ F_{1} \lor F_{2} \\ (F_{1} ⊧ F) \land (F_{2} ⊧ F) \Rightarrow (F_{1} \lor F_{2} ⊧ F) \end{matrix}

These rules (whose soundness can be established with the help of Definition 8) state that

(F_{1} \land F_{2})

is the “weakest” formula F for which both

(F ⊧ F_{1})

and

(F ⊧ F_{2})

hold and that

(F_{1} \lor F_{2})

is the “strongest” formula F for which both

(F_{1} ⊧ F)

and

(F_{2} ⊧ F)

hold. Thus,

and (F_{1}, F_{2}) = (F_{1} \land F_{2})

is the product of the

S Y N

-objects

F_{1}

and

F_{2}

and

or (F_{1}, F_{2}) = (F_{1} \lor F_{2})

is their coproduct (see Definition 17). Furthermore, by Proposition 9, we have

Δ_{S Y N} ⊣ and

and

or ⊣ Δ_{S Y N}

i.e., functor

and

is the right adjoint of the diagonal functor

Δ_{S Y N} : S Y N \to S Y N \times S Y N

while functor

or

is its left one.

Correspondingly

AND (P_{1}, P_{2})

is the product of the

S E M

-objects

P_{1}

and

P_{2}

(the “weakest” predicate P for which both

(P \subseteq P_{1})

and

(P \subseteq P_{2})

hold) and

OR (P_{1}, P_{2})

is their coproduct (the “strongest” predicate P for which

(P_{1} \subseteq P)

and

(P_{2} \subseteq P)

hold). By Proposition 9, we then have

Δ_{S E M} ⊣ AND

and

OR ⊣ Δ_{S E M}

i.e., functor

AND

is the right adjoint of the diagonal functor

Δ_{S E M} : S E M \to S E M \times S E M

, while functor

OR

is its left one.

Thus, we have, corresponding to the rules for formulas, the following rules for all predicates

P_{1}, P_{2}, P

:

\begin{matrix} AND (P_{1}, P_{2}) \subseteq P_{1} \\ AND (P_{1}, P_{2}) \subseteq P_{2} \\ (P \subseteq P_{1}) \land (P \subseteq P_{2}) \Rightarrow (P \subseteq AND (P_{1}, P_{2})) \end{matrix}

Dually, we have

\begin{matrix} P_{1} \subseteq OR (P_{1}, P_{2}) \\ P_{2} \subseteq OR (P_{1}, P_{2}) \\ (P_{1} \subseteq P) \land (P_{2} \subseteq P) \Rightarrow (OR (P_{1}, P_{2}) \subseteq P) \end{matrix}

Since products and coproducts are uniquely defined, these rules actually represent implicit but unique definitions of

AND (P_{1}, P_{2})

and

OR (P_{1}, P_{2})

which can be explicitly written as follows:

\begin{matrix} AND (P_{1}, P_{2}) & : = ⋃ {P \in Pred | P \subseteq P_{1} \land P \subseteq P_{2}} \\ OR (P_{1}, P_{2}) & : = ⋂ {P \in Pred | P_{1} \subseteq P \land P_{2} \subseteq P} \end{matrix}

This gives us alternative characterizations

〚 F_{1} \land F_{2} 〛 = 〚 F_{1} 〛 \cup 〚 F_{2} 〛 = AND (〚 F_{1} 〛, 〚 F_{2} 〛)

and

〚 F_{1} \lor F_{2} 〛 = 〚 F_{1} 〛 \cup 〚 F_{2} 〛 = OR (〚 F_{1} 〛, 〚 F_{2} 〛)

that are both constructive and universal (Proposition 5 implies

〚 F_{1} \land F_{2} 〛 = 〚 F_{1} 〛 \cap 〚 F_{2} 〛

and

〚 F_{1} \lor F_{2} 〛 = 〚 F_{1} 〛 \cup 〚 F_{2} 〛

from which it is not difficult to verify these equalities).

4.4.3. Implication

The role of implication in reasoning is exhibited by the following rules for arbitrary formulas

F_{1}, F_{2}, F

(the first rule mimics the logical inference rules of “implication elimination” or “modus ponens”, the last one mimics the inference rule of “implication introduction”):

\begin{matrix} (F_{1} \to F_{2}) \land F_{1} ⊧ F_{2} \\ (F \land F_{1} ⊧ F_{2}) \Rightarrow (F ⊧ F_{1} \to F_{2}) \end{matrix}

These rules (whose soundness can be established with the help of Definition 8) state that

(F_{1} \to F_{2})

is the “weakest” formula F for which

(F \land F_{1} ⊧ F_{2})

holds. Thus,

{imp}_{F_{1}} (F_{2}) = (F_{1} \to F_{2})

is the exponential of the

S Y N

-objects

F_{1}

and

F_{2}

(see Definition 19). Proposition 10 then gives us

{and}_{F_{1}} ⊣ {imp}_{F_{1}}

, i.e., functor

{imp}_{F_{1}}

is the right adjoint of the unary conjunction functor

{and}_{F_{1}} : S Y N \to S Y N \times S Y N

with object map

{and}_{F_{1}} (F_{2}) : = and (F_{1}, F_{2}) = F_{1} \land F_{2}

.

Correspondingly,

{IMP}_{P_{1}} (P_{2})

is the product of the

S E M

-objects

P_{1}

and

P_{2}

(the “weakest” predicate P for which

(P \cap P_{1} \subseteq P_{2})

holds; Proposition 10 then gives us

{AND}_{P_{1}} ⊣ {IMP}_{P_{1}}

, i.e., functor

{IMP}_{P_{1}}

is the right adjoint of the unary functor

{AND}_{P_{1}} : S E M \to S E M \times S E M

with object map

{AND}_{P_{1}} (P_{2}) : = AND (P_{1}, P_{2}) = P_{1} \cup P_{2}

.

Thus, corresponding to above rules for formulas, we have the following rules for all predicates

P_{1}, P_{2}, P

:

\begin{matrix} {IMP}_{P_{1}} (P_{2}) \cap P_{1} \subseteq P_{2} \\ (P \cap P_{1} \subseteq P_{2}) \Rightarrow (P \subseteq {IMP}_{P_{1}} (P_{2})) \end{matrix}

Since exponentials are uniquely defined, these rules represent an implicit but unique definition of

{IMP}_{P_{1}} (P_{2})

which can be explicitly written as follows:

\begin{matrix} {IMP}_{P_{1}} (P_{2}) & : = ⋃ {P \in Pred | P \cap P_{1} \subseteq P_{2}} \end{matrix}

This gives us an alternative characterization

〚 F_{1} \to F_{2} 〛 = {IMP}_{〚 F_{1} 〛} (〚 F_{2} 〛)

that is both constructive and universal (Proposition 5 implies

〚 F_{1} \land F_{2} 〛 = \bar{〚 F_{1} 〛} \cup 〚 F_{2} 〛

from which it is possible to verify this equality).

4.4.4. Universal and Existential Quantification

The role of universal quantification in reasoning is exhibited by the following rules for arbitrary formulas

F, G

provided that the semantics

〚 G 〛

of G do not depend on x (see Definition 4):

\begin{matrix} \forall x . F ⊧ F \\ (G ⊧ F) \Rightarrow (G ⊧ \forall x . F) \end{matrix}

The first rule mimics the logical inference rule of “universal elimination”, the second one mimics the inference rule of “universal introduction” (except that our version of first-order logic does not involve terms and variables and thus copes without variable substitutions). This pair of rules in a nutshell yields that

(\forall x . F)

is the “weakest” formula G from which F is a logical consequence and whose semantics do not depend on x. Dually, we have for existential quantification the following pair of rules:

\begin{matrix} F ⊧ \exists x . F \\ (F ⊧ G) \Rightarrow (\exists x . F ⊧ G) \end{matrix}

These rules state that

(\exists x . F)

is the “strongest” formula G that is a logical consequence of F and whose semantics

〚 G 〛

does not depend on x.

We are now going to derive appropriate categorical characterizations of the corresponding functors

{forall}_{x \in Var} : S Y N \to {S Y N}_{x}

and

{exists}_{x \in Var} : S Y N \to {S Y N}_{x}

from the category

S Y N

of all formulas to the subcategory

{S Y N}_{x}

of all those formulas whose semantics do not depend on x. For this, we may notice that, from above rules, the relations

(G ⊧ F)

and

(F ⊧ G)

involve two kinds of relations, a more general relation F that may depend on x and a more special relation G that is independent of x. In order to bring all relations to the “same level”, we introduce a syntactic “injection” functor

I_{x} : {S Y N}_{x} \to S Y N

whose maps are just identities, i.e.,

I_{x} (G) = G

and

I_{x} (f : F \to G) = f : F \to G

. This allows us to express above rules as

\begin{matrix} I_{x} ({forall}_{x} (F)) ⊧ F \\ (I_{x} (G) ⊧ F) \Rightarrow (G ⊧ {forall}_{x} (F)) \end{matrix}

and dually

\begin{matrix} F ⊧ I_{x} ({exists}_{x} (F)) \\ (F ⊧ I_{x} (G)) \Rightarrow ({exists}_{x} (F) ⊧ G) \end{matrix}

Now, the first set of rules matches the assumptions of the second part of Proposition 7 for

F : = I_{x}

and

G : = {forall}_{x}

(considering that the satisfaction relation ⊧ denotes the existence of an arrow in categories

S Y N

, respectively

{S Y N}_{x}

); thus, we have

I_{x} ⊣ {forall}_{x}

. Likewise, the second set of rules matches the assumptions of the first part of that proposition for

F : = {exists}_{x}

and

G : = I_{x}

; thus, we have

{exists}_{x} ⊣ I_{x}

. Summarizing, the universal functor

{forall}_{x}

is the right adjoint of the injection functor

I_{x}

while the existential functor

{exists}_{x}

is its left adjoint.

These considerations can be easily transferred to categorical characterizations of the corresponding functors

{FORALL}_{x \in Var} : S E M \to {S E M}_{x}

and

{EXISTS}_{x \in Var} : S E M \to {S E M}_{x}

from the category

S E M

of all predicates to the subcategory

{S E M}_{x}

of all those predicates that do not depend on x with the semantic “injection” functor

J_{x} : {S E M}_{x} \to S E M

whose maps are just identities, i.e.,

J_{x} (Q) = Q

and

J_{x} (f : P \to Q) = f : P \to Q

. We then have

\begin{matrix} J_{x} ({FORALL}_{x} (P)) \subseteq P \\ (J_{x} (Q) \subseteq P) \Rightarrow (Q \subseteq {FORALL}_{x} (P)) \end{matrix}

and dually

\begin{matrix} P \subseteq J_{x} ({EXISTS}_{x} (P)) \\ (P \subseteq J_{x} (Q)) \Rightarrow ({EXISTS}_{x} (P) \subseteq Q) \end{matrix}

Now, the first set of rules matches the assumptions of the second part of Proposition 7 for

F : = J_{x}

and

G : = {FORALL}_{x}

(considering that the subset relation ⊆ denotes the existence of an arrow in categories

S E M

, respectively

{S E M}_{x}

); thus, we have

J_{x} ⊣ {FORALL}_{x}

. Likewise, the second set of rules matches the assumptions of the first part of that proposition for

F : = {EXISTS}_{x}

and

G : = J_{x}

; thus, we have

{EXISTS}_{x} ⊣ J_{x}

. Summarizing, the universal functor

{FORALL}_{x}

is the right adjoint of the injection functor

J_{x}

while the existential functor

{EXISTS}_{x}

is its left adjoint.

The above rules say that

{FORALL}_{x} (P)

is the weakest predicate Q that does not depend on x for which

(J_{x} (Q) \subseteq Q)

holds while

{EXISTS}_{x} (P)

is the strongest predicate Q that does not depend on x for which

(Q \subseteq J_{x} (Q))

holds. Since left and right adjoints are uniquely defined, these rules represent implicit but unique definitions of

{FORALL}_{x} (P)

and

{EXISTS}_{x} (P)

which can be explicitly written as follows:

\begin{matrix} {FORALL}_{x} (P) & : = ⋃ {Q \in {Pred}_{x} | J_{x} (Q) \subseteq P} \\ {EXISTS}_{x} (P) & : = ⋂ {Q \in {Pred}_{x} | P \subseteq J_{x} (Q)} \end{matrix}

From

J_{x} (Q) = Q

and

Q \in {Pred}_{x} \Leftrightarrow Q \in Pred \land Q ⫫ x

(see Definition 5), this can also be written as follows:

\begin{matrix} {FORALL}_{x} (P) & : = ⋃ {Q \in Pred | Q ⫫ x \land Q \subseteq P} \\ {EXISTS}_{x} (P) & : = ⋂ {Q \in Pred | Q ⫫ x \land P \subseteq Q} \end{matrix}

Thus, we have derived alternative characterizations

〚 \forall x . F 〛 = {FORALL}_{x} (〚 F 〛)

and

〚 \exists x . F 〛 = {EXISTS}_{x} (〚 F 〛)

that are both constructive and universal. This is exactly the characterization whose correctness we have proved in Proposition 6.

5. An Implementation of the Categorical Semantics

In this section, we describe how the constructions that we have theoretically modeled in Section 2 can be actually implemented. For this purpose, we use RISCAL (RISCAL is developed at JKU, Linz, Austria, https://www3.risc.jku.at/research/formal/software/RISCAL/, see [13]), the RISC Algorithm Language [13,17], a specification language, and an associated software system for modeling mathematical theories and algorithms in a specification language based on first-order logic and set theory. The language is based on a type system where all types have finite sizes (specified by the user); this allows for fully automatically deciding formulas and verifying the correctness of algorithms for all possible inputs. To this end, the system translates every syntactic phrase into an executable form of its denotational semantics; the RISCAL model checker evaluates these semantics to determine the results of algorithms and the truth values of formulas such as the postconditions of algorithms. Since the domains of RISCAL models have (parameterized but) finite size, the validity of all theorems and the correctness of all algorithms can be fully automatically checked; the system has been mainly employed in educational scenarios [18,19]. Figure 2 gives a screenshot of the software with the RISCAL model that is going to be discussed below.

Figure 3 and Figure 4 list a RISCAL model of the categorical semantics over a domain of

N + 1

variables (identified with the natural numbers

0, \dots, N

) with

M + 1

values, for arbitrary model parameters

N, M \in N_{0}

; all theorems over these domains are decidable and can be checked by RISCAL. The RISCAL definition of domains, functions, and predicates closely correspond to those given in this paper; in particular, we have a domain Pred of predicates (since the number of variables is finite, by definition all relations are predicates) and predicate functions TRUE, FALSE, AND, OR, IMP, FORALL, EXISTS. Different from the categorical formulation, IMP is a binary function, not a family of unary functions; likewise, FORALL and EXISTS are binary functions whose first argument is a variable. Furthermore, we introduce functions NOT and EQUIV for the semantics of negation and conjunction and show by theorems Not and Equiv that they can be reduced to the other functions.

All other logical operations are first defined in their usual set-theoretic form. Subsequently, we describe their categorical semantics by a pair of theorems: the first theorem claims that the set-theoretic semantics is equivalent to an implicit definition of the categorical semantics while the second theorem claims equivalence to the corresponding constructive definition. Choosing small parameter values

N = 2

and

M = 1

(i.e., relations with variables

x_{0}, x_{1}, x_{2}

and values

0, 1

), RISCAL can easily check the validity of all claims, as demonstrated by the following output:

RISC Algorithm Language 2.6.4 (10 December 2018)

http://www.risc.jku.at/research/formal/software/RISCAL

This is free software distributed under the terms of the GNU GPL.

Execute "RISCAL -h" to see the available command line options.

-----------------------------------------------------------------

Reading file /usr2/schreine/papers/CategoricalLogic2019/catlogic.txt

Using N=2.

Using M=1.

Computing the value of Ass...

Computing the value of TRUE...

Computing the value of FALSE...

Type checking and translation completed.

Executing True1().

Execution completed (3 ms).

Executing True2().

Execution completed (1 ms).

Executing False1().

Execution completed (0 ms).

Executing False2().

Execution completed (1 ms).

Executing And1(Set[Array[ℤ]],Set[Array[ℤ]]) with all 65536 inputs.