research-article

Open access

Contextual Linear Types for Differential Privacy

Authors:

Éric TanterAuthors Info & Claims

ACM Transactions on Programming Languages and Systems, Volume 45, Issue 2

Article No.: 8, Pages 1 - 69

https://doi.org/10.1145/3589207

Published: 17 May 2023 Publication History

PDF eReader

Abstract

Language support for differentially private programming is both crucial and delicate. While elaborate program logics can be very expressive, type-system-based approaches using linear types tend to be more lightweight and amenable to automatic checking and inference, and in particular in the presence of higher-order programming. Since the seminal design of Fuzz, which is restricted to ϵ-differential privacy in its original design, significant progress has been made to support more advanced variants of differential privacy, like (ϵ, δ)-differential privacy. However, supporting these advanced privacy variants while also supporting higher-order programming in full has proven to be challenging. We present Jazz, a language and type system that uses linear types and latent contextual effects to support both advanced variants of differential privacy and higher-order programming. Latent contextual effects allow delaying the payment of effects for connectives such as products, sums, and functions, yielding advantages in terms of precision of the analysis and annotation burden upon elimination, as well as modularity. We formalize the core of Jazz, prove it sound for privacy via a logical relation for metric preservation, and illustrate its expressive power through a number of case studies drawn from the recent differential privacy literature.

1 Introduction

Note. This article uses colorblind-friendly colors in notation to convey information and is best consumed using an electronic device or color printer.

Over the past decade, differential privacy [32] has become the de facto gold standard in protecting the privacy of individuals when processing sensitive data. In contrast to traditional approaches like de-identification, differential privacy provides a formal, composable privacy guarantee. Differentially private algorithms typically protect privacy by selecting from a handful of basic mechanisms to perturb their outputs. For example, the Laplace mechanism can be used to add noise to the population count of a city to prevent an adversary from successfully guessing whether or not a particular individual lives in that city. Most programming-language-based approaches to differential privacy are applied to verifying either the implementation of a mechanism, such as the Exponential mechanism, or the composition of multiple uses of mechanisms, such as computing a histogram using the Laplace mechanism (multiple times) as a primitive.

There are two challenges when writing differentially private programs. First, noise must be added to the right values in the program to achieve some guarantee of privacy; this includes the final output of the program, as well as many intermediate program values. Second, the correct amount of noise must be added in those places to achieve the desired amount of privacy. In the differential privacy framework, privacy is a quantitative feature—more noise gives more privacy. Adding too little noise is as ineffective as adding no noise at all, and adding too much noise renders the result of the computation useless. Programmers must therefore ensure they have added enough noise, in the right places, and that the noise is minimal—a daunting task.

Since differential privacy is a probabilistic, multi-run (hyper [25]) property, it is not straightforward to develop test cases for differentially private algorithms. Consequently, differentially private algorithms are usually developed by experts in the field, and these experts produce manual proofs of privacy for each new algorithm. This reliance on experts is limiting. First, there is a practical need for developing privacy-preserving applications without access to an expert in differential privacy. Even still, experts are not perfect: For example, several incorrect versions of the Sparse Vector Technique [31, 32] have appeared in published papers [41], despite being authored and peer-reviewed by experts in differential privacy.

Due to these challenges, verifying differential privacy in programs via typechecking has received considerable attention. The first such approach, Fuzz [49], uses linear types to verify pure ϵ-differential privacy. Fuzz and its successor DFuzz [34] have a number of attractive properties, including support for automation and higher-order programming. Fuzz was the first to use linear types to bound function sensitivity: how much a function’s output changes given a change to its input. Sensitivity is then used to determine the (minimal) amount of noise required to achieve privacy. Fuzz uses the same sensitivity type system to also track privacy, which is advantageous due to its simplicity, but as a consequence is unable to support advanced variants of differential privacy, like (ϵ, δ). A recent approach [29] extends the terminating fragment of Fuzz using graded comonadic liftings to support advanced variants such as (ϵ, δ)-differential privacy. In the following, we call this extended language Fuzz^ϵδ. Another approach, HOARe², uses relational refinement types to encode differential privacy [14] and improves on Fuzz-like systems in its ability to support advanced variants. In general, type-based approaches such as Fuzz and HOARe² are used to verify programs that compose mechanisms, and not the implementations of mechanisms.

An alternative set of approaches use program logics [15, 16, 17, 50] to verify both the implementations of mechanisms and simple forms of composing mechanisms, while also supporting advanced variants such as (ϵ, δ)-differential privacy [32], zero-concentrated differential privacy [21], and Rényi differential privacy [44]. However, these benefits come at the expense of automation and support for higher-order programming.

The Duet language [46] and type system strikes a new balance in this space by building on the designs of Fuzz and DFuzz. Like Fuzz, Duet supports automation and higher-order programming, and like Fuzz^ϵδ, HOARe², and recently developed program logics, Duet supports advanced variants of differential privacy. Like all type-based approaches, Duet cannot be used to verify implementations of mechanisms, however, even when verifying programs that compose mechanisms there is still room to improve: Duet is not expressive enough to support higher-order programming in full generality—something Fuzz, DFuzz, Fuzz^ϵδ, and HOARe² are each able to achieve.

This article presents Jazz, the successor to Duet that significantly improves upon its design. Jazz is a linear type system with support for latent contextual effects for function sensitivity and differential privacy; this combination supports advanced privacy variants (such as Duet, Fuzz^ϵδ, and HOARe²), automation (such as Duet and DFuzz), and fully general higher-order programming (such as Fuzz/DFuzz, Fuzz^ϵδ, and HOARe²). Like Duet, the Jazz language is built from two mutually embedded sublanguages—one for sensitivity and one for privacy—which allows it to support advanced variants of differential privacy automatically through typechecking. Also like Duet (and Fuzz/DFuzz, Fuzz^ϵδ, and HOARe²), Jazz is designed for verifying the composition of mechanisms and not their direct implementation.

The key insight of Jazz is the incorporation of latent contextual effects into a linear type system. A latent effect is one that is deferred or delayed; rather than accounting for the effect immediately, it is tracked and accounted for later. A contextual effect is one that tracks effect information for each variable in the context, including closure variables used in higher-order function bodies. Technically, this is similar to the open closure types introduced by Scherer and Hoffmann [51], specialized to the tracking of sensitivity and privacy and generalized to positive type constructors such as sums and products. In addition to supporting higher-order programming in the presence of advanced privacy variants, these latent contextual effects also can yield advantages in terms of precision of the analysis, annotation burden, and modularity.

The challenge of higher-order programming. Consider the n-iteration loop combinator in Fuzz, loop_n, which has type \(\color{blue}{\tau \rightarrow (\tau \rightarrow \bigcirc \tau)\multimap_{n} \bigcirc \tau}\). This type describes a two-argument function that takes some value of type τ as the first argument, a function as second argument (which accepts and returns values of type τ), and returns a final value of type τ. The modality \(\color{blue}{\bigcirc }\) in the return type for the function argument and final return type indicates that the function is probabilistic (due to the use of differential privacy mechanisms) and when appearing in the codomain of a linear arrow ⊸ indicates that the function satisfies differential privacy.

Both function sensitivity and differential privacy are two-run (hyper)properties of a function output w.r.t. some particular input. For example, a function of body 2x + 3y is 2-sensitive in x and 3-sensitive in y, meaning that if e.g., input x varies by at most d and y is held constant, then the function output varies at most by 2d. When a closure is created, the closure captures sensitivities as well as values, so the sensitivity of the closure \(\color {blue}{\lambda x.} \color {blue}{2x + 3y}\) would be “3 in y.” The situation is analogous when tracking privacy and creating closures that capture privacy costs. Looking back to the type of loop_n in Fuzz, the second argument will be a closure whose captured environment tracks a privacy cost for each closure variable. The interpretation of the linear function type ⊸_n is to scale the privacy effects in the closure environment of the looping function of type \(\color {blue}{\tau \rightarrow {\scriptstyle \bigcirc} \tau }\) by n. We call this scaling implicit and pervasive in Fuzz, because it occurs at every let-binding and function call. In the original Fuzz language, such scaling is only sound and precise for pure ϵ-differential privacy, and as a consequence of this pervasive scaling, Fuzz could not be instantiated to advanced differential privacy variants, until recently, where Fuzz^ϵδ now supports advanced variants such as (ϵ, δ)-differential privacy through a path metric construction.

The Duet language prohibits this pervasive scaling in its type system to support advanced differential privacy variants, but as a consequence it cannot initially derive a type for loop_n, and instead it must define a custom typing rule for loop_n. The issue is that Duet prohibits all scaling of privacy quantities. However, scaling is allowable (i.e., sound) in special restricted instances when using advanced variants. The challenge is then to disallow implicit pervasive scaling while allowing explicit restricted scaling. Because no type can be written for loop_n in Duet, it (and many other higher-order functions) must be given explicit typing rules. This poses a significant restriction on higher-order programming, for instance, loop_n cannot be lambda-abstracted in Duet.

Jazz directly solves the challenge of encoding the explicit, restricted scaling that is required to support both advanced privacy variants and higher-order programming. In Jazz, the type of the n-iteration construct is:

. In this type, the privacy effect on the closure is given an explicit representation notated

, which means “(ϵ,δ)-privacy for variables in the closure environment Σ.” This effect is latent, because the effect is not “paid for” until (and each time) the function is called, and it is contextual because it includes a privacy effect (which may be “zero”) for each free variable in the context. This effect is then explicitly scaled by n, the number of loop iterations, in the final effect of applying the function.¹ More powerful looping combinators such as advanced composition can also be encoded with these latent contextual effects; such combinators cannot be described in any prior linear type system—including Fuzz and DFuzz.

Contributions. Jazzsupports writing higher-order programs, and automaticallyverifying that such programs satisfy advanced variantsof differential privacy. The novel features of Jazz—linear types with latent contextual effects—are crucial for practical differentially private programming. We illustrate this expressive power by showing how to encode numerous mechanisms and tools for differential privacy as Jazz primitives, including the Laplace, Gaussian, and Exponential mechanisms, advanced composition, and privacy amplification by subsampling. We also demonstrate the use of Jazz to verify larger algorithms in two case studies: the MWEM algorithm [38] and a recently proposed differentially private machine learning algorithm based on gradient descent with adaptive gradient clipping [53]. Note that these examples are expressible in Duet only by adding new core typing rules for each primitive used, which strictly speaking requires re-proving the metatheory of the extended language. In contrast, Jazz subsumes Duet and supports all these examples without having to add new typing rules, and with a much smaller core language. Finally, Jazz is amenable to reasonably efficient automated typechecking; we have implemented a typechecker for the language that can verify privacy costs for our case studies in milliseconds.

We prove the type soundness of Jazz using a step-indexed logical relation over a mixed big-step/denotational semantics with embedded discrete probability distributions as probability mass functions (PMFs).²

In summary, the contributions of this article are:

•

Jazz, a practical, higher-order, general purpose programming language for writing differentially private programs, which supports advanced variants of differential privacy.

•

A novel linear type system for Jazz that includes latent contextual effects, allowing to delay the payment of effects of connectives such as product, sums, and functions, until actually eliminated; e.g., if the second element of a pair is never used, then it does not contribute to the effect of the program.

•

A formalization and proof of type soundness of \(\lambda _{\text{{J}}}\), the core language of Jazz, based on a proof technique with step-indexed logical relations.

•

A prototype implementation of the Jazz typechecker, together with a library of primitives for differential privacy, and case studies that demonstrate the expressive power and practicality of Jazz.

We first briefly introduce some key concepts of differential privacy (Section 2) and then give an overview of key design choices and benefits of contextual linear types in Jazz (Section 3). Jazz is a two-language design, and what follows is a presentation of each sub-language in two multi-section arcs. First, we present the sensitivity-only language design (Section 4) and metatheory (Section 5). This language does not include differential privacy operations in the language or privacy quantities in types. Next, building on this sensitivity core, we present the full privacy language design and metatheory (Sections 6 and 7). Finally, we discuss implementation details including gaps between the actual implementation of Jazz and its formal model (Section 8), present a few case studies in Jazz (Section 9), discuss related work (Section 10), and conclude (Section 11). An extended version with proofs of all the results stated in the article can be found in the companion technical report [54].

2 A Differential Privacy Primer

Differential privacy is a mathematical definition of what it means for a computation over sensitive data to preserve privacy [32]. It interprets privacy as a form of plausible deniability and relies on the use of randomization to achieve it. Informally, a randomized algorithm is differentially private if the probability that it outputs a particular value remains almost the same with or without a single individual’s data used as part of the input. Formally, the definition is parameterized by two privacy parameters ϵ and δ that specify to what extent two probabilities are “almost the same,” and by a distance metric over the algorithm’s (sensitive) input whose role we discuss shortly.

Definition 2.1 (Differential Privacy).

Given a randomized algorithm (or mechanism) \(\color{blue}{ {\mathcal {M}} \in A \rightarrow B}\) and a distance metric \(\color{blue}{ {\mathfrak {D}} _{A} \in A \times A \rightarrow {\mathbb {R}}}\), the algorithm \(\color{blue}{ {\mathcal {M}}}\) satisfies (ϵ, δ)-differential privacy if for all x, x′ ∈ A such that \(\color{blue}{ {\mathfrak {D}} _{A}(x, x^{\prime }) \le 1}\) and all possible sets S⊆B of outcomes, \(\color{blue}{ {\text{Pr}}[{\mathcal {M}}(x) \in S] \le e^{\epsilon } {\text{Pr}}[{\mathcal {M}}(x^{\prime }) \in S] + \delta}\).

The parameter ϵ quantifies the adversary ability to distinguish two neighboring inputs upon observing the corresponding algorithm outputs. It represents the privacy guarantee provided by the algorithm—the smaller, the less information is leaked about its input. However, the parameter δ represents a failure probability: With probability at most δ, the algorithm is allowed to violate privacy altogether. In combination, ϵ and δ are typically understood as the “privacy cost” incurred by publicly releasing the algorithm output, associated to a given sensitive input. The case where δ = 0 is called pure (or pure ϵ-) differential privacy, and the case where δ > 0 is called approximate differential privacy. Several other recently proposed variants of the definition build on the advantages of (ϵ, δ)-differential privacy while eliminating the potential for failure; these include Rényi differential privacy (RDP) [44], zero-concentrated differential privacy (zCDP) [21], and truncated concentrated differential privacy (tCDP) [20].

Two algorithm inputs are said to be neighbors if the distance between them is bounded by 1 (i.e., \(\color{blue}{ {\mathfrak {D}} (x,x^{\prime }) \le 1}\)). For the formal definition to match our informal statement, the distance metric \(\color{blue}{ {\mathfrak {D}} _{A}}\) should ensure that neighboring inputs differ by at most one individual’s data. Formalizing this notion depends heavily on the domain, so different definitions of \(\color{blue}{ {\mathfrak {D}}}\) are used in different domains. When considering a relational database table represented as a bag of tuples, one commonly used definition for \(\color{blue}{ {\mathfrak {D}} _{{\text{DB}}}}\) is symmetric difference [42]: \(\color{blue}{ {\mathfrak {D}} _{{\text{DB}}}(x, x^{\prime }) = |(x - x^{\prime }) \cup (x^{\prime } - x)|}\). Under this definition, \(\color{blue}{ {\mathfrak {D}} _{{\text{DB}}}(x, x^{\prime }) = 1}\) for tables that differ in one row; if the data contributed by each individual is bounded to a single row, then this is a good approximation of neighboring inputs.

The definition of differential privacy implies two key properties: post-processing and composition. Post-processing means that the output of a differentially private mechanism stays differentially private, no matter what additional processing is applied. Composition allows bounding the privacy cost of multiple computations over the same underlaying data: Running an (ϵ₁, δ₁)-differentially private mechanism followed by an (ϵ₂, δ₂)-differentially private mechanism satisfies (ϵ₁ + ϵ₂, δ₁ + δ₂)-differential privacy. The privacy parameters ϵ and δ are often called the privacy cost because of the additive nature of composition.

Basic mechanisms, composition, and scaling. Differential privacy mechanisms typically add noise to the output of a deterministic function scaled to the function’s sensitivity [32]. A function f ∈ A → B with distance metrics \(\color{blue}{ {\mathfrak {D}} _{A}}\) and \(\color{blue}{ {\mathfrak {D}} _{B}}\) is called s-sensitive if \(\color{blue}{ {\mathfrak {D}} _{A}(x,y) \le d \Rightarrow {\mathfrak {D}} _{B}(f(x),f(y)) \le sd}\) for every \(\color{blue}{ d \in {\mathbb {R}}}\) and every x, y ∈ A. Two commonly used mechanisms are the Laplace [32] and the Gaussian [7, 32] mechanisms. Given an s-sensitive function \(\color{blue}{ f \in A \rightarrow {\mathbb {R}}}\), the Laplace mechanism releases \(\color{blue}{ f(x) + {\text{Lap}}(\tfrac{s}{\epsilon })}\), where Lap(b) denotes a random sample from the Laplace distribution centered at 0 with scale b; it satisfies ϵ-differential privacy. The Gaussian mechanism releases \(\color{blue}{ f(x) + {\mathcal {N}}(\tfrac{2s ^{2}\ln (1.25/\delta)}{\epsilon ^{2}})}\), where \(\color{blue}{ {\mathcal {N}}(\sigma ^{2})}\) denotes a random sample from the Gaussian distribution centered at 0 with variance σ² and ϵ, δ ∈ (0, 1); it satisfies (ϵ, δ)-differential privacy. While the original Gaussian mechanism requires ϵ < 1, Balle and Wang [7] introduce a variant—called analytic Gaussian mechanism—that drops the requirement that ϵ < 1.

For implementation purposes, naive floating-point truncations of the real-valued Laplace distribution lead to fatal privacy breaches [43]. Canonne et al. [22] have shown that the Laplacian and Gaussian mechanisms can both be discretized while still providing formal privacy guarantees.

Advanced composition [32] yields tighter bounds on privacy cost for many iterative algorithms but requires (ϵ, δ)-differential privacy. For ϵ, δ, δ′ ≥ 0, the class of (ϵ, δ)-differentially private mechanisms satisfies (ϵ′, kδ + δ′)-differential privacy under k-fold adaptive composition (e.g., a loop with k iterations) where \(\color{blue}{ \epsilon ^{\prime } = k\epsilon (e^\epsilon -1) + \epsilon \sqrt {2 k \ln (1/\delta ^{\prime })}}\). Advanced composition is especially useful for iterative algorithms that perform many differentially private steps in sequence (e.g., iterative machine learning algorithms).

Differential privacy is stated in terms of neighboring inputs, i.e., inputs x and x′ such that \(\color{blue}{ {\mathfrak {D}} _{A}(x, x^{\prime }) \le 1}\). When \(\color{blue}{ {\mathfrak {D}} _{A}(x, x^{\prime }) \gt 1}\), an ϵ-differentially private mechanism provides \(\color{blue}{ {\mathfrak {D}} _{A}(x, x^{\prime })\mathord {\cdotp }\epsilon}\)-differential privacy. Distances larger than one are typically interpreted as groups of individuals, e.g., \(\color{blue}{ {\mathfrak {D}} _{A}(x,x^{\prime }) = k}\) represents a change to k individual’s input data. Therefore, an ϵ-differentially private mechanism provides kϵ-differential group privacy [32] for groups of size k. A similar property holds for (ϵ, δ)-differential privacy and the more recently developed advanced variants, but the scaling of privacy cost is nonlinear for all of these variants . For example, on inputs at distance k, an algorithm satisfying (ϵ, δ)-differential privacy yields outputs that are only (kϵ,kδe^{(k − 1)ϵ})-close–instead of (kϵ,kδ)-close. This nonlinearity makes it difficult to apply techniques based on linear types (which generally internalize linear scaling of costs [34, 49]) for these variants of differential privacy.

Verification Techniques for Differential Privacy. A number of techniques have been proposed for verifying that a program satisfies differential privacy, including approaches based on linear logic [29, 34, 46, 49, 60], couplings and program logics [5, 12, 13, 15, 16, 17, 50], and randomness alignments [55, 59]. Our work is most closely related to Fuzz [49] and its descendants DFuzz [34], Duet [46], and Fuzz^ϵδ [29], which are based on linear type systems. In particular, these approaches focus heavily on fully automated verification of differential privacy properties through typechecking and are typically less expressive than program logics, which by contrast support significantly less (or no) automation. We defer a more complete discussion of related work to Section 10.

3 Overview of Jazz

Jazz builds on the linear type system of Fuzz [49] and the two-language design of Duet [46] by introducing latent contextual effects. This section introduces and motivates the design of Jazz’s two languages—one for sensitivity and one for privacy—using simple examples. The design of each language is described in Sections 4 and 6, respectively, and each metatheory is described in Sections 5 and 7.

3.1 A Two-language Design

Jazz follows Duet in being structured as two mutually embedded sublanguages, one for sensitivity and one for privacy. In a nutshell, this is because supporting scaling of both sensitivity and privacy in a uniform and tight fashion is sound only for ϵ-differential privacy, and not for (ϵ, δ)-differential privacy, which has nonlinear group privacy, as discussed in Section 2.

Let us elaborate on this. Recall that the framework of differential privacy builds on randomization to achieve privacy, and the randomization is typically calibrated according to the sensitivity of the function whose output one wants to protect. Therefore, in a language to describe differentially private computations, we can mostly distinguish two classes of functions: On the one hand, randomized (effectful) functions that are annotated with privacy information, and on the other hand, pure functions that are annotated with sensitivity information. However, when composing functions, the way their information is combined highly depends on the class of the composed functions.

For example, when composing two functions from the sensitivity fragment, their information is naturally combined via scaling. Consider, for instance, a 3-sensitive function f; it is not hard to see that the expression

is (2 · 3)-sensitive in x. The same scaling pattern remains valid when composing a function from the sensitivity fragment with a function from the privacy fragment, provided the latter satisfies pure differential privacy: If g is an, e.g., ϵ-differentially private function, then the computation

is (2 · ϵ)-differentially private in x. This uniform scaling behavior, which besides being sound is also tight, lies at the heart of Fuzz/DFuzz design. In fact, in Fuzz/DFuzz, both classes of functions live within the same space, following the same typing rules. To enable this uniform treatment, the languages rely on the two fundamental ingredients: (i) the presence of a monad/modality to encode randomization and (ii) the association of a metric space to each type, which in the case of monadic types, is tailored to encode differential privacy. Said otherwise, in Fuzz/DFuzz, differential privacy is encoded as a sensitivity claim about functions (with a monadic return type).

Unfortunately, this linear scaling—pervasive in Fuzz/DFuzz—is no longer sound when composing a function from the sensitivity fragment with a function from the privacy fragment that satisfies approximate—rather than pure—differential privacy. Returning to the previous example, if g is instead (ϵ, δ)-differentially private, then g(x + x) does not satisfy (2ϵ, 2δ)-differential privacy in x, but only (2ϵ, 2δe^ϵ)-differential privacy as dictated by the group privacy property for approximate differential privacy (see Section 2 and Dwork and Roth [32, Section 2.3]). In effect, this is why approximate differential privacy lies out of the scope of Fuzz/DFuzz “uniform” design—although recently, de Amorim et al. [29] have shown that using different metrics allows approximate differential privacy functions to be linear, extending a terminating subset of Fuzz to support (ϵ, δ)-differentially privacy via a path metric construction.

In view of this, Duet introduced a two-language design, separating a sensitivity sublanguage in which scaling remains implicit (internalized by the typing rules) and pervasive (modeling function composition) and a privacy sublanguage in which scaling is explicit and restricted (ad hoc typing rules are needed, e.g., to model some privacy combinators). Jazz builds upon this approach and significantly improves both sublanguages, thanks to latent contextual effects, as we illustrate next.

Coloring convention. As noted in the introduction, this article uses colorblind-friendly colors in notation to convey information and is best consumed using an electronic device or color printer. Jazz consists of two mutually embedded sublanguages, and each language is given its own color. Furthermore, these two languages share the same language of types, so we use a third color for the shared fragment. Consequently, we use three color schemes throughout the article: (1) blue for general math notation and the type system shared between languages; (2) green for the sensitivity language; and (3) red for the privacy language. We have carefully chosen the schemes to be distinguishable (as much as possible) for persons with various forms of color blindness.³

3.2 Sensitivity

In the sensitivity sublanguage of Jazz, the identity function is written \(\color{green}{ \lambda ^{\rm s }}\color{blue}{x.x}\). We write sensitivity lambdas as \(\color{green}{ \lambda ^{\rm s }}\) to more easily distinguish them from privacy lambdas, written \(\color{red}{ \lambda ^{\rm p }}\) (described later). The identity function is 1-sensitive in its argument x: If x changes by \(\color{green}{d}\), then the function’s output also changes by \(\color{green}{d}\). Similarly to the identity function, the doubling function \(\color{green}{ \lambda ^{\rm s }} \color{blue}{x.x+x}\) is 2-sensitive in x: If x changes by \(\color{green}{d}\), then the function’s output changes by \(\color{blue}{2}\color{green}{d}\).

Fuzz extends the notion of sensitivity to multi-argument functions by assigning a sensitivity to each argument. For example, the curried function \(\color{green}{\lambda^{\mathrm{s}}} \color{blue}{x.}\color{green}{\lambda^{\mathrm{s}} \color{blue}{y.x+x+y}}\) is 2-sensitive in x and 1-sensitive in y. If x changes by \(\color{green}{d_{x}}\) and y changes by \(\color{green}{d_{y}}\), then the function’s output changes by \(\color{blue}{ 2}\color {green}{d_{x}} +\color{blue}{1}\color {green}{d_{y}}\).

In general, the sensitivity of a function can be written as a linear combination of the changes in its inputs. In Jazz, we express function sensitivities as linear formulas over the function’s input variables, using the variable name itself as a placeholder for the change in that input. Jazz’s type system gives the following types for the three examples we have seen so far:

The linear formulas written above function arrows represent the sensitivity effect of the corresponding function. The general form of sensitivity function types is \(\color{blue}{x:\tau_{1}}\ \color{green}{\xrightarrow\Sigma}\ \color{blue}{\tau_{2}}\), where Σ is the sensitivity effect of the function, expressed as a linear formula. Note that x is in scope for Σ and τ₂. Importantly, occurrences of x in Σ and τ₂ represent the sensitivity of the variable x, rather than its value, so Jazz supports sensitivity-dependent types. Also, we usually drop “null” effects over function arrows such as

above and instead just write \(\color{green}{\rightarrow}\).

As usual, Jazz accommodates higher-order functions by scaling sensitivities. For example, applying a 2-sensitive function twice yields a 4-sensitive function:

In addition to function types, other type connectives in Jazz such as sums and products also carry sensitivity effects, such as \(\color{blue}{\tau _{1}}\ {}^{\color {green}{\Sigma_{1}}}\color{blue}{\oplus} ^{{{{\color {green}{\Sigma_{2}}}}}} {\color {blue}{\tau _{2}}}\) for sums, \(\color{blue}{\tau _{1}}\ {}^{\color {green}{\Sigma_{1}}}\color{blue}{\otimes} ^{{{{\color {green}{\Sigma_{2}}}}}} {\color {blue}{\tau _{2}}}\) for multiplicative products, and \(\color{blue}{\tau _{1}}\ {}^{\color {green}{\Sigma_{1}}}\color{blue}{\&} ^{{{{\color {green}{\Sigma_{2}}}}}}\ {\color {blue}{\tau _{2}}}\) for additive products. These connectives are extensions of the linear type connectives \(\color{blue}{\tau _{1} \oplus \tau _{2}}\), \(\color{blue}{\tau _{1} \otimes \tau _{2}}\) and \(\color{blue}{\tau _{1} \& \tau _{2}}\) from Fuzz, augmented with latent sensitivity effects.

We say that sensitivity effects in Jazz are latent, because they only contribute to the sensitivity of an expression when the type connective is actually eliminated. For instance, in the third example above—a curried function—the sensitivity effect on the first argument is delayed until the second argument is received. If a second argument is never received, then the sensitivity effect on the first argument can be ignored. Likewise, the annotations Σ₁ and Σ₂ in the type \(\color{blue}{\tau _{1}}\ {}^{\color {green}{\Sigma_{1}}}\color{blue}{\oplus} ^{{{{\color {green}{\Sigma_{2}}}}}}\ {\color {blue}{\tau _{2}}}\) encode the latent sensitivity cost for each component of the connective: for τ₁ (the left) and τ₂ (the right), respectively. In contrast to Fuzz, creating a pair in Jazz can have no immediate sensitivity cost: Only projecting out of a pair has a cost in sensitivity, depending on which component is projected. Additionally, we say that sensitivity effects are contextual because Σ can refer to variables in scope.

3.3 Privacy

To encode differential privacy, Jazz makes use of privacy functions (notated \(\color{red}{\twoheadrightarrow}\)) rather than of sensitivity functions (notated \(\color{green}{\rightarrow }\)) as in the previous examples. As such, privacy functions are annotated with privacy—rather than sensitivity—effects. The type of a function from τ₁ to τ₂ that is (ϵ,δ)-differentially private in its argument is as follows:

Semantically, this type describes functions f, where if \(\color{blue}{ {\mathfrak {D}} _{\tau _{1}}(x,x^{\prime }) \le {{{\color {green}d,}}}}\) then f(x) and f(x′) yield distributions that are “(ϵ,δ)-close” according to the definition of (ϵ, δ)-differential privacy.

The annotation \(\color{green}{d}\) is necessary to support (and unique to) advanced variants of differential privacy like (ϵ, δ)-differential privacy. In the pure ϵ-differential privacy framework, it is common to first establish the property for \(\color{green}{d} \color{blue}{= 1}\), that is, if \(\color{blue}{ {\mathfrak {D}} _{\tau _{1}}(x,x^{\prime }) \le 1}\) then f(x) and f(x′) are ϵ-close. Once established, this property then implies that if \(\color{blue}{ {\mathfrak {D}} _{\tau _{1}}(x,x^{\prime }) \le {{{\color {green}d,}}}}\) then f(x) and f(x′) are \(\color {green}{d}\color{blue}{\epsilon}\)-close, for any \({ {{{\color {green}d}}}}\). However, this linear scaling does not carry over to advanced variants like (ϵ, δ)-differential privacy. As a consequence, \({ {{{\color {green}d}}}}\) must be specified directly as a parameter and cannot be recovered by scaling the property instantiated to \({ {{{\color {green}d}}} = 1}\). We refer to this distance—\({ {{{\color {green}d}}}}\)—as the relational distance, since it pertains to the (two-run) relational property of differential privacy, and specifically, the distance between inputs x and x′ for each of the two executions f(x) and f(x′). We also use this terminology in the context of sensitivity, e.g., an s-sensitive function is one that upon inputs within relational distance \({ {{{\color {green}d}}}}\) returns outputs within relational distance \(\color{blue}{ s{{{\color {green}d}}}}\).

As explained in Section 2, differential privacy is usually achieved by the use of mechanisms such as the Laplace (for ϵ-differential privacy) or the Gaussian mechanism (for (ϵ, δ)-differential privacy). In Jazz, the primitive function implementing the Laplace mechanism has the following type:

There are three logical parameters to the laplace function: d is the relational distance (explained above) used in the statement of privacy satisfied by the function, ϵ is the privacy level we want to enforce, and x is the value we (want to protect and) are adding noise to. When executing laplace, the amount of noise added is \(\color{blue}{ {\text{Lap}}(\frac{d}{\epsilon })}\), which depends on both d and ϵ, so they must be runtime values. Also, when typechecking laplace, the amount of privacy obtained depends on ϵ, and the distance d must also be tracked to enforce that the computation feeding laplace with its argument x produces values within relational distance no larger than d. Because the values of d and ϵ are required for both runtime execution and typechecking, we require a form of dependent types.

To support dependent types, we use a singletons approach—a technique initially developed for dependently typed programming in Haskell [33, 39] and which we borrow directly from DFuzz in the context of supporting parameterized differentially private functions [34]. In this approach, each dependent argument has two representations—one for the type and term level, respectively. In the type of laplace, \(\color{blue}{ \hat{d}}\) is the type-level representation of term-level variable d, and likewise for \(\color{blue}{ \hat{\epsilon }}\) and ϵ. (We further discuss singletons and their implementation in Section 8.)

The final argument \(\color{blue}{ x \mathrel {:} {\mathbb {R}}\mathord {\cdotp }\hat{d}}\) will have Laplace noise added to it and then returned as the result of the laplace function. The annotation “\(\color{blue}{ \mathord {\cdotp }\hat{d}}\)” in the type of x places a precondition on the computation used to supply the value to protect: Its output must have relational distance no larger than \(\color{blue}{ \hat{{{\color {green}d}}}}\). After all, the noise added is only guaranteed to give ϵ-differential privacy for values that result from computations with relational distance \(\color{blue}{ \hat{{{\color {green}d}}}}\).

The final privacy effect for the function is \(\color{blue}{ \infty (d + \epsilon) + (\hat{\epsilon },0)x}\), indicating that no privacy promises are made for the values d and ϵ and that privacy is promised for input x with cost \(\color{blue}{ (\hat{\epsilon },0)}\); we write ∞(d + ϵ) as shorthand for ∞d + ∞ϵ.

Like Fuzz, DFuzz, and Duet, Jazz extends the notion of differential privacy from single-argument to multi-argument functions, assigning each argument a privacy cost (e.g., the privacy effect \(\color{blue}{ \infty (d + \epsilon) + (\hat{\epsilon },0)x}\) for the laplace function describes privacy costs for d, ϵ, and x). This approach is formalized in Section 7.3. By convention, most differentially private programs expect a single input to contain the sensitive data, and the privacy cost assigned to this argument is most important in ensuring privacy. The costs associated with the other arguments are typically infinite, indicating that the program does not preserve privacy for these inputs.

We can give a similar type to the gauss function, which provides (ϵ, δ)-differential privacy by adding Gaussian noise drawn from \(\color{blue}{ {\mathcal {N}}(\tfrac{2{{{\color {green}d}}} ^{2}\ln (1.25/\delta)}{\epsilon ^{2}})}\):

In Jazz, privacy primitives are used in the privacy sublanguage. For example, the following privacy-sublanguage expression partially applies the Gaussian mechanism to values for \({ {{{\color {green}d}}}}\), ϵ and δ, resulting in a privacy function that satisfies (1.5, 10^{− 5})-differential privacy for any input at relational distance 4:

Note that we omit the instantiation of forall-quantified type variables \(\color{blue}{ \hat{{{\color {green}d}}}}\), \(\color{blue}{ \hat{\epsilon }}\), and \(\color{blue}{ \hat{\delta }}\) to type-level constants 4, 1.5, and 10^{− 5}, as they can be inferred from the value-level arguments \({ {{{\color {green}d}}}}\), ϵ, and δ.

The privacy sublanguage also contains monadic bind (notated ←) and return constructs for composing differentially private computations. Privacy functions \({\color {red}{\lambda ^{{\mathrm{p}}}}}\color{blue}{(x}\mathord {\cdot }{\color {green}d}). {{\color {red}e}}\) are created in the sensitivity sublanguage (because function creation is “pure”), although the function body e lives in the privacy sublanguage. The annotation \({ {{{\color {green}d}}}}\) is the relational distance explained previously for privacy function types. For example, the following function computes two differentially private results and adds them together:

The bind operator encodes the sequential composition property of differential privacy (Section 2), adding up the ϵ and δ values of subcomputations. The return operator encodes the post-processing property of differential privacy. The relational distance parameter of 1 is in general inferrable during typechecking; we include it as a visible term-level parameter for presentation purposes.

Beyond Duet. The privacy sublanguage of Jazz briefly introduced here lifts a number of important limitations of the privacy sublanguage of Duet. We sketch two of these here and postpone further comparison to later sections.

First, to avoid scaling in the privacy sublanguage, Duet requires the arguments to privacy functions to have a maximum sensitivity or relational distance of 1. This limitation makes it impossible to give general types to the gauss and laplace functions as we have just shown in Jazz. As a result, Duet includes a dedicated type rule for each basic differential privacy mechanism, where each rule is parametric in the sensitivity or relational distance of the argument. Jazz’s addition of relational distance annotation “\(\color{blue}{ \mathord {\cdotp }{{\color {blue}{{{\color {green}d}}}}}}\)” in the types of function arguments eliminates the need for special type rules, and mechanisms can instead be encoded as primitives with an axiomatized type. The primary benefit of this is that the metatheory need not be extended each time a new mechanism is considered.

Second, while pervasive scaling is generally undesirable for privacy costs, some constructs such as advanced composition rely on the ability to scale privacy costs in controlled ways that are supported by theorems specific to that privacy model. Because Duet’s privacy language disallows scaling entirely, these constructs are impossible to encode as functions and must also be given special typing rules. The latent privacy effects in Jazz allow constructs like advanced composition to be given regular function types. Overall, the Jazz design makes differential privacy by typing in the presence of higher-order programming possible for advanced differential privacy variants. The following sections dive into these benefits by focusing first on the sensitivity sublanguage (Section 4) and then the privacy sublanguage (Section 6). Sections 5 and 7 develop the metatheory of each respective sublanguage.

4 Design of Jazz’S Sensitivity Type System

Jazz builds upon prior approaches to encoding differential privacy using linear types. In this section, we first overview some limitations of these approaches related to the tracking of sensitivities and then discuss how they can be addressed by Jazz. In this section, we color expressions and metavariables green as they pertain to the sensitivity fragment of Jazz.

4.1 Linear Products and Sums

Existing approaches based on linear types [34, 46, 49] provide elementary datatype abstractions to programmers such as pairs (products) and tagged unions (sums). However, some of the sensitivity analysis they implement for these datatypes can lead to overly imprecise—or even unsound—approximations in some circumstances.

We now briefly overview these datatype abstractions; a summary is provided in Table 1.

Table 1.

Linear products.

Because existing systems are based on intuitionistic linear logic, two product types emerge: multiplicative products ⊗ and additive products \(\color{blue}{ \mathrel {\&}}\). Multiplicative pairs encode two resources, both of which can be used. Additive pairs encode two resources, but in contrast to multiplicative pairs, only one of them can be used at a time—a computation may use either their left or right component, but not both. This constraint is reflected on the management of type environments in their typing rules. Consider, for instance, the multiplicative product ⟨x, x⟩ and the additive product (x, x). Fuzz generates the following type derivations for the pairs:

where judgment \(\color{blue}{ x :_{{{\color {green}s}}} \tau \vdash e : \tau ^{\prime }}\) denotes that expression e is an \({ {{{\color {green}s}}}}\)-sensitive computation on x (and has type τ′ assuming that x has type τ). The type derivation on the left (for multiplicative products) adds (variablewise) the environments of both components (\(\color{blue}{ x:_{2}{\mathbb {R}} = x:_{1}{\mathbb {R}} + x:_{1}{\mathbb {R}}}\)), reporting a sensitivity of 2 in x. However, the type derivation on the right (for additive products) calculates the maximum (variablewise) between the environments of both components (\(\color{blue}{ x:_{1}{\mathbb {R}} = \mathit {max}(x:_{1}{\mathbb {R}}, x:_{1}{\mathbb {R}})}\)), reporting a sensitivity of 1 in x. The elimination rules also follow these principles: While a multiplicative pair is destructed via pattern matching giving access to both its components, an additive product is destructed via projection operators that give access to a single component.

When applied to sensitivity analysis, these type connectives no longer encode accessibility of a pair of resources; rather, they encode an abstraction of the sensitivities of each component of the pair. The sensitivity for the whole pair is coarse and either tracks the sum of sensitivities of each component (in the case of multiplicative products) or their maximum (in the case of additive products), as reflected in the last column of Table 1.

Linear sums.

Rather than a simultaneous occurrence of resources, sums encode an alternative occurrence of resources. Sums are introduced via inl and inr constructors and destructed via a case expression with one branch for each of the constructors.

In the context of sensitivity analysis, the sensitivity of a sum in(l/r) encodes both the sensitivities of the contained expression e, as well as the sensitivities for the direction of the injection (left or right). For example, inl \(\color{blue}{(x + x)}\) is 2-sensitive in x, however, if \(\color{blue}{y\le 10}\) then inl \(\color{blue}{x}\)else inr \(\color{blue}{x}\) is ∞-sensitive in y because a change in y could change the direction of the injection.

As usual, these systems leverage sums to encode Boolean values, e.g., the Boolean type is encoded as \(\color{blue}{\mathbb {B} \triangleq {\texttt {unit}} \oplus {\texttt {unit}}}\), where unit represents the unit type, inhabited by unit value t t. Under this encoding, an \(\color {green}{\texttt {if}}\color{blue}{-}\color {green}{\texttt {then}}\color{blue}{-}\color {green}{\texttt {else}}\) expression becomes syntactic sugar for a case expression. Also note that Boolean values \(\color{blue}{ {\text{true} \triangleq} \color {green}{\mathtt {inl\ tt}}}\) and \(\color{blue}{ \text{false} \triangleq} \color {green}{\mathtt {inr\ tt}}\) are at distance ∞ from each other in this encoding. This observation will be particularly relevant in some of the forthcoming examples.

4.2 Limitations of Prior Sensitivity Linear Type Systems

Fuzz [49] is the first work to leverage linear (or affine) types for reasoning about program sensitivity. Since its introduction, other systems based on linear types were developed to address different limitations of Fuzz. These primarily comprise DFuzz [34], which allows value-dependent sensitivities and privacy costs, and Duet [46], which allows advanced variants of differential privacy.

Being based on the same underlying sensitivity analysis, all these systems suffer from common limitations related to the sensitivity tracking for products and sums. Through a series of minimal—yet instructive—examples, we now discuss the limitations we have identified.

Limitations related to linear products.

In Fuzz-like systems, each product and sum type introduces an approximation for the sensitivity analysis they underpin. When using pair types, this approximation forces the programmer to predict how each pair will be used in later parts of the program and select the right one to achieve precision: If only one component of the pair is used, then the additive product will give perfect precision; conversely, if both components of the pair are used with the same sensitivity, then the multiplicative product will give perfect precision. This is limiting for abstraction, e.g., a library author must commit to one product type, and clients of the library may turn out to require the other.

Imprecision issues remain even if functions can be inlined: (1) the optimal product choice may be influenced by the dynamic control flow of the program, which cannot be predicted statically in general; and (2) for multiplicative products in particular, if both components of the pair are used with different sensitivities in the body of the pattern match, then the sensitivity estimation may give imprecise results. To illustrate these limitations, consider the following examples as seen by Fuzz:

Limitations related to linear sums.

In addition to imprecision with the product types, Fuzz-like systems exhibit imprecision with sum types. In these systems, the sensitivity analysis for a sum introduction is straightforward: The sensitivity of \(\color {green}{\texttt {in(l/r)}e}\) is simply the sensitivity of e. The sensitivity analysis for a sum elimination via expression

is, however, more involved. First, it computes the sensitivity of \({ {{{\color {green}e}}}_{\color{\blue}{i}}}\) in binder x_i for \(\color{blue}{ i={{{\color {green}1}}},{{{\color {green}2}}}}\) and retains the greatest, say, r. The sensitivity of the overall case expression in some variable, say, x, is then computed as the sum between (1) the max sensitivity of \({ {{{\color {green}e}}}_{\color{blue}{{i}}}}\) in x for \(\color{blue}{ i={{{\color {green}1}}},{{{\color {green}2}}}}\), and (2) the sensitivity of e in x, scaled by r. This brings both unsound and imprecise estimations.

*Example 4.3* (Discontinuous Predicate).

Although DFuzz and derivative systems do not account for this corner case and are, therefore, unsound, this soundness problem is not present in Fuzz, as its type system is constrained to non-null sensitivities (therefore, leaving the program out of its scope). Follow-up work such as Reference [26] and Fuzz^ϵδ introduce rules that recover the analysis soundness by interpreting \(\color{blue}{ \infty \mathord {\cdotp } 0 = 0 \mathord {\cdotp } \infty = \infty}\) rather than \(\color{blue}{ \infty \mathord {\cdotp } 0 = 0}\), but this leads to imprecision elsewhere in the system. For example, with this fix the program \(\color {green}{\texttt {let}}\ \color{blue}{y = x \le 10}\ \color {green}{\texttt {in}}\ \color{blue}{1}\) reports sensitivity ∞ in x despite the term being equivalent to the constant 1:

A more recent work [28] defines a non-commutative multiplication operator where \(\color{blue}{ 0\mathord {\cdotp }\infty = \infty}\) but \(\color{blue}{ \infty \mathord {\cdotp }0 = 0}\). In doing so, it addresses the soundness problem for \(\color {green}{\texttt {case}}\) expressions, and even though not supporting \(\color {green}{\texttt {let}}\)-like operations, it could be extended to do so in a precise manner (e.g.,

). This multiplication operator is, however, awkward to manipulate and not amenable to automation due to lack of support for non-commutative ring theories in SMT solvers. Even still, imprecisions continue to arise in this design, as we will see in the forthcoming Example 4.4.

Besides the corner case described above leading to unsound estimates, imprecise estimates can also arise when eliminating sums. Imprecision arises because, loosely speaking, the analysis approximates the sensitivity of a sum elimination via a case expression as the maximum sensitivity of its branches. As illustrated by the following example, this analysis can dismiss significant information.

4.3 Latent Contextual Effects for Precise Sensitivity Tracking in Jazz

Jazz adopts a novel approach to sensitivity tracking for product and sum types, which can address the previous limitations without the need to rely on scaling of types. The key insight is to delay the tracking of sensitivities whenever possible and to split it into two separate analyses: one for each side of the product or sum. Technically, the main idea is to encode latent sensitivity effects at the type-connective level. For instance, for multiplicative pairs Jazz has type \({ {{\color {blue}{\tau _{1}}}} \mathrel {^{{{{\color {green}{\Sigma _{1}}}}}}\color{blue}{\otimes} ^{{{{\color {green}{\Sigma _{2}}}}}}} {{\color {blue}{\tau _{2}}}}}\), where \(\color {green}{\Sigma _{1}}\) and \(\color {green}{\Sigma _{2}}\) denote the latent sensitivity effects of each of the pair components. This is in contrast to the Fuzz type \({ {{\color {blue}{\tau _{1}}}}\ {{\color {blue}{\otimes\ \tau _{2}}}}}\), which pays for all of its sensitivity effects upfront when the pair is created.

Precise products.

Consider the three related multiplicative pair constructions:

For the purpose of sensitivity analysis, Fuzz is unable to distinguish them, as it derives the very same type judgment for all three, namely

The type judgment says that the pairs are 2-sensitive in x and 1-sensitive in y (the subscript annotations in the type environment) but does not say how this sensitivity effect is distributed between the pair components. In other words, Fuzz treats pairs as a whole. In contrast, Jazz can derive three different type judgments, precisely capturing the sensitivity of each pair component:

Recall from Section 3.2 that in Jazz, we use linear formulas to denote sensitivity effects and therefore, in, e.g., the first type judgment above, 2x + y refers to the sensitivity effect \(\color {green}{\Sigma } \color{blue}{\triangleq \lbrace x \mapsto 2,y \mapsto 1\rbrace}\), meaning 2-sensitive in x and 1-sensitive in y. Moreover, we elide null sensitivity effects like 0x + 0y. This fine-grained tracking of the sensitivity of each pair component allows, in turn, deferring the payment of the pair sensitivity effect to the precise point where the pair is used, i.e., eliminated, and therefore paying only for what (and how it) is used. For example, if pair e₃ is used in a context where only its first component is referred, we pay for sensitivity effect x. Fuzz, in contrast, would always pay 2x + y.

Let us discuss the benefits that this fine-grained tracking brings to Examples 4.1 and 4.2. Consider first the program from Example 4.1, more concretely, the variant with additive pairs. The sensitivity of the then-branch is calculated as 6x from scaling by 3 the (latent) sensitivity effect 2x of the left component of pair p. Likewise, the sensitivity of the else-branch is calculated also as 6x from scaling by 2 the sum of (latent) sensitivity effects 2x and x of the respective left and right component of the pair. As a result, Jazz reports the precise sensitivity of 6x for the whole program. An analogous fine-grained tracking for the program from Example 4.2 gives also precise sensitivity 2x + 2y.

Precise sums.

The use of latent sensitivity effects yields tighter sensitivity bounds also for sums. However, the handling of sums impose an additional technical challenge related to the impossibility of delaying sensitivity effects. To illustrate this phenomenon, consider expressions:

All three expressions are ∞-sensitive in x. Fuzz sensitivity analysis conflates the three expressions to the same type, and some Fuzz derivative systems with support for 0-sensitivities, such as DFuzz, derive an unsound type (w.r.t. the embodied sensitivity analysis) for e₆: \(\color{blue}{ x \mathrel {:}_{0} {\mathbb {R}} \vdash {{{\color {green}e_{6}}}} \mathrel {:} {\mathbb {R}} \oplus {\mathbb {R}}}\).

Jazz derives instead:

The types of e₄ and e₅ encode a latent sensitivity effect for each side of the sum. In contrast, the type of e₆ is not able to represent its ∞-sensitivity in x as a latent effect, because x influences which injection is used to create the sum itself, not the value inside the injection. Instead, the effect must be paid for eagerly in the so-called ambient sensitivity effect (which was elided in previous examples). Therefore, type judgments in Jazz have shape \({{{\color {blue}\Gamma }}}\ {{\color {green}{\vdash e}} \mathrel {:}} {{{\color {blue}\tau }}} \ {{\color {green}{;\Sigma} }}\), where Σ represents the ambient sensitivity effect and Γ is a “traditional” environment, mapping variables to types. Thus, expression e₆ is formally typed as:

with ambient sensitivity effect ∞x. e₄ is typed as \(\color{green}{x : {\mathbb {R}}} \vdash {{\color {green}e_{4}}} \mathrel {:} {{{\color {blue} {{ {\mathbb {R}} \mathrel {^{{{\color {blue}\infty x}}}\oplus } {\mathbb {R}} }}}}} \mathrel {;} {{{\color {blue}\varnothing }}}\), i.e., with an empty ambient sensitivity effect, and analogously for e₅.

To showcase the benefits of this design, let us re-examine Example 4.4. In Jazz, the type for a is \(\color{blue}{ {\mathbb {R}} \mathrel {^{{{\color {blue}\infty x}}}\oplus ^{{{\color {blue}x}}}} {\mathbb {R}}}\) with ambient effect b.⁴ To compute the sensitivity of the \(\color {green}{\texttt {case}}\)-expression over a, we join—by taking the variable-wise maximum—the ambient effect of a, namely, b, with the “global” sensitivity effect of the second branch, namely, [b + x/x₂]x₂—the first branch is dismissed because it has no ambient effect. To compute the purported sensitivity effect of the second branch, we take its ambient effect x₂ and replace every occurrence of the branch binder, also x₂, with the effect b + x of the right component of a, computed as the sum between its ambient effect b and its latent effect x. This yields an overall sensitivity of b + x = b⊔[b + x/x₂]x₂ for the \(\color {green}{\texttt {case}}\)-expression.

Consider now Example 4.3. The guard x ≤ 10 of the conditional expression has type \(\color{blue}{{\mathtt {unit}} \oplus {\mathtt {unit}}}\) with ambient effect ∞x. Since the branches are constant and have no ambient effect, they do not contribute to the sensitivity of the conditional. Jazz analysis then concludes that the sensitivity of the conditional reduces to the ambient sensitivity of the guard, namely, ∞x, recovering soundness (and precision).

Jazz recovers soundness and precision for all four examples discussed in Section 4.2, as summarized in Table 2. With this observation, we conclude our motivation for the design of the Jazz sensitivity type system based on latent contextual effects.

Table 2.

Fuzz(*)		DFuzz-like TS(**)		Jazz
Reported Sensitivity	Bound Quality	Reported Sensitivity	Bound Quality	Reported Sensitivity	Bound Quality
Example 4.1 (additive)
8x	loose	8x	loose	6x	tight
Example 4.2 (multiplicative)
2x + 2y	tight	4x + 2y	loose (in x)	2x + 2y	tight
Example 4.3
∞x	tight	0x	unsound	∞x	tight
Example 4.4
∞x + b	loose (in x)	∞x + 0b	loose (in x)	x + b	tight
			unsound in b
Example 4.5
p	tight	p	tight	2p	loose (latent)
				p	tight (prepay)

Table 2. Comparison of Sensitivity Type-system: Fuzz and DFuzz-like Type Systems vs. Jazz

(*): sensitivities strictly greater than 0, programs transformed using scaling.(**): sensitivities can be greater or equal to 0, no scaling allowed.

As a final remark, note that, contrary to Fuzz, Jazz does not currently support recursive types; such functions are required to be primitives, as illustrated in Section 8.

The following section presents the formal development of latent contextual effects for sensitivity typing and includes a step-by-step type derivation for all four examples.

5 Sax: Jazz’S Sensitivity Type System, Formally

In this section, we present a core sensitivity sublanguage of Jazz, called Sax, for which we develop the sensitivity metatheory. In particular, we prove the type soundness property known as sensitivity metric preservation [49]. The core subset of Jazz that extends Sax with privacy is presented in later sections.

5.1 Syntax and Type System

The Sax type system is technically a type-and-effect system [36]. It supports real numbers, functions, sums, and products. As Sax only deals with ambient effects, all metavariables and keywords are typeset in green. Syntax. Figure 1 presents the syntax of Sax.

Fig. 1.

Expressions \({ {{{\color {green}e}}}}\) are mostly standard and include: real number \({ {{{\color {green}r}}}}\), addition \({ {{{\color {green}e}}} \color{blue}+ {{{\color {green}e}}}}\), multiplication \({ {{{\color {green}e}}} \color{blue}{*} {{{\color {green}e}}}}\), comparison \(\color{blue}{ {{{\color {green}e}}} \le {{{\color {green}e}}}}\), variable x, sensitivity lambda \(\color{blue}{ {{\lambda ^{\!^{{}_{\mathrm{s}}}}}} (x\mathrel {:}\tau).{{{\color {green}e}}}}\), application \(\color{blue}{ {{{\color {green}e}}}\ {{{\color {green}e}}}}\), unit value \(\color {green}{\texttt {t t}}\), sum constructors \(\color {green}{\texttt {inl}}^{\color{blue}{\tau _{2}}}\ \color {green}{e}\) and \(\color {green}{\texttt {inr}}^{\color{blue}{\tau _{2}}}\ \color {green}{e}\), and the sum destructor

Sax also supports two linear products types: additive and multiplicative. With additive products, the sensitivity of a pair may be approximated as the max of the sensitivities of each side; this sensitivity is paid for every projection. With multiplicative products, the sensitivity of a pair may be approximated as the sum of the sensitivities of each side; this sensitivity is paid for every tuple pattern match, scaled by the sensitivities of pattern variables in the body. We write additive product constructions \(\color{blue}{ {{\color {blue}{{({{{\color {green}e}}},{{{\color {green}e}}})}}}}}\) and destructions \(\color {green}{\texttt {fst}}\ \color {green}{e}\) and \(\color {green}{\texttt {snd}}\ \color {green}{e}\), and multiplicative product constructions \(\color{blue}{ {{\color {blue} {{\langle {{{\color {green}e}}},{{{\color {green}e}}}\rangle }}}}}\) and destructions \(\color {green}{\texttt {let}}\ \color{blue}{x,x=}\color {green}{e}\ \color {green}{\texttt {in}}\ \color {green}{e}\).

Finally, an expression \({ {{{\color {green}e}}}}\) can be an ascription \(\color{blue}{ {{{\color {green}e}}} \mathrel {:: } {{{\color {green}\tau }}}}\), or a derived expression such as a Boolean b, a conditional \(\color {green}{\texttt {if}\ e\ \texttt {then}\ e\ \texttt {else}\ e}\), or a let expression \(\color{green}{\texttt {let}}\ \color{blue}{x=}\color{green}{e\ \texttt{in}\ e}\). Booleans are encoded as \(\color{blue}{ \text{true} \triangleq} \color {green}{\mathtt {inl\ tt}}\), \(\color{blue}{\text{false} \triangleq} \color {green}{\mathtt {inr\ tt}}\), and \(\color{blue}{ {\mathbb {B}}}\) as \(\color {blue}{\texttt {unit}^{\varnothing }\oplus ^{\varnothing }\texttt {unit}}\), conditionals as

, and let expressions as

A sensitivity \({ {{{\color {green}s}}}}\) is either a non-negative real number or the symbol ∞, which represents an unbounded sensitivity; we notate this set \(\color{blue}{ {{{\color {blue} {{{\mathbb {R}}}}}}}^{\infty }_{\ge 0} \triangleq {{{\color {blue} {{{\mathbb {R}}}}}}}_{\ge 0} \uplus \lbrace \infty \rbrace}\). A sensitivity environment \({ {{{\color {green}\Sigma }}}}\) is a mapping from variables to their sensitivities. For convenience, we write sensitivity environments as first-order polynomials, e.g., \(\color{blue}{ {{{\color {green}\Sigma }}} = 1x + 2y}\) corresponds to an environment \({ {{{\color {green}\Sigma }}}}\) such that \(\color{blue}{ {{{\color {green}\Sigma }}}(x) = 1}\) and \(\color{blue}{ {{{\color {green}\Sigma }}}(y) = 2}\). A type τ is either the real number type \({ {{{\color {blue} {{{\mathbb {R}}}}}}}}\), the Boolean type \({ {{{\color {blue} {{{\mathbb {B}}}}}}}}\), the unit type \(\color {blue}{\texttt {unit}}\), a function type

, a sum type \(\color{blue}{ \tau \mathrel {^{{{{\color {green}\Sigma }}}}\oplus ^{{{{\color {green}\Sigma }}}}} \tau}\), an additive product type \(\color{blue}{\tau}\ {}^{\color {green}{\Sigma}}\color{blue}{\&} ^{{{{\color {green}{\Sigma}}}}}\ {\color {blue}{\tau}}\), or a multiplicative product type \(\color{blue}{ \tau \mathrel {^{{{{\color {green}\Sigma }}}}\otimes ^{{{{\color {green}\Sigma }}}}} \tau}\). The sensitivity environment annotation \({ {{{\color {green}\Sigma }}}}\) is called the latent contextual sensitivity effect (also called latent effect when clear from the context) and represents a delayed effect that emerges when a term of said type is eliminated. The latent effect \({ {{{\color {green}\Sigma }}}}\) of a function of type

corresponds to the effects of applying the function, i.e., a static approximation of the sensitivity of each variable used in its body. The sensitivity environment \(\color {green}{\Sigma _{1}}\) (respectively, \(\color {green}{\Sigma _{2}}\)) in \(\color{blue}{ \tau _{1} \mathrel {^{{{{\color {green}\Sigma _{1}}}}}\oplus ^{{{{\color {green}\Sigma _{2}}}}}} \tau _{2}}\) corresponds to the latent effect of the injected value using inl (respectively, inr). And similarly, \(\color {green}{\Sigma _{1}}\) and \(\color {green}{\Sigma _{2}}\) in \(\color{blue}{ \tau _{1} \mathrel {^{{{{\color {green}\Sigma _{1}}}}}\&^{{{{\color {green}\Sigma _{2}}}}}} \tau _{2}}\) or \(\color{blue}{ \tau _{1} \mathrel {^{{{{\color {green}\Sigma _{1}}}}}\otimes ^{{{{\color {green}\Sigma _{2}}}}}} \tau _{2}}\) correspond to the latent effect of accessing the first and second components of the pair, respectively. Finally, a type environment Γ is, as usual, a mapping from variables to types.

Type system. The Sax type system is presented in Figure 2. The judgment \({ {{{{\color {blue}\Gamma }}}\ {{{\color {green}\vdash }}}\ {{{\color {green}e}}}\ \color{blue}{:}\ {{{\color {blue}\tau }}} \mathrel {;} {{{\color {green}\Sigma }}}}}\) says that the term \({ {{{\color {green}e}}}}\) has type τ and ambient sensitivity effect \({ {{{\color {green}\Sigma }}}}\) (or ambient effect when clear from the context) under type environment Γ. The ambient effect \({ {{{\color {green}\Sigma }}}}\) represents an upper bound (conservative approximation) of the real sensitivity of \({ {{{\color {green}e}}}}\) after executing the program. The use of a sensitivity environment \({ {{{\color {green}\Sigma }}}}\) is different from Duet, where sensitivities are tracked in Γ and presented as a necessary condition to typecheck the expression. In other words, in Sax \({ {{{\color {green}\Sigma }}}}\) is used to infer sensitivities, whereas in Duet Γ is used to check sensitivities.

Fig. 2.

–

Rulesrlit andunit are standard and report no effect, as no variable is accessed. These two rules present no novelty with respect to Duet.

–

Rule var is mostly standard; it reports an ambient effect 1x.

For example,

–

Rule plus computes the resulting ambient effect as the addition of the ambient effects of both subterms. To add sensitivity environments, we use the + operator, which is simply defined as the addition of polynomials, e.g., (1x + 2y) + (3x) = 4x + 2y.

For example, in the following type derivation

we write 2x instead of x + x.

–

Rulestimes andleq are similar toplus, but the resulting sensitivity effect is scaled by infinity because (1) the sensitivity of a multiplication when neither side is a constant is unbounded, and (2) the distance between distinct Boolean values is deemed infinite, as explained in Section 4.2. Scaling a sensitivity environment \({ {{{\color {green}\Sigma }}}}\) by sensitivity \({ {{{\color {green}s}}}}\), written \({ {{{\color {green}s}}}{{{\color {green}\Sigma }}}}\), produces a new sensitivity environment in which each sensitivity in \({ {{{\color {green}\Sigma }}}}\) is multiplied by \({ {{{\color {green}s}}}}\). For multiplication, we assume that 0s = s0 = 0 for all \(\color{blue}{s \in \mathbb {R}^{\infty }_{\ge 0}}\) and we deem ∞s = s∞ = ∞ for s ≠ 0.

Rulesl-scale andr-scale address the overapproximation yielded by ruletimes when one of the factors is a real number. For instance, for program 0.5*x rulel-scale reports a (precise) sensitivity of 0.5x, whereas rulestimes would report ∞x.

–

Rule lam typechecks sensitivity functions and is novel with respect to Duet. The type of the function is annotated with a latent effect \({ {{{\color {green}\Sigma }}}}\), computed as a subset of the effect of its body. On a fully latent discipline, the whole body effect is left as latent and the ambient effect \({ {{{\color {green}\Sigma ^{\prime }}}}}\) of the function is empty. However, full eagerness of effects, as in Duet, is achieved when the latent effect \({ {{{\color {green}\Sigma }}}}\) is empty and the full effect of the body is paid upon construction.

Since the splitting of \(\color{blue}{ {{{\color {green}\Sigma }}} + {{{\color {green}\Sigma ^{\prime }}}}}\) is non-deterministic, a lambda expression can be given many types, ranging from fully latent to fully eager disciplines. We show this behavior later when explaining rulespair andtup. The implementation addresses this issue through the use of additional type annotations. Without loss of generality, in this article, we assume the fully latent derivation for all lambdas unless stated otherwise. The same applies to other language constructs that exhibit this kind of non-deterministic prepayment of latent effects.

For example, consider program \(\color{blue}{ {{{\color {green}{\color {green}\lambda ^{\!^{{}_{\mathrm{s}}}}}}}} (x\mathrel {:}{{{\color {blue} {{{\mathbb {R}}}}}}}).x+x}\) and its type derivation:

The ambient effect of the program is empty (values are pure) but its latent effect is 2x, the ambient effect of its body.

–

Rule app deals with function application. Unlike Duet, as variable x may be free in τ₂ (e.g., τ₂ can be a function type whose latent effect includes x), the resulting type replaces x with the ambient effect \(\color {green}{\Sigma _{2}}\) of its argument using the sensitivity environment substitution operator defined in Figure 3. For instance, consider type

. After application, if \(\color{blue}{ {{{\color {green}\Sigma _{2}}}} = 3y}\), then the resulting type would be

. The ambient effect of an application is computed as the ambient effect of the function \(\color {green}{\Sigma _{1}}\) plus its latent effect; but as x is free, we substitute it by \(\color {green}{\Sigma _{2}}\), e.g., if \({ {{{\color {green}\Sigma }}} = {{{\color {green}\Sigma ^{\prime }}}}+{{{\color {green}s}}}x}\), then \(\color{blue}{ [{{{\color {green}\Sigma _{2}}}}/x]({{{\color {green}\Sigma _{1}}}} + {{{\color {green}\Sigma ^{\prime }}}} + {{{\color {green}s}}}x) = {{{\color {green}\Sigma _{1}}}} + ({{{\color {green}\Sigma ^{\prime }}}} + {{{\color {green}s}}}{{{\color {green}\Sigma _{2}}}})}\). This is different from Duet, as where the latent effect of the function \({ {{{\color {green}\Sigma }}}}\) is paid when the function is created.

Fig. 3.

For instance, consider the open program \(\color{blue}{ ({{{\color {green}{\color {green}\lambda ^{\!^{{}_{\mathrm{s}}}}}}}} (x\mathrel {:}{{{\color {blue} {{{\mathbb {R}}}}}}}).2*x+y)\ (3*y)}\) and the following type derivation:

The resulting ambient effect cannot depend on x (otherwise it would be free), therefore, it is computed as the substitution of x by the ambient effect of the argument [3y/x](2x + y) = 7y.

–

Contrary to previous work [34, 49], and in particular Duet, Ruleinl does not necessarily report the effect of its body. The payment of effects for the subexpression (or a subset of it) can be delayed and eventually paid only if the sum is accessed or used. The term is tagged with type τ₂ to aid type inference. The resulting type is just a sum type where the latent effect of the left type is \({ {{{\color {green}\Sigma }}}}\), a subset of the ambient effect of its subterm, and the latent effect of the right type is empty (as it will never be used/accessed, so we choose the tighter ambient effect). Non-determinism is addressed similarly to rulelam. Ruleinr is defined similarly.

For instance, consider the type derivations of expressions \(\color{blue}{ e_{4} =} \color {green}{\texttt {inl}}\ \color{blue}{(x*x)}\) and \(\color{blue}{ e_{4} =} \color {green}{\texttt {inr}}\ \color{blue}{(x*x)}\) of Section 4.3:

For expression e₄, the latent effect of the left type is ∞x and of the right type is empty (it is the tighter upper bound, as the right component cannot be accessed). An analogous argument is used for e₅.

–

Rule case is more involved. The resulting type of the case is just the least upper bound (join) of the branch types τ₂ and τ₃. The join operator is defined in Figure 4. Note that similarly to rule app, τ₂ and τ₃ may have x and y as free variables, respectively, thus, we replace those variables with the ambient effects of using the sum term \({ {{{\color {green}e_{1}}}}}\): \(\color{blue}{ {{{\color {green}\Sigma _{1}}}} + {{{\color {green}\Sigma _{1 1}}}}}\) and

, respectively. The resulting ambient effect is computed as follows: We join the cost of reducing \({ {{{\color {green}e_{1}}}}}\): \(\color {green}{\Sigma _{1}}\), with the join of the cost of taking each branch. This is different from Duet, where \(\color {green}{\Sigma _{1}}\) is added to the cost of taking each branch, leading to a looser bound. Similarly to types τ₂ and τ₃, ambient effects \(\color {green}{\Sigma _{2}}\) and \({ {{{\color {green}\Sigma _{3}}}}}\) may have x and y free, so we substitute them away from the effects. Note that we use the join between Σ₁ and the cost of the branches (instead of the addition for instance), otherwise the result would be less precise when the branches use x or y.

For instance, the type derivation of Example 4.4 is described below:

where \(\color{blue}{ \Gamma = b: {{{\color {blue} {{{\mathbb {B}}}}}}}, x:{{{\color {blue} {{{\mathbb {R}}}}}}}}\), and

. As 0∞ = 0, the resulting ambient effect is b⊔(0b + 0(∞x))⊔(1b + 1(x)) = b⊔(0b + 0x)⊔(b + x) = b + x, where previous work reported ∞ on x.

Notice that if we change the program to \({ {{{\color {green}{\texttt {case}}}}}\ {{{\color {green}e}}}\ {{{\color {green}{\texttt {of}}}}}\color{blue}{\lbrace x_{1} \Rightarrow 0\rbrace \lbrace x_{2} \Rightarrow 1\rbrace}}\), then the resulting ambient effect is b⊔(0b + 0(∞x))⊔(0b + 0(x)) = b, i.e., the payment is not zero but b, the cost of reducing expression \({ {{{\color {green}e}}}}\) to a value.

Example 4.3 is desugared and typechecked as follows:

where \({ {{{\color {blue} {{{\mathbb {B}}}}}}} = {{\color {blue}{\texttt {unit}}}} \color{blue}{ {^{\varnothing }\oplus ^{\varnothing }}} {{\color {blue}{\texttt {unit}}}}}\). As \(\color{blue}{ \infty x \sqcup (0(\infty x) + 0(\varnothing)) \sqcup (0(\infty x) + 0(\varnothing)) = \infty x \sqcup 0x = \infty x}\), the expression is ∞-sensitive in x.

–

Rules pair andtup are novel and non-deterministic: the ambient effects are computed using subsets of the ambient effect of each component. If the ambient effects of the left component is \({\color {green}{\Sigma _{1}}\color{blue}{+}\color {green}{\Sigma ^{\prime }_{1}}}\) and of the right component is \({ {{{\color {green}{\Sigma _{2}}}}}\color{blue}{+}{{{\color {green}{\Sigma ^{\prime }_{2}}}}}}\) (for some \({ {{{\color {green}{\Sigma _{1}}}}}\color{blue}{,} {{{\color {green}{\Sigma ^{\prime }_{1}}}}}\color{blue}{,} {{{\color {green}{\Sigma _{2}}}}}\color{blue}{,} {{{\color {green}{\Sigma ^{\prime }_{2}}}}}}\)), then the latent effects of using the left component is \({ {{{\color {green}{\Sigma _{1}}}}}}\), and for the right component is \({ {{{\color {green}{\Sigma _{2}}}}}}\). For additive products, the ambient effect is the maximum between \({ {{{\color {green}{\Sigma ^{\prime }_{1}}}}}}\) and \({ {{{\color {green}{\Sigma ^{\prime }_{2}}}}}}\), and for multiplicative products, the sum between \({ {{{\color {green}{\Sigma ^{\prime }_{1}}}}}}\) and \({ {{{\color {green}{\Sigma ^{\prime }_{2}}}}}}\).

For instance, let us consider examples e₁ = (2*x + y, 0), e₂ = (0, 2*x + y), and e₃ = (x, x + y) from Section 4.3. We present next “lazy” type derivations for each of the examples:

where \(\color{blue}{ \Gamma = x:{\mathbb {R}}, y:{\mathbb {R}}}\).

Now, let us consider examples (2x, x) and ⟨2x, x⟩. Here are six possible type derivations of paying eagerly for effects:

Note that the difference between the two form of products is only present when effects are paid eagerly for both components.

–

Rules proj1 andproj2 typecheck the deconstruction of an additive product. The ambient effect is computed as the cost of reducing the product (\({ {{{\color {green}\Sigma }}}}\)) plus the cost of accessing either the first or the second component correspondingly (\({\color {green}{\Sigma _{1}}}\) or \({\color {green}{\Sigma _{2}}}\)). This differs from Duet, where, and conservatively, the cost of accessing both components are paid when the pair is created. In Sax, we only paid for the component we are accessing.

For instance, let us consider the first projections of last examples:

Contrary to previous work, the ambient effects of all three projections are different, as they capture precisely the variables accessed on the corresponding component.

–

Rule untup typechecks the deconstruction of a multiplicative product and is a little more involved. To compute the ambient effect, we start by paying for \({\color {green}{\Sigma _{2}}}\), the ambient effect of subexpression \({\color {green}{e _{2}}}\). We also want to pay \({\color {green}{\Sigma _{11}}}\) and \({\color {green}{\Sigma _{12}}}\), the cost of accessing the left and the right components, respectively, proportionally to the sensitivity of the left and right variables x₁ and x₂ in \({\color {green}{e _{2}}}\), i.e., \({ {{{\color {green}{s_{1}}}}}{{{\color {green}{\Sigma _{1 1}}}}} \color{blue}{+} {{{\color {green}{s_{2}}}}}{{{\color {green}{\Sigma _{1 2}}}}}}\). But, we also have to pay for \(\color {green}{\Sigma _{1}}\), the ambient effect of \({ {{{\color {green}{e_{1}}}}}}\). We could pay \({ \color{blue}{(}{{{\color {green}{s_{1}}}}} \color{blue}{+} {{{\color {green}{s_{2}}}}}\color{blue}{)}{{{\color {green}{\Sigma _{1}}}}}}\), but that would be an unnecessary over-approximation. For instance, program \({ {{{\color {green}{\texttt {let}}}}}\ \color{blue}{x_{1}, x_{2} = p}\ {{{\color {green}{\texttt {in}}}}}\ \color{blue}{x_{1} + x_{2}}}\) would pay twice for p (the ambient effect of \({ {{{\color {green}{e_{1}}}}}}\)), even though the whole pair is used only once. Instead, we want to pay proportional to the the maximum sensitivity between x₁ and x₂, i.e., \({ \color{blue}{(}{{{\color {green}{s_{1}}}}} \color{blue}{\sqcup} {{{\color {green}{s_{2}}}}}) {{{\color {green}{\Sigma _{1}}}}}}\). Finally, the ambient effect of the let expression is \({ \color{blue}{(}{{{\color {green}{s_{1}}}}} \color{blue}{\sqcup} {{{\color {green}{s_{2}}}}}\color{blue}{)} {{{\color {green}{\Sigma _{1}}}}} \color{blue}{+} {{{\color {green}{s_{1}}}}}{{{\color {green}{\Sigma _{1 1}}}}} \color{blue}{+} {{{\color {green}{s_{2}}}}}{{{\color {green}{\Sigma _{1 2}}}}} \color{blue}{+} {{{\color {green}{\Sigma _{2}}}}}}\).

For instance, let us consider the typing derivation of Example 4.2:

where \(\color{blue}{ \Gamma = x:{{{{{{\mathbb {R}}}}}}}, y:{{{{{{\mathbb {R}}}}}}}}\). The resulting ambient effect is 1(2x) + 2(y) = 2x + 2y.

Now consider \(\color{blue}{ \Gamma = p \mathrel {:} {{{{{{\mathbb {R}}}}}}} \mathrel {^{\varnothing }\otimes ^{\varnothing }} {{{{{{\mathbb {R}}}}}}}}\), and the following typing derivation:

We can typecheck subexpression \({ {{{\color {green}{\texttt {let}}}}}\ \color{blue}{x_{1},x_{2}=p}\ {{{\color {green}{\texttt {in}}}}}\ \color{blue}{(x_{1},x_{2})}}\) in different ways. If we do not prepay effects, then \({ {{{\color {green}{\Sigma _{1}}}}} \color{blue}{= x_{1},} {{{\color {green}{\Sigma _{2}}}}} \color{blue}{= x_{2}}}\), and \({ {{{\color {green}{\Sigma _{3}}}}} \color{blue}{= \varnothing}}\). Thus, \({ {{{\color {green}{\Sigma ^{\prime }_{1}}}}} \color{blue}{=} {{{\color {green}{\Sigma ^{\prime }_{2}}}}} \color{blue}{= p}}\), and \({ {{{\color {green}{\Sigma ^{\prime }_{3}}}}} \color{blue}{= \varnothing}}\). Finally the ambient effect of the program is 2p.

If we prepay the accesses of x₁ and x₂, then \({ {{{\color {green}{\Sigma _{1}}}}} \color{blue}{= \varnothing ,} {{{\color {green}{\Sigma _{2}}}}} \color{blue}{= \varnothing}}\), and \({ {{{\color {green}{\Sigma _{2}}}}} \color{blue}{= x_{1} + x_{2}}}\). Thus, \({ {{{\color {green}{\Sigma ^{\prime }_{1}}}}} \color{blue}{=} {{{\color {green}{\Sigma ^{\prime }_{2}}}}} \color{blue}{= \varnothing}}\), and \({ {{{\color {green}{\Sigma ^{\prime }_{3}}}}} \color{blue}{= (1 \sqcup 1)p = p}}\). Finally the ambient effect of the program is p.

–

Rules for Booleans, conditionals, and let expressions are derived rules from sums, case, and application rules, respectively, and can be found in Figure 5.

–

Finally, Ruleascr is the only rule that supports the use of subtyping and takes the role of checking whether the subexpression is subtype of a given type.

Fig. 4.

Fig. 5.

Subtyping for types and sensitivity environments is presented in Figure 6 and is mostly standard. We only allow subtyping for the sensitivity parts of types. A sensitivity environment is subtype of another if their sensitivities are less than or equal to the other for each variable. For instance, \(\color{blue}{ (x:{{{ {{{\mathbb {R}}}}}})} \mathrel {{{{\color {green}{\xrightarrow {{{{\color {blue}{x+y}}}}}}}}}} {{{\color {blue}{ {{{\mathbb {R}}}}}}} \lt : (x:{{{ {{{\mathbb {R}}}}}}})} \mathrel {{{{\color {green}{\xrightarrow {{{{\color {blue}{x+2y+3z}}}}}}}}}} {{{\color {blue}{{{{\mathbb {R}},}}}}}}}\) because x + y < : x + 2y + 3z (x ≤ x, y ≤ 2y, and 0z ≤ 3z).

Fig. 6.

5.2 Type Safety

Type safety is established relative to the runtime semantics of Sax. We adopt a big-step semantics with explicit substitutions. Concretely, we use \(\color{blue}{ \gamma \vdash }\ {{{\color {green}{e}}}\ \color{blue}{\Downarrow}\ {{{\color {green}{v}}}}}\) to represent that configuration \(\color{blue}{ \gamma \vdash}\ {{{\color {green}{e}}}}\) —formed by expression \({ {{{\color {green}e}}}}\) and value environment γ mapping variables to values— reduces to value \({ {{{\color {green}v}}}}\) after some number of steps. Reduction rules are rather standard and can be found in Appendix B, Figure 27.

To establish Sax type safety, we employ simple unary logical relations, called the type safety logical relations, that characterize well-typed, non-stuck execution. This relation is well defined, because it is defined by induction over the structure of types. The type safety result itself is derived as a corollary of the fundamental property of the type safety logical relations.

The type safety logical relations is defined in Figure 7. For simplicity, we only present the cases for real numbers, variables, functions, and sums. The other cases are similar and straightforward. The unary logical relations are split into mutually recursive value relations \(\color{blue}{ {\mathcal {V}}}\), computation relation \(\color{blue}{ {\mathcal {E}}}\), and environment relation \(\color{blue}{ {\mathcal {G}}}\), and defined as follows:

Fig. 7.

–

Any value is in \(\color{blue}{ Atom[\![ \tau ]\!]}\) if the value typechecks to some τ′ < : τ under an empty type environment.

–

A real number is in the value relation at type \(\color{blue} {\mathbb {R}}\) if the number is in \(\color{blue}{ Atom[\![ {\mathbb {R}}]\!]}\).

–

Similarly, a unit value \({ {{{\color {green}{\texttt {tt}}}}}}\) is always related at type \({ {{\color {blue}{\texttt {unit}}}}}\).

–

An inl(respectively, inr) value is in the value relation at \(\color{blue}{ \tau _{1} \mathrel {^{\varnothing }\oplus ^{\varnothing }} \tau _{2}}\) if the value is in \(\color{blue}{ Atom[\![ \tau _{1} \mathrel {^{\varnothing }\oplus ^{\varnothing }} \tau _{2}]\!]}\) and the underlying value v is in the value relation at τ₁ (respectively, τ₂).

–

A closure is in the value relation at type \(\color{blue}{ {(x\mathrel {:}\tau _{1})} \mathrel {{{{\color {green}{\xrightarrow {{{{{{\color {green}{s}}}}\color{blue}{x}}}}}}}}} \color{blue}{\tau _{2}}}\) if it satisfies \(\color{blue}{ {Atom[\![ (x\mathrel {:}\tau _{1}) }\mathrel {{{{\color {green}{\xrightarrow {{{{{{{\color {green}{s}}}}\color{blue}{x}}}}}}}}}} \color{blue}{\tau _{2}]\!]}}\), and given any value \({ {{{\color {green}v}}}}\) in the value relation at argument type τ₁, the extended configuration \(\color{blue}{{ \gamma [x \mapsto}\ {{{\color {green}{v}}}}] \color{blue}{\vdash}\ {{{\color {green}{e}}}}}\) is in the computation relation at type τ₂/(x: τ₁). We use the ./Γ operator to remove variables from a type and is defined as follows:

–

A configuration is in the computation relation at type τ if the configuration reduces to some value v, which is itself in the value relation at type τ.

–

Finally, a value environment γ is in the environment relation at Γ if the domains of γ and Γ are the same, and for each variable in the domain of γ the underlying value γ(x) is in the value relation at type Γ(x)/Γ (we use the ./Γ operator to emphasize that the type is closed).

As usual, the fundamental property of the type safety logical relation states that well-typed open terms are in the relation closed by an adequate environment γ:

Proposition 5.1 (Fundamental Property of the Type Safety Logical Relation).

Let \(\color{blue}{ {\Gamma \vdash} {{{\color {green}{e}}}} \color{blue}{: \tau ;}\ {{{\color {green}{\Sigma} }}}}\), and \(\color{blue}{ \gamma \in {\mathcal {G}}[\![ \Gamma ]\!]}\). Then \(\color{blue}{ \gamma \vdash}\ {{{\color {green}{e}}}} \ \color{blue}{\in {\mathcal {E}}[\![ \tau /\Gamma ]\!]}\).

Type safety for closed terms follows immediately as a corollary:

Corollary 5.1 (Type Safety and Normalization of Sax)

Let \(\color{blue}{ \vdash} {{{\color {green}{e}}}} \color{blue}{: \tau ; \varnothing}\), then \(\color{blue}{ \vdash}\ {{{\color {green}{e}}}}\ \color{blue}{\Downarrow}\ {{{\color {green}{v}}}}\) for some \({ {{{\color {green}v}}}}\) and τ′, such that \(\color{blue}{ \vdash}\ {{{\color {green}{v}}}}\ \color{blue}{: \tau ^{\prime };\varnothing}\) and τ′ < : τ.

5.3 Type Soundness

This section establishes the type soundness of Sax, stated in terms of a metric preservation result. Loosely speaking, metric preservation captures the maximum variation of an open term when it is closed under two different (but related) environments.

Logical relations. To establish this soundness result, we make use of logical relations [4, 6]. In particular, we define (mutually recursive) logical relations for sensitivity values, computations, and environments; see Figure 8. The logical relations for values (\(\color{blue}{ {\mathcal {V}}}_{{{{\color {green}{d}}}}}\color{blue}{[\![ \sigma ]\!]}\)) and computations (\(\color{blue}{{ {\mathcal {E}}}_{{{{\color {green}{d}}}}}\color{blue}{[\![ \sigma ]\!]}}\)) are indexed by a relational distance \({ {{{\color {green}{d}}}} \color{blue}{\in} {{{\color {blue}{ {{{\mathbb {R}}}}}}}^{\infty }_{\ge 0}}}\) and a so-called relational distance type σ, which is a regular type where sensitivity environments are enriched with a constant \({ {{{\color {green}{d}}}} \color{blue}{\in} {{{\color {blue}{{{{\mathbb {R}}}}}}}^{\infty }_{\ge 0}}}\) denoting the distance induced by pair of substitutions. Formally, the syntax of relational distance types is defined as follows:

Fig. 8.

Notice that the logical relations do not mention sensitivity environments \({ {{{\color {green}\Sigma }}}}\), because they are defined over closed terms and values. Nevertheless, relational distance types σ do mention sensitivity environments \({ {{{\color {green}\Sigma }}}}\). We use a combination of sensitivity environments and relational distances (\({ {{{\color {green}{\Sigma} }}} \color{blue}{+} {{{\color {green}{d}}}}}\)), because function types introduce binders that cannot be substituted until application. For instance, consider type \(\color{blue}{ (x\mathrel {:}{\mathbb {R}}) }\mathrel {{{{\color {green}{\xrightarrow {{{{\color {blue}{x + y}}}}}}}}}} \color{blue}{({\mathbb {R}} \mathrel {^{x + 3z}\oplus ^{2x + 2y}} {\mathbb {R}})}\) and two pair of substitutions for y and z, at distance 2 and 1, respectively, e.g., γ₁ = y↦1, z↦1 and γ₂ = y↦3, z↦2, where |γ₁(y) − γ₂(y)| ≤ 2 and |γ₁(z) − γ₂(z)| ≤ 1. The corresponding relational distance type after substitution is

. For notation simplicity, in the rest of the section, we name relational distance types as types when the accompanying relational distances can be inferred from the context. Also, we omit the environment notations when they are empty. However, the logical relation for environments (\(\color{blue}{ {{\mathcal {G}}}_{{{{\color {green}{\Delta }}}}}\color{blue}{[\![ \Gamma ]\!]}}\)) is indexed by a relational distance environment \({ {{{\color {green}\Delta }}}}\), mapping variables to relational distances in \(\color {blue}{\mathbb {R}}^{\infty }_{\ge 0}\) and a type environment Γ. We use \(\color{blue}{ (}{{{\color {green}{v_{1}}}}}\color{blue}{,}{{{\color {green}{v_{2}}}}}\color{blue}{)}\ \color{blue}{\in {\mathcal {V}}}_{{{{\color {green}{d}}}}}\ \color{blue}{[\![ \sigma ]\!]}\) to denote that value \(\color {green}{v_{1}}\) is related to value \(\color {green}{v_{2}}\) at type σ and relational distance \({ {{{\color {green}d}}}}\) and likewise for expressions (i.e., computations) and environments.

To define logical relations, we also make use of relational distance instantiations, which have shape \({ {{{\color {green}{\Delta} }}} \color{blue}{\cdot }\color{blue}{(}{{{\color {green}{\Sigma} }}} \color{blue}{+} {{{\color {green}{d}}}}\color{blue}{)}}\) and act by replacing free variables in sensitivity environment \({ {{{\color {green}\Sigma }}}}\) with the distances provided by distance environment \({ {{{\color {green}\Delta }}}}\). Relational distance instantiations only close variables defined in \({ {{{\color {green}\Delta }}}}\) and are formally defined as:

Furthermore, to close a type under a sensitivity environment, we use the relational distance type instantiation operator \({ {{{\color {green}{\Delta} }}}\color{blue}{(\sigma)}}\) (note that a τ is also an σ assuming that the “default” relational distance \({ {{{\color {green}d}}}}\) is 0) defined below:

Now that we have all the prerequisites, we briefly go through the definition of the logical relations (in Figure 8)⁵:

–

Two real numbers are related at type \({ {{{\color {blue}{{{\mathbb {R}}}}}}}}\) and distance \({ {{{\color {green}d}}}}\), if and only if the absolute difference between both numbers is at most \({ {{{\color {green}d}}}}\). For instance, \(\color{blue}{ (1,3) \in {\mathcal {V}}_{2}[\![ {{{ {{{\mathbb {R}}}}}}}]\!]}\) and \(\color{blue}{ (3, 1) \in {\mathcal {V}}_{2}[\![ {{{ {{{\mathbb {R}}}}}}}]\!]}\), as the logical relations are reflexive.

–

Unit value \(\color {green}{\texttt {tt}}\) is always related to itself at type \(\color {blue}{\texttt {unit}}\) under any distance.

–

Two inl (respectively, inr) values are related at \(\color{blue}{\sigma _{1}} \mathrel {^{{{{\color {green}{d_{1}}}}}}\color{blue}{\oplus} ^{{{{\color {green}{d_{2}}}}}}}\ \color{blue}{\sigma _{2}}\) and distance \({ {{{\color {green}d}}}}\) if the underlying values are related at type σ₁(respectively, σ₂) and distance \({ {{{\color {green}{d}}}} \color{blue}{+} {{{\color {green}{d_{1}}}}}}\) (respectively, \({ {{{\color {green}{d}}}} \color{blue}{+} {{{\color {green}{d_{2}}}}}}\)). The intuition is that \({ {{{\color {green}d}}}}\) can be treated as the distance between two computations that reduce to the given sums, and \({ {{{\color {green}{d_{1}}}}}}\) can be treated as the distance between the underlying values; thus, the total cost is the addition of both distances. For instance, for any \({ {{{\color {green}d}}}}\) and σ, we have \(\color{blue}{(} \color{green}{\mathtt{inl}}\ \color{blue}{1,}\ \color{green}{\mathtt{inl}}\ \color{blue}{3)}\ \color{blue}{\in\ \mathcal{V}_{0}[\![ \mathbb{R}\ {}^{2}\oplus}^{\color{green}{d}}\ \color{blue}{\sigma]\!]}\) because they are at immediate distance zero (both are inl) and latent distance 2; instead of delaying the distance, one also has \(\color{blue}{(} \color{green}{\mathtt{inl}}\ \color{blue}{1,}\ \color{green}{\mathtt{inl}}\ \color{blue}{3)}\ \color{blue}{\in\ \mathcal{V}_{2}[\![ \mathbb{R}\ {}^{0}\oplus}^{\color{green}{d}}\ \color{blue}{\sigma]\!]}\), i.e., both values are at distance 2 with zero latent distance between their content.

–

Two additive (respectively, multiplicative) products are related at type \(\color{blue}{ \sigma _{1}} \mathrel {^{{{{\color {green}{d_{1}}}}}}\color{blue}{\&}^{{{{\color {green}{d_{2}}}}}}} \color{blue}{\sigma _{2}}\) (respectively, \(\color{blue}{ \sigma _{1}} \mathrel {^{{{{\color {green}{d_{1}}}}}}\color{blue}{\otimes} ^{{{{\color {green}{d_{2}}}}}}} \color{blue}{\sigma _{2}}\)) and distance \({ {{{\color {green}d}}}\color{blue}{+}{{{\color {green}{d^{\prime }_{1}}}}} \color{blue}{\sqcup} {{{\color {green}{d^{\prime }_{2}}}}}}\) (respectively, \({ {{{\color {green}d}}}\color{blue}{+}{{{\color {green}{d^{\prime }_{1}}}}} \color{blue}{+} {{{\color {green}{d^{\prime }_{2}}}}}}\)) if both first components are related at type σ₁ and distance \({ {{{\color {green}d}}} \color{blue}{+} {{{\color {green}{d_{1}}}}}\color{blue}{+}{{{\color {green}{d^{\prime }_{1}}}}}}\) and both second components are related at type σ₂ and distance \({ {{{\color {green}d}}} \color{blue}{+} {{{\color {green}{d_{2}}}}}\color{blue}{+}{{{\color {green}{d^{\prime }_{2}}}}}}\). For instance,

are at distance 0 and

and \(\color{blue}{ (4,5) \in {\mathcal {V}}_{1}[\![ {{{ {{{\mathbb {R}}}}}}}]\!]}\).

–

Two sensitivity closures are related if, given related inputs, they produce related computations. In more detail, first the environments have to be related at some Γ and distance environment \({ {{{\color {green}\Delta }}}}\). Note that \({ {{{\color {green}\Delta }}}}\) has to be the same environment that closes the latent effect of the function \({ {{{\color {green}{\Delta} }}}\color{blue} {\cdot }{{{\color {green}{\Sigma} }}} \color{blue}{+} {{{\color {green}{s^{\prime }}}}}\color{blue}{x}}\) and the one that closes the input type (\(\color{blue}{ {\sigma _{1} =} {{{\color {green}{\Delta} }}}\color{blue}{(\tau _{1})}}\)). Second, inputs \({ {{{\color {green}{v^{\prime} _{1}}}}}}\) and \({ {{{\color {green}{v^{\prime }_{2}}}}}}\) have to be related at argument type σ₁ and any distance \({ {{{\color {green}{d^{\prime }}}}}}\). Finally, the body of the functions in environments extended with inputs \({ {{{\color {green}{v^{\prime }_{1}}}}}}\) and \({ {{{\color {green}{v^{\prime }_{2}}}}}}\) have to be related computations at type \({ {{{\color {green}{d^{\prime }}}}}\color{blue}{x(\sigma _{2})}}\) and distance \(\color {green}{d}\color{blue}{+}\color {green}{\Delta} \color{blue}{\cdot }\color {green}{\Sigma }\color{blue}{+}\color {green}{sd^{\prime }}\). Note that, as the variable x is out of scope after the application, we replace any instance of x with the distance of the inputs \(\color {green}{d^{\prime }}\) using the distance type instantiation operator. The new distance at which both computations are now related is computed as the addition of the distance of the values \({ {{{\color {green}d}}}}\) and the closed latent effect \(\color {green}{d^{\prime }}\ \color{blue}{x(}\color {green}{\Delta} \color{blue}{\cdot }\color {green}{\Sigma }\color{blue}{+}\color {green}{s}\color{blue}{x) =} \color {green}{\Delta } \color{blue}{\cdot }\color {green}{\Sigma }\color{blue}{+} \color {green}{sd^{\prime }}\). For instance,

, as in this case \(\color {green}{\Delta } \color{blue}{= 2y}\), \(\color {green}{\Sigma } \color{blue}{= 1y}\), and \(\color {green}{\Delta } \color{blue}{\cdot} \color {green}{\Sigma } \color{blue}{= 2y {\cdot } 1y = 2}\).

–

Two sensitivity configurations are related computations at type σ and distance \({ {{{\color {green}d}}}}\), noted \(\color{blue}{ (\gamma _{1} \vdash} \color {green}{e_{1}}\color{blue}{, \gamma _{2} \vdash} \color {green}{e_{2}}\color{blue}{) \in {\mathcal {E}}_{\color {green}{d}}\color{blue}{[\![ \sigma ]\!]}}\), when the distance is infinite, or if the first configuration reduces to a value, then the second configuration also reduces to a value, and these values are related at type σ and distance \({ {{{\color {green}d}}}}\).

–

Finally, value environment γ₁ is related to value environment γ₂ at type environment Γ and distance environment \(\color {green}{\Delta }\), written \(\color{blue}{ (\gamma _{1},\gamma _{2}) \in {\mathcal {G}}_{\color {green}{\Delta} }}\color{blue}{[\![ \Gamma ]\!]}\), if they both map each variable x in the type environment to values related at their corresponding type (closed with \({ {{{\color {green}\Delta }}}}\)) and at distance \(\color {green}{\Delta }\color{blue}{(x)}\).

Sensitivity Metric Preservation. Armed with these logical relations, we can establish the notion of type soundness and prove the fundamental property—well-typed terms are related with themselves—which corresponds to metric preservation [49]. As usual, we state this property appealing to open terms, where free variables indicate input parameters, which are then closed by related value environments.

Theorem 5.1 (Sensitivity Metric Preservation).

If \(\color{blue}{ \Gamma \vdash}\ {{{\color {green}{e}}}} \color{blue}{\mathrel {:} \tau \mathrel {;}} {{{\color {green}{\Sigma }}}}\), then for any distance environment \({ {{{\color {green}\Delta }}}}\) with \(\color{blue}{ dom(\Gamma) \subseteq dom(}{{{\color {green}{\Delta} }}}\color{blue}{)}\) and any pair of value environments \(\color{blue}{ (\gamma _{1},\gamma _{2}) \in {\mathcal {G}}}_{{{{\color {green}{\Delta} }}}}\color{blue}{[\![ \Gamma ]\!]}\), it holds that \(\color{blue}{ (\gamma _{1}\vdash}\ {{{\color {green}{e}}}},\color{blue}{\gamma _{2}\vdash }\ {{{\color {green}{e}}}}\color{blue}{) \in {\mathcal {E}}}_{{{{\color {green}{\Delta} }}}\color{blue}{\mathord {\cdotp }}{{{\color {green}{\Sigma} }}}}\color{blue}{[\![} {{{\color {green}{\Delta }}}(\tau)]\!]}\).

In other words, if a sensitivity term is well-typed, then for any valid distance environment \({ {{{\color {green}\Delta }}}}\) (that “fits” Γ) and any two value environments γ₁, γ₂ related at Γ and \({ {{{\color {green}\Delta }}}}\), configurations \(\color{blue}{ \gamma _{1}\vdash}\ \color {green}{e,}\ \color{blue}{\gamma _{2}\vdash}\ \color {green}{e}\) represent related computations at type \(\color {green}{\Delta }\color{blue}{(\tau)}\) (closing all free variables) and distance \(\color {green}{\Delta }\color{blue}{\mathord {\cdotp }}\color {green}{\Sigma }\). Note that, since \(\color{blue}{ dom(}\color {green}{\Sigma }\color{blue}{) \subseteq dom(\Gamma) \subseteq dom(}\color {green}{\Delta }\color{blue}{)}\), we have \(\color {green}{\Delta }\color{blue}{\mathord {\cdotp }}\color {green}{\Sigma } \color{blue}{\in \mathbb {R}^{\infty }_{\ge 0}}\).

From the above theorem it is easy to derive a corollary that only characterizes closed terms:

Corollary 5.2.1 (FP for Closed Sensitivity Terms).

If \(\color{blue}{ \varnothing \vdash}\ \color {green}{e} \color{blue}{\mathrel {:} \tau \mathrel {;} \varnothing}\), then \(\color{blue}{ (\varnothing \vdash}\ \color {green}{e,} \color{blue}{\varnothing \vdash}\ \color {green}{e}\color{blue}{) \in {\mathcal {E}}_{\color {green}{0}}}\color{blue}{[\![ \tau ]\!]}\).

As a direct consequence of Theorem 5.1, we can also establish the sensitivity type soundness at base types:

Theorem 5.3 (Sensitivity Type Soundness at Base Types).

Let us illustrate metric preservation by revisiting some examples. Consider Example 4.2:

If we know that in two different executions x may differ in at most 1, and y in at most 3, i.e., Δ = 1x + 3y, then the result will differ in at most \(\color{blue}{ \Delta \mathord {\cdotp } (2x + 2y) = 1\mathord {\cdot }2 + 3\mathord {\cdot }2 = 8}\). For instance, if in one execution x is bound to 0 and y to 4, then the result will be 8. In a second execution, if x is bound to 1 and y to 6, then the result will be 14. Comparing both results, we get |8 − 14| = 6 ≤ 8. Finally, in a third execution, if x is bound to 1 and y to 7, then the result will be 16. Comparing with the first execution, we have |8 − 16| = 8 ≤ 8, and with the second |14 − 16| = 2 ≤ 8.

Now consider Example 4.3:

In this case, if x varies in two different executions (Δ(x) > 0), then the outcome will differ in at most \(\color{blue}{ \Delta (x)\mathord {\cdot }\infty =\infty}\). For instance, if in one execution x is bound to 0 the result will be true, and if in a second execution x is bound to 1, then the result is also going to be true, and true is at distance zero with respect to itself, and 0 ≤ ∞. If in a third execution x is bound to 11, then the result will be false, and false is at distance infinity with respect to true. Now, if we now that x is constant across multiple executions (Δ(x) = 0), then we know from metric preservation that the result will differ in \(\color{blue}{ 0\mathord {\cdot }\infty = 0}\), i.e., the result will be constant.

6 Design of Jazz’S Privacy Type System

In this section, we review the limitations of prior approaches related to the tracking of privacy and then discuss how they are addressed by Jazz. In this section, we color expressions and metavariables red as they pertain to the privacy fragment of Jazz.

6.1 Privacy Closures

Consider a family of looping combinators parameterized by the number of loop iterations n, e.g., where \(\color{blue}{ {\text{loop}}_{3}\ x\ f = f(f(f\ x))}\). In Fuzz, loop_n would have the type \(\color {blue}{\tau \rightarrow (\tau\rightarrow {\scriptstyle \bigcirc } \tau ) \multimap _{n} {\scriptstyle \bigcirc } \tau}\). In this type, regular arrows → mean no sensitivity is tracked for the argument. The linear arrow \(\color{blue}{ \multimap _{{{\color {blue}n}}}}\) means the result is n-sensitive (where n is the number of loop iterations) in the closure variables of the supplied function of type \(\color{blue}{(\tau \rightarrow {\scriptstyle \bigcirc } \tau )}\). This allows for instantiating loop with a closure capturing a sensitive variable, like db. So, \(\color{blue}{\text{loop}_{n}0(\lambda x.x+{\mathtt{laplace}_{\epsilon }\ db)}}\) will give nϵ differential privacy for db by scaling ϵ—the privacy cost of closure variable db—by the loop iteration n. When supporting advanced variants of differential privacy like (ϵ, δ), a different metric must be chosen to recover this kind of scaling; otherwise, this argument only holds for pure ϵ-differential privacy.

In Duet, to support (ϵ, δ)-differential privacy (and disallow problematic scaling), privacy closures immediately report unbounded privacy (∞) for any captured variables in privacy lambdas. The principle of loop’s type above is justified in Duet, but not via a scaling argument, and instead via a primitive type rule—it cannot be expressed as a type. This is problematic for two reasons: First, it is not possible to extend Duet’s implementation with new looping primitives by adding terms with axiomatically justified types, leading to a bloated set of core typing rules, and second, it is not possible to lambda abstract looping combinators, e.g., to chain or compose them in helper functions.

To see the root cause for the limitation in Duet, we show the type rules for looping (advanced composition) and function introduction (from Reference [46]):

In the rule for advanced composition shown above (left), e₁ is the initial value for the looping state of type τ, and e₂ is the loop body that updates the looping state \(\color {blue}{\tau \rightarrow \tau }\) and may mention closure variables in Γ₂. Parameter δ′ is a meta-parameter for the advanced composition formula—this parameter is unique to looping in (ϵ, δ)-differential privacy. The notation

means there must exist some privacy cost ϵ and δ which upper-bounds any individual cost for each of these closure variables. The privacy cost of the whole loop is calculated based on this upper bound for closure variables with the formula

. An attempt to turn loop into a primitive (or abstract over loop, e.g., eta-expand via lambda abstraction) fails, because privacy types in Duet do not track privacy effects for closure variables; instead, they are just thrown away. In the rule for function introduction shown above (right), the function type τ₁@(ϵ, δ)⊸^*τ₂ is a probabilistic function from elements in τ₁ to elements in τ₂, which satisfies (ϵ, δ)-differential privacy in its argument. Notice the closure environment Γ above the line that is bumped to ∞ in

below the line. This has the effect of tossing out privacy bounds for anything with non-zero privacy in Γ, i.e., any closure variables that are used in the function’s definition. Privacy is only tracked for the function parameter x (or possibly multiple parameters; privacy functions in Duet are n-ary).

A deeper limitation in Duet is that the iterated 1-ary function space does not generalize to support encoding n-ary functions (i.e., currification is not supported). For this reason, n-ary functions are primitive in Duet. Implementing n-ary from 1-ary functions is computationally possible in Duet, but results in discarding bounds on privacy effects. For example, the Duet term

in a context where

has type (τ₁@∞)⊸^*(τ₂@(ϵ₂, δ₂))⊸^*τ₃, i.e., the privacy bounds (ϵ₁, δ₁) for the first argument τ₁ get discarded due to the Duet: Privacy-Fun-I rule.

Privacy Closures in Jazz. In Jazz, both privacy and sensitivity effects are delayed and attached to type-level connectives, including for privacy functions. Whereas, in Duet, privacy functions are written

, privacy functions in Jazz are written simply

where Σ is a latent contextual effect that can mention x. A type can now be given to loop (a named constant, analogous to the loop primitive from Duet) in Jazz, and abstracting over loop is possible due to the function introduction rule, also shown below.

N-ary functions are now recoverable from 1-ary ones using latent contextual effects in closures. The relational distance \(\color {green}{d}\) defaults to 1 when omitted. The encoding of lambda-abstracted gauss then follows the usual approach of nested lambda abstractions, but with sensitivity lambdas on the outside with a single privacy lambda on the inside. A 3-ary abstraction of the Gaussian mechanism applied to the sum of three arguments is as follows:

Notice here that the latent contextual effect is computed using a syntactic join operator (ϵ, δ)x⊔(ϵ, δ)y⊔(ϵ, δ)z, which computes the pointwise maximum, instead of the sum ((ϵ, δ)x + (ϵ, δ)y + (ϵ, δ)z). One of the novelties of Jazz is that we can reason about two executions where more than one input is at relational distance greater than 0. In particular, if x, y and z are at relational distance 1, i.e., the argument of \(\color{blue}{ {\text{gauss}}3\epsilon \delta}\) is at relational distance 3, then using addition would yield an over-approximated latent privacy of (3ϵ, 3δ), while using the join, we obtain a latent privacy of (ϵ, δ), as desired.

Abstracting Privacy Mechanisms. Even with support for privacy closures, there are still challenges in supporting lambda abstraction around privacy mechanisms in full generality. In Fuzz, the type assigned to the family of Laplace differential privacy mechanisms parameterized by ϵ is \(\color{blue}{\text{laplace}_{\epsilon } \mathrel {:} \mathbb {R} \multimap _{\epsilon } {\scriptstyle \bigcirc } \mathbb {R}}\) for achieved privacy ϵ. This mechanism does not need a dedicated type rule in the core calculus—it can be axiomatized as a primitive with the right type—and lambda-abstracting this primitive is natural via eta-expansion \(\color{blue}{ \lambda (x \mathrel {:}{\mathbb {R}}). \text{laplace}_{\epsilon }x}\) resulting in the same type and guarantee for privacy. However, this approach does not support (ϵ, δ)-differential privacy directly. Fuzz^ϵδ shows how to extend Fuzz to recover (ϵ, δ)-differential privacy by using graded comonadic liftings and path construction. In particular, the type assigned to the family of Gaussian differential privacy mechanisms parametrized by ε and δ is

, where \(\color{blue}{ \lceil {\mathbb {R}}\rceil}\) is \(\color{blue}{ {\mathbb {R}}}\) but with the metric rounded up to the nearest integer. In Duet, to support (ϵ, δ)-differential privacy, the Gaussian mechanism requires its own typing rule, shown below. Furthermore, a use of the mechanism looks like

where the argument e is a term in the sensitivity language with sensitivity bounded by s. Using privacy closures as described above, we can write

, however, note that we have lost the ability to be parametric in s—it must be fixed to 1. This assumption that gauss will be called only with a 1-sensitive argument is enforced in the function application rule in Duet, also shown below:

In the rule for gauss (left) it allows an argument of any sensitivity s, however, the privacy function application rule (right) restricts that arguments must have sensitivity equal to 1. Restricting gauss to only 1-distance arguments can be overly restrictive (e.g., \(\color {red}{\texttt {gauss}}\ \color{blue}{2\epsilon \delta (x + x)}\)), and relaxing the restriction on function application to an arbitrary s ≠ 1 in Duet would be unsound.

In Jazz, we extend function introduction to include an explicit bound on the sensitivity of the parameter and enforce this restriction in the application rule. Function introduction syntax introduces the bound and allows us to eta-expand the Gaussian mechanism with relational distance \({ {{{\color {green}d}}}}\) as a parameter, as shown below. The bound \({ {{{\color {green}d}}}}\) for the lambda argument is then enforced in function application as the upper bound of argument relational distance, instead of being fixed to 1 as in Duet. Now the use of a variable—like x in the body of eta-expanded gauss below—is not always considered 1-sensitive. To communicate non-zero sensitivities to variables in the type system, an environment of relational distances on lambda arguments must be threaded through the type system, which we notate \({ {{{\color {green}\Delta }}}}\). After extending this \({ {{{\color {green}\Delta }}}}\) to remember that x has relational distance \({ {{{\color {green}d}}}}\) in Jazz lambda abstraction, \(\color{blue}{ {\text{gauss}}\ s\ \epsilon\ \delta\ x}\) will see x as \({ {{{\color {green}d}}}}\) distant inside the lambda body. To do this, we allow lambda-abstracting gauss (including the distance parameter \({ {{{\color {green}d}}}}\) via singleton types) and extend the structure of typing for sensitivity and privacy terms respectively as follows:

6.2 Sensitivity Binding in Privacy Contexts

Jazz improves on prior systems by supporting let-binding intermediate sensitivity computations within the privacy language while also supporting (ϵ, δ)-differential privacy. Fuzz and DFuzz encode let-binding through function application, which scales the sensitivity of the right-hand side of the let with the sensitivity of the let-variable in the body. So, \(\color{blue}{\texttt {let}\ y = 2 * x\ {\texttt {in}}\ 3 * y}\) is 6-sensitive in x, because the right-hand side is 2-sensitive, and this is scaled by 3, the sensitivity of y in the body. However, monadic return and bind in Fuzz can also be used to encode let-binding, e.g., \(\color{blue}{ x \leftarrow {\text{return}}\ e_{1} \mathrel {;} e_{2}}\) instead of \(\color{blue}{\texttt {let}\ x=e_{1}\texttt {in}\ e_{2}}\). Unfortunately, this encoding of let using return and monadic bind does not preserve typeability in Fuzz; instead, it destroys the sensitivity/privacy analysis of the right-hand side, bumping its privacy cost unnecessarily to ∞. For this reason, let statements are encoded exclusively through function application in Fuzz and not through monadic return/bind.

In Fuzz, let-binding a sensitivity computation (the pure fragment) inside a privacy computation (the monadic fragment)—via encoding through function application—is supported seamlessly without the addition of extra rules. This flexibility can be extended to advanced privacy variants as shown by de Amorim et al. [29]. In Duet, however, the privacy/monadic fragment of Fuzz is pulled out into its own language with explicit typing rules; the primary reason to do this is to place restrictions on function application to support advanced privacy variants, as described in the previous subsection. This leaves the need for either an explicit typing rule for let-binding inside the privacy language or an escape hatch so privacy analysis is not destroyed for let-binding in privacy contexts à la Fuzz. Duet solves this issue by introducing a boxed type that delays the payment of a sensitivity term at the point it is “boxed” and pays for it later when it is “unboxed.” This avoids the issue but is unfriendly to program with: Every let-binding requires an explicit box, and every use of a let-bound variable requires an explicit unbox. So, instead of writing the program below on the left, Duet programmers are forced to write the program on the right.

In this program, it is essential to let-bind the expensive result, since inlining it would unnecessarily duplicate the computation, and many real programs in differential privacy require support for this pattern [46].

In Jazz, we recover the expressiveness that box types provide while eliminating the need for the programmer to explicitly introduce and eliminate them. In this way, our design can also be seen as a powerful box-inference capability, although we do not demonstrate explicit embeddings between a core language with box types. To recover the expressiveness of boxes without requiring the programmer to write them down, we add new information to typing judgments that has the effect of automatically boxing let-bound variables in privacy contexts and unboxing them at their use. The added information extends typing judgments with a new component Φ that tracks the sensitivities of all let-bound variables w.r.t. the sensitivities of all lambda-bound variables. All sensitivity contexts that mention both let-bound and lambda-bound variables are then reduced using Φ as needed to contexts that only mention lambda-bound variables. Φ can be seen as a matrix, and the reduction of contexts to only lambda-bound variables is then just matrix multiplication—a beautiful coincidence for a linear type system. The final form of type judgments for the sensitivity and privacy type systems are then:

Although the prototype implementation adopts the typing rules with Φ, and because the manipulation of Φ is more tedious than insightful, we omit it in the following technical presentation.

7 Jazz’S Differential Privacy Type System, Formally

In this section, we present a core subset of Jazz, dubbed \(\lambda _{\text{{J}}}\). \(\lambda _{\text{ {J}}}\) is an extension of Sax with support for reasoning about differential privacy. Similarly to Sax, we prove the type safety and type soundness property of \(\lambda _{\text{ {J}}}\). We discuss how to bridge the gap between \(\lambda _{\text{ {J}}}\) and Jazz in Section 8. Note that our formalism is fixed to (ϵ, δ)-differential privacy, but our design can be instantiated to other forms of advanced differential privacy disciplines, as illustrated in Section 8.

7.1 Syntax and Type System

\(\lambda _{\text{ {J}}}\) is divided in two mutually embedded sublanguages: the sensitivity sublanguage—an extension of Sax—used to reason about the sensitivity of computations, and the privacy sublanguage, used to reason about differential privacy. Thus, the type system of \(\lambda _{\text{ {J}}}\) contains two mutually embedded type systems, one for each of the sublanguages. Expressions of the sensitivity sublanguage remain typeset in green and expressions of the privacy sublanguage are typeset in red.

Syntax. Figure 9 presents the syntax of \(\lambda _{\text{ {J}}}\). Expressions of the language are divided into two mutually embedded expressions: sensitivity expressions \(\color {green}{e}\) and privacy expressions \(\color {red}{\mathsf {e}}\). Sensitivity expressions are defined the same way as in Sax, except that functions are split into sensitivity lambdas

and privacy lambdas

. Note that the only difference between a sensitivity lambda and a privacy lambda is that the body of a privacy lambda is a privacy expression \(\color {red}{\mathsf {e}}\). Also, both sensitivity lambdas and privacy lambdas are parametrized by a relational distance \({ {{{\color {green}d}}}}\) that represents an upper bound on distance between inputs pertained to the binary relational property of differential privacy: the maximum argument variation for each of two executions.

Fig. 9.

A privacy expression \(\color {red}{\mathsf {e}}\) can be a point distribution \(\color {red}{\texttt {return}}\ \color {green}{e}\), a sequential composition \(\color{blue}{ x:\tau \leftarrow} \color {red}{\mathsf {e}}\color{blue}{\mathrel {;}}\color {red}{\mathsf {e}}\), an application \(\color{red}{\mathsf {e\ e}}\), a conditional \(\color{red}{{\texttt{if}}\ \color {green}{e}\ \color{red}{\texttt {then}}\ {\mathsf{e}}\ {\texttt{else}}\ {\mathsf {e} }}\), a case expression

, or a let

A privacy cost \(\color{red}{p} \color{blue}{= (}\color {red}{\epsilon }, \color {red}{\delta }\color{blue}{)}\) is a pair of two (possibly infinite) real numbers, where the first component corresponds to the epsilon and the second to the delta in (ϵ, δ)-differential privacy. We use notation \(\color{red}{p}\color{blue}{.}\color {red}{\epsilon }\) and \(\color{red}{p}\color{blue}{.}\color {red}{\delta }\) to extract \({ {{{\color {red}\epsilon }}}}\) and \({ {{{\color {red}\delta }}}}\), respectively. A privacy environment \({ {{{\color {red}\Sigma }}}}\) is either an empty environment \(\color{blue}{ \varnothing}\), a pair \(\color {red}{p}\color{blue}{x}\) representing that variable x has privacy cost \({ {{{\color {red}p}}}}\), the addition \(\color {red}{\Sigma } \color{blue}{+} color {red}{\Sigma }\) of two privacy environments, the join \(\color {red}{\Sigma }\ \color{blue}{\sqcup}\ \color{red}{\Sigma }\) of two privacy environments, and the meet \(\color {red}{\Sigma }\ \color{blue}{\sqcap}\ \color{red}{\Sigma}\) of two privacy environments. Similarly to sensitivity environments, we also write privacy environments as first-order polynomials when possible. For instance, \(\color {red}{p_{1}}\color{blue}{x +} \color {red}{p_{2}}\color{blue}{x}\) can be written as \(\color{blue}{ (}\color{red}{p_{1}}\ \color{blue}{+}\ \color {red}{p_{2}}\color{blue}{)x}\), but

cannot be rewritten as a polynomial without losing precision. Function types are now divided into sensitivity function types

and privacy function types

Sensitivity type system. The type system for the sensitivity sublanguage is presented in Figure 10. The judgment

now includes a novel relational distance environment \(\color {green}{\Delta }\). The relational distance environment \(\color {green}{\Delta }\) stores how much each variable in Γ can vary in every two executions of a program. Most of the rules are straightforward extensions of the type system of Sax to include relational distance environments. We only present interesting cases.

Fig. 10.

Some of the rules use the sensitivity environment substitution operator \(\color{blue}{ [}\color {green}{\Sigma }\color{blue}{/x]\tau}\). We extend the definition of Sax to support privacy functions, as shown in Figure 11. Substitution on privacy function types depends on the definition of sensitivity environment substitution on privacy environments \(\color{blue}{ [}\color {green}{\Sigma }\color{blue}{/x]}\color {red}{\Sigma }\). \(\color{blue}{ [}\color {green}{\Sigma }\color{blue}{/x]}\color {red}{\Sigma }\) is defined inductively on the structure of \({ {{{\color {red}\Sigma }}}}\), where the only interesting case is when \(\color {red}{\Sigma } \color{blue}{=} \color {red}{p}\color{blue}{x}\). Substitution \(\color{blue}{ [}\color {green}{\Sigma }\color{blue}{/x]}\color {red}{p }\color{blue}{x}\) is defined using the lift operator:

. Intuitively, if we wiggle⁶ x on \(\color {red}{p}\color{blue}{x}\), then the privacy obtained is at most \({ {{{\color {red}p}}}}\) (no scaling and zero if x does not change). After substitution, as x depends on all variables on \({ {{{\color {green}\Sigma }}}}\), if we wiggle all variables in \({ {{{\color {green}\Sigma }}}}\) at the same time, then the privacy obtained should still be \({ {{{\color {red}p}}}}\) (scaling \({ {{{\color {red}p}}}}\) would be an over-approximation). Because of this,

is defined as the join

, where \(\color{blue}{ x_{i} \in dom(}\color {green}{\Sigma }\color{blue}{)}\). If all x_i wiggle, then the privacy obtained would be at most \({ {{{\color {red}p}}}}\). But as any \(\color {green}{\Sigma }\color{blue}{(x_{i})}\) can be zero (it means that variable x_i is not used), we multiply each \({ {{{\color {red}p}}}}\) in \(\color {red}{p}\color{blue}{x_{i}}\), by

to remove those variables from the resulting privacy environment.

along other lift operators used by Near et al. [46] are defined in Figure 11. For instance, suppose that x depends on 2y + 0z, then \(\color{blue}{ [(2y+0z)/x]}\color{red}{p}\color{blue}{x}\) is computed as

Fig. 11.

We now turn to describe the main changes of each type rule with respect to Sax.

–

Rule var now requires variable x to be present in the relational distance environment \(\color {green}{\Delta }\). This way, if

, we can compute how much the result of evaluating \({ {{{\color {green}e}}}}\) can change if we wiggle input x by multiplying \(\color {green}{\Delta }\color{blue}{(x)}\) by \(\color {green}{\Sigma }\color{blue}{(x)}\). For instance, consider program x + x and the following type derivations:

Then, we know that (1) x can change at most by 3, and (2) the expression is 2-sensitive in x, therefore the result can change at most by 6.

–

Rule s-lam typechecks sensitivity functions. As the body of the lambda has x as a free variable, the relational distance environment \(\color {green}{\Delta }\) is extended with distance \(\color {green}{d}\) obtained from the type annotation on the argument. For example, consider program

and its type derivation:

The program is a sensitivity lambda that takes as argument a real with an allowed variation of at most 2 and has a latent contextual effect of 2x.

–

Rule p-lam is defined analogously to s-lam, except that its body is a privacy term, therefore it is typechecked using the privacy type system, explained later.

–

Rule s-app deals with sensitivity applications. Note that, from the type of the function, we know that \(\color {green}{d}\) is an upper bound on the allowed argument variation, therefore, we require that the dot product between the relational distance environment and the sensitivity effect of the argument be less or equal than \(\color {green}{d}\). Intuitively, as \(\color {green}{\Delta }\) represents how much the input can change, and \(\color {green}{\Sigma _{2}}\) represent the sensitivity of variables used in the argument, \(\color {green}{\Delta }\color{blue}{ \mathord {\cdot}} \color {green}{\Sigma _{2}}\) represents how much the argument can change. For example, consider program

and its type derivation:

The outermost lambda allows a maximum variation of 1 in its argument y. The inner lambda allows a maximum variation of 2 on its argument and its being applied to 2*y. As 2*y is 2-sensitive on y and y can wiggle at most by 1, then we know that the argument is going to wiggle at most by 2, which matches the maximum permitted argument variation. If the argument were 3*y, then the program would not typecheck, as the argument of the application could wiggle at most by 1*3 = 3.

–

Rule s-case typechecks subterms \(\color {green}{e_{2}}\) and \(\color {green}{e_{3}}\) by extending the relational distance environment with a sound bound for x and y, respectively. For x (respectively, y), we use the dot product between the relational distance on all variables in scope \(\color {green}{\Delta }\) and the cost of using \(\color {green}{e_{1}}\): the cost \(\color {green}{Sigma_{1}}\) of reducing the expression, plus the latent cost of using its subterm \(\color {green}{Sigma_{11}}\) (respectively, \(\color {green}{Sigma_{12}}\)). For example, consider the type derivation of Example 4.4 given, e.g. relational distance environment x + 2b:

The relational distance for x₁ on the first branch is computed as the dot product between the maximum distance of all variables in scope, x + 2b, and the cost of using \({ {{{\color {green}e}}}}\) if it were an inl expression, i.e., \(\color{blue}{ ((x+2b)\mathord {\cdotp }(\infty x+b)) = \infty}\). Analogously, the bound for x₂ on the second branch is computed as \(\color{blue}{ ((x+2b)\mathord {\cdotp }(x+b)) = 3}\).

–

In Ruleuntup, as expression \(\color {green}{e_{2}}\) has in scope new variables x₁ and x₂, the relational distance environment \(\color {green}{\Delta }\) is extended accordingly. The relational distance for x₁ is computed as the dot product between the relational distance environment \(\color {green}{\Delta }\) and the cost \(\color {green}{\Sigma _{1} \color{blue}{+} \color {green}{\Sigma _{1 1}}}\) of accessing the first component (we proceed similarly with x₂). For instance, consider the type derivation of Example 4.2 given some arbitrary relational distance environment 2x + 3y:

The relational distance for x₁ is computed as the dot product between the relational distance of all variables in scope 2x + 3y and the effect of using the left component of the pair 2x + 0y, i.e., \(\color{blue}{ (2x+3y) \mathord {\cdotp } (2x+0y) = 4}\). Similarly, the bound of x₂ is computed as \(\color{blue}{ (2x+3y) \mathord {\cdotp } (y+0x) = 3.}\)

Subtyping is extended accordingly and presented in Figure 12. Parameterized relational distances on function types are contravariant, and subtyping for privacy function types relies on the definition of subtyping for privacy environment also defined in Figure 12, where \(\color {red}\bullet\) is an operator to close privacy environments defined below:

Fig. 12.

Privacy type system. The type system of the privacy part of the language is presented in Figure 13. The judgment

says that privacy term \(\color {red}{\mathsf {e}}\) has type τ and ambient privacy effect \({ {{{\color {red}\Sigma }}}}\) under type environment Γ and relational distance environment \(\color {green}{\Delta }\).

Fig. 13.

–

Rule return uses the type system of the sensitivity language to typecheck its subexpression \({ {{{\color {green}e}}}}\). Operationally, \(\color {red}{\texttt {return}}\) constructs a point-distribution, and any sensitive variables in the subexpression \({ {{{\color {green}e}}}}\) will have their privacy violated, i.e., privacy cost ∞. Notice that \({ {{{\color {red}\infty }}}}\) corresponds to the pair

. The resulting ambient privacy effect is computed by lifting to infinity the ambient effect of the subexpression as well as all free variable in \({ {{{\color {green}e}}}}\). The operator that lifts to \({ {{{\color {red}\infty }}}}\) free variables is written FP^∞ and defined in Figure 13. As we pay infinity for every free variable in \({ {{{\color {green}e}}}}\), we remove those variables from the reported type τ using the sensitivity environment substitution operator defined in Figure 11. For instance, consider the following type derivation:

The resulting type and effect environment is computed by paying in advance for the free variables in scope: The type \(\color{blue}{ [\varnothing /y](2x+y) = 2x}\) is computed by erasing the free variables, and the effect environment \({ {{{\color {red}\infty }}}y}\) is computed by lifting the free variables to infinity.

–

Rule bind typechecks both subexpressions using the type system for the privacy language, as they are privacy expressions. To typecheck \({ {{{\color {red}{\mathsf {e}_{2}}},} }}\) we extend type environment with variable x, therefore the relational distance environment \(\color {green}{\Delta }\) is also extended. We extend \(\color {green}{\Delta }\) with \({ {{{\color {green}0}}}\color{blue}{x,}}\) as the value bound to x is no longer considered sensitive—it has been declassified and can be used without restriction. Finally, as x is out of scope, we remove it from τ₂ and from the resulting ambient privacy effect. For instance, consider the type derivation of program

, similar to the example presented in Section 6.2, given an arbitrary relational distance environment x.

Each laplace call has an effect environment of \(\color{blue}{ (}\color {red}{\epsilon _{1}}\color{blue}{,0)x}\) and \(\color{blue}{ (}\color{red}{\epsilon _{2}}\color{blue}{,0)x}\), respectively. The \({ {{{\color {red}{\texttt {return}}}}}}\) subexpression lifts to infinite the privacy of variables y and z, but to typecheck the innermost bind expression the privacy on z is dropped:

. Then, to typecheck the outermost bind expression, now the privacy on y is dropped getting a final effect environment of

–

Rule p-case is similar to rules-case. Here, we lift the ambient sensitivity effect of the sum expression to infinity, i.e., we pay infinity for all non-zero-sensitive variables used in \(\color {green}{e_{1}}\). For the additional cost of each branch, we compute the join between the cost of each branch by substituting each binder by their appropriated cost:

for the first branch and

for the second. Note that we do not use

and

as we do in rules-case, because we are already lifting to infinity (or paying for) every cost associated with \(\color {green}{Sigma_{1}}\). For example, consider the following type derivation:

Note that the variation bound of x₁ is computed as \(\color{blue}{ (x+2y)\mathord {\cdotp }(x+y) = 3}\) and that of x₂ as \(\color{blue}{ (x+2y)\mathord {\cdotp }(x) = 1}\). As

, and

, then the resulting effect environment is

–

Rule p-app uses the sensitivity type system to typecheck both subterms. The first subterm has to be typed as a privacy function. Just as s-app, it checks that the sensitivity cost of the argument is bounded by \(\color {green}{d}\) by computing the dot operation \(\color {green}{\Delta } \color{blue}{\mathord {\cdot }} \color {green}{\Sigma _{2}}\) between relational distance environment \(\color {green}{\Delta }\) and sensitivity environment \(\color {green}{\Sigma _{2}}\). The resulting ambient privacy effect is computed as the lift to infinite of the ambient sensitivity effect of \(\color {green}{e_{1}}\), plus the latent contextual effect of the privacy function, where we substitute \(\color {green}{\Sigma _{2}}\) by x. Similarly to rules-app, rulep-app also enforces that the relational distance of the argument is bounded by \(\color {green}{d}\), i.e., \(\color {green}{\Delta } \color{blue}{\mathord {\cdot }} \color {green}{\Sigma _{2}} \color{blue}{\le} \color {green}{d}\). For instance, consider the following type derivation:

The resulting effect environment is computed as

, which is equivalent to \(\color {red}{\infty }\color{blue}{y}\). If y does not wiggle, then the ambient privacy effect will be zero. If the relational distance environment for y were 3, then this program would be ill-typed, since \(\color{blue}{ 3y\mathord {\cdotp }2y \not\le 4}\).

7.2 \(\lambda _{\text{{J}}}\): Type Safety

Type safety is defined in the same line of Section 5.2. To establish type safety of the privacy language, we define a non-deterministic sampling big-step semantics of privacy expressions; see Figure 14. We naturally extend the type safety logical relations of Figure 8 to support for both sensitivity and privacy lambdas and privacy expressions as shown in Figure 15. The fundamental property of the type safety logical relation is defined similarly to Proposition 5.1, but now accounting for relational distance environments and expressions:

Fig. 14.

Fig. 15.

Proposition 7.1 (Fundamental Property of the Type Safety Logical Relation).

Let \(\color{blue}{ \Gamma ;}\color {green}{\Delta }\ \color{blue}{\vdash}\ \color {green}{e}\ \color{blue}{: \tau ;}\color {green}{\Sigma }\), and \(\color{blue}{ \gamma \in {\mathcal {G}}[\![ \Gamma ]\!]}\). Then \(\color{blue}{ \gamma\ \vdash}\ \color{green}{e}\ \color{blue}{\in\ {\mathcal {E}}[\![ \tau /\Gamma ]\!]}\).

Let \(\color{blue}{ \Gamma ;}\color {green}{\Delta }\ \color{blue}{\vdash}\ \color {red}{\mathsf e}\ \color{blue}{: \tau ;}\color {red}{\Sigma }\), and \(\color{blue}{ \gamma \in {\mathcal {G}}[\![ \Gamma ]\!]}\). Then \(\color{blue}{ \gamma\ \vdash}\ \color{red}{\mathsf e}\ \color{blue}{\in\ {\mathcal {E}}[\![ \tau /\Gamma ]\!]}\).

Finally, type safety for closed terms is just a corollary of the fundamental property above:

Corollary 7.1 (Type Safety and Normalization of \(\lambda _{\text{{J}}}\))

Let \(\color{blue}{ \vdash} \color {green}{e}\ \color{blue}{: \tau ; \varnothing}\), then \(\color{blue}{ \vdash}\ \color {green}{e}\ \color{blue}{\Downarrow}\ \color {green}{v}\), and \(\color{blue}{ \vdash}\ \color {green}{v}\ \color{blue}{: \tau ^{\prime };\varnothing}\), where τ′ < : τ.

Let \(\color{blue}{ \vdash}\ \color {red}{\mathsf {e}}\ \color{blue}{: \tau ; \varnothing}\), then \(\color{blue}{ \vdash}\ \color {red}{\mathsf {e}}\ \color{blue}{\Downarrow}\ \color {green}{v}\), and \(\color{blue}{ \vdash {{{\color {green}v}}}: \tau ^{\prime };\varnothing}\), where τ′ < : τ.

7.3 Soundness of \(\lambda _{\text{ {J}}}\): Metric Preservation

This section establishes the soundness of \(\lambda _{\text{ {J}}}\), named metric preservation. Metric preservation for \(\lambda _{\text{{J}}}\) extends the notion of metric preservation of Sax. In addition to reasoning about sensitivity terms, given a privacy term with free variables, we can reason about the achieved privacy level when closing the privacy term under different (but related) environments.

Contrary to Sax, we establish soundness for \(\lambda _{\text{ {J}}}\) using a step-indexed logical relation [3]. Although \(\lambda _{\text{{J}}}\) is a strongly normalizing language, step indexing is still required to prove the bind case of the fundamental property of the logical relation.

Probabilistic Semantics.

A first step to define the soundness property of \(\lambda _{\mathsf {J}}\) is to endow privacy expressions with a probabilistic semantics. An important observation here is that even though to match their traditional (theoretical) presentation, we have introduced the Laplacian and Gaussian mechanisms as sampling from the (uncountable) set of real numbers; for the formal account of the language, we consider discrete versions thereof over the set of integers [22]. This discretization is not only a natural but also a necessary requirement for any implementation of the language (on “finite” computers), since it is well-known that the naïve use of finite-precision approximations may result in fatal privacy breaches [43]. (However, for the sake of uniformity, in the rest of the presentation, we refer to these mechanisms—at the type level—as operating over the set of real numbers.) Therefore, privacy expressions in \(\lambda _{\mathsf {J}}\) sample values only from discrete distributions and can be interpreted as discrete distributions over values.

The probabilistic semantics of a privacy expression \({\color {red}{\mathsf e}}\) is formally defined in Figure 16. Judgment \(\color{blue}{ \gamma \vdash {{{\color {red}{\mathsf e}}}} \Downarrow ^{k} D_{}}\) denotes that privacy expression \({\color {red}{\mathsf e}}\) reduces to distribution D within k steps; the probability that the privacy configuration \(\color {blue}{ \gamma \vdash {\color {red}{\mathsf e}}}\) reduces to value \({\color {green}v}\) is then computed as \(\color{blue}{ D_{}({\color {green}v})}\). We encode discrete distributions D as probability mass functions(PMF), i.e., a discrete distribution over A is modeled as an element of the set \(\color{blue}{ {\mathcal {D}}(A) = \lbrace f :A \rightarrow [0,1] \mid \sum _{a \in A} f(a) \,=\, 1\rbrace}\).

Fig. 16.

Let us briefly explain the set of rules in Figure 16. For simplicity, we omit the underlying step indices. The probabilistic semantics of \({\color{blue}{ \color {red}{\mathsf return}\ {\color {green}e}}}\) assigns probability 1 to the (necessarily unique) value to which expression \({\color {green}e}\) reduces. The probabilistic semantics of a bind \(\color{blue}{ y: \tau _{1} \leftarrow \color{red}{\mathsf {e_1}};\color{red}{\mathsf {e_2}}}\) operates as follows: To compute the probability that it assigns to x, it ranges over the set of values in the support of D₁ denoted as \(\color{blue}{ \mathtt {Sup}(D_{1})}\), i.e., the set of values \(\color{blue}{\color {green}v}\) such that \(\color{blue}{ D_{1}({\color {green}v})\gt 0}\), and for each \(\color{blue}{ {{{\color {green}{v_i}}}} \in \mathtt {Sup}(D_{1})}\) it sums the product between the probability that \(\color {red}{\mathsf {e}_{1}}\) reduces to \({\color {green}{v_i}}\) with the probability that \({\color {red}{\mathsf {e}_{2}}}\) reduces to x in an extended environment where y is bound to \({\color {green}{v_i}}\). The discrete Gauss distribution with mean μ and scale σ² assigns probability proportional to \(\color{blue}{ e^{\frac {-(x - \mu)^{2}}{2\sigma ^{2}}}}\) to each integer x. The probabilistic semantics of a privacy application is defined as the probabilistic semantics of the body of the resulting privacy closure in an extended environment where the closure formal argument is bound to the value of the real argument. Finally, the probabilistic semantics of a \({\color {green}{\mathsf {case}}}\) term is simply the probabilistic semantics of the corresponding branch in an extended environment with the corresponding association for the branch binder variable.

We consider a step-indexed semantics to establish the language metatheory. In particular, the metric preservation theorem is proved by induction on the step index of the logical relation. However, step indices might not interact very well with the bind rule: When reducing \(\color{blue}{ y: \tau _{1} \leftarrow} \color{red}{\mathsf{e_1}} \color{blue};\color{red}{\mathsf{e_2}}\), the reduction of \({\color {red}{\mathsf {e_2}}}\) requires a possibly different number of steps (k_i) for each value (\({\color {green}{v_i}}\)) to which \({\color {red}{\mathsf {e_1}}}\) reduces. This set of steps could in principle be unbounded, making \(\mathsf {max}_{i}~k_i\) undefined and thus rendering the semantics partial. However, this is not an issue for our technical development, because all formal results are concerned with programs that reach the distribution of final values within a finite number of steps, only.

For convenience throughout this section, we also introduce D(S) to denote the probability of observing S in D, computed as \({\color{blue}{ \sum _{{{{\color {green}v}}} \in S} D_{}({\color {green}v})}}\). Also, we define

as \({\color{blue}{ D_{}({\color {green}v)}}}\) if

for some k′ ≤ k (undefined otherwise).

Logical relation.

The logical relations for sensitivity computations, privacy computations, values, and environments are mutually recursive and presented in Figure 17.

Fig. 17.

Note that each logical relation is also indexed by a relational distance type that now accounts for sensitivity and privacy lambdas:

Note that similarly to sensitivity environments, privacy environments \({\color {red}\Sigma }\) are also extended to include partially instantiated data, for instance, \({\color{blue}{\color {red}{p}x + {\color {red}{p^{\prime }}}}}\). Notation \({ \color{blue}{ ({{{\color {green}{v_1}}}},{{{\color {green}{v_2}}}}) \in {\mathcal {V}}_{{{{\color {green}d}}}}^{k}[\![ \sigma]\!]}}\) indicates that value \({\color {green}{v_1}}\) is related to \({\color {green}{v_2}}\) at type σ and distance \({\color {green}d}\) for k steps.

The sensitivity parts of the logical relations are defined analogously to Figure 8 with the addition of a step index k. We only present relevant changes:

–

Two sensitivity closures are also related if, given related inputs, they produce related computations. Specifically, first the environments have to be related for any step j < k. Second, inputs \(\color{green}{v^{\prime }_{1}}\) and \(\color{green}{v^{\prime }_{2}}\) have to be related at distance \({\color {green}{d^{\prime \prime }}}\) not greater than \({\color {green}d}\)’ and for j steps. Finally, the bodies of the functions in extended environments have to be related computations for j steps.

–

Similarly to sensitivity closures, two privacy closures are related if they produce related computations when applied to related inputs. The computations are related at privacy

. Note that we lift \({\color {green}d}\) to infinite, because we cannot record relational distances as a privacy result. In addition to that, we also pay the latent contextual effect of the function instantiated to \({\color {green}{d^{\prime \prime }}x}\), i.e.,

Two sensitivity configurations are related computations at type σ and distance \({\color {green}d}\) for k steps, noted \(\color{blue}{ (\gamma _{1} \vdash {{{\color {green}{e_1}}}}, \gamma _{2} \vdash {{{\color {green}{e_2}}}}) \in {\mathcal {E}}_{{{{\color {green}d}}}}^{k}[\![ \sigma ]\!]}\), when for any j < k, if the first configuration reduces in j steps to a value, then the second configuration also reduces to a value in any number of steps, and these values are related for the remaining k − j steps at type σ and distance \({ {{{\color {green}d}}}}\). We write \(\color{blue}{ \gamma \vdash {\color {green}e} \Downarrow ^{k} {\color {green}v}}\) to say that the configuration \(\color{blue}{ \gamma \vdash {\color {green}e}}\) reduces to value \(\color{green}{v}\) in k steps.

We now turn to the definition of related privacy computations. Notation

indicates that two privacy configurations are related computations at type σ and privacy \(\color{blue}{{\color{red}p}=({\color{red}\epsilon},{\color {red}\delta})}\) for k steps. Two privacy configurations are related when, for any j < k, if the first privacy configuration reduces in j steps to a distribution, then the second configuration also reduces to a distribution in any number of steps, and the probability of observing S in the first distribution is no greater than \(\color{blue}{e^{\color{red}\epsilon}}\) times the probability of observing S in the second distribution, plus \(\color{red}{\delta}\).

Metric Preservation. Armed with these logical relations, we can establish the notion of type soundness for \(\lambda _{\text{{J}}}\) and prove the fundamental property.

where

. The theorem says that if a sensitivity term (respectively, privacy term) is well-typed, then for any number of steps k, valid relational distance environment \(\color{green}{\Delta ^{\prime }}\) (not greater than \(\color{green}{\Delta}\)) and value environments (γ₁, γ₂), the two configurations are related computations at distance type \({\color {green}{\Delta ^{\prime }}\color{blue}{(\tau)}}\) (closing all free sensitivity or privacy variables) and at relational distance \(\color{green}{ {\Delta ^{\prime }}\mathord {\cdotp }{\Sigma }}\) (respectively, Δ′\({\color{green}{\varDelta^{\prime}}\color{red}{\cdot\varSigma}}\)\(\Sigma\)). Note that as \(\color{blue}{ dom({\color {green}\Sigma}) \subseteq dom({\color {green}{\Delta ^{\prime }}}) = dom({\color {green}\Delta })}\) and \(\color{blue}{dom(}\color {red} {\Sigma }\color{blue}{) \subseteq dom(}{\color {green}\Delta ^{\prime }\color{blue}{) = dom(}{\color {green}\Delta }\color{blue}{)}}\), then \({{\color {green}{\Delta ^{\prime }}} \color{blue}{\mathord {\cdotp }}{\color {green}\Sigma} \color{blue}{\in {\mathbb {R}}^{\infty }}}\) and

To prove the fundamental property, we rely on the following three lemmas that connect types, sensitivity, and privacy environments from the type system, with distances and privacy costs from the logical relations:

Lemma 7.3.

If \(\color{green}{ {\Delta} \mathord {\cdotp } {\Sigma} \color{blue}= {d}}\) and \(\color{blue}{ x \notin dom({\color {green}\Sigma})\cup dom({\color {green}\Delta })}\), then \({\color{blue}{ {\color {green}\Delta} \mathord {\cdotp } ([{\color {green}\Sigma}/x]{\color {green}{\Sigma ^{\prime }}}) = ({\color {green}\Delta } + {\color {green}d}x) \mathord {\cdotp } {\color {green}{\Sigma ^{\prime }}.}}}\)

Lemma 7.5.

Let \(\color{green}{{\Delta }\mathord {\cdotp }{\Sigma } \color{blue} = {\color {green}{d}}}\) and \({\color{blue}{ x \notin dom({\color {green}\Sigma })\cup dom({\color {green}\Delta })}}\), then \({\color{blue}{ {\color {green}\Delta }\color {blue}([{\color {green}\Sigma }/x]\tau) = {\color {green}d}x({\color {green}\Delta }(\tau)).}}\)

We can also derive from the fundamental property some corollaries about closed terms.

Corollary 7.6 (FP for Closed Sensitivity Terms).

Corollary 7.7 (FP for Closed Privacy Terms).

In addition to sensitivity type soundness at base types (Prop 5.3), from the fundamental property, we can now establish privacy type soundness at base types:

Theorem 7.8 (Privacy Type Soundness at Base Types).

Finally, we observe that our technical development relies on the specific variant of differential privacy considered in restricted places: the bind case of the soundness theorem (Theorem 7.2); the definition of subtyping for privacy costs, specially the base case (Figure 23); operations over privacies p, such as dot product, addition, meet, join, lifting (Figures 25 and 26); monotonicity of meet and join of privacies w.r.t. subtyping (Reference [54, Lemmas C.7 and C.8]); monotonicity of dot product w.r.t. privacy ordering (Reference [54, Lemma C.13]); distributivity of dot product w.r.t. substitution (Lemma 7.4); weakening of related private computations (Reference [54, Lemma C.10]). The remaining definitions and lemmas are independent and could be reused as such to adapt this work to deal with other variants of differential privacy.

8 FROM \(\lambda \mathsf{_J}\) TO JAZZ

The full prototype implementation of Jazz includes several extensions to the core language \(\lambda \mathsf{_J}\) and address the non-determinism of multiplicative and additive products by using type annotations.

Type Polymorphism. Jazz implements System F (universal quantification over well-kinded types) and parametric polymorphism over all compound types, including vector/matrix schemas, allowing all data objects and functions in the language to be fully generic. This feature requires the use of type-level quantifiers and application.

Value Dependency. Jazz supports type-level dependency on values through singleton types—an approach we borrow directly from DFuzz [34]. This allows differentially private algorithms to be verified with respect to privacy parameters that are not fixed and instead are function arguments.

More specifically, singleton types [33] are a technique for supporting limited forms of value dependency that builds on standard (System-F-style) polymorphic type system features and an enriched kind system. In a type system with native support for dependent types, a dependent function with a real-valued argument is written \(\color{blue}{(x:\mathbb {R}) \rightarrow {\tau }}\) where the return type τ can use x to refer symbolically to the eventual runtime value of x. In a singleton type encoding of dependent types, the same function is written \(\color{blue}{x :\mathbb {R}[\hat{x}]\rightarrow {\tau}}\), where the return type τ can use \(\color{blue}{ \hat{x}}\) to refer symbolically to the eventual runtime value of x. In essence, there is still a syntactic split between term-level variables (x) and type-level variables (\(\color{blue}{ \hat{x}}\)), and the type declaration \(\color{blue}{ ( x:\mathbb{R}[\hat{x}])}\) links them, so \(\color{blue}{ \hat{x}}\) is the type-level proxy for the term-level variable x.

Let-binding Sensitivity Terms in Privacy Contexts. As described in Section 6.2, we implement latent sensitivity via local bindings in the privacy language. We implement this feature using an environment \({\color {green}\Phi }\) to delay the “payments” of a value’s sensitivity, which fulfills the same role as the boxed type introduced by Near et al. [46]. Unlike boxed types, this feature requires no additional annotations—sensitivity is inferred automatically. The complete type systems that includes \({\color {green}\Phi }\) are presented in Appendix A, Figures 19 and 20.

Context Polymorphism. When implementing flexible primitives in Jazz, it becomes convenient to abstract over latent contextual effects. We label this form of abstraction as context polymorphism. This form of polymorphism in our language is what enables us to give a single generalized type to the primitives gauss and seqloop (a looping combinator that uses sequential composition, further discussed in Section 9.2). Because Jazz implements quantification over latent contextual effects, it is possible to afford privacy in type signatures to closed-over variables involved in the differentially private computation. This feature requires the use of type-level quantifiers, application, substitution, and annotations for context schemas. A context schema is an angle bracket enclosed list of variables. Angle bracket context schemas in Jazz denote the set of variables that we care about preserving privacy for and are used in the introduction forms for sums, pairs, and functions, as well as in type-level application. For example, gauss <x> (x + y * y) is 1-sensitive in x, and bumps y to infinity privacy. This demonstrates the use of context polymorphism to indicate which variables we care about preserving privacy for. The use of seqloop in Section 9 provides another example of context polymorphism in action. Note that context-polymorphic functions in Jazz are required to be primitives.

Variants of Differential Privacy. In addition to (ϵ, δ)-differential privacy, Jazz supports zero-concentrated differential privacy [21] and Rényi differential privacy [44] and has built-in constructs for mixing the variants. Each variant has different privacy parameters and rules for composition, but all of them follow the same basic pattern as (ϵ, δ)-differential privacy. For example, we can give the Gaussian mechanism the following types for Rényi differential privacy (RDP; privacy parameters α and ϵ) and zero-concentrated differential privacy (zCDP; privacy parameter ρ):

Since RDP and zCDP guarantees can be converted to (ϵ, δ) guarantees, Jazz provides constructs for converting between variants. For example, the following code uses the Gaussian mechanism twice, each time satisfying (20, 0.25)-RDP. By sequential composition, the total cost is (20, 0.5)-RDP. The program then converts this guarantee to (ϵ, δ)-differential privacy, using δ = 10^{− 5}.

Jazz automatically finds the privacy cost of this function, in (ϵ, δ)-differential privacy, by performing the appropriate conversion. The ability to mix privacy variants in Jazz makes it easy to frame the privacy guarantee of any program in terms of (ϵ, δ) privacy cost, allowing privacy costs to be directly compared. In addition, it enables embedding iterative RDP and zCDP algorithms inside of (ϵ, δ) programs, allowing these programs to take advantage of the improved composition properties RDP and zCDP provide. This approach—leveraging recent variants for composition, but reporting privacy costs in terms of ϵ and δ—is extremely common in recent work on differentially private machine learning [2]. We make extensive use of variant-mixing in our case studies, described next.

9 Implementation & Case Studies

Jazz enables programmers to implement and verify largely the same set of applications as Duet, but Jazz empowers the programmer to construct these applications in simpler ways, e.g., via composition of reusable library functions. This is possible, because Jazz gives types to many privacy functions and looping combinators that required custom typing rules in Duet. Our case studies demonstrate that instead of encoding these applications as a single monolithic function, Jazz enables their implementation through composition of multiple helper functions and their verification without the use of custom typing rules.

In particular, we highlight two important features of Jazz that enable refactoring programs to reuse library functions:

•

Jazz gives types to privacy primitives and looping combinators that are not typeable in Duet, enabling privacy functions to be parameterized by these components (see Section 9.2).

•

Jazz’s privacy functions allow sensitivity arguments with arbitrary sensitivity bounds, enabling code reuse in more places than Duet’s sensitivity-1 privacy functions (see Section 9.3).

In addition, we select realistic algorithms previously verified using other systems to demonstrate that Jazz maintains the capabilities of previous work.

A summary of our case study programs appears in Table 3. We present two new representative case study algorithms we have implemented and verified using Jazz: the MWEM algorithm [38] for a workload of linear queries, and a recently proposed algorithm for differentially private deep learning with adaptive clipping [53]. In both case studies, privacy mechanisms (e.g., laplace and exponential) and looping constructs (e.g., aloop—advanced composition) can be expressed with regular functions, provided in a library of primitives. We mark these two case studies with a ^* in Table 3 and describe them in detail later in this section. The other case study programs are available in our source code repository.

Table 3.

			Typecheck
Technique	Ref.	Privacy Concept	Time
Machine Learning Algorithms
Noisy Gradient Descent	[18]	Composition	4.1 ms
Gradient Descent w/ Output Perturbation	[57]	Parallel comp. (sens.)	4.2 ms
Noisy Frank-Wolfe	[52]	Exponential mechanism	5.9 ms
Variations on Gradient Descent
Minibatching	[18]	Privacy amplification	5.5 ms
Parallel-composition minibatching	—	Parallel composition	5.9 ms
Gradient clipping	[2]	Sensitivity bounds	4.5 ms
Adaptive gradient clipping^* (Section 9.3)	[53]	Advanced variants	5.6 ms
Preprocessing & Deployment
Hyperparameter tuning	[23]	Exponential mechanism	6.9 ms
Adaptive clipping	—	Sparse Vector Technique	7.7 ms
Z-Score normalization	[1]	Composition	6.9 ms
Algorithms for Linear Queries
Multiplicative Weights (MWEM)^* (Section 9.2)	[38]	Exponential mechanism	5.2 ms

Table 3. List of Case Studies Included with the Jazz Implementation

Case studies marked with a ^* are described in detail in this section.

9.1 Implementation

We have implemented a prototype of the Jazz typechecker in Haskell and used it to verify the case studies from Table 3. The prototype implementation is available on GitHub.⁷ Table 3 lists the time needed to typecheck each of the case studies; our typechecker takes just a few milliseconds for each one.

Type inference & annotations. Our prototype implements type inference for both Sax and Jazz. Type annotations for sensitivity and privacy are required for inputs at top-level functions and lambda-expressions, but no additional annotations are required for function outputs or elsewhere in the program. The case studies described later in this section have been typeset for readability, but are otherwise identical to the input for our prototype; in particular, the actual examples typechecked by our prototype have the same annotations as the examples in this section, except for the input sensitivity annotations on inputs to top-level functions. The types given for primitives in this section are also drawn directly from our implementation.

Constraint solving. As described earlier, and detailed in Section 10, prior work has made extensive use of SMT solvers for the equations over real expressions that arise in type inference for sensitivity. Because SMT solvers are incomplete for non-linear operations (like the logarithms and square roots used in advanced composition), our implementation does not follow the same path.

Instead, we implement a custom solver for inequalities over symbolic real expressions, based on the solver from Duet [46]. Our custom solver is based on a simple decidable (but incomplete) theory; it supports logarithms, square roots, and polynomial formulas over real numbers. The solver is transparent to the programmer and produces readable output expressions for the privacy costs in our case studies.

9.2 MWEM

The MWEM algorithm [38] generates differentially private synthetic data approximating the target sensitive data by iteratively optimizing the accuracy of a set of workload queries on the synthetic data. In each iteration, the algorithm uses the exponential mechanism to pick a query from the workload for which the synthetic data produces an inaccurate result, uses the Laplace mechanism to run that query on the real data, and uses the result to update the synthetic data via the multiplicative weights update rule. The Jazz program shown below implements the MWEM algorithm. Its inputs are a sensitive dataset X over a domain D, a workload Q of linear queries, the number of iterations to be performed k, the privacy parameter ϵ, initial synthetic data Y₀ (n times the uniform distribution over D), and the dimensions of the input data set (matrix) m rows by n columns. The algorithm performs k iterations, invoking laplace and exponential in each iteration. The privacy parameter for each invocation is \({\color {red}\epsilon} /{\color {blue}{2k}}\), yielding a total privacy cost of ϵ. We omit the sensitivity annotations on the function’s inputs for readability.

The Jazz typechecker produces the following type for this implementation, indicating that the algorithm satisfies ϵ-differential privacy. Note the homogenous matrix type notation used here is \(\color{blue}{\mathbb {M} {\color {blue} {[m,n] \tau}}}\), where m denotes the number of rows, n the number of columns, and τ the type of each entry. < X > is the context schema argument for type-application of loop and indicates the program variable we want to preserve privacy for in this expression.

On an average of 10 runs, it takes the Jazz typechecker 5.2 ms to produce this type for the MWEM algorithm.

Beyond Duet. This example demonstrates Jazz’s ability to define algorithms in terms of library functions and to parameterize algorithms by the choice of component pieces. In this case, we define MWEM in terms of a generic looping privacy combinator loop; the caller of MWEM can specify looping combinators based on sequential composition, advanced composition, or even a custom combinator. Jazz similarly allows functions like MWEM to be parameterized by the choice of basic mechanism (e.g., laplace vs. gauss) with the appropriate privacy function type. In both cases, the relevant functions can be pulled from libraries or defined by the programmer.

This kind of modularity is impossible in Duet. Functions cannot be parameterized by basic privacy mechanisms or looping combinators, because it is not possible to write their types in Duet.

Primitives used. This case study demonstrates the composition of a complex iterative algorithm from basic privacy mechanisms encoded as Jazz primitives (e.g., laplace and exponential) and privacy combinators (e.g., seqloop, which implements looping with sequential composition for privacy). These primitives with types shown above would require explicit typing rules in the core Duet language. In Jazz, they can be given regular types, as shown below:

To typecheck MWEM, the privacy closure rule in Jazz creates a function type for the \(\color{red}{\color {cbsafeCBright}{\color {cbsafeCBright}\lambda ^{\!^{_{\mathrm{p}}}}}}\) that has a privacy effect for the body of \(\color{blue}{ \frac{\epsilon }{k}}\) because of the two uses of mechanisms that give \(\color{blue}{ \frac{\epsilon }{2k}}\) differential privacy. If we pass seqloop as the looping combinator loop, then this privacy effect is multiplied by the loop iteration number k as a result of the type of

and the type rule for privacy function application. Finally, the new let rule for the Jazz privacy fragment that tracks latent contextual sensitivities is used in the let-binding for q_i to precompute an intermediate value that is used multiple times without the need for explicit boxing.

9.3 Differentially Private Deep Learning with Adaptive Clipping

The current state-of-the-art in differentially private machine learning is noisy gradient descent [2]: At each iteration of training, compute the gradient, clip the gradient to have bounded L2 norm, and add noise in proportion to the clipping parameter. The clipping parameter is typically treated as a hyperparameter set by the analyst before training.

Recent work by Thakkar et al. [53] proposed an algorithm for adaptively determining the clipping parameter during training by adaptively improving the clipping parameter based on a differentially private estimate of the percentage of gradients clipped in each iteration.

In each iteration, the implementation computes the gradients for a batch of examples gs, clips each gradient using the current parameter C^t, and uses the Gaussian mechanism to compute a differentially private average gradient g_p. Then, the algorithm updates the clipping parameter for the next iteration C^t′ using \({{\color {red}{\mathsf{clipUpdate}}}}\), which computes a noisy count β of the number of gradients in gs that are clipped under the clipping parameter C^t and uses the count to update the parameter. The inputs to the algorithm are the training data X, the training labels y, the number of iterations k, and the target percentage of gradients remaining un-clipped γ. ϵ, δ, and δ′ are the privacy cost parameters. We omit the sensitivity annotations on top-level function inputs for readability. The Jazz typechecker typechecks DPAL in 5.6 ms (averaged over 10 runs).

Beyond Duet. In this case study, we implement a library function for the differentially private average (\({{\color{red}{\mathsf{DPMean}}}}\)) and use it in two places. This refactoring is not possible in Duet, because Duet’s privacy functions require all sensitive arguments to have a sensitivity of 1. In this algorithm, one of the uses of \({{\color{red}{\mathsf{DPMean}}}}\) in fact has data-dependent sensitivity (the sensitivity of \({{\color {green}{\text{mclip}}}\ {{{\color {blue}C^{t}}}}\ {\color {blue}{gs}}}\) is \(\color{blue}{ {{{\color {blue}C^{t}}}}}\)).

Practical implementations of algorithms like this one often rely on libraries of differentially private functions like \({{\color {red}{{\mathsf{DPMean}}}}}\) (e.g., the Opacus library for differentially private deep learning [58], or the OpenDP library for differentially private analytics [35]). By lifting the limitations of Duet’s privacy functions, Jazz makes it possible to implement and use such libraries.

Primitives used. This algorithm demonstrates the use of privacy combinators (e.g., \({{\color{red}{\mathsf{aloop}}}}\)) that can be specified as primitives in Jazz but require special typing rules in Duet:

This type for \({{\color{red}{\mathsf{aloop}}}}\) encodes the advanced composition theorem (introduced in Section 2). Our advanced composition combinator runs an (ϵ, δ)-differentially private function (\(\color{blue}{ {\mathcal {M}}}\)) representing the body of the loop k times, for a total privacy cost of \(\color{blue}{ (}\color {red}{2\epsilon (\sqrt {2k(\log (1/\delta ^{\prime }))}), k\delta + \delta ^{\prime }}\color{blue} {)}\) (a significant improvement over the sequential composition cost of \(\color{blue}{ (}\color {red}{k\epsilon , k\delta }\color{blue}{)}\)).

Our version of the adaptive clipping gradient descent algorithm uses advanced composition and (ϵ, δ)-differential privacy to demonstrate the encoding of \({\color {red}{\mathsf{aloop}}}\) as a regular function in Jazz. Our source code repository contains an alternative implementation that uses zero-concentrated differential privacy for improved composition and converts the privacy guarantee to (ϵ, δ)-differential privacy at the end of the algorithm.

10 Related Work

Verification techniques based on type systems.

There are two threads of prior work in type-system-based verification of differential privacy for high-level programs: those based on linear types and those based on relational refinement types. Reed and Pierce [49] proposed Fuzz, the first type system for differential privacy based on linear typing; its fundamental components are a linear type system with an indexed “scaling” modality !_s for tracking the sensitivity of programs and a monadic connective \(\color{blue}{ {\scriptstyle \bigcirc }}\) to model randomized computations. An s-sensitive function is encoded in Fuzz as a linear function with scaled domain !_sA⊸B and often notated A⊸_sB. An ϵ-differential privacy mechanism is represented as an ϵ-sensitive function with monadic return type as in \(\color{blue}{ A \multimap _{\epsilon } {\scriptstyle \bigcirc }B}\). DFuzz [34] extends Fuzz with dependent types to encode sensitivity and privacy bounds that depend on the values of function arguments. This allows, e.g., reasoning about the privacy of iterative algorithms whose privacy cost depend on the number of iterations. Fuzz and DFuzz can be characterized by strong support for higher-order programming and potential for automation via type inference. They support pure differential privacy but approximate, and any other recent variants of differential privacy fall out of their scope due to nonlinear scaling. Several recently proposed approaches allow a Fuzz-like analysis for (ϵ, δ)-differentially private programs: Azevedo de Amorim et al. [29] leverage a path construction and a Fuzz-like type system. Fuzzi [60] integrates a Fuzz-like type system with an expressive program logic. Fuzzi directly connects (automated) type-based proofs of composition for sensitivity and privacy properties with (manual) apRHL proofs for basic constructs such as sequential composition and the Laplace mechanism. In our approach, however, properties of basic mechanisms must be axiomatized. The Fuzzi system targets imperative programs and does not provide support for higher-order programming with privacy functions. Finally, Duet [46] proposes a two-language design with linear types for tracking sensitivity and privacy; crucial restrictions in the typing rules of the privacy language allow encoding advanced differential privacy variants such as approximate, Rényi, zero-concentrated, and truncated-concentrated differential privacy. In Duet privacy functions are n-ary and written \(\color{blue}{ ({{\color {blue}\tau _{1}}}@{{{\color {red}{p_1}}}},\ldots ,{{\color {blue}\tau _{n}}}@{{{\color {red}{p_n}}}}) \multimap ^{*} {{\color {blue}\tau }}}\) for privacy quantities p_i such as (ϵ_i, δ_i) in the case of approximate differential privacy.

Unlike previous works based on linear type systems, HOARe² [14] uses relational refinement types to encode arbitrary relational properties of programs, including differential privacy. In HOARe², an \({ {{{\color {green}s}}}}\)-sensitive function type is written \(\color{blue}{ \Pi s^{\prime }. \lbrace x \mathrel {:: } \tau _{1} \mathrel {|} {\mathfrak {D}} _{\tau _{1}}(x_{\vartriangleleft },x_{\vartriangleright }) \le s^{\prime }\rbrace \rightarrow \lbrace y \mathrel {:: } \tau _{2} \mathrel {|} {\mathcal {D}}_{\tau _{2}}(y_{\vartriangleleft },y_{\vartriangleright }) \le {{{\color {green}s}}} s^{\prime }\rbrace}\) where \(\color{blue}{ {\mathfrak {D}} _{\tau }}\) is a type-indexed distance metric, and x_⊲ and x_⊳ are explicit symbolic representations of the “left” and “right” executions of the program in support of encoding relational properties. To account for probabilistic private computations, HOARe² uses an indexed monad \(\color{blue}{ {\mathfrak {M}} _{\epsilon ,\delta }[\tau ]}\): The type of an (ϵ, δ)-differentially private function is written \(\color{blue}{ \lbrace x \mathrel {:: } \tau _{1} \mathrel {|} {\mathfrak {D}} _{\tau _{1}}(x_{\vartriangleleft },x_{\vartriangleright }) \le 1\rbrace \rightarrow {\mathfrak {M}} _{\epsilon ,\delta }[\lbrace y \mathrel {:: } \tau _{2} \mathrel {|} y_{\vartriangleleft } = y_{\vartriangleright }\rbrace ]}\). A limitation of this encoding is that a function of two arguments that provides different privacy bounds for each argument (as described in Section 3.3) will report a summed, global privacy bound, because the tracking of privacy occurs in a single global index to the privacy monad \(\color{blue}{ {\mathfrak {M}} _{\epsilon ,\delta }}\). Because privacy is proved as a relational property, rather than as a sensitivity/Lipschitz continuity property, HOARe² is also capable of placing relational distance bounds on arguments to functions. Regarding the implementation, both HOARe² and Jazz use dependent types to capture sizes. Regarding typechecking, even though some automation has been achieved [24], the automation relies heavily on what is achievable with SMT solvers and has limited application to programs that make generous use of compositional or higher-order programming techniques or metric-distance relationships between values at non-base types. SMT solvers are not complete for non-linear operations, which are common in complex differential privacy mechanisms. Consider, for example, the expression for privacy cost under advanced composition: \(\color{blue}{ \epsilon ^{\prime } = k\epsilon (e^\epsilon -1) + \epsilon \sqrt {2 k \ln (1/\delta ^{\prime })}}\); SMT solvers are not capable of proving universally quantified qualities between equations like these, which limits their ability to automate reasoning about privacy cost.

It is important to note that all of these type systems support some form of recursion. Adding any form of recursion in the formalism of Jazz would make the technical development even more complicated. We just focused on a small core that could illustrate the main novelties of the latent or contextual approach.

To conclude the overview about type-system-based verification techniques, we refer the reader to Table 4, comparing different aspects of the reviewed type systems.

Table 4.

Techniques based on couplings and program logics.

Approximate couplings [12] are a probabilistic abstraction that witnesses differential privacy properties of programs and have been successfully exploited for verification purposes. The relational Hoare logic apRHL [16] and its successors apRHL⁺ [15] and span-apRHL [50] internalize the compositional construction of such couplings and capture from pure and approximate differential privacy to more recent variants such as Rényi, zero-concentrated, and truncated-concentrated differential privacy. While compared to other methods, these program logics are rather expressive going beyond the composition of (a set of predefined) basic mechanisms, derivations in the logics involve complex quantitative reasoning, not always amenable to automation. Even though there has been a successful report on partial automation [11], e.g., the synthesis of (quantitative relational) loop invariants for iterative algorithms remains challenging. To synthesize couplings, Albarghouthi and Hsu [5] use an alternative approach based on constraint solving, which is highly amenable to automation; the approach is, however, confined to ϵ-DP. Finally, to verify programs that achieve (ϵ, δ)-DP composing basic mechanisms, Barthe et al. [13] use a customized program product construction and traditional (non-relational and non-probabilistic) Hoare logic augmented with mechanism-specific rules. In recent work, Barthe et al. [9] show that checking differential privacy for imperative programs is undecidable in general but present a reduction to a decidable fragment of first-order logic for a restricted class of programs. Barthe et al. [10] also show that checking accuracy bounds for differentially private programs is also undecidable (in general). A common limitation of all these approaches is that they are restricted to first-order imperative programs.

Techniques based on randomness alignment.

LightDP [59] and ShadowDP [55] take a third approach to verifying differential privacy based on randomness alignments. A randomness alignment is an injective function relating the randomness from one execution of a differentially private mechanism to a second execution of the same mechanism (i.e., \(\color{blue}{ {\mathcal {M}}(x)}\) outputs the same result with noise H as \(\color{blue}{ {\mathcal {M}}(x^{\prime })}\) outputs with noise f(H), where f is the randomness alignment). Both LightDP and ShadowDP are capable of verifying complex low-level mechanisms like the sparse vector technique in just a few seconds. However, both tools target a first-order imperative programming language and have limited support for higher-order programming.

Techniques based on testing.

Since differential privacy mechanisms are randomized, traditional methods of software testing do not apply. Two recent works by Bichsel et al. [19] and Ding et al. [30] address this challenge by automatically generating neighboring inputs for the mechanism being tested and sampling from their outputs many times to approximate their output distributions. For privacy mechanisms with major bugs, these tools are able to show that the approximated distributions do not satisfy the claimed differential privacy guarantee. Wilson et al. [56] have implemented a testing tool based on this approach in their open-source library. DPCheck [61] combines static analysis with instrumented concrete execution of the target program to detect bugs in even more complex algorithms.

Type systems and contextual information.

The technical device of contextual latent effects used in Sax and Jazz is related to prior approaches to expose information about captured variables in function types. Leroy [40] and Hannan et al. [37] use function types augmented with the set of captured variables, the former for tracking dangerous type variables for polymorphic generalization and the latter for lifetime analysis. Scherer and Hoffmann [51] introduce open closure types to track additional information about closed-over variables in first-class functions. An open closure type augments the traditional arrow type with a lexical environment of closed over variables, further decorated by a mapping characterizing the use of each variable. For instance, they formalize a system where the mapping marks each variable with a Boolean indicating whether the evaluation of the body of the function depends on the variable or not. An open closure type also includes the name of the function argument similarly decorated. For the considered system, they prove a non-interference property, which states that closing an open term with two valuations that coincide on used variables yield the same result, up to used variables. Function types in Sax and Jazz can be seen as specific cases of open closure types, in which the information attached to captured variables (and arguments) is not a Boolean value, but sensitivity and privacy information. Consequently, the type soundness result we establish is more general than noninterference; for instance, in Sax, for two valuations that are at a given distance apart, the distance between the results is bounded by the sensitivities of each variable. The technical development of contextual linear types shares many concerns with that of open closure types, notably regarding the proper handling of the typing environment—in which order matters—and of scoping. However, our soundness results rely on logical relations, while they adopt a more restricted technique, sufficient for the purpose of the simple type system tracking Boolean information.

As observed by Scherer and Hoffmann, using open closure types allows delaying the accounting of information flow into closures from abstraction time to application time. Likewise, contextual sensitivity in Sax delays sensitivity accounting to application time. We extend this principle to positive type constructors such as products and sums, generally deferring accounting to elimination forms; we believe this would likewise apply to the simpler information-flow control setting they study.

Recently, Bao et al. [8] use a similar context-annotation technique to track reachability information in types. For instance, they augment the type of reference cell with the variables in scope that alias the cell. This information is tracked on function types as well to track the reachability information from closures. In all these approaches, type information depends on variable names; this is quite different from dependent types, however, where types depend on arbitrary terms. The telescope nature of the typing environment is similar, but many of the deep challenges of dependent types do not manifest in this restricted setting.

Hoare Type Theory [45] supports specifying effectful, heap-manipulating computations by introducing Hoare-style pre/postconditions in types. Computation types include contexts of variables and heap locations to track footprints of logical assertions. In a similar vein as Leroy [40], the context of variables is not decorated with any information, so it is unclear whether one could handle contextual sensitivity and privacy in this approach.

Finally, it would be interesting to study if generic approaches such as coeffects [48] and graded modal types [47] can express delayed sensitivity and privacy tracking as developed here. These generic approaches use resource algebras (such as the semiring of natural numbers) for capturing modalities in types. To the best of our knowledge, these are not contextual: For instance, the arrow type is annotated with a scalar, label, and so on, not with a decorated environment as in open closure types and this work. Extending these generic approaches to support contextual information on all type constructors could bring the benefits of lazy accounting to a wide range of type-based quantitative program reasoning.

11 Conclusion

We have presented Jazz, a language and type system for differentially private programming with strong support for both higher-order programming and advanced variants of differential privacy. The key insight of our approach is latent contextual tracking of both privacy and sensitivity, which enables sum, product, and function types to describe their privacy effects—even for closure variables—making it possible to delay the payment of the effects until actual elimination, sometimes yielding advantages on the precision of the analysis and the annotation burden.

We have formalized a core subset of Jazz and proved its soundness using a step-indexed logical relation, following a novel strategy. Case studies demonstrate the ability to encode basic privacy mechanisms and privacy combinators as primitives in Jazz and to compose them to develop more complicated iterative differentially private algorithms.

Jazz extends the expressive power of systems like Fuzz [49] to advanced variants of differential privacy. Like Fuzz, it remains incapable of proving the correctness of basic privacy mechanisms like the Laplace mechanism. One interesting avenue for future work lies in combining Jazz with an expressive program logic like apRHL [16, 17] in the style of Fuzzi [60]. Such a combined system would provide a complete programming framework supporting higher-order programming and end-to-end privacy proofs. Another line of future work is to incorporate some form of recursion to Jazz such as recursive types.

Footnotes

In addition to the color red, we notate the arrow in privacy function types with a double head \(\color {red}\twoheadrightarrow\) to further visually distinguish them from sensitivity function arrows \(\color {green}\rightarrow\). We describe details such as the definition of the

notation later.

We restrict ourselves to discrete distributions, because considering continuous distributions would complicate the language semantics (continuous distributions interact badly with higher-order functions), and our main focus here is on the type system.

We chose colors following the 24-Color Palette from http://mkweb.bcgsc.ca/colorblind. E.g., to persons with deuteranopia (the most common form of color blindness), colorschemes ■/ ■/ ■ appear as ■/ ■/ ■, respectively.

⁴

At first sight, one might think that a is ∞-sensitive in b because a change in b may flip the direction of the returned injection. However, any change on the value of b necessarily results in an infinite variation, since true and false are ∞ far apart. Therefore, the induced variation on the value of a is trivially bounded by ∞, scaled by 1, turning a 1-sensitive in b.

⁵

For simplicity, we use “distance” instead of “relational distance.”

⁶

We show how to wiggle variables on privacy environment with the relational distance instantiation operator, later in Section 7.3.

⁷

https://github.com/uvm-plaid/contextual-duet/.

Appendices

Throughout the appendix, we use symbol \({ {{{\color {green}s}}}}\) instead of \({ {{{\color {green}d}}}}\) and \({ {{{\color {green}\Sigma }}}}\) instead of \(\color {green}{\Delta,}\) as they are interchangeable.

A \(\lambda _{\text{J}}\): Static Semantics

In this section, we present some definitions of the static semantics of \(\lambda _{\text{J}}\) not presented in the main document. Figure 18 presents the syntax of \(\lambda _{\text{J}}\). Figures 19 and 20 present the complete sensitivity type system of \(\lambda _{\text{J}}\). Figure 21 presents the complete type system of \(\lambda _{\text{J}}\). They include the usage of Φ and the type rules for the derived expressions: Boolean, conditional, and let expressions. Figure 22 presents the sensitivity instantiation or dot product operator and the sensitivity type instantiation operator. Figure 23 presents the subtyping rules for \(\lambda _{\text{J}}\). Figure 24 presents the sensitivity environment substitution environment. Figure 25 presents the different lift operators. Finally, Figure 26 presents the join and meet operator between types.

Fig. 18.

Fig. 19.

Fig. 20.

Fig. 21.

Fig. 22.

Fig. 23.

Fig. 24.

Fig. 25.

Fig. 26.

B \(\lambda _{\text{J}}\): Dynamic Semantics

Figure 27 presents the dynamic semantics of the sensitivity language. Figure 28 presents the dynamic semantics of the privacy language.

Fig. 27.

Fig. 28.

C \(\lambda _{\text{J}}\): Soundness

In this section, we present auxiliary definitions used in Section 7.3 and the proof of the fundamental property. Figure 29 presents the join and meet operators for sensitivity environments and sensitivities \({ {{{\color {green}\Sigma }}} + {{{\color {green}s}}}}\). Figure 30 presents the subtyping relation between sensible types.

Fig. 29.

Fig. 30.

References

[1]

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830. https://scikit-hep.org/citing.

Digital Library

Google Scholar

[2]

Martin Abadi, Andy Chu, Ian Goodfellow, H. Brendan McMahan, Ilya Mironov, Kunal Talwar, and Li Zhang. 2016. Deep learning with differential privacy. In Proceedings of the ACM SIGSAC Conference on Computer and Communications Security. 308–318.

Digital Library

Google Scholar

[3]

Amal Ahmed. 2004. Semantics of Types for Mutable State. Ph.D. Dissertation. Princeton University.

Google Scholar

[4]

Amal Ahmed. 2006. Step-indexed syntactic logical relations for recursive and quantified types. In Programming Languages and Systems, 15th European Symposium on Programming, ESOP 2006, Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2006, Vienna, Austria, March 27–28, 2006, Proceedings (Lecture Notes in Computer Science), Vol. 3924. Springer, 69–83.

Google Scholar

[5]

Aws Albarghouthi and Justin Hsu. 2018. Synthesizing coupling proofs of differential privacy. Proc. ACM Program. Lang. 2, POPL (2018), 58:1–58:30. DOI:

Digital Library

Google Scholar

[6]

Andrew W. Appel and David McAllester. 2001. An indexed model of recursive types for foundational proof-carrying code. ACM Trans. Program. Lang. Syst. 23, 5 (Sept.2001), 657–683.

Digital Library

Google Scholar

[7]

Borja Balle and Yu-Xiang Wang. 2018. Improving the Gaussian mechanism for differential privacy: Analytical calibration and optimal denoising. In Proceedings of the International Conference on Machine Learning. PMLR, 394–403.

Google Scholar

[8]

Yuyan Bao, Guannan Wei, Oliver Bracevac, Yuxuan Jiang, Qiyang He, and Tiark Rompf. 2021. Reachability types: Tracking aliasing and separation in higher-order functional programs. Proc. ACM Program. Lang. 5, OOPSLA (2021), 1–32.

Digital Library

Google Scholar

[9]

Gilles Barthe, Rohit Chadha, Vishal Jagannath, A. Prasad Sistla, and Mahesh Viswanathan. 2020. Deciding differential privacy for programs with finite inputs and outputs. In Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science. 141–154.

Digital Library

Google Scholar

[10]

Gilles Barthe, Rohit Chadha, Paul Krogmeier, A. Prasad Sistla, and Mahesh Viswanathan. 2021. Deciding accuracy of differential privacy schemes. Proc. ACM Program. Lang. 5, POPL (2021), 1–30.

Digital Library

Google Scholar

[11]

Gilles Barthe, George Danezis, Benjamin Grégoire, César Kunz, and Santiago Zanella Béguelin. 2013. Verified computational differential privacy with applications to smart metering. In Proceedings of the IEEE 26th Computer Security Foundations Symposium. IEEE Computer Society, 287–301. DOI:

Digital Library

Google Scholar

[12]

Gilles Barthe, Thomas Espitau, Justin Hsu, Tetsuya Sato, and Pierre-Yves Strub. 2019. Relational ⋆⋆\star-liftings for differential privacy. Logic. Meth. Comput. Sci. 15, 4 (2019). Retrieved from https://lmcs.episciences.org/5989.

Google Scholar

[13]

G. Barthe, M. Gaboardi, E. J. G. Arias, J. Hsu, C. Kunz, and P. Strub. 2014. Proving differential privacy in Hoare logic. In Proceedings of the IEEE 27th Computer Security Foundations Symposium. 411–424. DOI:

Digital Library

Google Scholar

[14]

Gilles Barthe, Marco Gaboardi, Emilio Jesús Gallego Arias, Justin Hsu, Aaron Roth, and Pierre-Yves Strub. 2015. Higher-order approximate relational refinement types for mechanism design and differential privacy. In Proceedings of the 42nd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (POPL’15). Association for Computing Machinery, New York, NY, 55–68. DOI:

Digital Library

Google Scholar

[15]

Gilles Barthe, Marco Gaboardi, Benjamin Grégoire, Justin Hsu, and Pierre-Yves Strub. 2016. Proving differential privacy via probabilistic couplings. In Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science (LICS’16). Association for Computing Machinery, New York, NY, 749–758. DOI:

Digital Library

Google Scholar

[16]

Gilles Barthe, Boris Köpf, Federico Olmedo, and Santiago Zanella Béguelin. 2012. Probabilistic relational reasoning for differential privacy. In Proceedings of the 39th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (POPL’12). Association for Computing Machinery, New York, NY, 97–110. DOI:

Digital Library

Google Scholar

[17]

Gilles Barthe, Boris Köpf, Federico Olmedo, and Santiago Zanella-Béguelin. 2013. Probabilistic relational reasoning for differential privacy. ACM Trans. Program. Lang. Syst. 35, 3, (Nov.2013). DOI:

Digital Library

Google Scholar

[18]

Raef Bassily, Adam Smith, and Abhradeep Thakurta. 2014. Private empirical risk minimization: Efficient algorithms and tight error bounds. In Proceedings of the IEEE 55th Annual Symposium on Foundations of Computer Science (FOCS’14). IEEE, 464–473.

Digital Library

Google Scholar

[19]

Benjamin Bichsel, Timon Gehr, Dana Drachsler-Cohen, Petar Tsankov, and Martin Vechev. 2018. DP-finder: Finding differential privacy violations by sampling and optimization. In Proceedings of the ACM SIGSAC Conference on Computer and Communications Security. 508–524.

Digital Library

Google Scholar

[20]

Mark Bun, Cynthia Dwork, Guy N. Rothblum, and Thomas Steinke. 2018. Composable and versatile privacy via truncated CDP. In Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing. 74–86.

Digital Library

Google Scholar

[21]

Mark Bun and Thomas Steinke. 2016. Concentrated differential privacy: Simplifications, extensions, and lower bounds. In Proceedings of the Theory of Cryptography Conference. Springer, 635–658.

Digital Library

Google Scholar

[22]

Clément L. Canonne, Gautam Kamath, and Thomas Steinke. 2020. The discrete Gaussian for differential privacy. In Proceedings of the Annual Conference on Neural Information Processing Systems (NeurIPS’20).

Google Scholar

[23]

Kamalika Chaudhuri and Staal A. Vinterbo. 2013. A stability-based validation procedure for differentially private machine learning. In Proceedings of the Conference on Advances in Neural Information Processing Systems. 2652–2660.

Google Scholar

[24]

Ezgi Çiçek, Weihao Qu, Gilles Barthe, Marco Gaboardi, and Deepak Garg. 2019. Bidirectional type checking for relational properties. In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation. ACM, 533–547. DOI:

Digital Library

Google Scholar

[25]

Michael R. Clarkson and Fred B. Schneider. 2010. Hyperproperties. J. Comput. Secur. 18, 6 (Sept.2010), 1157–1210.

Crossref

Google Scholar

[26]

Arthur Azevedo de Amorim, Emilio Jesús Gallego Arias, Marco Gaboardi, and Justin Hsu. 2015. Really natural linear indexed type checking. CoRR abs/1503.04522 (2015).

Google Scholar

[27]

Arthur Azevedo de Amorim, Marco Gaboardi, Emilio Jesús Gallego Arias, and Justin Hsu. 2014. Really natural linear indexed type checking. In Proceedings of the 26th International Symposium on Implementation and Application of Functional Languages (IFL’14). ACM, New York, NY. DOI:

Digital Library

Google Scholar

[28]

Arthur Azevedo de Amorim, Marco Gaboardi, Justin Hsu, Shin-ya Katsumata, and Ikram Cherigui. 2017. A semantic account of metric preservation. In Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages. 545–556.

Digital Library

Google Scholar

[29]

Arthur Azevedo de Amorim, Marco Gaboardi, Justin Hsu, and Shin-ya Katsumata. 2019. Probabilistic relational reasoning via metrics. In Proceedings of the 34th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS’19). IEEE, 1–19.

Digital Library

Google Scholar

[30]

Zeyu Ding, Yuxin Wang, Guanhong Wang, Danfeng Zhang, and Daniel Kifer. 2018. Detecting violations of differential privacy. In Proceedings of the ACM SIGSAC Conference on Computer and Communications Security. 475–489.

Digital Library

Google Scholar

[31]

Cynthia Dwork, Moni Naor, Omer Reingold, Guy N. Rothblum, and Salil Vadhan. 2009. On the complexity of differentially private data release: Efficient algorithms and hardness results. In Proceedings of the 41st Annual ACM Symposium on Theory of Computing (STOC’09). Association for Computing Machinery, New York, NY, 381–390. DOI:

Digital Library

Google Scholar

[32]

Cynthia Dwork and Aaron Roth. 2014. The algorithmic foundations of differential privacy. Found Trends® Theoret. Comput. Sci. 9, 3–4 (2014), 211–407.

Digital Library

Google Scholar

[33]

Richard A. Eisenberg and Stephanie Weirich. 2012. Dependently typed programming with singletons. In Proceedings of the Haskell Symposium. Association for Computing Machinery, New York, NY, 117–130.

Digital Library

Google Scholar

[34]

Marco Gaboardi, Andreas Haeberlen, Justin Hsu, Arjun Narayan, and Benjamin C. Pierce. 2013. Linear dependent types for differential privacy. In Proceedings of the 40th Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages. 357–370.

Digital Library

Google Scholar

[35]

Marco Gaboardi, Michael Hay, and Salil Vadhan. 2020. A programming framework for OpenDP. 6th Workshop on the Theory and Practice of Differential Privacy (TPDP’20). https://salil.seas.harvard.edu/publications/programming-framework-opendp.

Google Scholar

[36]

David K. Gifford and John M. Lucassen. 1986. Integrating functional and imperative programming. In Proceedings of the ACM Conference on Lisp and Functional Programming. 28–38.

Digital Library

Google Scholar

[37]

J. Hannan, P. Hicks, and D. Liben-Nowell. 1997. A Lifetime Analysis for Higher-order Languages. Technical Report. Pennsylvania State University.

Google Scholar

[38]

Moritz Hardt, Katrina Ligett, and Frank McSherry. 2012. A simple and practical algorithm for differentially private data release. In Proceedings of the Conference on Advances in Neural Information Processing Systems. 2339–2347.

Google Scholar

[39]

Susumu Hayashi. 1991. Singleton, union and intersection types for program extraction. In Theoretical Aspects of Computer Software, International Conference TACS’91, Sendai, Japan, September 24–27, 1991, Proceedings (Lecture Notes in Computer Science), Takayasu Ito and Albert R. Meyer (Eds.), Vol. 526. Springer, 701–730.

Crossref

Google Scholar

[40]

Xavier Leroy. 1992. Polymorphic Typing of an Algorithmic Language. Technical Report Research Report 1778. INRIA.

Google Scholar

[41]

Min Lyu, Dong Su, and Ninghui Li. 2017. Understanding the sparse vector technique for differential privacy. PVLDB 10, 6 (2017), 637–648. DOI:

Digital Library

Google Scholar

[42]

Frank McSherry. 2009. Privacy integrated queries: An extensible platform for privacy-preserving data analysis. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 19–30. DOI:

Digital Library

Google Scholar

[43]

Ilya Mironov. 2012. On significance of the least significant bits for differential privacy. In Proceedings of the ACM Conference on Computer and Communications Security (CCS’12). ACM, 650–661.

Digital Library

Google Scholar

[44]

Ilya Mironov. 2017. Rényi differential privacy. In Proceedings of the IEEE 30th Computer Security Foundations Symposium (CSF’17). IEEE, 263–275.

Crossref

Google Scholar

[45]

Aleksandar Nanevski, J. Gregory Morrisett, and Lars Birkedal. 2008. Hoare type theory, polymorphism and separation. J. Funct. Program. 18, 5-6 (2008), 865–911.

Digital Library

Google Scholar

[46]

Joseph P. Near, David Darais, Chike Abuah, Tim Stevens, Pranav Gaddamadugu, Lun Wang, Neel Somani, Mu Zhang, Nikhil Sharma, Alex Shan et al. 2019. Duet: An expressive higher-order language and linear type system for statically enforcing differential privacy. Proc. ACM Program. Lang. 3, OOPSLA (2019), 1–30.

Digital Library

Google Scholar

[47]

Dominic Orchard, Vilem-Benjamin Liepelt, and Harley Eades III. 2019. Quantitative program reasoning with graded modal types. Proc. ACM Program. Lang. 3, ICFP (2019), 110:1–110:30.

Digital Library

Google Scholar

[48]

Tomas Petricek, Dominic A. Orchard, and Alan Mycroft. 2014. Coeffects: A calculus of context-dependent computation. In Proceedings of the 19th ACM SIGPLAN International Conference on Functional Programming. ACM, 123–135.

Digital Library

Google Scholar

[49]

Jason Reed and Benjamin C. Pierce. 2010. Distance makes the types grow stronger: A calculus for differential privacy. In Proceedings of the 15th ACM SIGPLAN International Conference on Functional Programming. 157–168.

Digital Library

Google Scholar

[50]

T. Sato, G. Barthe, M. Gaboardi, J. Hsu, and S. Katsumata. 2019. Approximate span liftings: Compositional semantics for relaxations of differential privacy. In Proceedings of the 34th Annual ACM/IEEE Symposium on Logic in Computer Science (LICS’19). 1–14. DOI:

Crossref

Google Scholar

[51]

Gabriel Scherer and Jan Hoffmann. 2013. Tracking data-flow with open closure types. In Proceedings of the 19th International Conference on Logic for Programming, Artificial Intelligence, and Reasoning (LPAR’13) (Lectures Notes in Computer Science). Springer, 710–726.

Crossref

Google Scholar

[52]

Kunal Talwar, Abhradeep Guha Thakurta, and Li Zhang. 2015. Nearly optimal private lasso. In Proceedings of the Conference on Advances in Neural Information Processing Systems. 3025–3033.

Google Scholar

[53]

Om Thakkar, Galen Andrew, and H. Brendan McMahan. 2019. Differentially private learning with adaptive clipping. arXiv preprint arXiv:1905.03871 (2019).

Google Scholar

[54]

Matías Toro, David Darais, Chike Abuah, Joseph P. Near, Damián Árquez, Federico Olmedo, and Éric Tanter. 2023. Contextual Linear Types for Differential Privacy — Extended with Proofs. Technical Report TR/DCC-2023-1. University of Chile.

Google Scholar

[55]

Yuxin Wang, Zeyu Ding, Guanhong Wang, Daniel Kifer, and Danfeng Zhang. 2019. Proving differential privacy with shadow execution. In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation. 655–669.

Digital Library

Google Scholar

[56]

Royce J. Wilson, Celia Yuxing Zhang, William Lam, Damien Desfontaines, Daniel Simmons-Marengo, and Bryant Gipson. 2020. Differentially private SQL with bounded user contribution. Proc. Privac. Enhanc. Technol. 2020, 2 (2020).

Google Scholar

[57]

Xi Wu, Fengan Li, Arun Kumar, Kamalika Chaudhuri, Somesh Jha, and Jeffrey Naughton. 2017. Bolt-on differential privacy for scalable stochastic gradient descent-based analytics. In Proceedings of the ACM International Conference on Management of Data (SIGMOD’17). ACM, New York, NY, 1307–1322. DOI:

Digital Library

Google Scholar

[58]

Ashkan Yousefpour, Igor Shilov, Alexandre Sablayrolles, Davide Testuggine, Karthik Prasad, Mani Malek, John Nguyen, Sayan Ghosh, Akash Bharadwaj, Jessica Zhao, et al. 2021. Opacus: User-friendly differential privacy library in PyTorch. arXiv preprint arXiv:2109.12298 (2021).

Google Scholar

[59]

Danfeng Zhang and Daniel Kifer. 2017. LightDP: Towards automating differential privacy proofs. In Proceedings of the 44th ACM SIGPLAN Symposium on Principles of Programming Languages. 888–901.

Digital Library

Google Scholar

[60]

Hengchu Zhang, Edo Roth, Andreas Haeberlen, Benjamin C. Pierce, and Aaron Roth. 2019. Fuzzi: A three-level logic for differential privacy. Proc. ACM Program. Lang. 3, ICFP (2019), 1–28.

Digital Library

Google Scholar

[61]

Hengchu Zhang, Edo Roth, Andreas Haeberlen, Benjamin C. Pierce, and Aaron Roth. 2020. Testing differential privacy with dual interpreters. Proc. ACM Program. Lang. 4, OOPSLA (2020), 1–26.

Digital Library

Google Scholar

Cited By

View all

Lobo-Vesga ERusso AGaboardi MCortiñas C(2024)Sensitivity by ParametricityProceedings of the ACM on Programming Languages10.1145/36897268:OOPSLA2(415-441)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3689726
Toro MOlmedo FTanter É(2024)Gradual Differentially Private ProgrammingCommunications of the ACM10.1145/365332867:8(49-53)Online publication date: 18-Jul-2024
https://dl.acm.org/doi/10.1145/3653328

Index Terms

Contextual Linear Types for Differential Privacy
1. Security and privacy
  1. Formal methods and theory of security
    1. Logic and verification
2. Theory of computation
  1. Logic
    1. Linear logic
  2. Semantics and reasoning
    1. Program constructs
      1. Type structures
    2. Program semantics

Recommendations

Linear dependent types for differential privacy
POPL '13: Proceedings of the 40th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages

Differential privacy offers a way to answer queries about sensitive information while providing strong, provable privacy guarantees, ensuring that the presence or absence of a single individual in the database has a negligible statistical effect on the ...
Linear dependent types for differential privacy
POPL '13

Differential privacy offers a way to answer queries about sensitive information while providing strong, provable privacy guarantees, ensuring that the presence or absence of a single individual in the database has a negligible statistical effect on the ...
An Emerging Strategy for Privacy Preserving Databases: Differential Privacy
HCI for Cybersecurity, Privacy and Trust
Abstract
Data De-identification and Differential Privacy are two possible approaches for providing data security and user privacy. Data de-identification is the process where the personal identifiable information of individuals is extracted to create ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Programming Languages and Systems

ACM Transactions on Programming Languages and Systems Volume 45, Issue 2

June 2023

378 pages

ISSN:0164-0925

EISSN:1558-4593

DOI:10.1145/3597018

Editor:
Jan Vitek
Northeastern University, USA

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 May 2023

Online AM: 06 April 2023

Accepted: 25 February 2023

Revised: 21 February 2023

Received: 22 February 2021

Published in TOPLAS Volume 45, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

ANID FONDECYT
ANID Millennium Science Initiative Program code
NSF

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
1,121
Total Downloads

Downloads (Last 12 months)646
Downloads (Last 6 weeks)91

Reflects downloads up to 10 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Lobo-Vesga ERusso AGaboardi MCortiñas C(2024)Sensitivity by ParametricityProceedings of the ACM on Programming Languages10.1145/36897268:OOPSLA2(415-441)Online publication date: 8-Oct-2024
https://dl.acm.org/doi/10.1145/3689726
Toro MOlmedo FTanter É(2024)Gradual Differentially Private ProgrammingCommunications of the ACM10.1145/365332867:8(49-53)Online publication date: 18-Jul-2024
https://dl.acm.org/doi/10.1145/3653328

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

1 Introduction

2 A Differential Privacy Primer

3 Overview of Jazz

3.1 A Two-language Design

3.2 Sensitivity

3.3 Privacy

4 Design of Jazz’S Sensitivity Type System

4.1 Linear Products and Sums

Linear products.

Linear sums.

4.2 Limitations of Prior Sensitivity Linear Type Systems

Limitations related to linear products.

Limitations related to linear sums.

4.3 Latent Contextual Effects for Precise Sensitivity Tracking in Jazz

Precise products.

Precise sums.

5 Sax: Jazz’S Sensitivity Type System, Formally

5.1 Syntax and Type System

5.2 Type Safety

5.3 Type Soundness

6 Design of Jazz’S Privacy Type System

6.1 Privacy Closures

6.2 Sensitivity Binding in Privacy Contexts

7 Jazz’S Differential Privacy Type System, Formally

7.1 Syntax and Type System

7.2 \(\lambda _{\text{{J}}}\): Type Safety

7.3 Soundness of \(\lambda _{\text{ {J}}}\): Metric Preservation

Probabilistic Semantics.

Logical relation.

8 FROM \(\lambda \mathsf{_J}\) TO JAZZ

9 Implementation & Case Studies

9.1 Implementation

9.2 MWEM

9.3 Differentially Private Deep Learning with Adaptive Clipping

10 Related Work

Verification techniques based on type systems.

Techniques based on couplings and program logics.

Techniques based on randomness alignment.

Techniques based on testing.

Type systems and contextual information.

11 Conclusion

Footnotes

Appendices

A \(\lambda _{\text{J}}\): Static Semantics

B \(\lambda _{\text{J}}\): Dynamic Semantics

C \(\lambda _{\text{J}}\): Soundness

References

Cited By

Index Terms

Recommendations

Linear dependent types for differential privacy

Linear dependent types for differential privacy

An Emerging Strategy for Privacy Preserving Databases: Differential Privacy

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Get Access

Login options

Full Access

Figures

Other

Share