Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Zero Inflation as a Missing Data Problem: a Proxy-based Approach

Trung Phung tphung1@jhu.edu Johns Hopkins Whiting School of Engineering, Baltimore, MD Jaron J.R. Lee Johns Hopkins Whiting School of Engineering, Baltimore, MD Opeyemi Oladapo-Shittu Johns Hopkins University School of Medicine, Baltimore, MD Eili Y. Klein Johns Hopkins University School of Medicine, Baltimore, MD Ayse Pinar Gurses Johns Hopkins Whiting School of Engineering, Baltimore, MD Johns Hopkins University School of Medicine, Baltimore, MD Johns Hopkins Bloomberg School of Public Health, Baltimore, MD Johns Hopkins Medicine, Baltimore, MD Susan M. Hannum Johns Hopkins Bloomberg School of Public Health, Baltimore, MD Kimberly Weems Johns Hopkins Health System, Baltimore, MD Vassar Brothers Medical Center, Poughkeepsie, NY Jill A. Marsteller Johns Hopkins Bloomberg School of Public Health, Baltimore, MD Johns Hopkins Medicine, Baltimore, MD Sara E. Cosgrove Johns Hopkins University School of Medicine, Baltimore, MD Johns Hopkins Medicine, Baltimore, MD Johns Hopkins Health System, Baltimore, MD Sara C. Keller Johns Hopkins University School of Medicine, Baltimore, MD Johns Hopkins Medicine, Baltimore, MD Ilya Shpitser Johns Hopkins Whiting School of Engineering, Baltimore, MD
Abstract

A common type of zero-inflated data has certain true values incorrectly replaced by zeros due to data recording conventions (rare outcomes assumed to be absent) or details of data recording equipment (e.g. artificial zeros in gene expression data).

Existing methods for zero-inflated data either fit the observed data likelihood via parametric mixture models that explicitly represent excess zeros, or aim to replace excess zeros by imputed values. If the goal of the analysis relies on knowing true data realizations, a particular challenge with zero-inflated data is identifiability, since it is difficult to correctly determine which observed zeros are real and which are inflated.

This paper views zero-inflated data as a general type of missing data problem, where the observability indicator for a potentially censored variable is itself unobserved whenever a zero is recorded. We show that, without additional assumptions, target parameters involving a zero-inflated variable are not identified. However, if a proxy of the missingness indicator is observed, a modification of the effect restoration approach of Kuroki and Pearl allows identification and estimation, given the proxy-indicator relationship is known.

If this relationship is unknown, our approach yields a partial identification strategy for sensitivity analysis. Specifically, we show that only certain proxy-indicator relationships are compatible with the observed data distribution. We give an analytic bound for this relationship in cases with a categorical outcome, which is sharp in certain models. For more complex cases, sharp numerical bounds may be computed using methods in Duarte et al. [2023].

We illustrate our method via simulation studies and a data application on central line-associated bloodstream infections (CLABSIs).

1 Introduction

Zero-inflated (ZI) data is prevalent in many empirical sciences such as public health, epidemiology, computational biology, and medical research. An important type of zero inflation occurs when some observed zeros of an outcome of interest do not represent true zero values.

As an example, consider patient surveillance for complications in outpatient settings, where any complication developed outside the hospital is of interest. One such complication is a central line-associated bloodstream infection (CLABSI) which can occur in patients undergoing therapies involving central venous catheters (CVCs). Such complications are fairly rare, but are associated with significant morbidity and mortality, and their prevalence is often assessed retrospectively. Because of this, absence of sufficient information on whether such a complication is present in a particular patient is often coded as a “presumed negative” rather than a “missing value” [Keller et al., 2020]. Since this type of value differs from a true negative value, indicating actual absence of a complication in a patient, the result is zero-inflated data. Another prominent example is single-cell RNA sequence data, whose zeros may signify either genuine values (representing, e.g. lack of gene expression) or artificial zeros resulting from technical artifacts of experimental protocols or recording equipment [Wagner et al., 2016, Jiang et al., 2022]. In all these cases, naive analysis of ZI data that does not distinguish true from artificial zeros can lead to markedly biased conclusions.

Existing approaches for zero inflation focus on observed data likelihood modelling using either hurdle models or zero-inflation models [Neelon et al., 2016, Greene, 2005]. Hurdles models are mixtures models of a distribution truncated at zero and another distribution modeling the occurrence of 00 values [Mullahy, 1986]. In genomics applications, Yu et al. [2023], Dai et al. [2023] use graphical models to represent the zero-inflated likelihood for the purposes of causal discovery. On the other hand, zero inflation models [Lambert, 1992, Young et al., 2022] assume two sources of zeros, either structural (or inflated) zeros or true zeros due to sampling. More recent work has extended this type of approach to include semi-parametric models [Arab et al., 2012, Lam et al., 2006]. Kleinke and Reinecke [2013] apply an augmentation of the chained equations imputation approach to correct the bias introduced by inflated zeros. Lukusa et al. [2017] review methods in settings where inflated zeros co-occur with missing data, however these settings do not include cases considered here, where the excess zeros represent a censored realization.

The disadvantage of the first type of approach is that it does not aim to reconstruct underlying values, which are often of interest. The disadvantage of the second type of approach is that correctly distinguishing true from inflated zero values relies on assumptions that are unlikely to hold in practice, e.g., strict parametric assumptions. Moreover, these assumptions may not be congenial and not lead to a coherent full data distribution – guaranteeing model misspecification. This is a more general issue than zero inflation, and occurs in standard missing data problems as well. In contrast, our approach to modeling inflated zeros has two important features. First, we aim to distinguish true from inflated zeros, and thus identify underlying realizations in the data. Second, we avoid imposing strong parametric assumptions to do so.

Specifically, we propose to model zero inflation using a generalization of missing data models. In standard missing data, the relationship between an observed variable and its corresponding underlying variable is determined by an observability indicator. If the indicator is 1111, the observed and the underlying variables coincide, while if the indicator is 00, the observed variable is recorded as a missing value. In zero inflated problems, we view improperly recorded zero values as missing values denoted by a zero. Hence, in this view, we cannot tell a zero indicating an actual value from a zero indicating missingness, and observing a zero means the observability indicator is itself unobserved.

This complication implies that even if we assume a missing data model where the full data distribution would have been identified absent zero inflation, such as the Missing-Completely-At-Random (MCAR) model, we would generally not obtain identification in the presence of zero inflation. Thus, the variant of the missing data problem we consider is significantly more complicated than standard missing data.

We approach this problem using recent theory of graphical models applied to missing data, which gives general identification results in the absence of zero inflation [Mohan et al., 2013, Bhattacharya et al., 2019, Nabi et al., 2020]. We first note that zero inflation problems viewed in this framework could be arranged in a hierarchy similarly to missing data problems [Rubin, 1976]: Zero-Inflated Missing-Completely-At-Random (ZI MCAR), Zero-Inflated Missing-At-Random (ZI MAR), and Zero-Inflated Missing-Not-At-Random (ZI MNAR).

We then show that if zero inflation is present, target parameters involving zero inflated variables are not identified without additional assumptions, even in the relatively simple ZI MCAR model. We further show that if an informative proxy for a missingness indicator exists, identification of the target parameters becomes possible provided the missing data model (sans zero inflation) is identified, via a modification of the effect restoration approach in Kuroki and Pearl [2014], provided the true proxy-indicator relationship is known.

If this relationship is not known, we show that only certain proxy-indicator relationships are compatible with the overall model which provides a natural sensitivity analysis strategy. In particular, in the case of a categorical outcome, we provide an analytic bound for the proxy-indicator relationship in the presence of zero inflation in a number of missing data models, and show that in some models our bound is sharp. In more general cases, we show that the numeric approach for obtaining bounds detailed in Duarte et al. [2023] may be used instead.

Finally, we demonstrate an application of our method on simulated data, as well as a real world dataset on CLABSIs.

2 Graphical Models of Missing Data and Zero Inflated Data

In this section we briefly review relevant existing works on missing data, and describe difficulties posed by zero inflation.

2.1 Missing data and identification

Let X(1)={X1(1),,Xn(1)}superscript𝑋1superscriptsubscript𝑋11superscriptsubscript𝑋𝑛1X^{(1)}=\{X_{1}^{(1)},\ldots,X_{n}^{(1)}\}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = { italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT } be a set of random variables (r.v.s) of interest. Denote 𝒳i(1)subscriptsuperscript𝒳1𝑖\mathcal{X}^{(1)}_{i}caligraphic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as the state space of Xi(1)subscriptsuperscript𝑋1𝑖X^{(1)}_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, which we assume is categorical, and without loss of generality, includes the value 00. Samples of X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT are systematically missing, with true values being replaced by a special symbol “?”. To better represent missing data problems, it is convenient to use two additional sets of r.v.s: the proxies X={X1,,Xn}𝑋subscript𝑋1subscript𝑋𝑛X=\{X_{1},\ldots,X_{n}\}italic_X = { italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, where each proxy XiXsubscript𝑋𝑖𝑋X_{i}\in Xitalic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X has the state space 𝒳i=𝒳i(1){``?"}subscript𝒳𝑖superscriptsubscript𝒳𝑖1``?"\mathcal{X}_{i}=\mathcal{X}_{i}^{(1)}\cup\{``?"\}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ∪ { ` ` ? " }, and the binary observability indicators R={R1,,Rn}𝑅subscript𝑅1subscript𝑅𝑛R=\{R_{1},\ldots,R_{n}\}italic_R = { italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }. Each proxy Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is deterministically defined in terms of the underlying variable Xi(1)subscriptsuperscript𝑋1𝑖X^{(1)}_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and the observability indicator Risubscript𝑅𝑖R_{i}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT via the missing data version of the consistency rule: Xi=Xi(1)subscript𝑋𝑖subscriptsuperscript𝑋1𝑖X_{i}=X^{(1)}_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT when Ri=1subscript𝑅𝑖1R_{i}=1italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 and Xi=``?"subscript𝑋𝑖``?"X_{i}=``?"italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ` ` ? " when Ri=0subscript𝑅𝑖0R_{i}=0italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0. Thus, a variable Xi(1)superscriptsubscript𝑋𝑖1X_{i}^{(1)}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT may be described as "Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT had it (hypothetically) been observed", i.e., a counterfactual. The superscript notation is deliberately chosen to make the connection to counterfactuals in causal inference explicit. In addition to X(1),R,Xsuperscript𝑋1𝑅𝑋X^{(1)},R,Xitalic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X, let C𝐶Citalic_C represents other fully observed variables.

We define Risubscript𝑅𝑖R_{-i}italic_R start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT as {R1,,Ri1,Ri+1,,Rn}subscript𝑅1subscript𝑅𝑖1subscript𝑅𝑖1subscript𝑅𝑛\{R_{1},\ldots,R_{i-1},R_{i+1},\ldots,R_{n}\}{ italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, R<isubscript𝑅absent𝑖R_{<i}italic_R start_POSTSUBSCRIPT < italic_i end_POSTSUBSCRIPT as {R1,,Ri1}subscript𝑅1subscript𝑅𝑖1\{R_{1},\ldots,R_{i-1}\}{ italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT } and Risubscript𝑅absent𝑖R_{\geq i}italic_R start_POSTSUBSCRIPT ≥ italic_i end_POSTSUBSCRIPT as {Ri,Rn}subscript𝑅𝑖subscript𝑅𝑛\{R_{i}\ldots,R_{n}\}{ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, with analogous subsets of X𝑋Xitalic_X, X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT defined similarly. Following the nomenclature in Nabi et al. [2020], Bhattacharya et al. [2019], we call p(X(1),R,C)𝑝superscript𝑋1𝑅𝐶p(X^{(1)},R,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_C ) the full law, p(R,X,C)𝑝𝑅𝑋𝐶p(R,X,C)italic_p ( italic_R , italic_X , italic_C ) the observed law, and p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) the target law. A missing data model is a set of distributions over the variables {X(1),R,X,C}superscript𝑋1𝑅𝑋𝐶\{X^{(1)},R,X,C\}{ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C } that satisfy the above consistency rule.

Following Mohan et al. [2013], we consider missing data model defined using a class of directed acyclic graphs (DAGs) called missing data DAGs (m-DAGs). Specifically, an m-DAG 𝒢(V)𝒢𝑉\mathcal{G}(V)caligraphic_G ( italic_V ) consists of nodes V={X(1),R,X,C}𝑉superscript𝑋1𝑅𝑋𝐶V=\{X^{(1)},R,X,C\}italic_V = { italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C }. Like all DAGs, m-DAGs only have directed edges and lack directed cycles, but also have a number of additional restrictions: each proxy Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT has exactly 2 incoming edges Xi(1)XiRisubscriptsuperscript𝑋1𝑖subscript𝑋𝑖subscript𝑅𝑖X^{(1)}_{i}\rightarrow X_{i}\leftarrow R_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT → italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ← italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (due to consistency); there is no edge from any Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT or Risubscript𝑅𝑖R_{i}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to any Xi(1)subscriptsuperscript𝑋1𝑖X^{(1)}_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. A joint p(X(1),R,X)𝑝superscript𝑋1𝑅𝑋p(X^{(1)},R,X)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) in the missing data model corresponding to the m-DAG 𝒢𝒢\mathcal{G}caligraphic_G factorizes as

V{R,X(1)}p(Vpa𝒢(V))XiXp(XiRi,Xi(1))subscriptproduct𝑉𝑅superscript𝑋1𝑝conditional𝑉subscriptpa𝒢𝑉subscriptproductsubscript𝑋𝑖𝑋𝑝conditionalsubscript𝑋𝑖subscript𝑅𝑖subscriptsuperscript𝑋1𝑖\displaystyle\prod_{V\in\{R,X^{(1)}\}}p(V\mid\operatorname{pa}_{\mathcal{G}}(V% ))\prod_{X_{i}\in X}p(X_{i}\mid R_{i},X^{(1)}_{i})∏ start_POSTSUBSCRIPT italic_V ∈ { italic_R , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT } end_POSTSUBSCRIPT italic_p ( italic_V ∣ roman_pa start_POSTSUBSCRIPT caligraphic_G end_POSTSUBSCRIPT ( italic_V ) ) ∏ start_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X end_POSTSUBSCRIPT italic_p ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )

where all terms p(XiRi,Xi(1))𝑝conditionalsubscript𝑋𝑖subscript𝑅𝑖subscriptsuperscript𝑋1𝑖p(X_{i}\mid R_{i},X^{(1)}_{i})italic_p ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) are deterministic. Using m-DAGs, one can represent many interesting missing data scenarios, see Fig. 1 for examples.

An important goal in missing data problems, prior to statistical inference, is to ensure the target parameter, which is generally some function of the target law, is identified from the observed law. It follows by definition that the target law p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) is identified if and only if the propensity score p(RX(1))𝑝conditional𝑅superscript𝑋1p(R\mid X^{(1)})italic_p ( italic_R ∣ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) evaluated at R=1𝑅1R=1italic_R = 1 is identified, while the full law p(X(1),R)𝑝superscript𝑋1𝑅p(X^{(1)},R)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R ) is identified if and only if the propensity score p(RX(1))𝑝conditional𝑅superscript𝑋1p(R\mid X^{(1)})italic_p ( italic_R ∣ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) at all values of R𝑅Ritalic_R is identified. While identification of the target law is still an open problem, Nabi et al. [2020] showed a sound and complete method for identification of the full law p(X(1),R)𝑝superscript𝑋1𝑅p(X^{(1)},R)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R ) from the observed law p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ) in missing data models represented by m-DAGs and hidden variable m-DAGs.

X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTR1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT(a)𝑎(a)( italic_a )X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTR1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTC𝐶Citalic_C(b)𝑏(b)( italic_b )X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2(1)subscriptsuperscript𝑋12X^{(1)}_{2}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTX3(1)subscriptsuperscript𝑋13X^{(1)}_{3}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPTR1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTR2subscript𝑅2R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTR3subscript𝑅3R_{3}italic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTX3subscript𝑋3X_{3}italic_X start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT(c)𝑐(c)( italic_c )
Figure 1: Missing data scenarios represented by m-DAG. Circle nodes denote observed variables, while others nodes are unobserved. Gray edges denote deterministic nature of p(XiRi,Xi(1))𝑝conditionalsubscript𝑋𝑖subscript𝑅𝑖subscriptsuperscript𝑋1𝑖p(X_{i}\mid R_{i},X^{(1)}_{i})italic_p ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) due to consistency. (a) X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is MCAR since R1X(1)subscript𝑅1perpendicular-toabsentperpendicular-tosuperscript𝑋1R_{1}\operatorname*{\perp\!\!\!\perp}X^{(1)}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_OPERATOR ⟂ ⟂ end_OPERATOR italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT. (b) X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is MAR since R1X(1)Cconditionalsubscript𝑅1perpendicular-toabsentperpendicular-tosuperscript𝑋1𝐶R_{1}\operatorname*{\perp\!\!\!\perp}X^{(1)}\mid Citalic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_OPERATOR ⟂ ⟂ end_OPERATOR italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ∣ italic_C. (c) X1(1),X2(1),X3(1)subscriptsuperscript𝑋11subscriptsuperscript𝑋12subscriptsuperscript𝑋13X^{(1)}_{1},X^{(1)}_{2},X^{(1)}_{3}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are MNAR, since observability indicators R1,R2,R3subscript𝑅1subscript𝑅2subscript𝑅3R_{1},R_{2},R_{3}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are are not independent of these variables, either marginally or given observed variables.

2.2 Zero Inflation Non-identifiability

A zero inflated (ZI) model associated with an m-DAG is a variant of the missing data model associated with that m-DAG, with the following important difference: the missing data consistency relating variables Xi(1)X(1),XiX,RiRformulae-sequencesubscriptsuperscript𝑋1𝑖superscript𝑋1formulae-sequencesubscript𝑋𝑖𝑋subscript𝑅𝑖𝑅X^{(1)}_{i}\in X^{(1)},X_{i}\in X,R_{i}\in Ritalic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_R is replaced by a zero inflation version, where Xi=Xi(1)subscript𝑋𝑖subscriptsuperscript𝑋1𝑖X_{i}=X^{(1)}_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT if Ri=1subscript𝑅𝑖1R_{i}=1italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1, and Xi=0subscript𝑋𝑖0X_{i}=0italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 if Ri=0subscript𝑅𝑖0R_{i}=0italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0. 111Note that we consider ZI models with categorical state spaces only, unless stated otherwise.

There are several important consequences of zero inflated consistency. Firstly, both Xi(1)X(1)subscriptsuperscript𝑋1𝑖superscript𝑋1X^{(1)}_{i}\in X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and XiXsubscript𝑋𝑖𝑋X_{i}\in Xitalic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_X take values in 𝒳isubscript𝒳𝑖\mathcal{X}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and no variable in a ZI problems takes the value “?”. Secondly, as in missing data, the ZI-variable Xi(1)subscriptsuperscript𝑋1𝑖X^{(1)}_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is counterfactual, and according to the ZI consistency rule, its true realizations are observed only when Ri=1subscript𝑅𝑖1R_{i}=1italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1. In particular, if Xi=x0subscript𝑋𝑖𝑥0X_{i}=x\neq 0italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_x ≠ 0, we deduce Ri=1subscript𝑅𝑖1R_{i}=1italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 and Xi(1)=xsuperscriptsubscript𝑋𝑖1𝑥X_{i}^{(1)}=xitalic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = italic_x. However, since it is not possible to tell whether a realization Xi=0subscript𝑋𝑖0X_{i}=0italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 corresponds to the situation where 00 is the true value of Xi(1)subscriptsuperscript𝑋1𝑖X^{(1)}_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, or corresponds to a censored realization of Xi(1)subscriptsuperscript𝑋1𝑖X^{(1)}_{i}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, Risubscript𝑅𝑖R_{i}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is unobserved whenever Xi=0subscript𝑋𝑖0X_{i}=0italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0. Moreover, while we still refer to p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and p(X(1),R)𝑝superscript𝑋1𝑅p(X^{(1)},R)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R ) as the target law and the full law, respectively, we will refer to p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ) as the zero-inflated law (ZI law), rather than the observed law, since R𝑅Ritalic_R is not always observed. Thirdly, the ZI consistency imposes the following important restriction on the ZI law p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X )

  • (Z) For every i𝑖iitalic_i and x0𝑥0x\neq 0italic_x ≠ 0, p(Ri=0,Xi=x)=0𝑝formulae-sequencesubscript𝑅𝑖0subscript𝑋𝑖𝑥0p(R_{i}=0,X_{i}=x)=0italic_p ( italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 , italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_x ) = 0.

We classify ZI models as ZI MCAR, ZI MAR, or ZI MNAR, if its missing data version is MCAR, MAR, or MNAR, respectively. Examples of ZI models are shown in Fig. 2 and Fig. 3.

Just as in missing data problems, the goal in ZI problems is to identify (a function of) the target law or the full law from the observed law and possibly additional objects. We focus on the full law identification in this paper. Unsurprisingly, ZI problems are significantly harder than missing data problems, in the sense that both the target law and the full law are non-parametrically non-identified even in the simplest setting (ZI MCAR), as shown by the following result.

Lemma 1 (Non-identifiability).

Given a ZI model associated with any m-DAG 𝒢𝒢{\cal G}caligraphic_G, both the target law p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and the full law p(X(1),R,C)𝑝superscript𝑋1𝑅𝐶p(X^{(1)},R,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_C ) are non-parametrically non-identified.

X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPTX𝑋Xitalic_XR𝑅Ritalic_RW𝑊Witalic_W(a)𝑎(a)( italic_a )X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPTX𝑋Xitalic_XR𝑅Ritalic_RC𝐶Citalic_CW𝑊Witalic_W(b)𝑏(b)( italic_b )X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPTX𝑋Xitalic_XR𝑅Ritalic_RC𝐶Citalic_CW𝑊Witalic_W(c)𝑐(c)( italic_c )
Figure 2: Examples of proxy-augmented ZI MCAR model (a) and ZI MAR models (b and c). A1, A2 holds in (a), A1, A2 hold in (b), and A1, A2 hold in (c). Unlike missing data, indicator R𝑅Ritalic_R is partially observed.
X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2(1)subscriptsuperscript𝑋12X^{(1)}_{2}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTR1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTR2subscript𝑅2R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW2subscript𝑊2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW1subscript𝑊1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT(a)𝑎(a)( italic_a )X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2(1)subscriptsuperscript𝑋12X^{(1)}_{2}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTR1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTR2subscript𝑅2R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW2subscript𝑊2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW1subscript𝑊1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT(b)𝑏(b)( italic_b )
Figure 3: Examples of proxy-augmented ZI MNAR models. (a) ZI bivariate block-parallel model. (b) ZI bivariate block-sequencial MAR model.

3 Proxy-based Identification

We first demonstrate our approach to proxy-based identification with the simplest ZI missing data model, ZI MCAR, and generalize it to arbitrary ZI m-DAG.

3.1 Identification in the ZI MCAR Model

Lemma 1 implies that any identification method must rely on additional assumptions beyond those implied by the m-DAG. To illustrate additional assumptions that will be employed, consider a simple ZI MCAR model with a single ZI variable X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT taking values in {0,,K}0𝐾\{0,\ldots,K\}{ 0 , … , italic_K }, and the corresponding inflation indicator R𝑅Ritalic_R, where RX(1)R\perp\!\!\!\perp X^{(1)}italic_R ⟂ ⟂ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT.

To simplify subsequent presentation, we will use the following notational shorthand: paibjsubscript𝑝conditionalsubscript𝑎𝑖subscript𝑏𝑗p_{a_{i}\mid b_{j}}italic_p start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT to mean p(A=iB=j)𝑝𝐴conditional𝑖𝐵𝑗p(A=i\mid B=j)italic_p ( italic_A = italic_i ∣ italic_B = italic_j ), and 𝐩ABsubscript𝐩conditional𝐴𝐵\mathbf{p}_{A\mid B}bold_p start_POSTSUBSCRIPT italic_A ∣ italic_B end_POSTSUBSCRIPT to mean the stochastic matrix whose elements are paibjsubscript𝑝conditionalsubscript𝑎𝑖subscript𝑏𝑗p_{a_{i}\mid b_{j}}italic_p start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Similarly for pai,bjsubscript𝑝subscript𝑎𝑖subscript𝑏𝑗p_{a_{i},b_{j}}italic_p start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT and pA,Bsubscript𝑝𝐴𝐵p_{A,B}italic_p start_POSTSUBSCRIPT italic_A , italic_B end_POSTSUBSCRIPT. We also use a matrix multiplication shorthand, where paibjpbjck=paicksubscript𝑝conditionalsubscript𝑎𝑖subscript𝑏𝑗subscript𝑝conditionalsubscript𝑏𝑗subscript𝑐𝑘subscript𝑝conditionalsubscript𝑎𝑖subscript𝑐𝑘p_{a_{i}\mid b_{j}}p_{b_{j}\mid c_{k}}=p_{a_{i}\mid c_{k}}italic_p start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∣ italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT is taken to mean jp(A=i|B=j)p(B=j|C=k)=p(A=i|C=k)subscript𝑗𝑝𝐴conditional𝑖𝐵𝑗𝑝𝐵conditional𝑗𝐶𝑘𝑝𝐴conditional𝑖𝐶𝑘\sum_{j}p(A=i|B=j)p(B=j|C=k)=p(A=i|C=k)∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_p ( italic_A = italic_i | italic_B = italic_j ) italic_p ( italic_B = italic_j | italic_C = italic_k ) = italic_p ( italic_A = italic_i | italic_C = italic_k ).

We will assume the existence of an observed binary proxy variable W𝑊Witalic_W informative for R𝑅Ritalic_R with the following properties:

  • (A1) WX(1)RW\perp\!\!\!\perp X^{(1)}\mid Ritalic_W ⟂ ⟂ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ∣ italic_R,

  • (A2) The matrix 𝐩WRsubscript𝐩conditional𝑊𝑅\mathbf{p}_{W\mid R}bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT is invertible.

Note that since W𝑊Witalic_W and R𝑅Ritalic_R are binary, A2 is equivalent to pw0r0pw0r1subscript𝑝conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤0subscript𝑟1p_{w_{0}\mid r_{0}}\neq p_{w_{0}\mid r_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Due to the existence of the proxy variable W𝑊Witalic_W, we call this ZI MCAR model "proxy-augmented", whose graph is shown in Fig. 2 (a).

Kuroki and Pearl [2014] considered assumptions A1 and A2 in the context of obtaining identification of causal effects in the presence of unobserved confounding. In that work, the proxy variable W𝑊Witalic_W was related to an unobserved categorical variable which was a common cause of the treatment and outcome variables.

In this paper, we adopt the method of Kuroki and Pearl [2014] to express the ZI law p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ) in terms of the observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) and the conditional distribution p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ). In addition to A1 and A2, the Kuroki-Pearl method requires that the observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) and p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) are from the same full law (e.g. compatible), and p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) is known.

To see that point identification is then possible, we write p(wx)=rp(wr)p(r,x)𝑝𝑤𝑥subscript𝑟𝑝conditional𝑤𝑟𝑝𝑟𝑥p(wx)=\sum_{r}p(w\mid r)p(r,x)italic_p ( italic_w italic_x ) = ∑ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_p ( italic_w ∣ italic_r ) italic_p ( italic_r , italic_x ) in matrix form

(pw0,x0pw0,xKpw1,x0pw1,xK)𝐩WXsubscriptmatrixsubscript𝑝subscript𝑤0subscript𝑥0subscript𝑝subscript𝑤0subscript𝑥𝐾subscript𝑝subscript𝑤1subscript𝑥0subscript𝑝subscript𝑤1subscript𝑥𝐾subscript𝐩𝑊𝑋\displaystyle\underbrace{\begin{pmatrix}p_{w_{0},x_{0}}&\cdots&p_{w_{0},x_{K}}% \\ p_{w_{1},x_{0}}&\cdots&p_{w_{1},x_{K}}\end{pmatrix}}_{\mathbf{p}_{WX}}under⏟ start_ARG ( start_ARG start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_ARG start_POSTSUBSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT =(pw0r0pw0r1pw1r0pw1r1)𝐩WR(pr0,x00pr1,x0pr1,xK)𝐩RX,absentsubscriptmatrixsubscript𝑝conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤1subscript𝑟0subscript𝑝conditionalsubscript𝑤1subscript𝑟1subscript𝐩conditional𝑊𝑅subscriptmatrixsubscript𝑝subscript𝑟0subscript𝑥00subscript𝑝subscript𝑟1subscript𝑥0subscript𝑝subscript𝑟1subscript𝑥𝐾subscript𝐩𝑅𝑋\displaystyle=\underbrace{\begin{pmatrix}p_{w_{0}\mid r_{0}}&p_{w_{0}\mid r_{1% }}\\ p_{w_{1}\mid r_{0}}&p_{w_{1}\mid r_{1}}\end{pmatrix}}_{\mathbf{p}_{W\mid R}}\>% \underbrace{\begin{pmatrix}p_{r_{0},x_{0}}&\cdots&0\\ p_{r_{1},x_{0}}&\cdots&p_{r_{1},x_{K}}\end{pmatrix}}_{\mathbf{p}_{RX}},= under⏟ start_ARG ( start_ARG start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_ARG start_POSTSUBSCRIPT bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT under⏟ start_ARG ( start_ARG start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_ARG start_POSTSUBSCRIPT bold_p start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (1)

where the 00 entry in 𝐩RXsubscript𝐩𝑅𝑋\mathbf{p}_{RX}bold_p start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT is due to the restriction Z. Since 𝐩WRsubscript𝐩conditional𝑊𝑅\mathbf{p}_{W\mid R}bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT is invertible, we can solve for 𝐩RXsubscript𝐩𝑅𝑋\mathbf{p}_{RX}bold_p start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT by [𝐩WR]1𝐩WXsuperscriptdelimited-[]subscript𝐩conditional𝑊𝑅1subscript𝐩𝑊𝑋[\mathbf{p}_{W\mid R}]^{-1}\mathbf{p}_{WX}[ bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT, leading to the following result.

Theorem 1 (ZI law restoration in ZI MCAR).

For the ZI MCAR model in Fig. 2 (a) under A1, A2, the ZI law p(R,X,W)𝑝𝑅𝑋𝑊p(R,X,W)italic_p ( italic_R , italic_X , italic_W ) is point identified given the observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) and a compatible proxy-indicator conditional distribution p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ), as follows

p(r,x,w)=p(wr)[𝐩WR1𝐩WX]r,x.𝑝𝑟𝑥𝑤𝑝conditional𝑤𝑟subscriptdelimited-[]superscriptsubscript𝐩conditional𝑊𝑅1subscript𝐩𝑊𝑋𝑟𝑥p(r,x,w)=p(w\mid r)\left[\mathbf{p}_{W\mid R}^{-1}\mathbf{p}_{WX}\right]_{r,x}.italic_p ( italic_r , italic_x , italic_w ) = italic_p ( italic_w ∣ italic_r ) [ bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r , italic_x end_POSTSUBSCRIPT . (2)

After the ZI law p(R,X,W)𝑝𝑅𝑋𝑊p(R,X,W)italic_p ( italic_R , italic_X , italic_W ) is identified, the full law is identified, p(X(1),R,W)=p(X,WR=1)p(R)𝑝superscript𝑋1𝑅𝑊𝑝𝑋conditional𝑊𝑅1𝑝𝑅p(X^{(1)},R,W)=p(X,W\mid R=1)p(R)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W ) = italic_p ( italic_X , italic_W ∣ italic_R = 1 ) italic_p ( italic_R ), by standard assumptions of the MCAR model.

Remark.

There are two difficulties with this result. First, since R𝑅Ritalic_R is potentially unobserved, it is not always reasonable to specify the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) in applications. Second, since our working model corresponds to a hidden variable DAG, the model imposes restrictions on the pair (p(X,W),p(WR))𝑝𝑋𝑊𝑝conditional𝑊𝑅\left(p(X,W),p(W\mid R)\right)( italic_p ( italic_X , italic_W ) , italic_p ( italic_W ∣ italic_R ) ), meaning that not every potential distribution p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) would be consistent with the observed data law under our model. Using inconsistent p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) in the matrix inversion equation places us outside the model, and can yield inconsistent results, such as invalid negative probabilities p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ). Examples of such an inconsistency is provided in the Appendix. Kuroki and Pearl [2014] noted the latter issue in the context of causal inference, but did not provide bounds.

3.2 Proxy-based identification in general ZI missing data models

In this section, we generalize our previous proxy-based approach to an arbitrary graphical ZI model corresponding to an m-DAG, given that the full law is point identified in the missing data model associated to that m-DAG.

Consider any ZI model associated with an arbitrary m-DAG, with a set of fully observed covariates C𝐶Citalic_C, a set of ZI variables X(1)={X1(1),,Xn(1)}superscript𝑋1subscriptsuperscript𝑋11subscriptsuperscript𝑋1𝑛X^{(1)}=\{X^{(1)}_{1},\ldots,X^{(1)}_{n}\}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = { italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, inflation indicators R={R1,,Rn}𝑅subscript𝑅1subscript𝑅𝑛R=\{R_{1},\ldots,R_{n}\}italic_R = { italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, observed versions X={X1,,Xn}𝑋subscript𝑋1subscript𝑋𝑛X=\{X_{1},\ldots,X_{n}\}italic_X = { italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } for variables in X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT, and proxies W={W1,,Wn}𝑊subscript𝑊1subscript𝑊𝑛W=\{W_{1},\ldots,W_{n}\}italic_W = { italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } for variables in R𝑅Ritalic_R.

We make the following assumptions which generalize A1 and A2:

  • (A1) ifor-all𝑖\forall i∀ italic_i, WiX(1),C,RiRiW_{i}\perp\!\!\!\perp X^{(1)},C,{R_{-i}}\mid R_{i}italic_W start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟂ ⟂ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_R start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

  • (A2) The matrix 𝐩WRsubscript𝐩conditional𝑊𝑅\mathbf{p}_{W\mid R}bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT is invertible.

In addition, we will provide alternatives to A1 and A2 which allow the proxies W𝑊Witalic_W to potentially depend on C𝐶Citalic_C:

  • (A1) ifor-all𝑖\forall i∀ italic_i, WiX(1),RiC,RiW_{i}\perp\!\!\!\perp X^{(1)},R_{-i}\mid C,R_{i}italic_W start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟂ ⟂ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT - italic_i end_POSTSUBSCRIPT ∣ italic_C , italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

  • (A2) The matrix 𝐩WR,csubscript𝐩conditional𝑊𝑅𝑐\mathbf{p}_{W\mid R,c}bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R , italic_c end_POSTSUBSCRIPT is invertible for every value c𝑐citalic_c.

The identification strategy we adopt proceeds in two stages:

  1. 1.

    ZI law restoration: point identify (if true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) is known) or partially identify the ZI law p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) from the observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ).

  2. 2.

    Downstream identification: identify the full law p(X(1),R,W,C)𝑝superscript𝑋1𝑅𝑊𝐶p(X^{(1)},R,W,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W , italic_C ) from the ZI law p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ).

Since every Risubscript𝑅𝑖R_{i}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is unobserved whenever Xi=0subscript𝑋𝑖0X_{i}=0italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 in ZI problems, the purpose of the first stage is to recover the ZI law involving R𝑅Ritalic_R and other observed variables. Under mentioned proxy assumptions and knowledge of the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ), point identification of this law is possible. Otherwise, partial identification bounds are computed. If point or partial identification is possible, variables in R𝑅Ritalic_R may now be treated as observed data, and the problem is reduced to classical identification in missing data model. In particular, we adopt the sound and complete identification procedure described by Nabi et al. [2020] to point identify the full law in the second stage.

While we focus on non-parametric point identification results for the full law, one could instead employ any point or partial identification procedure developed for missing data problems for the second stage. We leave these types of extensions to future work.

3.2.1 ZI law restoration

Under the proxy assumptions, we have the following identification result, which generalizes Theorem 1.

Theorem 2 (ZI law restoration).

Given a ZI model satisfying assumptions A1 and A2 (or A1 and A2), the ZI law p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) is point identified given the observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) and a compatible proxy-indicator conditional distribution p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) (OR p(WR,C)𝑝conditional𝑊𝑅𝐶p(W\mid R,C)italic_p ( italic_W ∣ italic_R , italic_C )),

p(r,x,w,c)=p(wr)[𝐩WR1𝐩WXC]r,x,c under A1,A2𝑝𝑟𝑥𝑤𝑐𝑝conditional𝑤𝑟subscriptdelimited-[]superscriptsubscript𝐩conditional𝑊𝑅1subscript𝐩𝑊𝑋𝐶𝑟𝑥𝑐 under A1,A2\displaystyle p(r,x,w,c)=p(w\mid r)\!\!\left[\mathbf{p}_{W\mid R}^{-1}\mathbf{% p}_{WXC}\right]_{r,x,c}\!\!\!\!\!\!\!\!\!\!\text{ under {\bf A1${}^{*}$},{\bf A% 2${}^{*}$}}italic_p ( italic_r , italic_x , italic_w , italic_c ) = italic_p ( italic_w ∣ italic_r ) [ bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X italic_C end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r , italic_x , italic_c end_POSTSUBSCRIPT under bold_A1∗ , bold_A2∗ (3)
p(r,x,w,c)=p(wr,c)[𝐩WR,C1𝐩WXC]r,x,c under A1,A2.𝑝𝑟𝑥𝑤𝑐𝑝conditional𝑤𝑟𝑐subscriptdelimited-[]superscriptsubscript𝐩conditional𝑊𝑅𝐶1subscript𝐩𝑊𝑋𝐶𝑟𝑥𝑐 under A1,A2\displaystyle p(r,x,w,c)=p(w\mid r,c)\!\!\left[\mathbf{p}_{W\mid R,C}^{-1}% \mathbf{p}_{WXC}\right]_{r,x,c}\!\!\!\!\!\!\!\!\!\!\text{ under {\bf A1${}^{{% \dagger}}$},{\bf A2${}^{{\dagger}}$}}.italic_p ( italic_r , italic_x , italic_w , italic_c ) = italic_p ( italic_w ∣ italic_r , italic_c ) [ bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R , italic_C end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X italic_C end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r , italic_x , italic_c end_POSTSUBSCRIPT under bold_A1† , bold_A2† .

Theorem 2 suffers from the same issue as Theorem 1: it is unlikely that the true distribution p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) (or p(WR,C)𝑝conditional𝑊𝑅𝐶p(W\mid R,C)italic_p ( italic_W ∣ italic_R , italic_C )) will always be available, and given a candidate distribution, it is not obvious to verify that it is compatible with the model and the observed law.

If the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) (or p(WR,C)𝑝conditional𝑊𝑅𝐶p(W\mid R,C)italic_p ( italic_W ∣ italic_R , italic_C )) is not given, we must find the set of compatible p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) (or p(WR,C)𝑝conditional𝑊𝑅𝐶p(W\mid R,C)italic_p ( italic_W ∣ italic_R , italic_C )) distributions to the model and the observed law. In general, bounds on p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) (or p(WR,C)𝑝conditional𝑊𝑅𝐶p(W\mid R,C)italic_p ( italic_W ∣ italic_R , italic_C )) may be computed numerically by encoding the model as a system of polynomial equations and finding extrema of this system using polynomial programming. A method for solving such systems of equations using a primal/dual method is described in Duarte et al. [2023]. These bounds lead to a natural sensitivity analysis strategy according to our two stage approach. Particularly, each compatible p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) in the bounds implies a valid ZI law by Theorem 2, which in turn implies a full law by Proposition 1 of the next section. In Section 4, we conduct a grid search of the compatible set to illustrate this point.

While numeric bound computation is a general approach, finding such bounds is computationally challenging due to the need to solve polynomial programs. Fortunately, we show that in certain ZI models, it is possible to derive analytic bounds on p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) (or p(WR,C)𝑝conditional𝑊𝑅𝐶p(W\mid R,C)italic_p ( italic_W ∣ italic_R , italic_C )), instead. We also show that these bounds are sharp in some cases.

3.2.2 Downstream identification

After the ZI law p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) is recovered in the restoration step, one may consider this law as the "observed law" in the missing data problem corresponding to the same m-DAG, and invoke missing data identification to obtain the full law p(X(1),R,W,C)𝑝superscript𝑋1𝑅𝑊𝐶p(X^{(1)},R,W,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W , italic_C ). We note that this second identification stage is not precisely the same as that for standard missing data problems, because identification relies on consistency, and consistency under ZI differs from missing data consistency whenever R=0𝑅0R=0italic_R = 0.

Fortunately, consistency when R=1𝑅1R=1italic_R = 1 coincides in ZI problems and missing data problems, and, as the following result shows, suffices for identification.

Proposition 1 (ZI full law identification).

The full law p(X(1),R,W,C)𝑝superscript𝑋1𝑅𝑊𝐶p\left(X^{(1)},R,W,C\right)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W , italic_C ) exhibiting zero inflation that is Markov relative to an m-DAG 𝒢𝒢\mathcal{G}caligraphic_G is identified given the ZI law p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) if and only if 𝒢𝒢\mathcal{G}caligraphic_G does not contain edges of the form Xi(1)Risuperscriptsubscript𝑋𝑖1subscript𝑅𝑖X_{i}^{(1)}\rightarrow R_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT → italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (no self-censoring) and structures of the form Xj(1)RiRjsuperscriptsubscript𝑋𝑗1subscript𝑅𝑖subscript𝑅𝑗X_{j}^{(1)}\rightarrow R_{i}\leftarrow R_{j}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT → italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ← italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (no colluders), and the positivity assumption holds. Moreover, the identifying functional for the full law coincides with the functional given in Malinsky et al. [2021].

3.3 Partial Identification in ZI MCAR

In this subsection, we relax the requirement that the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) must be given in the ZI law restoration step, and provide bounds for this conditional distribution in the proxy-augmented ZI MCAR model.

Consider the proxy-augmented ZI MCAR model in Fig. 2 (a). This model is equivalently described by the following model 𝒫𝒫\mathcal{P}caligraphic_P, satisfying Z, A1, and A2:

𝒫={(𝐪WR,𝐪RX):𝐪WR0,wqwr=1,r,𝐪RX0,rxqrx=1,x0(qr0x=0);qw0r0qw0r1.}𝒫:subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋absentmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪conditional𝑊𝑅0subscript𝑤subscript𝑞conditional𝑤𝑟1for-all𝑟missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪𝑅𝑋0subscript𝑟𝑥subscript𝑞𝑟𝑥1missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencefor-all𝑥0subscript𝑞subscript𝑟0𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle\mathcal{P}=\left\{\begin{aligned} (\mathbf{q}_{W\mid R},\mathbf{% q}_{RX}):&&&\!\textstyle\mathbf{q}_{W\mid R}\geq 0,\sum_{w}q_{w\mid r}=1,% \forall r,\\ &&&\!\textstyle\mathbf{q}_{RX}\geq 0,\sum_{rx}q_{rx}=1,\\ &&&\!\textstyle\forall x\neq 0(q_{r_{0}x}=0);\>q_{w_{0}\mid r_{0}}\neq q_{w_{0% }\mid r_{1}}.\\ \end{aligned}\right\}caligraphic_P = { start_ROW start_CELL ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) : end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 , ∀ italic_r , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) ; italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . end_CELL end_ROW } (4)

Given an observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ), we are interested in the following subset 𝒬𝒫𝒬𝒫\mathcal{Q}\subseteq\mathcal{P}caligraphic_Q ⊆ caligraphic_P of distributions yielding the observed law,

𝒬={(𝐪WR,𝐪RX)𝒫:𝐪WR𝐪RX=𝐩WX}.𝒬conditional-setsubscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋𝒫subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋subscript𝐩𝑊𝑋\mathcal{Q}=\left\{(\mathbf{q}_{W\mid R},\mathbf{q}_{RX})\in\mathcal{P}:\>% \mathbf{q}_{W\mid R}\mathbf{q}_{RX}=\mathbf{p}_{WX}\right\}.caligraphic_Q = { ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) ∈ caligraphic_P : bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT } . (5)

In particular, our goal is finding all 𝐪WR𝒬subscript𝐪conditional𝑊𝑅𝒬\mathbf{q}_{W\mid R}\in\mathcal{Q}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∈ caligraphic_Q, which is the partial identification of q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) w.r.t. the given observed law. This is equivalent to projecting 𝒬𝒬\mathcal{Q}caligraphic_Q onto the probability simplex of 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT. From (5) and (4), one way to check whether an invertible 𝐪WR𝒬subscript𝐪conditional𝑊𝑅𝒬\mathbf{q}_{W\mid R}\in\mathcal{Q}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∈ caligraphic_Q is to compute 𝐪RX=(𝐪WR)1𝐩WXsubscript𝐪𝑅𝑋superscriptsubscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋\mathbf{q}_{RX}=(\mathbf{q}_{W\mid R})^{-1}\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT and check (𝐪WR,𝐪RX)𝒬subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋𝒬(\mathbf{q}_{W\mid R},\mathbf{q}_{RX})\in\mathcal{Q}( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) ∈ caligraphic_Q. First, 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT must be a stochastic matrix for any problem under A1 and A2. Second, 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT must also satisfy ZI-consistency constraint Z. If these conditions are true, there is a joint distribution in the model generates both 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT and 𝐩WXsubscript𝐩𝑊𝑋\mathbf{p}_{WX}bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT, and they are said to be compatible. After the compatible set of 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT is derived, the partial identification of 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT could be obtained using (2).

We note that Z implies, for all x0𝑥0x\neq 0italic_x ≠ 0, qx=qr1,xsubscript𝑞𝑥subscript𝑞subscript𝑟1𝑥q_{x}=q_{r_{1},x}italic_q start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT, so qr1x=1subscript𝑞conditionalsubscript𝑟1𝑥1q_{r_{1}\mid x}=1italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = 1. Then by considering qw0r0qr0,x+qw0r1qr1,x=pw0,xsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞subscript𝑟0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞subscript𝑟1𝑥subscript𝑝subscript𝑤0𝑥q_{w_{0}\mid r_{0}}q_{r_{0},x}+q_{w_{0}\mid r_{1}}q_{r_{1},x}=p_{w_{0},x}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT, we obtain point identification qw0r1=pw0x1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and the marginal constraints x0,pw0x=pw0x1formulae-sequencefor-all𝑥0subscript𝑝conditionalsubscript𝑤0𝑥subscript𝑝conditionalsubscript𝑤0subscript𝑥1\forall x\neq 0,p_{w_{0}\mid x}=p_{w_{0}\mid x_{1}}∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Note that these constraints may be used to design a falsification test of the model.

However, qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is not identified, and its bounds must be obtained by solving the following polynomial program:

maxqw0r0subscriptsubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\max_{q_{w_{0}\mid r_{0}}}roman_max start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ±qw0r0plus-or-minussubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\pm q_{w_{0}\mid r_{0}}± italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (6)
s.t. 𝐪WR𝐪RX=𝐩WX,subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋subscript𝐩𝑊𝑋\displaystyle\mathbf{q}_{W\mid R}\mathbf{q}_{RX}=\mathbf{p}_{WX},bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ,
𝐪WR0,r(wqw|r=1),qw0r0qw0r1,formulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle\textstyle\mathbf{q}_{W\mid R}\geq 0,\>\forall r(\sum_{w}q_{w|r}=% 1),\>q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}},bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w | italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ,
𝐪RX0,rxqrx=1,qw0r1=pw0x1.formulae-sequencesubscript𝐪𝑅𝑋0formulae-sequencesubscript𝑟𝑥subscript𝑞𝑟𝑥1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\textstyle\mathbf{q}_{RX}\geq 0,\>\sum_{rx}q_{rx}=1,\>q_{w_{0}% \mid r_{1}}=p_{w_{0}\mid x_{1}}.bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Since both qw0|r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}|r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT are unknowns, the above system of equations corresponds to a quadratic program, which is difficult to solve in general.

However, it is possible to transform this optimization into an equivalent linear program with the following observations:

  1. 1.

    A specific solution to 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT is not required. One merely needs to check if 𝐪WR1𝐩WXsuperscriptsubscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋\mathbf{q}_{W\mid R}^{-1}\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT is a stochastic matrix.

  2. 2.

    If 𝐪WR𝐪RX=𝐩WXsubscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋subscript𝐩𝑊𝑋\mathbf{q}_{W\mid R}\mathbf{q}_{RX}=\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT, where all matrices are non-negative, 𝐩WXsubscript𝐩𝑊𝑋\mathbf{p}_{WX}bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT sum to 1 and 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT sum to 1, then 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT sum to 1. The proof of this fact is in the Appendix.

  3. 3.

    The inverse [𝐪WR]1superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1\left[\mathbf{q}_{W\mid R}\right]^{-1}[ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT is

    1qw0r0qw0r1(1qw0r1qw0r1qw0r01qw0r0)1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1matrix1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟0\textstyle\frac{1}{q_{w_{0}\mid r_{0}}-q_{w_{0}\mid r_{1}}}\begin{pmatrix}1-q_% {w_{0}\mid r_{1}}&-q_{w_{0}\mid r_{1}}\\ q_{w_{0}\mid r_{0}}-1&q_{w_{0}\mid r_{0}}\end{pmatrix}divide start_ARG 1 end_ARG start_ARG italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG ( start_ARG start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) (7)

Observations 1 and 2 imply that checking compatibility involves only checking non-negativity of 𝐪WR1𝐩WXsuperscriptsubscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋\mathbf{q}_{W\mid R}^{-1}\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT, reducing the unknowns in our optimization problem to only qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Checking 𝐪WR1𝐩WXsuperscriptsubscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋\mathbf{q}_{W\mid R}^{-1}\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT is still non-linear in 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT, but (7) suggests an equivalent procedure consisting of two separate problems where qw0r0>qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}>q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT or qw0r0<qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}<q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, respectively. Concretely, for each case s=1𝑠1s=1italic_s = 1 and s=1𝑠1s=-1italic_s = - 1, we consider 2 linear programs

maxqw0r0subscriptsubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\max_{q_{w_{0}\mid r_{0}}}roman_max start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ±qw0r0plus-or-minussubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\pm q_{w_{0}\mid r_{0}}± italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (8)
s.t. s(1qw0r1qw0r1qw0r01qw0r0)𝐩WX𝟎,𝑠matrix1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝐩𝑊𝑋0\displaystyle s\cdot\begin{pmatrix}1-q_{w_{0}\mid r_{1}}&-q_{w_{0}\mid r_{1}}% \\ q_{w_{0}\mid r_{0}}-1&q_{w_{0}\mid r_{0}}\end{pmatrix}\mathbf{p}_{WX}\geq% \mathbf{0},italic_s ⋅ ( start_ARG start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ≥ bold_0 ,
sqw0r0>sqw0r1,0qw0r01,formulae-sequence𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟10subscript𝑞conditionalsubscript𝑤0subscript𝑟01\displaystyle s\cdot q_{w_{0}\mid r_{0}}>s\cdot q_{w_{0}\mid r_{1}},0\leq q_{w% _{0}\mid r_{0}}\leq 1,italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1 ,
qw0r1=pw0x1.subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}.italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

These problems could be solved analytically using fast linear program solvers, yielding the following partial identification result for p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ). A detailed proof is in the Appendix.

Theorem 3 (ZI MCAR compatibility bound).

Consider a ZI MCAR model in Fig. 2 (a) under proxy assumptions A1, A2, with categorical X𝑋Xitalic_X and binary R,W𝑅𝑊R,Witalic_R , italic_W. Given a consistent observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) satisfying positivity assumption, x,p(x)>0for-all𝑥𝑝𝑥0\forall x,p(x)>0∀ italic_x , italic_p ( italic_x ) > 0, the set of compatible proxy-indicator conditionals q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) is given by

qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =pw0x1absentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle=p_{w_{0}\mid x_{1}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT {[pw0x0,1] if pw0x0>pw0x1[0,pw0x0] if pw0x0<pw0x1(0,1){pw0x0} if pw0x0=pw0x1absentcasessubscript𝑝conditionalsubscript𝑤0subscript𝑥01 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise0subscript𝑝conditionalsubscript𝑤0subscript𝑥0 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥0 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise\displaystyle\in\begin{cases}[p_{w_{0}\mid x_{0}},1]\text{ if }p_{w_{0}\mid x_% {0}}>p_{w_{0}\mid x_{1}}\\ [0,p_{w_{0}\mid x_{0}}]\text{ if }p_{w_{0}\mid x_{0}}<p_{w_{0}\mid x_{1}}\\ (0,1)\setminus\{p_{w_{0}\mid x_{0}}\}\text{ if }p_{w_{0}\mid x_{0}}=p_{w_{0}% \mid x_{1}}\end{cases}∈ { start_ROW start_CELL [ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 1 ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW

These bounds are sharp. Moreover, if pw0x0=pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}=p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) must satisfy 0<pw0x0<10subscript𝑝conditionalsubscript𝑤0subscript𝑥010<p_{w_{0}\mid x_{0}}<10 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 1, and zero inflation does not occur, i.e., q(R=0)=0𝑞𝑅00q(R=0)=0italic_q ( italic_R = 0 ) = 0.

3.4 Partial Identification in ZI MAR

We compute analytical bounds for two versions of the proxy-augmented ZI MAR model, illustrated in Fig. 2 (b) and (c). The first model has CW𝐶𝑊C\rightarrow Witalic_C → italic_W and satisfies A1 and A2, while C↛W↛𝐶𝑊C\not\rightarrow Witalic_C ↛ italic_W in the second model, and the proxy assumptions are A1 and A2.

In the first proxy-augmented ZI MAR model, the set of compatible pWR,Csubscript𝑝conditional𝑊𝑅𝐶p_{W\mid R,C}italic_p start_POSTSUBSCRIPT italic_W ∣ italic_R , italic_C end_POSTSUBSCRIPT is given by the Cartesian product of the independently determined ZI MCAR bounds for each value c𝑐citalic_c. This leads to the following direct analogue of Theorem 3. The proof is deferred to the Appendix.

Theorem 4 (ZI MAR compatibility bound 1).

Consider a ZI MAR model in Fig. 2 (b) under proxy assumptions A1 and A2, with categorical X,C𝑋𝐶X,Citalic_X , italic_C and binary R,W𝑅𝑊R,Witalic_R , italic_W. Given a consistent observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) satisfying positivity assumption, x,c,p(x,c)>0for-all𝑥𝑐𝑝𝑥𝑐0\forall x,c,p(x,c)>0∀ italic_x , italic_c , italic_p ( italic_x , italic_c ) > 0, the set of compatible proxy-indicator conditional distributions q(WR,C)𝑞conditional𝑊𝑅𝐶q(W\mid R,C)italic_q ( italic_W ∣ italic_R , italic_C ) is given by, for each value c𝑐citalic_c,

qw0r1,csubscript𝑞conditionalsubscript𝑤0subscript𝑟1𝑐\displaystyle q_{w_{0}\mid r_{1},c}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT =pw0x1,cabsentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐\displaystyle=p_{w_{0}\mid x_{1},c}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT
qw0r0,csubscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑐\displaystyle q_{w_{0}\mid r_{0},c}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT {[pw0x0,c,1] if pw0x0,c>pw0x1,c[0,pw0x0,c] if pw0x0,c<pw0x1,c(0,1){pw0,x0,c} if pw0x0,c=pw0x1,c\displaystyle\in\begin{cases}[p_{w_{0}\mid x_{0},c},1]\text{ if }p_{w_{0}\mid x% _{0},c}>p_{w_{0}\mid x_{1},c}\\ [0,p_{w_{0}\mid x_{0},c}]\text{ if }p_{w_{0}\mid x_{0},c}<p_{w_{0}\mid x_{1},c% }\\ (0,1)\setminus\{p_{w_{0},\mid x_{0},c}\}\text{ if }p_{w_{0}\mid x_{0},c}=p_{w_% {0}\mid x_{1},c}\end{cases}∈ { start_ROW start_CELL [ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT , 1 ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT } if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW

These bounds are sharp. Moreover, if pw0x0,c=pw0x1,csubscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐p_{w_{0}\mid x_{0},c}=p_{w_{0}\mid x_{1},c}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT, p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) must satisfy 0<pw0x0,c<10subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐10<p_{w_{0}\mid x_{0},c}<10 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1, and zero inflation does not occur for stratum C=c𝐶𝑐C=citalic_C = italic_c, i.e., q(R=0c)=0𝑞𝑅conditional0𝑐0q(R=0\mid c)=0italic_q ( italic_R = 0 ∣ italic_c ) = 0.

On the other hand, the compatibility bound for the second ZI MAR model is the intersection of the ZI MCAR bounds for each values c𝑐citalic_c. The proof is deferred to the Appendix.

Theorem 5 (ZI MAR compatibility bound 2).

Consider a ZI MAR model in Fig. 2 (c) under proxy assumptions A1 and A2, with categorical X,C𝑋𝐶X,Citalic_X , italic_C and binary R,W𝑅𝑊R,Witalic_R , italic_W. Given a consistent observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) satisfying positivity assumption, x,c,p(x,c)>0for-all𝑥𝑐𝑝𝑥𝑐0\forall x,c,p(x,c)>0∀ italic_x , italic_c , italic_p ( italic_x , italic_c ) > 0, the set of compatible proxy-indicator conditional distributions q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) is given by

qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =pw0x1absentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle=p_{w_{0}\mid x_{1}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT {[maxcpw0x0,c,1] if c~,pw0x0,c~>pw0x1,[0,mincpw0x0,c] if c~,pw0x0,c~<pw0x1,(0,1){pw0x1} if c,pw0x0,c=pw0x1.absentcasessubscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1 if ~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise0subscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐 if ~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥1 if for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise\displaystyle\in\begin{cases}[\max_{c}p_{w_{0}\mid x_{0},c},1]\text{ if }% \exists\tilde{c},p_{w_{0}\mid x_{0},\tilde{c}}>p_{w_{0}\mid x_{1}},\\ [0,\min_{c}p_{w_{0}\mid x_{0},c}]\text{ if }\exists\tilde{c},p_{w_{0}\mid x_{0% },\tilde{c}}<p_{w_{0}\mid x_{1}},\\ (0,1)\setminus\{p_{w_{0}\mid x_{1}}\}\text{ if }\forall c,p_{w_{0}\mid x_{0},c% }=p_{w_{0}\mid x_{1}}.\end{cases}∈ { start_ROW start_CELL [ roman_max start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT , 1 ] if ∃ over~ start_ARG italic_c end_ARG , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over~ start_ARG italic_c end_ARG end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , roman_min start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ] if ∃ over~ start_ARG italic_c end_ARG , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over~ start_ARG italic_c end_ARG end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } if ∀ italic_c , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . end_CELL start_CELL end_CELL end_ROW

These bounds are sharp. Moreover, if c,pw0x0,c=pw0x1for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\forall c,p_{w_{0}\mid x_{0},c}=p_{w_{0}\mid x_{1}}∀ italic_c , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) must satisfy c,0<pw0x0,c<1for-all𝑐0subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1\forall c,0<p_{w_{0}\mid x_{0},c}<1∀ italic_c , 0 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1, and zero inflation does not occur, i.e., q(R=0)=0𝑞𝑅00q(R=0)=0italic_q ( italic_R = 0 ) = 0.

Note that the first two cases are mutually exclusive due to the following lemma.

Lemma 2.

For a ZI MAR model in Fig. 2 (b) under A1 and A2, the observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) obeys

c,x0,pw0x,c=pw0x1,c.formulae-sequencefor-all𝑐for-all𝑥0subscript𝑝conditionalsubscript𝑤0𝑥𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐\forall c,\forall x\neq 0,p_{w_{0}\mid x,c}=p_{w_{0}\mid x_{1},c}.∀ italic_c , ∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT . (9)

For a ZI MAR model in Fig. 2 (c) under A1 and A2, the observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) obeys

c,x0,pw0x,c=pw0x1,formulae-sequencefor-all𝑐for-all𝑥0subscript𝑝conditionalsubscript𝑤0𝑥𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\forall c,\forall x\neq 0,p_{w_{0}\mid x,c}=p_{w_{0}\mid x_{1}},∀ italic_c , ∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (10)
either c(pw0x0,cpw0x1) or c(pw0x0,cpw0x1).either for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1 or for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\text{either }\forall c\left(p_{w_{0}\mid x_{0},c}\leq p_{w_{0}% \mid x_{1}}\right)\text{ or }\forall c\left(p_{w_{0}\mid x_{0},c}\geq p_{w_{0}% \mid x_{1}}\right).either ∀ italic_c ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) or ∀ italic_c ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ≥ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) .

Note that, as before, the marginal constraints described may be used to design a model falsification test.

3.5 Partial Identification In ZI MNAR

Consider the ZI version of any MNAR model represented by an m-DAG where the target law is identified. In missing data, an important subclass of such models are submodels of the no-self-censoring model in Malinsky et al. [2021] due to the results in Nabi et al. [2020]. The ZI versions of such models exhibit a crucial complication not found in previously discussed ZI models, namely that multiple variables may be zero inflated. For these models, we posit a set of proxies W={W1,,Wn}𝑊subscript𝑊1subscript𝑊𝑛W=\{W_{1},\ldots,W_{n}\}italic_W = { italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } corresponding to R={R1,,Rn}𝑅subscript𝑅1subscript𝑅𝑛R=\{R_{1},\ldots,R_{n}\}italic_R = { italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, and assume assumptions A1, A2 in Section 3.2.1 are satisfied. Fig. 3 (a) and (b) show two bivariate examples of such models. We use the short hand pwkaxkb=p(Wk=aXk=b)subscript𝑝conditionalsubscript𝑤𝑘𝑎subscript𝑥𝑘𝑏𝑝subscript𝑊𝑘conditional𝑎subscript𝑋𝑘𝑏p_{w_{ka}\mid x_{kb}}=p(W_{k}=a\mid X_{k}=b)italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k italic_a end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p ( italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_a ∣ italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_b ).

Given observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ), we seek the compatible set of {q(WkRk)}k=1nsuperscriptsubscript𝑞conditionalsubscript𝑊𝑘subscript𝑅𝑘𝑘1𝑛\{q(W_{k}\mid R_{k})\}_{k=1}^{n}{ italic_q ( italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, whose elements allow restoration of p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) via Theorem 2. Although sharp bounds for {q(WkRk)}k=1nsuperscriptsubscript𝑞conditionalsubscript𝑊𝑘subscript𝑅𝑘𝑘1𝑛\{q(W_{k}\mid R_{k})\}_{k=1}^{n}{ italic_q ( italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are unknown, the ZI MAR partial identification procedure could be applied to each Rksubscript𝑅𝑘R_{k}italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT independently to obtain bounds for qwk0rk0subscript𝑞conditionalsubscript𝑤𝑘0subscript𝑟𝑘0q_{w_{k0}\mid r_{k0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Moreover, due to the usual properties of ZI, qwk0rk1subscript𝑞conditionalsubscript𝑤𝑘0subscript𝑟𝑘1q_{w_{k0}\mid r_{k1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is point identified for each k𝑘kitalic_k.

For each k𝑘kitalic_k, we apply Theorem 5 with Xk,Rk,Wksubscript𝑋𝑘subscript𝑅𝑘subscript𝑊𝑘X_{k},R_{k},W_{k}italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT being X,R,W𝑋𝑅𝑊X,R,Witalic_X , italic_R , italic_W, respectively, and Zk{X,W,C}{Xk,Wk}subscript𝑍𝑘𝑋𝑊𝐶subscript𝑋𝑘subscript𝑊𝑘Z_{k}\triangleq\{X,W,C\}\setminus\{X_{k},W_{k}\}italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ { italic_X , italic_W , italic_C } ∖ { italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } being the covariates C𝐶Citalic_C. These bounds are not sharp as structural constraints of the MNAR model are not considered. However, these bounds are valid in the sense that the Cartesian product of these bounds contains the true model compatible set of distributions {p(WkRk):k=1,n}:𝑝conditionalsubscript𝑊𝑘subscript𝑅𝑘𝑘1𝑛\{p(W_{k}\mid R_{k}):k=1,\ldots n\}{ italic_p ( italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) : italic_k = 1 , … italic_n }.

In addition, we note that (11) below hold in the observed law under our model, and may be used as falsification test for our ZI model.

Lemma 3.

Consider any ZI model in Section 3.2.1 under A1 and A2. Denote Zk{X,W,C}{Wk,Xk}subscript𝑍𝑘𝑋𝑊𝐶subscript𝑊𝑘subscript𝑋𝑘Z_{k}\triangleq\{X,W,C\}\setminus\{W_{k},X_{k}\}italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ { italic_X , italic_W , italic_C } ∖ { italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }. The observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) must satisfy, for each k𝑘kitalic_k,

zk,x0pwk0xk=x,zk=pwk0zk1,for-allsubscript𝑧𝑘for-all𝑥0subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘𝑥subscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑧𝑘1\displaystyle\forall z_{k},\forall x\neq 0p_{w_{k0}\mid x_{k}=x,z_{k}}=p_{w_{k% 0}\mid z_{k1}},∀ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_x ≠ 0 italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_x , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_z start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (11)
zk(pwk0xk0,zkpwk0xk1) or zk(pwk0xk0,zkpwk0zk1).for-allsubscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘0subscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘1 or for-allsubscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘0subscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑧𝑘1\displaystyle\forall z_{k}\left(p_{w_{k0}\mid x_{k0},z_{k}}\leq p_{w_{k0}\mid x% _{k1}}\right)\text{ or }\forall z_{k}\left(p_{w_{k0}\mid x_{k0},z_{k}}\geq p_{% w_{k0}\mid z_{k1}}\right).∀ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) or ∀ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_z start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) .

3.6 Identification Given A Known Zero Inflation Probability

For ZI MCAR models in Theorem 3 and ZI MAR model in Theorem 5, we provided the identification qw0r1=pw0x1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and the bounds for qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, which lead to partial identification of the full law p(X(1),R,X,W,C)𝑝superscript𝑋1𝑅𝑋𝑊𝐶p(X^{(1)},R,X,W,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W , italic_C ).

If qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is known a priori, the full law is point identified. Alternatively, point identification of the full law may be obtained if the zero inflation probability, or p(R=0)𝑝𝑅0p(R=0)italic_p ( italic_R = 0 ), is known.

This is because the joint distribution p(W,R)𝑝𝑊𝑅p(W,R)italic_p ( italic_W , italic_R ) for binary W,R𝑊𝑅W,Ritalic_W , italic_R has dimension 3333, and one (variationally dependent) parameterization for this joint is via the following 3333 parameters p(R=0)𝑝𝑅0p(R=0)italic_p ( italic_R = 0 ), p(W=0)𝑝𝑊0p(W=0)italic_p ( italic_W = 0 ), qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. This is easy to see by noting that we can compute p(R=0,W=0)=qw0r1(1p(R=0))𝑝formulae-sequence𝑅0𝑊0subscript𝑞conditionalsubscript𝑤0subscript𝑟11𝑝𝑅0p(R=0,W=0)=q_{w_{0}\mid r_{1}}(1-p(R=0))italic_p ( italic_R = 0 , italic_W = 0 ) = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_p ( italic_R = 0 ) ), and p(R=0)𝑝𝑅0p(R=0)italic_p ( italic_R = 0 ), p(W=0)𝑝𝑊0p(W=0)italic_p ( italic_W = 0 ), and p(R=0,W=0)𝑝formulae-sequence𝑅0𝑊0p(R=0,W=0)italic_p ( italic_R = 0 , italic_W = 0 ) are the Möbius parameters for p(W,R)𝑝𝑊𝑅p(W,R)italic_p ( italic_W , italic_R ) [Evans and Richardson, 2014].

In particular, we have the following: qw0r0=pw0qw0r1pr1pr0=pw0pw0x1pr1pr0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝subscript𝑤0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝subscript𝑟1subscript𝑝subscript𝑟0subscript𝑝subscript𝑤0subscript𝑝conditionalsubscript𝑤0subscript𝑥1subscript𝑝subscript𝑟1subscript𝑝subscript𝑟0q_{w_{0}\mid r_{0}}=\frac{p_{w_{0}}-q_{w_{0}\mid r_{1}}p_{r_{1}}}{p_{r_{0}}}=% \frac{p_{w_{0}}-p_{w_{0}\mid x_{1}}p_{r_{1}}}{p_{r_{0}}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = divide start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG, which in turns implies point identification of the full law.

Note that not every zero inflation probability p(R=0)𝑝𝑅0p(R=0)italic_p ( italic_R = 0 ) is compatible with the model. This is easily seen by noting that the Möbius parameterization is variationally dependent, and two parameters, namely p(W=0)𝑝𝑊0p(W=0)italic_p ( italic_W = 0 ) and qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, are known. Howevre, our derived bounds for qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT naturally imply bounds for p(R=0)𝑝𝑅0p(R=0)italic_p ( italic_R = 0 ), with sharp bounds for the former implying sharp bounds for the latter.

4 Experiments

We confirmed the validity of our analytical results for inflated zero models by sampling data generating processes (DGPs), and numerical methods. In addition, we used our methods to perform sensitivity analyses on CLABSI data. Details of these experiments are in the Appendix. The code could be found at https://github.com/trungpq-ci/zero-inflation-bounds.

4.1 Bound Validity In Random DGPs

We verify the results of Theorem 3, Theorem 5 and related observed law constraints by randomly generating DGPs in models we described. We generated 108superscript10810^{8}10 start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT DGPs in the model in Fig. 2 (a), satisfying ZI-consistency, A1, A2, and 108superscript10810^{8}10 start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT DGPs in the model in Fig. 2 (b), satisfying ZI-consistency, A1, A2. For both cases, we verified identification of qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and the bounds for qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT as predicted by the corresponding theorem. For the bounds, two tests were conducted

  1. 1.

    Bound validity: is the true pw0|r0subscript𝑝conditionalsubscript𝑤0subscript𝑟0p_{w_{0}|r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT inside the bounds?

  2. 2.

    Model consistency: grid search the bound, compute p(r,x)𝑝𝑟𝑥p(r,x)italic_p ( italic_r , italic_x ) (or p(r,x,c)𝑝𝑟𝑥𝑐p(r,x,c)italic_p ( italic_r , italic_x , italic_c )) according to Theorem 1 (or  2), and verify that these are probability distributions.

Additionally, for ZI MAR, we checked marginal constraints in (10). We found that all considered results held up to floating point precision in every single DGP.

4.2 Bounds By Numerical Methods

We compared our analytical bounds with numerical bounds computed using autobounds package in Duarte et al. [2023] for a subset of DGPs used for verification of bound validity in Section 4.1.

In particular, 20 DGPs were randomly selected for each model (ZI MCAR and ZI MAR), and their observed laws were computed. For each DGP, 2 polynomial programs were constructed, whose objective functions are maximizing or minimizing p(W=0|R=0)𝑝𝑊conditional0𝑅0p(W=0|R=0)italic_p ( italic_W = 0 | italic_R = 0 ), respectively, and whose constraints are (i) structural constraints from the corresponding graph, (ii) probability constraints, (iii) ZI-consistency constraint, (iv) constraints resulted from the structure imposed on the observed law by the structure of the full law. The solutions to these programs are the numerical lower and upper bounds of p(W=0|R=0)𝑝𝑊conditional0𝑅0p(W=0|R=0)italic_p ( italic_W = 0 | italic_R = 0 ). We refer reader to Duarte et al. [2023] for details of the program’s construction and the methods used by the polynomial program solver.

For all DGPs, the numerical bounds coincided with our analytical bounds up to the 4444th decimal place. Since the algorithm in the autobounds package is an anytime algorithm, our analytic bounds were always contained inside the numerical bounds. Table 1 shows a selection of these results.

MCAR lb ub num lb num ub pw0|r0subscript𝑝conditionalsubscript𝑤0subscript𝑟0p_{w_{0}|r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
0 0.5564 1 0.5564 1 0.8207
1 0.3578 1 0.3578 1 0.4936
2 0 0.5206 0 0.5206 0.4536
3 0.6064 1 0.6064 1 0.6826
MAR lb ub num lb num ub pw0|r0subscript𝑝conditionalsubscript𝑤0subscript𝑟0p_{w_{0}|r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
0 0 0.4290 0 0.4290 0.4132
1 0.8346 1 0.8346 1 0.8486
2 0 0.3404 0 0.3404 0.3192
3 0.3002 1 0.3002 1 0.5155
Table 1: Comparison between our analytical lower and upper bound (lb/ub) to numerical bounds (num lb/num ub) for a randomly selected set of DGPs. True pw0|r0subscript𝑝conditionalsubscript𝑤0subscript𝑟0p_{w_{0}|r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is reported.

4.3 Data Application

Patients receiving therapies involving central venous catheters (CVCs) through home infusion agencies may develop CLABSI. Though relatively rare, CLABSIs are potentially dangerous. Knowing true CLABSI rates is essential in deploying and testing the impact of CLABSI prevention activities. Recorded CLABSI rates undercount true positive cases. This is because adjudicators performing CLABSI surveillance often lack access to the full information required to determine whether a CLABSI has occurred [Hannum et al., 2022, 2023]. If the available information do not meet the CLABSI definition criteria, as CLABSIs are relatively rare, the adjudicator typically records the CLABSI status as a presumed negative.

We will apply our zero inflation correction method to data on patients undergoing CVC therapies and thus potentially susceptible to a CLABSI. Our data contains 652 unique patient records obtained from five different home infusion agencies across 14 states and the District of Columbia, see [Keller et al., 2023] for additional details. These records correspond to records investigated on patients who presented to a hospital due to a complication and on whom blood cultures were drawn and were positive. Many patients with CVCs who presented to the hospital due to a complication on whom blood cultures were drawn and were positive do have CLABSIs. In fact, the observed CLABSI rate in our data was more than 65%percent6565\%65 %, much higher than the prevalence in the population undergoing CVC therapies. However due to zero inflation, even the elevated observed CLABSI rate undercounts the true CLABSI rate in this cohort.

Variables in our data included covariates C𝐶Citalic_C, which indicated home infusion therapy type and CVC type, coded as binary variables. A description of these covariates is found in the Appendix. The outcome of interest is the true CLABSI probability (had zero inflation not occurred), which we denote by X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT. This outcome is not directly observed. Instead, our data contains the observed CLABSI status X𝑋Xitalic_X, recorded as 00 and 1111. Given this variable, we define the inflation indicator R𝑅Ritalic_R which corresponds to the adjudicator having enough information to make a CLABSI determination for a particular case. The information could come from private meeting with patients and specialists, or from reading patients test results and other data in health record systems. Recording conventions dictate that this indicator has a known value whenever the observed CLABSI is 1111, and is unobserved otherwise (since we cannot distinguish true negatives from inflated zeros). We considered two candidates for the proxy W𝑊Witalic_W: (i) adjudicator access to the shared electronic health record system EPIC, (ii) either adjudicator access to EPIC, or the statewide health information exchange CRISP. Since R𝑅Ritalic_R encodes the state of knowing all required information from all sources, we have RW𝑅𝑊R\rightarrow Witalic_R → italic_W.

Our working model is the proxy-augmented ZI MAR under assumptions A1 and A2, shown in Fig. 2 (b). Using the analytic bounds for the ZI MAR model derived in Section 3.4, we perform a sensitivity analysis to understand how the true CLABSI rate p(X(1)=1)𝑝superscript𝑋11p(X^{(1)}=1)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) changes as the proxy-indicator relationship p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) varies, within its compatibility range. First, we use the EM algorithm [Dempster et al., 1977] to maximize the observed data likelihood ^EM(X,W,C)subscript^𝐸𝑀𝑋𝑊𝐶\hat{\cal L}_{EM}(X,W,C)over^ start_ARG caligraphic_L end_ARG start_POSTSUBSCRIPT italic_E italic_M end_POSTSUBSCRIPT ( italic_X , italic_W , italic_C ) defined via the full data distribution consistent with our assumptions. Next, we invoke Theorem 5 to obtain the plug-in estimate for p(W=0|R=1)𝑝𝑊conditional0𝑅1p(W=0|R=1)italic_p ( italic_W = 0 | italic_R = 1 ) and the bounds for p(W=0|R=0)𝑝𝑊conditional0𝑅0p(W=0|R=0)italic_p ( italic_W = 0 | italic_R = 0 ). Finally, we do a grid search over the bounds interval, compute the full data distribution p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) for each value of p(W=0|R=0)𝑝𝑊conditional0𝑅0p(W=0|R=0)italic_p ( italic_W = 0 | italic_R = 0 ) via (2) and obtain p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) using standard g-formula adjustment in MAR models. The sensitivity analysis curve is shown in Fig 4. 222 This plot differs somewhat from the plot in the published version of the paper, due to a corrected data processing error. The conclusions on the underlying CLABSI rate were not substantially affected.

The values of p(W=0|R=0)𝑝𝑊conditional0𝑅0p(W=0|R=0)italic_p ( italic_W = 0 | italic_R = 0 ) consistent with the model show that inability to make a CLABSI determination is strongly associated with access to patient data via electronic health records. For proxy EPIC, our obtained (sharp) bound for the nuisance parameter p(W=0|R=0)𝑝𝑊conditional0𝑅0p(W=0|R=0)italic_p ( italic_W = 0 | italic_R = 0 ) is [0.88,1]0.881[0.88,1][ 0.88 , 1 ], yielding the estimated range of the true CLABSI rate to be [0.69,0.79]0.690.79[0.69,0.79][ 0.69 , 0.79 ]. Compared with the baseline rate of 65%percent6565\%65 % under no-zero-inflation assumption, the rate’s bound implies that anywhere from 4%percent44\%4 % to 14%percent1414\%14 % of true CLABSI cases are undercounted, even in our patient cohort with a highly elevated CLABSI prevalence.

We have repeated the analysis using the proxy-augmented ZI MAR model under assumptions A1 and A2, shown in Fig. 2 (c). In this case, bounds for p(W=0|R=0,c)p(W=0|R=0,c)italic_p ( italic_W = 0 | italic_R = 0 , italic_c ) were obtained, for each value c𝑐citalic_c. The narrowest bound [0.985,1.0]0.9851.0[0.985,1.0][ 0.985 , 1.0 ] corresponds to adult patients receiving outpatient parenteral antimicrobial therapy (OPAT) via a peripherally inserted central catheter (PICC). On the other hand, the widest bound [0.03,1.0]0.031.0[0.03,1.0][ 0.03 , 1.0 ] corresponds to pediatric patients receiving chemotherapy via tunneled CVC, a type of catheter under the skin. We performed a search of 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT points over the polytope comprised of these bounds and find the estimated range of true CLABSI rate to be [0.77,0.93]0.770.93[0.77,0.93][ 0.77 , 0.93 ]. That is, anywhere from 12%-28% of true CLABSI cases are undercounted.

All derived bounds for the true CLABSI rate were deemed to be medically plausible by our medical collaborators.

Refer to caption
Figure 4: CLABSI rate consistent with model compatible distributions p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) under the ZI MAR model with assumptions A1, and A2.

5 Conclusion

In this paper, we considered inference on data with inflated zeros as a missing data problem where censored realizations are indicated by a 00 rather than by a special token such as ``?"``?"``?"` ` ? ". This leads to a situation where the censoring indicator for a variable is unobserved any time the value 00 is observed for such a variable. We have shown that this significantly complicates the problem, and results in lack of identification even in simple missing data models such as MCAR.

To address this, we proposed a generalization of the approach in Kuroki and Pearl [2014] which assumes the existence of an informative proxy for the censoring indicator. We show that only some relationships between this proxy and the indicator are compatible with the model, derive analytic bounds for this relationship in a number of cases, and show that in some cases our bound is sharp. Our bounds directly imply bounds on the zero inflated mean parameter. We verified our results by deriving bounds numerically using the autobounds package described in Duarte et al. [2023]. Finally, we applied our methods to CLABSI data, which exhibits significant zero inflation. Our methods led to informative bounds on the true CLABSI rate, and provided a natural sensitivity analysis strategy.

Zero inflation is common in many types of data, particularly in electronic health records. Our approach provides a principled strategy for deriving informative conclusions from such data without reliance on unrealistic modeling assumptions.

Acknowledgements.
This research is funded in part by ONR N00014-21-1-2820, NSF 2040804, NSF CAREER 1942239, NIH R01 AI127271-01A1, AHRQ R01 HS027819.

References

  • Arab et al. [2012] Ali Arab, Scott H. Holan, Christopher K. Wikle, and Mark L. Wildhaber. Semiparametric bivariate zero-inflated Poisson models with application to studies of abundance for multiple species. Environmetrics, 23(2):183–196, March 2012. ISSN 1180-4009, 1099-095X. 10.1002/env.1142.
  • Bhattacharya et al. [2019] Rohit Bhattacharya, Razieh Nabi, Ilya Shpitser, and James Robins. Identification in missing data models represented by directed acyclic graphs. In Proceedings of the Thirty Fifth Conference on Uncertainty in Artificial Intelligence (UAI-35th). AUAI Press, 2019.
  • Dai et al. [2023] Haoyue Dai, Ignavier Ng, Gongxu Luo, Peter Spirtes, Petar Stojanov, and Kun Zhang. Gene Regulatory Network Inference in the Presence of Dropouts: A Causal View. In The Twelfth International Conference on Learning Representations (ICLR 2024), October 2023.
  • Dempster et al. [1977] A.P. Dempster, N.M. Laird, and D.B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39:1–38, 1977.
  • Duarte et al. [2023] Guilherme Duarte, Noam Finkelstein, Dean Knox, Jonathan Mummolo, and Ilya Shpitser. An automated approach to causal inference in discrete setting. Journal of the American Statistical Association, 2023.
  • Evans and Richardson [2014] Robin J. Evans and Thomas S. Richardson. Markovian acyclic directed mixed graphs for discrete data. Annals of Statistics, pages 1–30, 2014.
  • Greene [2005] William H. Greene. Censored Data and Truncated Distributions, 2005.
  • Hannum et al. [2022] Susan M. Hannum, Opeyemi Oladapo-Shittu, Alejandra B. Salinas, Kimberly Weems, Jill Marsteller, Ayse P Gurses, Sara E. Cosgrove, and Sara C. Keller. A task analysis of central line-associated bloodstream infection (CLABSI) surveillance in home infusion therapy. American Journal of Infection Control, 50(5):555–562, May 2022. ISSN 0196-6553. 10.1016/j.ajic.2022.01.008.
  • Hannum et al. [2023] Susan M. Hannum, Opeyemi Oladapo-Shittu, Alejandra B. Salinas, Kimberly Weems, Jill Marsteller, Ayse P. Gurses, Ilya Shpitser, Eili Klein, Sara E. Cosgrove, and Sara C. Keller. Controlling the chaos: Information management in home-infusion central-line–associated bloodstream infection (CLABSI) surveillance. Antimicrobial Stewardship & Healthcare Epidemiology, 3(1):e69, January 2023. ISSN 2732-494X. 10.1017/ash.2023.134.
  • Jiang et al. [2022] Ruochen Jiang, Tianyi Sun, Dongyuan Song, and Jingyi Jessica Li. Statistics or biology: The zero-inflation controversy about scRNA-seq data. Genome Biology, 23(1):31, January 2022. ISSN 1474-760X. 10.1186/s13059-022-02601-5.
  • Keller et al. [2020] Sara Keller, Alejandra Salinas, Deborah Williams, Mary McGoldrick, Lisa Gorski, Mary Alexander, Anne Norris, Jennifer Charron, Roger Scott Stienecker, Catherine Passaretti, Lisa Maragakis, and Sara E. Cosgrove. Reaching consensus on a home infusion central line-associated bloodstream infection surveillance definition via a modified Delphi approach. American Journal of Infection Control, 48(9):993–1000, September 2020. ISSN 0196-6553. 10.1016/j.ajic.2019.12.015.
  • Keller et al. [2023] Sara C. Keller, Susan M. Hannum, Kimberly Weems, Opeyemi Oladapo-Shittu, Alejandra B. Salinas, Jill A. Marsteller, Ayse P. Gurses, Eili Y. Klein, Ilya Shpitser, Christopher J. Crnich, Nitin Bhanot, Clare Rock, Sara E. Cosgrove, and the Home Infusion CLABSI Prevention Collaborative. Implementing and validating a home-infusion central-line–associated bloodstream infection surveillance definition. Infection Control & Hospital Epidemiology, 44(11):1748–1759, November 2023. ISSN 0899-823X, 1559-6834. 10.1017/ice.2023.70.
  • Kleinke and Reinecke [2013] Kristian Kleinke and Jost Reinecke. Multiple imputation of incomplete zero-inflated count data. Statistica Neerlandica, 67(3):311–336, 2013. ISSN 1467-9574. 10.1111/stan.12009.
  • Kuroki and Pearl [2014] Manabu Kuroki and Judea Pearl. Measurement bias and effect restoration in causal inference. Biometrika, 101:423–437, 2014.
  • Lam et al. [2006] K. F. Lam, Hongqi Xue, and Yin Bun Cheung. Semiparametric Analysis of Zero-Inflated Count Data. Biometrics, 62(4):996–1003, December 2006. ISSN 0006-341X. 10.1111/j.1541-0420.2006.00575.x.
  • Lambert [1992] Diane Lambert. Zero-Inflated Poisson Regression, With an Application to Defects in Manufacturing. Technometrics, February 1992. ISSN 1048-5228. 10.2307/1269547.
  • Lukusa et al. [2017] T. Martin Lukusa, Shen-Ming Lee, and Chin-Shang Li. Review of Zero-Inflated Models with Missing Data. Current Research in Biostatistics, 7(1):1–12, October 2017. ISSN 2524-2229. 10.3844/amjbsp.2017.1.12.
  • Malinsky et al. [2021] Daniel Malinsky, Ilya Shpitser, and Eric J Tchetgen Tchetgen. Semiparametric inference for nonmonotone missing-not-at-random data: the no self-censoring model. Journal of the American Statistical Association, pages 1–9, 2021.
  • Mohan et al. [2013] Karthika Mohan, Judea Pearl, and Jin Tian. Graphical models for inference with missing data. In C.J.C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems 26, pages 1277–1285. Curran Associates, Inc., 2013.
  • Mullahy [1986] John Mullahy. Specification and Testing of some Modified Count Data Models. Journal of Econometrics, 33(3):341–365, December 1986. ISSN 0304-4076. 10.1016/0304-4076(86)90002-3.
  • Nabi et al. [2020] Razieh Nabi, Rohit Bhattacharya, and Ilya Shpitser. Full law identification in graphical models of missing data: Completeness results. In Proceedings of the 37th International Conference on Machine Learning, pages 7153–7163. PMLR, November 2020.
  • Neelon et al. [2016] Brian Neelon, A. James O’Malley, and Valerie A. Smith. Modeling zero-modified count and semicontinuous data in health services research Part 1: Background and overview. Statistics in Medicine, 35(27):5070–5093, 2016. ISSN 1097-0258. 10.1002/sim.7050.
  • Rubin [1976] D. B. Rubin. Causal inference and missing data (with discussion). Biometrika, 63:581–592, 1976.
  • Wagner et al. [2016] Allon Wagner, Aviv Regev, and Nir Yosef. Revealing the vectors of cellular identity with single-cell genomics. Nature Biotechnology, 34(11):1145–1160, November 2016. ISSN 1546-1696. 10.1038/nbt.3711.
  • Young et al. [2022] Derek S. Young, Eric S. Roemmele, and Peng Yeh. Zero-inflated modeling part I: Traditional zero-inflated count regression models, their applications, and computational tools. WIREs Computational Statistics, 14(1):e1541, 2022. ISSN 1939-0068. 10.1002/wics.1541.
  • Yu et al. [2023] Shiqing Yu, Mathias Drton, and Ali Shojaie. Directed Graphical Models and Causal Discovery for Zero-Inflated Data. In Proceedings of the Second Conference on Causal Learning and Reasoning, pages 27–67. PMLR, August 2023.

Supplementary Material

Appendix A Proofs

A.1 Downstream identification

Proposition 1 The full law p(X(1),R,W,C,X)𝑝superscript𝑋1𝑅𝑊𝐶𝑋p\left(X^{(1)},R,W,C,X\right)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W , italic_C , italic_X ) exhibiting zero inflation that is Markov relative to an m-DAG 𝒢𝒢\mathcal{G}caligraphic_G is identified given p(R,X,W,C)𝑝𝑅𝑋𝑊𝐶p(R,X,W,C)italic_p ( italic_R , italic_X , italic_W , italic_C ) if and only if 𝒢𝒢\mathcal{G}caligraphic_G does not contain edges of the form Xi(1)Risuperscriptsubscript𝑋𝑖1subscript𝑅𝑖X_{i}^{(1)}\rightarrow R_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT → italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (no self-censoring) and structures of the form Xj(1)RiRjsuperscriptsubscript𝑋𝑗1subscript𝑅𝑖subscript𝑅𝑗X_{j}^{(1)}\rightarrow R_{i}\leftarrow R_{j}italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT → italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ← italic_R start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (no colluders), and the positivity assumption holds. Moreover, the identifying functional for the full data law coincides with the functional given in Malinsky et al. [2021].

Proof.

Following the proof in Nabi et al. [2020], the full law factorizes as

p(RX(1),C,W)𝑝conditional𝑅superscript𝑋1𝐶𝑊\displaystyle p\left(R\mid X^{(1)},C,W\right)italic_p ( italic_R ∣ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W ) (12)
=1Z×k=1Kp(RkRk=1,X(1),C,W)absent1𝑍superscriptsubscriptproduct𝑘1𝐾𝑝conditionalsubscript𝑅𝑘subscript𝑅𝑘1superscript𝑋1𝐶𝑊\displaystyle\quad=\frac{1}{Z}\times\prod_{k=1}^{K}p\left(R_{k}\mid R_{-k}=1,X% ^{(1)},C,W\right)= divide start_ARG 1 end_ARG start_ARG italic_Z end_ARG × ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W )
×Rk,RlROR(Rk,RlR(k,l)=1,X(1),C,W)\displaystyle\quad\times\prod_{R_{k},R_{l}\in R}\operatorname{OR}\left(R_{k},R% _{l}\mid R_{-(k,l)}=1,X^{(1)},C,W\right)× ∏ start_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∈ italic_R end_POSTSUBSCRIPT roman_OR ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - ( italic_k , italic_l ) end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W )
×Rk,Rl,RmRf(Rk,Rl,RmR(k,l,m)=1,X(1),C,W)\displaystyle\quad\times\prod_{R_{k},R_{l},R_{m}\in R}f\left(R_{k},R_{l},R_{m}% \mid R_{-(k,l,m)}=1,X^{(1)},C,W\right)× ∏ start_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ italic_R end_POSTSUBSCRIPT italic_f ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - ( italic_k , italic_l , italic_m ) end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W )
×Rk,Rl,Rm,RnRf(Rk,Rl,Rm,RnR(k,l,m,n)=1,X(1),C,W)××f(R1,,RKX(1),C,W),\displaystyle\quad\times\prod_{R_{k},R_{l},R_{m},R_{n}\in R}f\left(R_{k},R_{l}% ,R_{m},R_{n}\mid R_{-(k,l,m,n)}=1,X^{(1)},C,W\right)\times\cdots\times f\left(% R_{1},\ldots,R_{K}\mid X^{(1)},C,W\right),× ∏ start_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ italic_R end_POSTSUBSCRIPT italic_f ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - ( italic_k , italic_l , italic_m , italic_n ) end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W ) × ⋯ × italic_f ( italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ∣ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W ) ,

where Rk=R{Rk}subscript𝑅𝑘𝑅subscript𝑅𝑘R_{-k}=R\setminus\{R_{k}\}italic_R start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT = italic_R ∖ { italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }, and similarly for X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT.

  • No-colluder condition implies RkXk(1)Rk,Xk(1),C,Wconditionalsubscript𝑅𝑘perpendicular-toabsentperpendicular-tosubscriptsuperscript𝑋1𝑘subscript𝑅𝑘subscriptsuperscript𝑋1𝑘𝐶𝑊R_{k}\operatorname*{\perp\!\!\!\perp}X^{(1)}_{k}\mid R_{-k},X^{(1)}_{-k},C,Witalic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_OPERATOR ⟂ ⟂ end_OPERATOR italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT , italic_C , italic_W, so p(RkRk=1,X(1),C,W)=p(RkRk=1,Xk(1),C,W)𝑝conditionalsubscript𝑅𝑘subscript𝑅𝑘1superscript𝑋1𝐶𝑊𝑝conditionalsubscript𝑅𝑘subscript𝑅𝑘1subscriptsuperscript𝑋1𝑘𝐶𝑊p\left(R_{k}\mid R_{-k}=1,X^{(1)},C,W\right)=p\left(R_{k}\mid R_{-k}=1,X^{(1)}% _{-k},C,W\right)italic_p ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W ) = italic_p ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT - italic_k end_POSTSUBSCRIPT , italic_C , italic_W ). Hence these factors use only R=1𝑅1R=1italic_R = 1 case of consistency.

  • The 2-way odd-ratio OR(Rk,RlR(k,l)=1,X(1),C,W)ORsubscript𝑅𝑘conditionalsubscript𝑅𝑙subscript𝑅𝑘𝑙1superscript𝑋1𝐶𝑊\operatorname{OR}\left(R_{k},R_{l}\mid R_{-(k,l)}=1,X^{(1)},C,W\right)roman_OR ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - ( italic_k , italic_l ) end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W ) is not a function of {Xk(1),Xl(1)}subscriptsuperscript𝑋1𝑘subscriptsuperscript𝑋1𝑙\{X^{(1)}_{k},X^{(1)}_{l}\}{ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT }. Therefore, only case R=1𝑅1R=1italic_R = 1 of consistency is used.

  • the 3-way interaction term f(Rk,Rl,Rm,RnR(k,l,m,n)=1,X(1),C,W)f\left(R_{k},R_{l},R_{m},R_{n}\mid R_{-(k,l,m,n)}=1,X^{(1)},C,W\right)italic_f ( italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT - ( italic_k , italic_l , italic_m , italic_n ) end_POSTSUBSCRIPT = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_C , italic_W ) is not a function of {Xk(1),Xl(1),Xm(1)}subscriptsuperscript𝑋1𝑘subscriptsuperscript𝑋1𝑙subscriptsuperscript𝑋1𝑚\{X^{(1)}_{k},X^{(1)}_{l},X^{(1)}_{m}\}{ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }. Therefore, only case R=1𝑅1R=1italic_R = 1 of consistency is used. Similarly for any k-way interaction term.

Hence the proof in Nabi et al. [2020] applies to ZI problems, whose consistency differs missing data consistency only at R=0𝑅0R=0italic_R = 0 case. ∎

A.2 Non-identifiability proof

Lemma 1 Given a ZI model associated with any m-DAG 𝒢𝒢{\cal G}caligraphic_G, both the target law p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and the full law p(X(1),R,C)𝑝superscript𝑋1𝑅𝐶p(X^{(1)},R,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_C ) are non-parametrically non-identified.

Proof.

Let 𝒢𝒢\mathcal{G}caligraphic_G be an m-DAG over X(1),R,X,Csuperscript𝑋1𝑅𝑋𝐶X^{(1)},R,X,Citalic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C and 𝒫𝒫\mathcal{P}caligraphic_P its associated ZI-model. The m-DAG 𝒢mcarsubscript𝒢mcar\mathcal{G}_{\text{mcar}}caligraphic_G start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT obtained from 𝒢𝒢\mathcal{G}caligraphic_G by deleting all edges while keeping X(1)XRsuperscript𝑋1𝑋𝑅X^{(1)}\rightarrow X\leftarrow Ritalic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT → italic_X ← italic_R defines a sub-model 𝒫mcar𝒫subscript𝒫mcar𝒫\mathcal{P}_{\text{mcar}}\subseteq\mathcal{P}caligraphic_P start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT ⊆ caligraphic_P in which X(1),R,Csuperscript𝑋1𝑅𝐶X^{(1)},R,Citalic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_C are jointly independent. If p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and p(X(1),R,C)𝑝superscript𝑋1𝑅𝐶p(X^{(1)},R,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_C ) are non-parametrically non-identified in this sub-model, they are also non-identified in 𝒫𝒫\mathcal{P}caligraphic_P.

It suffices to prove non-identification for binary variables. The target is p(X(1)=1)𝑝superscript𝑋11p(X^{(1)}=1)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ), and the observed marginals are

p(X=1)p(c)𝑝𝑋1𝑝𝑐\displaystyle p(X=1)p(c)italic_p ( italic_X = 1 ) italic_p ( italic_c ) =p(X(1)=1)p(R=1)p(c)absent𝑝superscript𝑋11𝑝𝑅1𝑝𝑐\displaystyle=p(X^{(1)}=1)p(R=1)p(c)= italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) italic_p ( italic_R = 1 ) italic_p ( italic_c ) (13)
p(X=0)p(c)𝑝𝑋0𝑝𝑐\displaystyle p(X=0)p(c)italic_p ( italic_X = 0 ) italic_p ( italic_c ) =p(X(1)=0)p(R=1)p(c)+p(R=0)p(c),absent𝑝superscript𝑋10𝑝𝑅1𝑝𝑐𝑝𝑅0𝑝𝑐\displaystyle=p(X^{(1)}=0)p(R=1)p(c)+p(R=0)p(c),= italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 0 ) italic_p ( italic_R = 1 ) italic_p ( italic_c ) + italic_p ( italic_R = 0 ) italic_p ( italic_c ) ,

using d-separation in 𝒢mcarsubscript𝒢mcar\mathcal{G}_{\text{mcar}}caligraphic_G start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT. Since the second equation is just p(c)𝑝𝑐p(c)italic_p ( italic_c ) minus the first, if the quantity

p(X=1)=p(X(1)=1)p(R=1)𝑝𝑋1𝑝superscript𝑋11𝑝𝑅1p(X=1)=p(X^{(1)}=1)p(R=1)italic_p ( italic_X = 1 ) = italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) italic_p ( italic_R = 1 ) (14)

is shown to be identical for 2 joint distributions in 𝒫mcarsubscript𝒫mcar\mathcal{P}_{\text{mcar}}caligraphic_P start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT, the proof is finished. Indeed, for any p1𝒫mcarsubscript𝑝1subscript𝒫mcarp_{1}\in\mathcal{P}_{\text{mcar}}italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT, we pick any real number 1>mmax{p1(X(1)=1),p1(R=1)}1𝑚subscript𝑝1superscript𝑋11subscript𝑝1𝑅11>m\geq\max\{p_{1}(X^{(1)}=1),p_{1}(R=1)\}1 > italic_m ≥ roman_max { italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) , italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R = 1 ) } and construct p2𝒫mcarsubscript𝑝2subscript𝒫mcarp_{2}\in\mathcal{P}_{\text{mcar}}italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT as follow

p2(X(1)=1)subscript𝑝2superscript𝑋11\displaystyle p_{2}(X^{(1)}=1)italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) =1mp1(X(1)=1);absent1𝑚subscript𝑝1superscript𝑋11\displaystyle=\frac{1}{m}p_{1}(X^{(1)}=1);= divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) ; p2(R=1)subscript𝑝2𝑅1\displaystyle p_{2}(R=1)italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_R = 1 ) =mp1(R=1);absent𝑚subscript𝑝1𝑅1\displaystyle=mp_{1}(R=1);= italic_m italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R = 1 ) ; p2(C)=p1(C).subscript𝑝2𝐶subscript𝑝1𝐶\displaystyle p_{2}(C)=p_{1}(C).italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_C ) = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_C ) . (15)

Evidently, the target laws are different p1(X(1))p2(X(1))subscript𝑝1superscript𝑋1subscript𝑝2superscript𝑋1p_{1}(X^{(1)})\neq p_{2}(X^{(1)})italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) ≠ italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ), yet the observed marginals are the same p1(X,C)=p2(X,C)subscript𝑝1𝑋𝐶subscript𝑝2𝑋𝐶p_{1}(X,C)=p_{2}(X,C)italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X , italic_C ) = italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_X , italic_C ). Moreover, the full laws are also different

p2(X(1)=0)p2(R=1)subscript𝑝2superscript𝑋10subscript𝑝2𝑅1\displaystyle p_{2}(X^{(1)}=0)p_{2}(R=1)italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 0 ) italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_R = 1 ) =(11mp1(X(1)=1))mp1(R=1)absent11𝑚subscript𝑝1superscript𝑋11𝑚subscript𝑝1𝑅1\displaystyle=\left(1-\frac{1}{m}p_{1}(X^{(1)}=1)\right)mp_{1}(R=1)= ( 1 - divide start_ARG 1 end_ARG start_ARG italic_m end_ARG italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) ) italic_m italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R = 1 ) (16)
p1(R=1)p1(X(1)=1)p1(R=1)absentsubscript𝑝1𝑅1subscript𝑝1superscript𝑋11subscript𝑝1𝑅1\displaystyle\neq p_{1}(R=1)-p_{1}(X^{(1)}=1)p_{1}(R=1)≠ italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R = 1 ) - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R = 1 )
=p1(X(1)=0)p1(R=1).absentsubscript𝑝1superscript𝑋10subscript𝑝1𝑅1\displaystyle=p_{1}(X^{(1)}=0)p_{1}(R=1).= italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 0 ) italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R = 1 ) .

Hence, p(X(1))𝑝superscript𝑋1p(X^{(1)})italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) and p(X(1),R,C)𝑝superscript𝑋1𝑅𝐶p(X^{(1)},R,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_C ) are non-parametrically non-identified in 𝒫mcarsubscript𝒫mcar\mathcal{P}_{\text{mcar}}caligraphic_P start_POSTSUBSCRIPT mcar end_POSTSUBSCRIPT. ∎

A.3 Examples of Compatibility Issue

Consider the proxy-augmented ZI MCAR model, in which a joint distribution factorizes as

p(X(1),R,X,W)=p(X(1),R,X)p(WR).𝑝superscript𝑋1𝑅𝑋𝑊𝑝superscript𝑋1𝑅𝑋𝑝conditional𝑊𝑅p(X^{(1)},R,X,W)=p(X^{(1)},R,X)p(W\mid R).italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ) = italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) italic_p ( italic_W ∣ italic_R ) . (17)

Here, the proxy assumptions insist that p(W=0R=0)p(W=0R=1)𝑝𝑊conditional0𝑅0𝑝𝑊conditional0𝑅1p(W=0\mid R=0)\neq p(W=0\mid R=1)italic_p ( italic_W = 0 ∣ italic_R = 0 ) ≠ italic_p ( italic_W = 0 ∣ italic_R = 1 ). Therefore, any p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) obeys this inequality is said to be model compatible. Moreover, any joint distribution with p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) violating this inequality is outside of the model. Works investigate marginal models of hidden variable models often consider this type of compatibility.

In our paper, we mentioned another type of compatibility. Any joint distribution in the model yields a pair of observed law and proxy-indicator conditional distribution (p(X,W),p(WR))𝑝𝑋𝑊𝑝conditional𝑊𝑅(p(X,W),p(W\mid R))( italic_p ( italic_X , italic_W ) , italic_p ( italic_W ∣ italic_R ) ). Obviously, both p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) and p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) produced this way are model compatible. Furthermore, they are compatible to one another, in the sense that there exists a model compatible joint distribution producing them. It is possible to construct an incompatible pair (p(X,W),p(WR))𝑝𝑋𝑊𝑝conditional𝑊𝑅(p(X,W),p(W\mid R))( italic_p ( italic_X , italic_W ) , italic_p ( italic_W ∣ italic_R ) ) whose components are both model compatible, because the joint distribution yielding them is not in the model. This is illustrated in the following simple examples.

Example 1:

X=0X=1W=0abW=1cdp(W,X)R=0R=1W=010W=101p(WR)subscriptmissing-subexpression𝑋0𝑋1missing-subexpressionmissing-subexpressionmissing-subexpression𝑊0𝑎𝑏𝑊1𝑐𝑑𝑝𝑊𝑋subscriptmissing-subexpression𝑅0𝑅1missing-subexpressionmissing-subexpressionmissing-subexpression𝑊010𝑊101𝑝conditional𝑊𝑅\underbrace{\begin{array}[]{c|c|c}&X=0&X=1\\ \hline\cr W=0&a&b\\ W=1&c&d\end{array}}_{p(W,X)}\quad\quad\quad\underbrace{\begin{array}[]{c|c|c}&% R=0&R=1\\ \hline\cr W=0&1&0\\ W=1&0&1\end{array}}_{p(W\mid R)}under⏟ start_ARG start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_X = 0 end_CELL start_CELL italic_X = 1 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_W = 0 end_CELL start_CELL italic_a end_CELL start_CELL italic_b end_CELL end_ROW start_ROW start_CELL italic_W = 1 end_CELL start_CELL italic_c end_CELL start_CELL italic_d end_CELL end_ROW end_ARRAY end_ARG start_POSTSUBSCRIPT italic_p ( italic_W , italic_X ) end_POSTSUBSCRIPT under⏟ start_ARG start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_R = 0 end_CELL start_CELL italic_R = 1 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_W = 0 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL italic_W = 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY end_ARG start_POSTSUBSCRIPT italic_p ( italic_W ∣ italic_R ) end_POSTSUBSCRIPT (18)

Since ZI MCAR does not impose any restriction on p(W,X)𝑝𝑊𝑋p(W,X)italic_p ( italic_W , italic_X ) in the binary case (see our proof for the bound in the ZI MCAR case), we can pick any number for a,b,c,d𝑎𝑏𝑐𝑑a,b,c,ditalic_a , italic_b , italic_c , italic_d. In particular, let they be all non-zero. Then both p(W,X)𝑝𝑊𝑋p(W,X)italic_p ( italic_W , italic_X ) and p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) are model compatible. However, there isn’t any valid p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ) (non-negative, summed to 1111) such that the Kuroki-Pearl equation holds 𝐩WX=𝐩WR𝐩RXsubscript𝐩𝑊𝑋subscript𝐩conditional𝑊𝑅subscript𝐩𝑅𝑋\mathbf{p}_{WX}=\mathbf{p}_{W\mid R}\mathbf{p}_{RX}bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_p start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT. Attempting to invert 𝐩WRsubscript𝐩conditional𝑊𝑅\mathbf{p}_{W\mid R}bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT in this equation will yield negative-valued p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ).

Example 2:

We choose a joint distribution (DGP) p(X(1),R,X,W)𝑝superscript𝑋1𝑅𝑋𝑊p(X^{(1)},R,X,W)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ) Markov to the proxy-augmented ZI MCAR graph in Figure 2(a), from which we obtain the true p(W,X)𝑝𝑊𝑋p(W,X)italic_p ( italic_W , italic_X ), true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ), true p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ).

We calculate p^1(R,X)subscript^𝑝1𝑅𝑋\hat{p}_{1}(R,X)over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R , italic_X ) via the matrix inversion equation using the true p(W,X)𝑝𝑊𝑋p(W,X)italic_p ( italic_W , italic_X ) and the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ). The calculated p^1(R,X)subscript^𝑝1𝑅𝑋\hat{p}_{1}(R,X)over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_R , italic_X ) is valid, and close to the true p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ) up to floating point precision. This indicates the true p(W,X)𝑝𝑊𝑋p(W,X)italic_p ( italic_W , italic_X ) and the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) are compatible to one another.

We sample 100000100000100000100000 data points (Wi,Xi)subscript𝑊𝑖subscript𝑋𝑖(W_{i},X_{i})( italic_W start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) from this DGP and estimate p^(W,X)^𝑝𝑊𝑋\hat{p}(W,X)over^ start_ARG italic_p end_ARG ( italic_W , italic_X ) by counting, which is the MLE for binary data. Again, this estimation is in the model, since marginal model for p(W,X)𝑝𝑊𝑋p(W,X)italic_p ( italic_W , italic_X ) is saturated in the binary case. Then, we calculate p^2(R,X)subscript^𝑝2𝑅𝑋\hat{p}_{2}(R,X)over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_R , italic_X ) via the matrix inversion equation, using the estimated p^(W,X)^𝑝𝑊𝑋\hat{p}(W,X)over^ start_ARG italic_p end_ARG ( italic_W , italic_X ) and the true p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ). This estimated p^2(R,X)subscript^𝑝2𝑅𝑋\hat{p}_{2}(R,X)over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_R , italic_X ) has a negative value, which renders it invalid.

The code for this experiment could be found in the supplement of the paper. Its output is printed below.

True p(W,X):
 [[0.42643891  0.31215362]
  [0.14620603  0.11520144]]
True p(W|R):
 [[0.74919143  0.73043156]
  [0.25080857  0.26956844]]
True p(R,X):
 [[0.43502295  0.        ]
  [0.13762199  0.42735506]]
Computed p(R,X) via matrix inv using true p(W,X) and true p(W|R):
 [[0.43502294 -1.81411279e-16]
  [0.13762199  0.42735505]]

Estimated p(W,X):
 [[0.42883     0.30976]
  [0.14496     0.11645]]
Computed p(R,X) via matrix inv using estimated p(W,X) and true p(W|R):
 [[ 0.5178968 -0.08300896]
 [ 0.05589317  0.50921896]]

A.4 ZI MCAR model and bounds

In this section, X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and C𝐶Citalic_C are categorical, while R𝑅Ritalic_R and W𝑊Witalic_W are binary.

A.4.1 Model definition

Both the ZI MCAR model and ZI MAR model are Cartesian products, between p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) model and p(X(1),R,X)𝑝superscript𝑋1𝑅𝑋p(X^{(1)},R,X)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) model, or p(X(1),R,X,C)𝑝superscript𝑋1𝑅𝑋𝐶p(X^{(1)},R,X,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ) model, respectively. Firstly, the adjustment formula establishes a 1-to-1 relation between the p(X(1),R,X,C)𝑝superscript𝑋1𝑅𝑋𝐶p(X^{(1)},R,X,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ) model and the p(R,X,C)𝑝𝑅𝑋𝐶p(R,X,C)italic_p ( italic_R , italic_X , italic_C ) model. The constraint of the latter is fully understood.

Lemma 4.

C For 1 variable ZI MCAR and ZI MAR model, the full law model for p(X(1),R,X,C)𝑝superscript𝑋1𝑅𝑋𝐶p(X^{(1)},R,X,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ) is 1-to-1 to the model for p(R,X,C)𝑝𝑅𝑋𝐶p(R,X,C)italic_p ( italic_R , italic_X , italic_C ) satisfying Z: x0,c,p(X=x,R=0,C=c)=0formulae-sequencefor-all𝑥0for-all𝑐𝑝formulae-sequence𝑋𝑥formulae-sequence𝑅0𝐶𝑐0\forall x\neq 0,\forall c,p(X=x,R=0,C=c)=0∀ italic_x ≠ 0 , ∀ italic_c , italic_p ( italic_X = italic_x , italic_R = 0 , italic_C = italic_c ) = 0.

Proof.

We only need to prove the lemma for ZI MAR model.

  • 𝒫𝒫\mathcal{P}caligraphic_P includes all full laws p(X(1),R,X,C)𝑝superscript𝑋1𝑅𝑋𝐶p(X^{(1)},R,X,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ) factorizing as

    p(X(1),R,X,C)=p(XX(1),R)p(X(1)C)p(RC)p(C),𝑝superscript𝑋1𝑅𝑋𝐶𝑝conditional𝑋superscript𝑋1𝑅𝑝conditionalsuperscript𝑋1𝐶𝑝conditional𝑅𝐶𝑝𝐶p(X^{(1)},R,X,C)=p(X\mid X^{(1)},R)p(X^{(1)}\mid C)p(R\mid C)p(C),italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ) = italic_p ( italic_X ∣ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R ) italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ∣ italic_C ) italic_p ( italic_R ∣ italic_C ) italic_p ( italic_C ) , (19)

    with p(XX(1),R)𝑝conditional𝑋superscript𝑋1𝑅p(X\mid X^{(1)},R)italic_p ( italic_X ∣ italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R ) denotes the deterministic ZI-consistency.

  • 𝒬𝒬\mathcal{Q}caligraphic_Q includes all laws p(R,X,C)𝑝𝑅𝑋𝐶p(R,X,C)italic_p ( italic_R , italic_X , italic_C ) factorizing as

    p(R,X,C)=p(XR,C)p(RC)p(C).𝑝𝑅𝑋𝐶𝑝conditional𝑋𝑅𝐶𝑝conditional𝑅𝐶𝑝𝐶p(R,X,C)=p(X\mid R,C)p(R\mid C)p(C).italic_p ( italic_R , italic_X , italic_C ) = italic_p ( italic_X ∣ italic_R , italic_C ) italic_p ( italic_R ∣ italic_C ) italic_p ( italic_C ) . (20)

    and obeying Z: c,x0:p(X=x,R=0,C=c)=0:for-all𝑐for-all𝑥0𝑝formulae-sequence𝑋𝑥formulae-sequence𝑅0𝐶𝑐0\forall c,\forall x\neq 0:p(X=x,R=0,C=c)=0∀ italic_c , ∀ italic_x ≠ 0 : italic_p ( italic_X = italic_x , italic_R = 0 , italic_C = italic_c ) = 0.

These 2 models are 1-to-1:

  • (𝒫𝒬maps-to𝒫𝒬\mathcal{P}\mapsto\mathcal{Q}caligraphic_P ↦ caligraphic_Q): This is just summation p(R,X,C)=X(1)p(X(1),R,X,C)𝑝𝑅𝑋𝐶subscriptsuperscript𝑋1𝑝superscript𝑋1𝑅𝑋𝐶p(R,X,C)=\sum_{X^{(1)}}p(X^{(1)},R,X,C)italic_p ( italic_R , italic_X , italic_C ) = ∑ start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ). The ZI-consistency implies Z.

  • (𝒬𝒫maps-to𝒬𝒫\mathcal{Q}\mapsto\mathcal{P}caligraphic_Q ↦ caligraphic_P): By d-separation p(X(1)C)=p(XR=1,C)𝑝conditionalsuperscript𝑋1𝐶𝑝conditional𝑋𝑅1𝐶p(X^{(1)}\mid C)=p(X\mid R=1,C)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ∣ italic_C ) = italic_p ( italic_X ∣ italic_R = 1 , italic_C ).

In principle, asking whether p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ) and p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) are compatible means pointing out a full law p(X(1),R,X,C)p(WR)𝑝superscript𝑋1𝑅𝑋𝐶𝑝conditional𝑊𝑅p(X^{(1)},R,X,C)p(W\mid R)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_C ) italic_p ( italic_W ∣ italic_R ) which yields both of them. The above lemma allows us to reformulate this compatibility question by pointing out a joint p(R,X,C)p(WR)𝑝𝑅𝑋𝐶𝑝conditional𝑊𝑅p(R,X,C)p(W\mid R)italic_p ( italic_R , italic_X , italic_C ) italic_p ( italic_W ∣ italic_R ) in the model. This has the advantage of simplyfying the original compatibility question, i.e., the polynomial program describing it is of higher degree. Moreover, we do not sacrify bound sharpness as we invoke this lemma, since the joint p(R,X,C)p(WR)𝑝𝑅𝑋𝐶𝑝conditional𝑊𝑅p(R,X,C)p(W\mid R)italic_p ( italic_R , italic_X , italic_C ) italic_p ( italic_W ∣ italic_R ) satisfying Z is 1-to-1 to the full law.

Lemma 5.

The ZI MCAR model with categorical X𝑋Xitalic_X and binary R,W𝑅𝑊R,Witalic_R , italic_W, which is Markov to the proxy-augmented Fig. 2 (a) (reproduced in Fig. 5) under proxy assumptions A1, A2, is described by

𝒫={(𝐩WR,𝐩RX)𝐩WR0,r(wpwr=1),pw0r0pw0r1,𝐩RX0,rxprx=1,x0(pr0x=0)}.𝒫subscript𝐩conditional𝑊𝑅subscript𝐩𝑅𝑋missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐩conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑝conditional𝑤𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤0subscript𝑟1missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐩𝑅𝑋0formulae-sequencesubscript𝑟𝑥subscript𝑝𝑟𝑥1for-all𝑥0subscript𝑝subscript𝑟0𝑥0\displaystyle\mathcal{P}=\left\{\begin{array}[]{c|c}(\mathbf{p}_{W\mid R},% \mathbf{p}_{RX})&\begin{aligned} &&&\textstyle\mathbf{p}_{W\mid R}\geq 0,\>% \forall r\left(\sum_{w}p_{w\mid r}=1\right),\>p_{w_{0}\mid r_{0}}\neq p_{w_{0}% \mid r_{1}},\\ &&&\textstyle\mathbf{p}_{RX}\geq 0,\>\sum_{rx}p_{rx}=1,\>\forall x\neq 0(p_{r_% {0}x}=0)\\ \end{aligned}\end{array}\right\}.caligraphic_P = { start_ARRAY start_ROW start_CELL ( bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_p start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_p start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) end_CELL end_ROW end_CELL end_ROW end_ARRAY } . (21)
Proof.

Due to 𝐀𝟏𝐀𝟏\bf A1bold_A1,

p(X(1),R,X,W)𝑝superscript𝑋1𝑅𝑋𝑊\displaystyle p(X^{(1)},R,X,W)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ) =p(X(1),R,X)p(WR).absent𝑝superscript𝑋1𝑅𝑋𝑝conditional𝑊𝑅\displaystyle=p(X^{(1)},R,X)p(W\mid R).= italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) italic_p ( italic_W ∣ italic_R ) . (22)

Therefore, model for p(X(1),R,X,W)𝑝superscript𝑋1𝑅𝑋𝑊p(X^{(1)},R,X,W)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ) is a Cartesian product between the model for p(X(1),R,X)𝑝superscript𝑋1𝑅𝑋p(X^{(1)},R,X)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) and the model for p(WR)𝑝conditional𝑊𝑅p(W\mid R)italic_p ( italic_W ∣ italic_R ). The former is shown to be 1-to-1 to the model for p(R,X)𝑝𝑅𝑋p(R,X)italic_p ( italic_R , italic_X ) with restriction Z, by lemma 4.

{𝐩R,X𝐩R,X0,x,rpR,X=1,x0(pr0,x=0)}.conditional-setsubscript𝐩𝑅𝑋formulae-sequencesubscript𝐩𝑅𝑋0formulae-sequencesubscript𝑥𝑟subscript𝑝𝑅𝑋1for-all𝑥0subscript𝑝subscript𝑟0𝑥0\left\{\mathbf{p}_{R,X}\mid\mathbf{p}_{R,X}\geq 0,\sum_{x,r}p_{R,X}=1,\forall x% \neq 0(p_{r_{0},x}=0)\right\}.{ bold_p start_POSTSUBSCRIPT italic_R , italic_X end_POSTSUBSCRIPT ∣ bold_p start_POSTSUBSCRIPT italic_R , italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_x , italic_r end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_R , italic_X end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_p start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = 0 ) } . (23)

While the latter is

{𝐩WR𝐩WR0,r(wpwr=1),det𝐩WR0}.conditional-setsubscript𝐩conditional𝑊𝑅formulae-sequencesubscript𝐩conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑝conditional𝑤𝑟1subscript𝐩conditional𝑊𝑅0\left\{\mathbf{p}_{W\mid R}\mid\mathbf{p}_{W\mid R}\geq 0,\forall r\left(\sum_% {w}p_{w\mid r}=1\right),\det\mathbf{p}_{W\mid R}\neq 0\right\}.{ bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∣ bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , roman_det bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≠ 0 } . (24)

We just need to rewrite det𝐩WR0subscript𝐩conditional𝑊𝑅0\det\mathbf{p}_{W\mid R}\neq 0roman_det bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≠ 0. Since W,R𝑊𝑅W,Ritalic_W , italic_R are binary

det𝐩WRsubscript𝐩conditional𝑊𝑅\displaystyle\det\mathbf{p}_{W\mid R}roman_det bold_p start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT =pw0r0pw1r1pw0r1pw1r0absentsubscript𝑝conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤1subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤1subscript𝑟0\displaystyle=p_{w_{0}\mid r_{0}}p_{w_{1}\mid r_{1}}-p_{w_{0}\mid r_{1}}p_{w_{% 1}\mid r_{0}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (25)
=pw0r0(1pw0r1)pw0r1(1pw0r0)absentsubscript𝑝conditionalsubscript𝑤0subscript𝑟01subscript𝑝conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑟11subscript𝑝conditionalsubscript𝑤0subscript𝑟0\displaystyle=p_{w_{0}\mid r_{0}}(1-p_{w_{0}\mid r_{1}})-p_{w_{0}\mid r_{1}}(1% -p_{w_{0}\mid r_{0}})= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT )
=pw0r0pw0r10.absentsubscript𝑝conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤0subscript𝑟10\displaystyle=p_{w_{0}\mid r_{0}}-p_{w_{0}\mid r_{1}}\neq 0.= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ 0 .

A.4.2 Bounds for ZI MCAR

X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPTX𝑋Xitalic_XR𝑅Ritalic_RW𝑊Witalic_W
Figure 5: The graph considered in Theorem 3: proxy-augmented ZI MCAR model satisfying A1 and A2 (Fig. 2 a in the main paper).

Before proving the bound theorem, we have the following useful lemma:

Lemma 6.

For the ZI MCAR model in Theorem 3, Z constraint x0(qr0x=0)for-all𝑥0subscript𝑞subscript𝑟0𝑥0\forall x\neq 0(q_{r_{0}x}=0)∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) is equivalent to x0,(qw0r1=qw0x)for-all𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0𝑥\forall x\neq 0,(q_{w_{0}\mid r_{1}}=q_{w_{0}\mid x})∀ italic_x ≠ 0 , ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT ). This means: (i) there is a marginal constraint x0(qw0x=qw0x1)for-all𝑥0subscript𝑞conditionalsubscript𝑤0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑥1\forall x\neq 0(q_{w_{0}\mid x}=q_{w_{0}\mid x_{1}})∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), and (ii) qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is point-identified.

Proof.

(\Rightarrow) direction: Suppose x0(qr0x=0)for-all𝑥0subscript𝑞subscript𝑟0𝑥0\forall x\neq 0(q_{r_{0}x}=0)∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ). Then for all x0𝑥0x\neq 0italic_x ≠ 0

qr0xsubscript𝑞subscript𝑟0𝑥\displaystyle q_{r_{0}x}italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT =0absent0\displaystyle=0= 0 \displaystyle\Leftrightarrow qxsubscript𝑞𝑥\displaystyle q_{x}italic_q start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT =qr0,x+qr1,x=qr1,xabsentsubscript𝑞subscript𝑟0𝑥subscript𝑞subscript𝑟1𝑥subscript𝑞subscript𝑟1𝑥\displaystyle=q_{r_{0},x}+q_{r_{1},x}=q_{r_{1},x}= italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT \displaystyle\Leftrightarrow qr1xsubscript𝑞conditionalsubscript𝑟1𝑥\displaystyle q_{r_{1}\mid x}italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT =1.absent1\displaystyle=1.= 1 . (26)

Then, for all x0𝑥0x\neq 0italic_x ≠ 0

qw0,xsubscript𝑞subscript𝑤0𝑥\displaystyle q_{w_{0},x}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT =qw0r0qr0,x+qw0r1qr1,x=0+qw0r1qr1,xabsentsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞subscript𝑟0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞subscript𝑟1𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞subscript𝑟1𝑥\displaystyle=q_{w_{0}\mid r_{0}}q_{r_{0},x}+q_{w_{0}\mid r_{1}}q_{r_{1},x}=0+% q_{w_{0}\mid r_{1}}q_{r_{1},x}= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = 0 + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT (27)
qw0xabsentsubscript𝑞conditionalsubscript𝑤0𝑥\displaystyle\Rightarrow q_{w_{0}\mid x}⇒ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT =qw0r1qr1x=qw0r1.absentsubscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟1𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle=q_{w_{0}\mid r_{1}}q_{r_{1}\mid x}=q_{w_{0}\mid r_{1}}.= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

(\Leftarrow) direction: Suppose x0(qw0r1=qw0x)for-all𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0𝑥\forall x\neq 0(q_{w_{0}\mid r_{1}}=q_{w_{0}\mid x})∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT ). Then, for all x0𝑥0x\neq 0italic_x ≠ 0

qw0,xsubscript𝑞subscript𝑤0𝑥\displaystyle q_{w_{0},x}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT =qw0r0qr0,x+qw0r1qr1,xabsentsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞subscript𝑟0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞subscript𝑟1𝑥\displaystyle=q_{w_{0}\mid r_{0}}q_{r_{0},x}+q_{w_{0}\mid r_{1}}q_{r_{1},x}= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT (28)
qw0xabsentsubscript𝑞conditionalsubscript𝑤0𝑥\displaystyle\Rightarrow q_{w_{0}\mid x}⇒ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT =qw0r0qr0x+qw0r1qr1xabsentsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑟0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟1𝑥\displaystyle=q_{w_{0}\mid r_{0}}q_{r_{0}\mid x}+q_{w_{0}\mid r_{1}}q_{r_{1}% \mid x}= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT
qw0r1absentsubscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle\Rightarrow q_{w_{0}\mid r_{1}}⇒ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =qw0r0qr0x+qw0r1qr1xabsentsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑟0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟1𝑥\displaystyle=q_{w_{0}\mid r_{0}}q_{r_{0}\mid x}+q_{w_{0}\mid r_{1}}q_{r_{1}% \mid x}= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT
0absent0\displaystyle\Rightarrow 0⇒ 0 =(qw0r0qw0r1)qr0x.absentsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟0𝑥\displaystyle=(q_{w_{0}\mid r_{0}}-q_{w_{0}\mid r_{1}})q_{r_{0}\mid x}.= ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT .

Since qw0r0qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we must have qr0x=0qr0,x=0subscript𝑞conditionalsubscript𝑟0𝑥0subscript𝑞subscript𝑟0𝑥0q_{r_{0}\mid x}=0\Rightarrow q_{r_{0},x}=0italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = 0 ⇒ italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = 0. This is true for all x0𝑥0x\neq 0italic_x ≠ 0.

Due to this lemma, an observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) is consistent to the model if and only if x0,p(W=0,X=x)=p(W=0,X=1)formulae-sequencefor-all𝑥0𝑝formulae-sequence𝑊0𝑋𝑥𝑝formulae-sequence𝑊0𝑋1\forall x\neq 0,p(W=0,X=x)=p(W=0,X=1)∀ italic_x ≠ 0 , italic_p ( italic_W = 0 , italic_X = italic_x ) = italic_p ( italic_W = 0 , italic_X = 1 ). We also require positivity x,p(X=x)>0for-all𝑥𝑝𝑋𝑥0\forall x,p(X=x)>0∀ italic_x , italic_p ( italic_X = italic_x ) > 0, so that p(WX)𝑝conditional𝑊𝑋p(W\mid X)italic_p ( italic_W ∣ italic_X ) is well-defined.

Theorem 3 Consider a ZI MCAR model in Fig. 2 (a) (reproduced in Fig. 5) under proxy assumptions A1, A2, with categorical X𝑋Xitalic_X and binary R,W𝑅𝑊R,Witalic_R , italic_W. Given a consistent observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) satisfying positivity assumption, x,p(x)>0for-all𝑥𝑝𝑥0\forall x,p(x)>0∀ italic_x , italic_p ( italic_x ) > 0, the set of compatible proxy-indicator conditionals q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) is given by

qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =pw0x1absentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle=p_{w_{0}\mid x_{1}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT {[pw0x0,1] if pw0x0>pw0x1[0,pw0x0] if pw0x0<pw0x1(0,1){pw0x0} if pw0x0=pw0x1absentcasessubscript𝑝conditionalsubscript𝑤0subscript𝑥01 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise0subscript𝑝conditionalsubscript𝑤0subscript𝑥0 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥0 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise\displaystyle\in\begin{cases}[p_{w_{0}\mid x_{0}},1]\text{ if }p_{w_{0}\mid x_% {0}}>p_{w_{0}\mid x_{1}}\\ [0,p_{w_{0}\mid x_{0}}]\text{ if }p_{w_{0}\mid x_{0}}<p_{w_{0}\mid x_{1}}\\ (0,1)\setminus\{p_{w_{0}\mid x_{0}}\}\text{ if }p_{w_{0}\mid x_{0}}=p_{w_{0}% \mid x_{1}}\end{cases}∈ { start_ROW start_CELL [ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 1 ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW

These bounds are sharp. Moreover, if pw0x0=pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}=p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) must satisfy 0<pw0x0<10subscript𝑝conditionalsubscript𝑤0subscript𝑥010<p_{w_{0}\mid x_{0}}<10 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 1, and zero inflation does not occur, i.e., q(R=0)=0𝑞𝑅00q(R=0)=0italic_q ( italic_R = 0 ) = 0.

Proof.

In the following, q()𝑞q(\cdot)italic_q ( ⋅ ) denotes an element in a model, while p()𝑝p(\cdot)italic_p ( ⋅ ) is derived from the given marginal p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ).

In principle, any compatible q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) to p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) must be derived from some full joint distribution q(X(1),R,X)q(WR)𝑞superscript𝑋1𝑅𝑋𝑞conditional𝑊𝑅q(X^{(1)},R,X)q(W\mid R)italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) italic_q ( italic_W ∣ italic_R ), such that q(X,W)=p(X,W)𝑞𝑋𝑊𝑝𝑋𝑊q(X,W)=p(X,W)italic_q ( italic_X , italic_W ) = italic_p ( italic_X , italic_W ). Since the model for q(X(1),R,X)𝑞superscript𝑋1𝑅𝑋q(X^{(1)},R,X)italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ) is 1-to-1 to the model for q(R,X)𝑞𝑅𝑋q(R,X)italic_q ( italic_R , italic_X ) with restriction Z, we can simplify this process by considering the marginal model for q(R,X,W)𝑞𝑅𝑋𝑊q(R,X,W)italic_q ( italic_R , italic_X , italic_W )

𝒫={(𝐪WR,𝐪RX)𝐪WR0,r(wqwr=1),qw0r0qw0r1,𝐪RX0,rxqrx=1,x0(qr0x=0)}.𝒫subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪𝑅𝑋0formulae-sequencesubscript𝑟𝑥subscript𝑞𝑟𝑥1for-all𝑥0subscript𝑞subscript𝑟0𝑥0\displaystyle\mathcal{P}=\left\{\begin{array}[]{c|c}(\mathbf{q}_{W\mid R},% \mathbf{q}_{RX})&\begin{aligned} &&&\textstyle\mathbf{q}_{W\mid R}\geq 0,\>% \forall r\left(\sum_{w}q_{w\mid r}=1\right),\>q_{w_{0}\mid r_{0}}\neq q_{w_{0}% \mid r_{1}},\\ &&&\textstyle\mathbf{q}_{RX}\geq 0,\>\sum_{rx}q_{rx}=1,\>\forall x\neq 0(q_{r_% {0}x}=0)\\ \end{aligned}\end{array}\right\}.caligraphic_P = { start_ARRAY start_ROW start_CELL ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) end_CELL end_ROW end_CELL end_ROW end_ARRAY } . (29)

The subset of q(R,X,W)𝑞𝑅𝑋𝑊q(R,X,W)italic_q ( italic_R , italic_X , italic_W ) yielding the observed law p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) is

𝒬𝒬\displaystyle\mathcal{Q}caligraphic_Q ={(𝐪WR,𝐪RX)𝐪WR𝐪RX=𝐩WX,(𝐪WR,𝐪RX)𝒫}.absentconditional-setsubscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋formulae-sequencesubscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋subscript𝐩𝑊𝑋subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋𝒫\displaystyle=\left\{(\mathbf{q}_{W\mid R},\mathbf{q}_{RX})\mid\>\mathbf{q}_{W% \mid R}\mathbf{q}_{RX}=\mathbf{p}_{WX},(\mathbf{q}_{W\mid R},\mathbf{q}_{RX})% \in\mathcal{P}\right\}.= { ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) ∣ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT , ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) ∈ caligraphic_P } . (30)

Polynomial program

Since 𝐪WR𝒫subscript𝐪conditional𝑊𝑅𝒫\mathbf{q}_{W\mid R}\in\mathcal{P}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∈ caligraphic_P is invertible, 𝒬𝒬\mathcal{Q}caligraphic_Q is the set of all pairs (𝐪WR,𝐪RX)subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋(\mathbf{q}_{W\mid R},\mathbf{q}_{RX})( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ) with 𝐪RX=[𝐪WR]1𝐩WXsubscript𝐪𝑅𝑋superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋\mathbf{q}_{RX}=[\mathbf{q}_{W\mid R}]^{-1}\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT and 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}\in\mathcal{B}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∈ caligraphic_B,

\displaystyle\mathcal{B}caligraphic_B ={𝐪WR𝐪WR0,r(wqwr=1),qw0r0qw0r1,𝐪RX0,rxqrx=1,x0(qr0x=0),where 𝐪RX=[𝐪WR]1𝐩WX.}.absentsubscript𝐪conditional𝑊𝑅missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪𝑅𝑋0formulae-sequencesubscript𝑟𝑥subscript𝑞𝑟𝑥1for-all𝑥0subscript𝑞subscript𝑟0𝑥0missing-subexpressionmissing-subexpressionmissing-subexpressionwhere subscript𝐪𝑅𝑋superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋\displaystyle=\left\{\begin{array}[]{c|c}\mathbf{q}_{W\mid R}&\begin{aligned} % &&&\textstyle\mathbf{q}_{W\mid R}\geq 0,\>\forall r\left(\sum_{w}q_{w\mid r}=1% \right),\>q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}},\\ &&&\textstyle\mathbf{q}_{RX}\geq 0,\>\sum_{rx}q_{rx}=1,\>\forall x\neq 0(q_{r_% {0}x}=0),\\ &&&\textstyle\text{where }\mathbf{q}_{RX}=[\mathbf{q}_{W\mid R}]^{-1}\mathbf{p% }_{WX}.\\ \end{aligned}\end{array}\right\}.= { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL where bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT . end_CELL end_ROW end_CELL end_ROW end_ARRAY } . (31)

\mathcal{B}caligraphic_B is called the compatibility set of q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) w.r.t. p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ). As mentioned in the main paper, one can directly solve for \mathcal{B}caligraphic_B via the following polynomial program, where 𝐪RXsubscript𝐪𝑅𝑋\mathbf{q}_{RX}bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT are slack variables.

maxqw0r0subscriptsubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\max_{q_{w_{0}\mid r_{0}}}roman_max start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ±qw0r0plus-or-minussubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\pm q_{w_{0}\mid r_{0}}± italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (32)
s.t. 𝐪WR𝐪RX=𝐩WX,subscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋subscript𝐩𝑊𝑋\displaystyle\mathbf{q}_{W\mid R}\mathbf{q}_{RX}=\mathbf{p}_{WX},bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ,
𝐪WR0,r(wqw|r=1),qw0r0qw0r1,formulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle\textstyle\mathbf{q}_{W\mid R}\geq 0,\>\forall r(\sum_{w}q_{w|r}=% 1),\>q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}},bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w | italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ,
𝐪RX0,rxqrx=1,x0(qr0x=0).formulae-sequencesubscript𝐪𝑅𝑋0formulae-sequencesubscript𝑟𝑥subscript𝑞𝑟𝑥1for-all𝑥0subscript𝑞subscript𝑟0𝑥0\displaystyle\textstyle\mathbf{q}_{RX}\geq 0,\>\sum_{rx}q_{rx}=1,\>\forall x% \neq 0(q_{r_{0}x}=0).bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) .

As we will show below, the constraint x0(qr0x=0)for-all𝑥0subscript𝑞subscript𝑟0𝑥0\forall x\neq 0(q_{r_{0}x}=0)∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) is equivalent to qw0r1=pw0x1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. This is a quadratic program due to the first constraint.

Linear program

We will simplify \mathcal{B}caligraphic_B. We do so by considering its superset and adding constraints to it. Firstly, \mathcal{B}caligraphic_B could be parameterized by only 2 numbers, because its superset is

{𝐪WR𝐪WR0,r(wqwr=1)}={𝐪WR=(qw0r0qw0r11qw0r01qw0r1)0qw0r01, 0qw0r11}.subscript𝐪conditional𝑊𝑅subscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝐪conditional𝑊𝑅matrixsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟11subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟1formulae-sequence0subscript𝑞conditionalsubscript𝑤0subscript𝑟01 0subscript𝑞conditionalsubscript𝑤0subscript𝑟11\displaystyle\left\{\begin{array}[]{c|c}\mathbf{q}_{W\mid R}&\mathbf{q}_{W\mid R% }\geq 0,\>\forall r\left(\sum_{w}q_{w\mid r}=1\right)\end{array}\right\}=\left% \{\begin{array}[]{c|c}\textstyle\mathbf{q}_{W\mid R}=\begin{pmatrix}q_{w_{0}% \mid r_{0}}&q_{w_{0}\mid r_{1}}\\ 1-q_{w_{0}\mid r_{0}}&1-q_{w_{0}\mid r_{1}}\end{pmatrix}&0\leq q_{w_{0}\mid r_% {0}}\leq 1,\>0\leq q_{w_{0}\mid r_{1}}\leq 1\end{array}\right\}.{ start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) end_CELL end_ROW end_ARRAY } = { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_CELL start_CELL 0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1 , 0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1 end_CELL end_ROW end_ARRAY } . (33)

Secondly, when qw0r0qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, the 2-2 matrix 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT has inverse

[𝐪WR]1=1qw0r0qw0r1(1qw0r1qw0r1qw0r01qw0r0)M.superscriptdelimited-[]subscript𝐪conditional𝑊𝑅11subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscriptmatrix1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑀\left[\mathbf{q}_{W\mid R}\right]^{-1}=\frac{1}{q_{w_{0}\mid r_{0}}-q_{w_{0}% \mid r_{1}}}\underbrace{\begin{pmatrix}1-q_{w_{0}\mid r_{1}}&-q_{w_{0}\mid r_{% 1}}\\ q_{w_{0}\mid r_{0}}-1&q_{w_{0}\mid r_{0}}\end{pmatrix}}_{M}.[ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG under⏟ start_ARG ( start_ARG start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT . (34)

Therefore, we can transform the quadratic constraint into the following equivalent linear constraints

{𝐪WR𝐪WR0,r(wqwr=1),qw0r0qw0r1,𝐪RX=[𝐪WR]1𝐩WX0}subscript𝐪conditional𝑊𝑅formulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1formulae-sequencesubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝐪𝑅𝑋superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1subscript𝐩𝑊𝑋0\displaystyle\left\{\begin{array}[]{c|c}\mathbf{q}_{W\mid R}&\textstyle\mathbf% {q}_{W\mid R}\geq 0,\>\forall r\left(\sum_{w}q_{w\mid r}=1\right),\>q_{w_{0}% \mid r_{0}}\neq q_{w_{0}\mid r_{1}},\>\mathbf{q}_{RX}=[\mathbf{q}_{W\mid R}]^{% -1}\mathbf{p}_{WX}\geq 0\\ \end{array}\right\}{ start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ≥ 0 end_CELL end_ROW end_ARRAY } (35)
=s{1,1}{𝐪WR𝐪WR0,r(wqwr=1),sqw0r0>sqw0r1,sM𝐩WX0}.absentsubscript𝑠11subscript𝐪conditional𝑊𝑅formulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1formulae-sequence𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟1𝑠𝑀subscript𝐩𝑊𝑋0\displaystyle=\bigcup_{s\in\{1,-1\}}\left\{\begin{array}[]{c|c}\mathbf{q}_{W% \mid R}&\mathbf{q}_{W\mid R}\geq 0,\>\forall r\left(\sum_{w}q_{w\mid r}=1% \right),\>sq_{w_{0}\mid r_{0}}>sq_{w_{0}\mid r_{1}},\>sM\mathbf{p}_{WX}\geq 0% \end{array}\right\}.= ⋃ start_POSTSUBSCRIPT italic_s ∈ { 1 , - 1 } end_POSTSUBSCRIPT { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_s italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_s italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_s italic_M bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ≥ 0 end_CELL end_ROW end_ARRAY } .

Next, given 𝐪WR𝐪RX=𝐩WXsubscript𝐪conditional𝑊𝑅subscript𝐪𝑅𝑋subscript𝐩𝑊𝑋\mathbf{q}_{W\mid R}\mathbf{q}_{RX}=\mathbf{p}_{WX}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT where all terms are non-negative, wqwR=1subscript𝑤subscript𝑞conditional𝑤𝑅1\sum_{w}q_{w\mid R}=1∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_R end_POSTSUBSCRIPT = 1, w,xpw,x=1subscript𝑤𝑥subscript𝑝𝑤𝑥1\sum_{w,x}p_{w,x}=1∑ start_POSTSUBSCRIPT italic_w , italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w , italic_x end_POSTSUBSCRIPT = 1, and 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT is invertible, then rxqrx=1subscript𝑟𝑥subscript𝑞𝑟𝑥1\sum_{rx}q_{rx}=1∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1. Hence, rxqrx=1subscript𝑟𝑥subscript𝑞𝑟𝑥1\sum_{rx}q_{rx}=1∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT = 1 is a redundant constraint. Proof: entry ij𝑖𝑗ijitalic_i italic_j-th [𝐩WX]ij=qwir0qr0xj+qwir1qr1xjsubscriptdelimited-[]subscript𝐩𝑊𝑋𝑖𝑗subscript𝑞conditionalsubscript𝑤𝑖subscript𝑟0subscript𝑞subscript𝑟0subscript𝑥𝑗subscript𝑞conditionalsubscript𝑤𝑖subscript𝑟1subscript𝑞subscript𝑟1subscript𝑥𝑗[\mathbf{p}_{WX}]_{ij}=q_{w_{i}\mid r_{0}}q_{r_{0}x_{j}}+q_{w_{i}\mid r_{1}}q_% {r_{1}x_{j}}[ bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then 1=ij[𝐩WX]ij=j((iqwir0)qr0xj+(iqwir1)qr1xj)=j(qr0xj+qr1xj)1subscript𝑖𝑗subscriptdelimited-[]subscript𝐩𝑊𝑋𝑖𝑗subscript𝑗subscript𝑖subscript𝑞conditionalsubscript𝑤𝑖subscript𝑟0subscript𝑞subscript𝑟0subscript𝑥𝑗subscript𝑖subscript𝑞conditionalsubscript𝑤𝑖subscript𝑟1subscript𝑞subscript𝑟1subscript𝑥𝑗subscript𝑗subscript𝑞subscript𝑟0subscript𝑥𝑗subscript𝑞subscript𝑟1subscript𝑥𝑗1=\sum_{ij}[\mathbf{p}_{WX}]_{ij}=\sum_{j}\left((\sum_{i}q_{w_{i}\mid r_{0}})q% _{r_{0}x_{j}}+(\sum_{i}q_{w_{i}\mid r_{1}})q_{r_{1}x_{j}}\right)=\sum_{j}(q_{r% _{0}x_{j}}+q_{r_{1}x_{j}})1 = ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT [ bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( ( ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT + ( ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ).

Finally, lemma 6 says x0(qr0x=0)x0(qw0r1=qw0x)for-all𝑥0subscript𝑞subscript𝑟0𝑥0for-all𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0𝑥\forall x\neq 0(q_{r_{0}x}=0)\Leftrightarrow\forall x\neq 0(q_{w_{0}\mid r_{1}% }=q_{w_{0}\mid x})∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) ⇔ ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT ), and qw0x=pw0xsubscript𝑞conditionalsubscript𝑤0𝑥subscript𝑝conditionalsubscript𝑤0𝑥q_{w_{0}\mid x}=p_{w_{0}\mid x}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT in \mathcal{B}caligraphic_B. Note that this lemma also requires p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ) to satisfy the marginal constraint pw0x=pw0x1subscript𝑝conditionalsubscript𝑤0𝑥subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x}=p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Therefore, the constraint x0(qr0x=0)qw0r1=pw0x1for-all𝑥0subscript𝑞subscript𝑟0𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\forall x\neq 0(q_{r_{0}x}=0)\Leftrightarrow q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x% _{1}}∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0 ) ⇔ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Putting together, we can write \mathcal{B}caligraphic_B as

=s{1,1}{𝐪WR=(qw0r0qw0r11qw0r01qw0r1)s(1qw0r1qw0r1qw0r01qw0r0)𝐩WX𝟎,sqw0r0>sqw0r1,0qw0r01,qw0r1=pw0x1}.subscript𝑠11subscript𝐪conditional𝑊𝑅matrixsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟11subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟1missing-subexpressionmissing-subexpressionmissing-subexpression𝑠matrix1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝐩𝑊𝑋0missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequence𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟10subscript𝑞conditionalsubscript𝑤0subscript𝑟01missing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\mathcal{B}=\bigcup_{s\in\{1,-1\}}\left\{\begin{array}[]{c | c}% \mathbf{q}_{W\mid R}=\begin{pmatrix}q_{w_{0}\mid r_{0}}&q_{w_{0}\mid r_{1}}\\ 1-q_{w_{0}\mid r_{0}}&1-q_{w_{0}\mid r_{1}}\end{pmatrix}&\begin{aligned} &&&% \textstyle s\begin{pmatrix}1-q_{w_{0}\mid r_{1}}&-q_{w_{0}\mid r_{1}}\\ q_{w_{0}\mid r_{0}}-1&q_{w_{0}\mid r_{0}}\end{pmatrix}\mathbf{p}_{WX}\geq% \mathbf{0},\\ &&&\textstyle s\cdot q_{w_{0}\mid r_{0}}>s\cdot q_{w_{0}\mid r_{1}},0\leq q_{w% _{0}\mid r_{0}}\leq 1,\\ &&&\textstyle q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}\end{aligned}\end{array}% \right\}.caligraphic_B = ⋃ start_POSTSUBSCRIPT italic_s ∈ { 1 , - 1 } end_POSTSUBSCRIPT { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL italic_s ( start_ARG start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ≥ bold_0 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_CELL end_ROW end_ARRAY } . (36)

or,

={𝐪WR=(qw0r0qw0r11qw0r01qw0r1)qw0r1=pw0x1,qw0r0w0r0},subscript𝐪conditional𝑊𝑅matrixsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟11subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟1formulae-sequencesubscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscriptconditionalsubscript𝑤0subscript𝑟0\displaystyle\mathcal{B}=\left\{\begin{array}[]{c | c}\mathbf{q}_{W\mid R}=% \begin{pmatrix}q_{w_{0}\mid r_{0}}&q_{w_{0}\mid r_{1}}\\ 1-q_{w_{0}\mid r_{0}}&1-q_{w_{0}\mid r_{1}}\end{pmatrix}&q_{w_{0}\mid r_{1}}=p% _{w_{0}\mid x_{1}},\>q_{w_{0}\mid r_{0}}\in\mathcal{B}_{w_{0}\mid r_{0}}\end{% array}\right\},caligraphic_B = { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT = ( start_ARG start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY } , (37)
w0r0=s{1,1}w0r0s=s{1,1}{qw0r0s(1qw0r1qw0r1qw0r01qw0r0)𝐩WX𝟎,sqw0r0>sqw0r1, 0qw0r01,qw0r1=pw0x1}.subscriptconditionalsubscript𝑤0subscript𝑟0subscript𝑠11superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠subscript𝑠11subscript𝑞conditionalsubscript𝑤0subscript𝑟0missing-subexpressionmissing-subexpressionmissing-subexpression𝑠matrix1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝐩𝑊𝑋0missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequence𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟1 0subscript𝑞conditionalsubscript𝑤0subscript𝑟01missing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\mathcal{B}_{w_{0}\mid r_{0}}=\bigcup_{s\in\{1,-1\}}\mathcal{B}_{% w_{0}\mid r_{0}}^{s}=\bigcup_{s\in\{1,-1\}}\left\{\begin{array}[]{c | c}q_{w_{% 0}\mid r_{0}}&\begin{aligned} &&&\textstyle s\begin{pmatrix}1-q_{w_{0}\mid r_{% 1}}&-q_{w_{0}\mid r_{1}}\\ q_{w_{0}\mid r_{0}}-1&q_{w_{0}\mid r_{0}}\end{pmatrix}\mathbf{p}_{WX}\geq% \mathbf{0},\\ &&&\textstyle s\cdot q_{w_{0}\mid r_{0}}>s\cdot q_{w_{0}\mid r_{1}},\>0\leq q_% {w_{0}\mid r_{0}}\leq 1,\\ &&&\textstyle q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}\end{aligned}\end{array}% \right\}.caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ⋃ start_POSTSUBSCRIPT italic_s ∈ { 1 , - 1 } end_POSTSUBSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT = ⋃ start_POSTSUBSCRIPT italic_s ∈ { 1 , - 1 } end_POSTSUBSCRIPT { start_ARRAY start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL italic_s ( start_ARG start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ≥ bold_0 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_CELL end_ROW end_ARRAY } .

The set w0r0subscriptconditionalsubscript𝑤0subscript𝑟0\mathcal{B}_{w_{0}\mid r_{0}}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is called the compatible set of qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT w.r.t. p(X,W)𝑝𝑋𝑊p(X,W)italic_p ( italic_X , italic_W ). As will be shown, this is an interval in [0,1]01[0,1][ 0 , 1 ], hence the name compatibility bound.

To find w0r0subscriptconditionalsubscript𝑤0subscript𝑟0\mathcal{B}_{w_{0}\mid r_{0}}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we will find each w0r0ssubscriptsuperscript𝑠conditionalsubscript𝑤0subscript𝑟0\mathcal{B}^{s}_{w_{0}\mid r_{0}}caligraphic_B start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and take their union. Each w0r0ssuperscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠\mathcal{B}_{w_{0}\mid r_{0}}^{s}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT could be numerically computed by solving the 2 linear programs

maxqw0r0subscriptsubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\max_{q_{w_{0}\mid r_{0}}}roman_max start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ±qw0r0plus-or-minussubscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle\pm q_{w_{0}\mid r_{0}}± italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (38)
s.t. s(1qw0r1qw0r1qw0r01qw0r0)𝐩WX𝟎,𝑠matrix1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝐩𝑊𝑋0\displaystyle s\cdot\begin{pmatrix}1-q_{w_{0}\mid r_{1}}&-q_{w_{0}\mid r_{1}}% \\ q_{w_{0}\mid r_{0}}-1&q_{w_{0}\mid r_{0}}\end{pmatrix}\mathbf{p}_{WX}\geq% \mathbf{0},italic_s ⋅ ( start_ARG start_ROW start_CELL 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 end_CELL start_CELL italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) bold_p start_POSTSUBSCRIPT italic_W italic_X end_POSTSUBSCRIPT ≥ bold_0 ,
sqw0r0>sqw0r1, 0qw0r01,formulae-sequence𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑠subscript𝑞conditionalsubscript𝑤0subscript𝑟1 0subscript𝑞conditionalsubscript𝑤0subscript𝑟01\displaystyle s\cdot q_{w_{0}\mid r_{0}}>s\cdot q_{w_{0}\mid r_{1}},\>0\leq q_% {w_{0}\mid r_{0}}\leq 1,italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_s ⋅ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1 ,
qw0r1=pw0x1.subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}.italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

These problems are linear program as qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the only unknown and all constraints are linear. The set w0r0ssuperscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠\mathcal{B}_{w_{0}\mid r_{0}}^{s}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT is the interval whose endpoints are 2 numbers returned by these programs.

Solutions to linear programs

Solving w0r0s=1superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠1\mathcal{B}_{w_{0}\mid r_{0}}^{s=1}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s = 1 end_POSTSUPERSCRIPT: We expand the matrix multiplication equation

pw0,x0(1qw0r1)pw1,x0qw0r1subscript𝑝subscript𝑤0subscript𝑥01subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝subscript𝑤1subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle p_{w_{0},x_{0}}\left(1-q_{w_{0}\mid r_{1}}\right)-p_{w_{1},x_{0}% }q_{w_{0}\mid r_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\geq 0≥ 0 \displaystyle\Leftrightarrow pw0x0pw1x1pw1x0pw0x10subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10\displaystyle p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{% 0}\mid x_{1}}\geq 0italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ 0 (39)
x0,for-all𝑥0\displaystyle\forall x\neq 0,∀ italic_x ≠ 0 , pw0,x(1qw0r1)pw1,xqw0r1subscript𝑝subscript𝑤0𝑥1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝subscript𝑤1𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle p_{w_{0},x}\left(1-q_{w_{0}\mid r_{1}}\right)-p_{w_{1},x}q_{w_{0% }\mid r_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT ( 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\geq 0≥ 0 \displaystyle\Leftrightarrow x0,pw0,xpw1x1pw1,xpw0x1=0formulae-sequencefor-all𝑥0subscript𝑝subscript𝑤0𝑥subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝subscript𝑤1𝑥subscript𝑝conditionalsubscript𝑤0subscript𝑥10\displaystyle\forall x\neq 0,p_{w_{0},x}p_{w_{1}\mid x_{1}}-p_{w_{1},x}p_{w_{0% }\mid x_{1}}=0∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0
pw0,x0(qw0r01)+pw1,x0qw0r0subscript𝑝subscript𝑤0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑝subscript𝑤1subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle p_{w_{0},x_{0}}\left(q_{w_{0}\mid r_{0}}-1\right)+p_{w_{1},x_{0}% }q_{w_{0}\mid r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\geq 0≥ 0 \displaystyle\Leftrightarrow qw0r0pw0,x0pw0,x0+pw1,x0=pw0x0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝subscript𝑤0subscript𝑥0subscript𝑝subscript𝑤0subscript𝑥0subscript𝑝subscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥0\displaystyle q_{w_{0}\mid r_{0}}\geq\frac{p_{w_{0},x_{0}}}{p_{w_{0},x_{0}}+p_% {w_{1},x_{0}}}=p_{w_{0}\mid x_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ divide start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
x0,for-all𝑥0\displaystyle\forall x\neq 0,∀ italic_x ≠ 0 , pw0,x(qw0r01)+pw1,xqw0r0subscript𝑝subscript𝑤0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑝subscript𝑤1𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle p_{w_{0},x}\left(q_{w_{0}\mid r_{0}}-1\right)+p_{w_{1},x}q_{w_{0% }\mid r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\geq 0≥ 0 \displaystyle\Leftrightarrow x0,qw0r0pw0,xpw0,x+pw1,x=pw0x.formulae-sequencefor-all𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝subscript𝑤0𝑥subscript𝑝subscript𝑤0𝑥subscript𝑝subscript𝑤1𝑥subscript𝑝conditionalsubscript𝑤0𝑥\displaystyle\forall x\neq 0,q_{w_{0}\mid r_{0}}\geq\frac{p_{w_{0},x}}{p_{w_{0% },x}+p_{w_{1},x}}=p_{w_{0}\mid x}.∀ italic_x ≠ 0 , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ divide start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT end_ARG = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT .

In the derivations above, we use the positivity assumption x(p(X=x)>0)for-all𝑥𝑝𝑋𝑥0\forall x(p(X=x)>0)∀ italic_x ( italic_p ( italic_X = italic_x ) > 0 ). At the very least, we assume there is zeros, i.e., p(X=0)>0𝑝𝑋00p(X=0)>0italic_p ( italic_X = 0 ) > 0, otherwise the problem does not make sense. If positivity is violated, e.g., x0,p(X=x)=0formulae-sequence𝑥0𝑝𝑋𝑥0\exists x\neq 0,p(X=x)=0∃ italic_x ≠ 0 , italic_p ( italic_X = italic_x ) = 0, one can show that pw0,x=pw1,x=0subscript𝑝subscript𝑤0𝑥subscript𝑝subscript𝑤1𝑥0p_{w_{0},x}=p_{w_{1},x}=0italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT = 0, and hence this value x𝑥xitalic_x does not place any restriction on qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, and can be ignored in the following discussion.

The first equation shows that the s=1𝑠1s=1italic_s = 1 case has no solution if pw0x0pw1x1pw1x0pw0x1<0subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{0}\mid x_{1}}<0italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 0. When the LHS is non-negative, the feasible region w0r0s=1superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠1\mathcal{B}_{w_{0}\mid r_{0}}^{s=1}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s = 1 end_POSTSUPERSCRIPT is maxxpw0xqw0r01subscript𝑥subscript𝑝conditionalsubscript𝑤0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟01\max_{x}p_{w_{0}\mid x}\leq q_{w_{0}\mid r_{0}}\leq 1roman_max start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1. We can further split into 2 cases, and note that qw0r0>qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}>q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, per s=1𝑠1s=1italic_s = 1.

  1. 1.

    If pw0x0pw1x1pw1x0pw0x1=0pw0x0=pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{0}\mid x_{1}}=% 0\Leftrightarrow p_{w_{0}\mid x_{0}}=p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0 ⇔ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, which is true for all values in [0,1]01[0,1][ 0 , 1 ]. Then pw0x0<qw0r01subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟01p_{w_{0}\mid x_{0}}<q_{w_{0}\mid r_{0}}\leq 1italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1. For this to make sense, we must have pw0x0<1subscript𝑝conditionalsubscript𝑤0subscript𝑥01p_{w_{0}\mid x_{0}}<1italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 1.

  2. 2.

    If pw0x0pw1x1pw1x0pw0x1>0pw0x0>pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{0}\mid x_{1}}>% 0\Leftrightarrow p_{w_{0}\mid x_{0}}>p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 0 ⇔ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, which is true for all 0pw0x1<10subscript𝑝conditionalsubscript𝑤0subscript𝑥110\leq p_{w_{0}\mid x_{1}}<10 ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 1. Then pw0x0qw0r01subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟01p_{w_{0}\mid x_{0}}\leq q_{w_{0}\mid r_{0}}\leq 1italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1.

The bounds are sharp because they are the feasible regions w0r0s=1superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠1\mathcal{B}_{w_{0}\mid r_{0}}^{s=1}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s = 1 end_POSTSUPERSCRIPT.

Solving w0r0s=1superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠1\mathcal{B}_{w_{0}\mid r_{0}}^{s=-1}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s = - 1 end_POSTSUPERSCRIPT: Similarly

pw0,x0(1qw0r1)pw1,x0qw0r1subscript𝑝subscript𝑤0subscript𝑥01subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝subscript𝑤1subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle p_{w_{0},x_{0}}\left(1-q_{w_{0}\mid r_{1}}\right)-p_{w_{1},x_{0}% }q_{w_{0}\mid r_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\leq 0≤ 0 \displaystyle\Leftrightarrow pw0x0pw1x1pw1x0pw0x10subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10\displaystyle p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{% 0}\mid x_{1}}\leq 0italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 0 (40)
x0,for-all𝑥0\displaystyle\forall x\neq 0,∀ italic_x ≠ 0 , pw0,x(1qw0r1)pw1,xqw0r1subscript𝑝subscript𝑤0𝑥1subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝subscript𝑤1𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle p_{w_{0},x}\left(1-q_{w_{0}\mid r_{1}}\right)-p_{w_{1},x}q_{w_{0% }\mid r_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT ( 1 - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\leq 0≤ 0 \displaystyle\Leftrightarrow x0,pw0,xpw1x1pw1,xpw0x1=0formulae-sequencefor-all𝑥0subscript𝑝subscript𝑤0𝑥subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝subscript𝑤1𝑥subscript𝑝conditionalsubscript𝑤0subscript𝑥10\displaystyle\forall x\neq 0,p_{w_{0},x}p_{w_{1}\mid x_{1}}-p_{w_{1},x}p_{w_{0% }\mid x_{1}}=0∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0
pw0,x0(qw0r01)+pw1,x0qw0r0subscript𝑝subscript𝑤0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑝subscript𝑤1subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle p_{w_{0},x_{0}}\left(q_{w_{0}\mid r_{0}}-1\right)+p_{w_{1},x_{0}% }q_{w_{0}\mid r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\leq 0≤ 0 \displaystyle\Leftrightarrow qw0r0pw0,x0pw0,x0+pw1,x0=pw0x0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝subscript𝑤0subscript𝑥0subscript𝑝subscript𝑤0subscript𝑥0subscript𝑝subscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥0\displaystyle q_{w_{0}\mid r_{0}}\leq\frac{p_{w_{0},x_{0}}}{p_{w_{0},x_{0}}+p_% {w_{1},x_{0}}}=p_{w_{0}\mid x_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ divide start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
x0,for-all𝑥0\displaystyle\forall x\neq 0,∀ italic_x ≠ 0 , pw0,x(qw0r01)+pw1,xqw0r0subscript𝑝subscript𝑤0𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟01subscript𝑝subscript𝑤1𝑥subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle p_{w_{0},x}\left(q_{w_{0}\mid r_{0}}-1\right)+p_{w_{1},x}q_{w_{0% }\mid r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT 0absent0\displaystyle\leq 0≤ 0 \displaystyle\Leftrightarrow x0,qw0r0pw0,xpw0,x+pw1,x=pw0x.formulae-sequencefor-all𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝subscript𝑤0𝑥subscript𝑝subscript𝑤0𝑥subscript𝑝subscript𝑤1𝑥subscript𝑝conditionalsubscript𝑤0𝑥\displaystyle\forall x\neq 0,q_{w_{0}\mid r_{0}}\leq\frac{p_{w_{0},x}}{p_{w_{0% },x}+p_{w_{1},x}}=p_{w_{0}\mid x}.∀ italic_x ≠ 0 , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ divide start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x end_POSTSUBSCRIPT end_ARG = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT .

The first equation shows that the s=1𝑠1s=1italic_s = 1 case has no solution if pw0x0pw1x1pw1x0pw0x1>0subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{0}\mid x_{1}}>0italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 0. When the LHS is non-positive, the feasible region w0r0s=1superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠1\mathcal{B}_{w_{0}\mid r_{0}}^{s=-1}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s = - 1 end_POSTSUPERSCRIPT is 0qw0r0minxpw0x0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑥subscript𝑝conditionalsubscript𝑤0𝑥0\leq q_{w_{0}\mid r_{0}}\leq\min_{x}p_{w_{0}\mid x}0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ roman_min start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT. We can further split into 2 cases, and note that qw0r0<qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}<q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, per s=1𝑠1s=-1italic_s = - 1.

  1. 1.

    If pw0x0pw1x1pw1x0pw0x1=0pw0x0=pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{0}\mid x_{1}}=% 0\Leftrightarrow p_{w_{0}\mid x_{0}}=p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0 ⇔ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, which is true for all values in [0,1]01[0,1][ 0 , 1 ]. Then 0qw0r0<pw0x00subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤0subscript𝑥00\leq q_{w_{0}\mid r_{0}}<p_{w_{0}\mid x_{0}}0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. For this to make sense, we must have pw0x0>0subscript𝑝conditionalsubscript𝑤0subscript𝑥00p_{w_{0}\mid x_{0}}>0italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 0.

  2. 2.

    If pw0x0pw1x1pw1x0pw0x1<0pw0x0<pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤1subscript𝑥1subscript𝑝conditionalsubscript𝑤1subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥10subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}p_{w_{1}\mid x_{1}}-p_{w_{1}\mid x_{0}}p_{w_{0}\mid x_{1}}<% 0\Leftrightarrow p_{w_{0}\mid x_{0}}<p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 0 ⇔ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. which is true for all 0<pw0x110subscript𝑝conditionalsubscript𝑤0subscript𝑥110<p_{w_{0}\mid x_{1}}\leq 10 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ 1. Then 0qw0r0pw0x00subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑝conditionalsubscript𝑤0subscript𝑥00\leq q_{w_{0}\mid r_{0}}\leq p_{w_{0}\mid x_{0}}0 ≤ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

The bounds are sharp because they are the feasible regions w0r0s=1superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠1\mathcal{B}_{w_{0}\mid r_{0}}^{s=-1}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s = - 1 end_POSTSUPERSCRIPT.

Result

Combine these results to get the compatibility bound w0r0=s{1,1}w0r0ssubscriptconditionalsubscript𝑤0subscript𝑟0subscript𝑠11superscriptsubscriptconditionalsubscript𝑤0subscript𝑟0𝑠\mathcal{B}_{w_{0}\mid r_{0}}=\bigcup_{s\in\{1,-1\}}\mathcal{B}_{w_{0}\mid r_{% 0}}^{s}caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ⋃ start_POSTSUBSCRIPT italic_s ∈ { 1 , - 1 } end_POSTSUBSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT. The bounds are sharp.

qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =pw0x1absentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle=p_{w_{0}\mid x_{1}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT {[pw0x0,1] if pw0x0>pw0x1[0,pw0x0] if pw0x0<pw0x1(0,1){pw0x0} if 0<pw0x0=pw0x1<1absentcasessubscript𝑝conditionalsubscript𝑤0subscript𝑥01 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise0subscript𝑝conditionalsubscript𝑤0subscript𝑥0 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥0 if 0subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥11otherwise\displaystyle\in\begin{cases}[p_{w_{0}\mid x_{0}},1]\text{ if }p_{w_{0}\mid x_% {0}}>p_{w_{0}\mid x_{1}}\\ [0,p_{w_{0}\mid x_{0}}]\text{ if }p_{w_{0}\mid x_{0}}<p_{w_{0}\mid x_{1}}\\ (0,1)\setminus\{p_{w_{0}\mid x_{0}}\}\text{ if }0<p_{w_{0}\mid x_{0}}=p_{w_{0}% \mid x_{1}}<1\end{cases}∈ { start_ROW start_CELL [ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 1 ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } if 0 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 1 end_CELL start_CELL end_CELL end_ROW

The situations pw0x0=pw0x1{0,1}subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥101p_{w_{0}\mid x_{0}}=p_{w_{0}\mid x_{1}}\in\{0,1\}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ { 0 , 1 } are not allowed by the model. Moreover, if pw0x0=pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1p_{w_{0}\mid x_{0}}=p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT then q(R=0)=0𝑞𝑅00q(R=0)=0italic_q ( italic_R = 0 ) = 0, i.e., zero inflation does not occur. Proof:

pw0x0subscript𝑝conditionalsubscript𝑤0subscript𝑥0\displaystyle p_{w_{0}\mid x_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =qw0r0qr0x0+qw0r1qr1x0absentsubscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑟0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟1subscript𝑥0\displaystyle=q_{w_{0}\mid r_{0}}q_{r_{0}\mid x_{0}}+q_{w_{0}\mid r_{1}}q_{r_{% 1}\mid x_{0}}= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (41)
pw0x1subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle p_{w_{0}\mid x_{1}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =qw0r1(id)absentsubscript𝑞conditionalsubscript𝑤0subscript𝑟1id\displaystyle=q_{w_{0}\mid r_{1}}\quad(\text{id})= italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( id )

Therefore, subtracting both sides,

0=pw0x0pw0x1=qw0r0qr0x0qw0r1qr0x0=(qw0r0qw0r1)qr0x0.0subscript𝑝conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑟0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟0subscript𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑞conditionalsubscript𝑟0subscript𝑥00=p_{w_{0}\mid x_{0}}-p_{w_{0}\mid x_{1}}=q_{w_{0}\mid r_{0}}q_{r_{0}\mid x_{0% }}-q_{w_{0}\mid r_{1}}q_{r_{0}\mid x_{0}}=\left(q_{w_{0}\mid r_{0}}-q_{w_{0}% \mid r_{1}}\right)q_{r_{0}\mid x_{0}}.0 = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ( italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (42)

Due to proxy assumption A2: qw0r0qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then the LHS equals 00 if and only if qr0x0=0subscript𝑞conditionalsubscript𝑟0subscript𝑥00q_{r_{0}\mid x_{0}}=0italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0. Moreover, Z implies x0,qr0x=0formulae-sequencefor-all𝑥0subscript𝑞conditionalsubscript𝑟0𝑥0\forall x\neq 0,q_{r_{0}\mid x}=0∀ italic_x ≠ 0 , italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT = 0. Then qr0=xqr0xpx=0subscript𝑞subscript𝑟0subscript𝑥subscript𝑞conditionalsubscript𝑟0𝑥subscript𝑝𝑥0q_{r_{0}}=\sum_{x}q_{r_{0}\mid x}p_{x}=0italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = 0.

A.5 ZI MAR proofs

X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPTX𝑋Xitalic_XR𝑅Ritalic_RC𝐶Citalic_CW𝑊Witalic_W
Figure 6: The graph considered in Theorem 4: proxy-augmented ZI MAR model satisfying A1 and A2 (Fig. 2 (b) in the main paper).

Theorem 4 Consider a ZI MAR model in Fig. 2 (b) (reproduced in Fig. 6) under proxy assumptions A1 and A2, with categorical X,C𝑋𝐶X,Citalic_X , italic_C and binary R,W𝑅𝑊R,Witalic_R , italic_W. Given a consistent observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) satisfying positivity assumption, x,c,p(x,c)>0for-all𝑥𝑐𝑝𝑥𝑐0\forall x,c,p(x,c)>0∀ italic_x , italic_c , italic_p ( italic_x , italic_c ) > 0, the set of compatible proxy-indicator conditional distributions q(WR,C)𝑞conditional𝑊𝑅𝐶q(W\mid R,C)italic_q ( italic_W ∣ italic_R , italic_C ) is given by, for each value c𝑐citalic_c,

qw0r1,csubscript𝑞conditionalsubscript𝑤0subscript𝑟1𝑐\displaystyle q_{w_{0}\mid r_{1},c}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT =pw0x1,cabsentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐\displaystyle=p_{w_{0}\mid x_{1},c}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT
qw0r0,csubscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑐\displaystyle q_{w_{0}\mid r_{0},c}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT {[pw0x0,c,1] if pw0x0,c>pw0x1,c[0,pw0x0,c] if pw0x0,c<pw0x1,c(0,1){pw0,x0,c} if pw0x0,c=pw0x1,c\displaystyle\in\begin{cases}[p_{w_{0}\mid x_{0},c},1]\text{ if }p_{w_{0}\mid x% _{0},c}>p_{w_{0}\mid x_{1},c}\\ [0,p_{w_{0}\mid x_{0},c}]\text{ if }p_{w_{0}\mid x_{0},c}<p_{w_{0}\mid x_{1},c% }\\ (0,1)\setminus\{p_{w_{0},\mid x_{0},c}\}\text{ if }p_{w_{0}\mid x_{0},c}=p_{w_% {0}\mid x_{1},c}\end{cases}∈ { start_ROW start_CELL [ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT , 1 ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT } if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW

These bounds are sharp. Moreover, if pw0x0,c=pw0x1,csubscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐p_{w_{0}\mid x_{0},c}=p_{w_{0}\mid x_{1},c}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT, p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) must satisfy 0<pw0x0,c<10subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐10<p_{w_{0}\mid x_{0},c}<10 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1, and zero inflation does not occur for stratum C=c𝐶𝑐C=citalic_C = italic_c, i.e., q(R=0c)=0𝑞𝑅conditional0𝑐0q(R=0\mid c)=0italic_q ( italic_R = 0 ∣ italic_c ) = 0.

Proof.

Model definition.

We assume C𝐶Citalic_C is a cardinal variable, taking values in a finite set 𝒞𝒞\mathcal{C}caligraphic_C. Any joint distribution in this ZI MAR model is

q(X(1),R,X,W,C)=q(X(1),R,X,WC)p(C).𝑞superscript𝑋1𝑅𝑋𝑊𝐶𝑞superscript𝑋1𝑅𝑋conditional𝑊𝐶𝑝𝐶q\left(X^{(1)},R,X,W,C\right)=q\left(X^{(1)},R,X,W\mid C\right)p(C).italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W , italic_C ) = italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ∣ italic_C ) italic_p ( italic_C ) . (43)

Since the Markov factors are variationally independent, the ZI MAR model is a Cartesian product

𝒫ZI MAR(1)subscriptsuperscript𝒫1ZI MAR\displaystyle\mathcal{P}^{(1)}_{\text{ZI MAR}}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT =(c𝒞𝒫ZI MCAR(1)(c))𝒫Cabsenttensor-productsubscripttensor-product𝑐𝒞subscriptsuperscript𝒫1ZI MCAR𝑐subscript𝒫𝐶\displaystyle=\left(\textstyle\bigotimes_{c\in\mathcal{C}}\mathcal{P}^{(1)}_{% \text{ZI MCAR}}(c)\right)\otimes\mathcal{P}_{C}= ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ) ⊗ caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT (44)
𝒫Csubscript𝒫𝐶\displaystyle\mathcal{P}_{C}caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ={q(C)},absent𝑞𝐶\displaystyle=\{q(C)\},= { italic_q ( italic_C ) } ,
𝒫ZI MCAR(1)(c)subscriptsuperscript𝒫1ZI MCAR𝑐\displaystyle\mathcal{P}^{(1)}_{\text{ZI MCAR}}(c)caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ={q(X(1),R,X,Wc)A1,A2}absentconditional𝑞superscript𝑋1𝑅𝑋conditional𝑊𝑐A1A2\displaystyle=\left\{q(X^{(1)},R,X,W\mid c)\mid\textbf{A1},\textbf{A2}\right\}= { italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ∣ italic_c ) ∣ A1 , A2 }

Note how constraints A1, A2 are equivalent to imposing A1, A2 to each stratum C=c𝐶𝑐C=citalic_C = italic_c. Notation: (i) 𝒫C={q(C)}subscript𝒫𝐶𝑞𝐶\mathcal{P}_{C}=\{q(C)\}caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT = { italic_q ( italic_C ) } means 𝒫Csubscript𝒫𝐶\mathcal{P}_{C}caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT is a non-parametric model contains all probability distribution q(C)𝑞𝐶q(C)italic_q ( italic_C ), and (ii) probability constraints are assumed to hold.

In this product, 𝒫ZI MCAR(1)(c)subscriptsuperscript𝒫1ZI MCAR𝑐\mathcal{P}^{(1)}_{\text{ZI MCAR}}(c)caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) for all c𝑐citalic_c are the same ZI MCAR model described in Theorem 3, repeated |𝒞|𝒞|\mathcal{C}|| caligraphic_C | times. The value c𝑐citalic_c is not a parameter of the model 𝒫ZI MCAR(1)(c)subscriptsuperscript𝒫1ZI MCAR𝑐\mathcal{P}^{(1)}_{\text{ZI MCAR}}(c)caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ), but a constant. Its only purpose is for the sake of book-keeping when constructing the joint distribution in 𝒫ZI MAR(1)subscriptsuperscript𝒫1ZI MAR\mathcal{P}^{(1)}_{\text{ZI MAR}}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT. For MAR, standard adjustment method point identifies q(X(1),R,X,W)𝑞superscript𝑋1𝑅𝑋𝑊q(X^{(1)},R,X,W)italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W ) as a functional of q(R,X,W)𝑞𝑅𝑋𝑊q(R,X,W)italic_q ( italic_R , italic_X , italic_W ). Therefore, as shown in lemma 4 the set 𝒫ZI MCAR(1)(c)subscriptsuperscript𝒫1ZI MCAR𝑐\mathcal{P}^{(1)}_{\text{ZI MCAR}}(c)caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) is 1-to-1 to the set 𝒫ZI MCAR(c)={q(R,X,Wc)𝐙,𝐀𝟏,𝐀𝟐}subscript𝒫ZI MCAR𝑐conditional𝑞𝑅𝑋conditional𝑊𝑐𝐙𝐀𝟏𝐀𝟐\mathcal{P}_{\text{ZI MCAR}}(c)=\left\{q(R,X,W\mid c)\mid{\bf Z,A1,A2}\right\}caligraphic_P start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) = { italic_q ( italic_R , italic_X , italic_W ∣ italic_c ) ∣ bold_Z , bold_A1 , bold_A2 }. Hence, we are interested in the marginal model

𝒫ZI MARsubscript𝒫ZI MAR\displaystyle\mathcal{P}_{\text{ZI MAR}}caligraphic_P start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT =(c𝒞𝒫ZI MCAR(c))𝒫C.absenttensor-productsubscripttensor-product𝑐𝒞subscript𝒫ZI MCAR𝑐subscript𝒫𝐶\displaystyle=\left(\textstyle\bigotimes_{c\in\mathcal{C}}\mathcal{P}_{\text{% ZI MCAR}}(c)\right)\otimes\mathcal{P}_{C}.= ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ) ⊗ caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT . (45)

Finding compatible set.

Given an observed law 𝐩WXCsubscript𝐩𝑊𝑋𝐶\mathbf{p}_{WXC}bold_p start_POSTSUBSCRIPT italic_W italic_X italic_C end_POSTSUBSCRIPT, we want to find the compatible set w.r.t. this law

𝒬𝒬\displaystyle\mathcal{Q}caligraphic_Q ={𝐪XRWC𝐪XRWC𝒫ZI MAR,c𝒞(𝐪WRc𝐪RXc=𝐩WXc),c𝒞(q(c)=p(c))}.absentconditional-setsubscript𝐪𝑋𝑅𝑊𝐶formulae-sequencesubscript𝐪𝑋𝑅𝑊𝐶subscript𝒫ZI MARformulae-sequencefor-all𝑐𝒞subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐for-all𝑐𝒞𝑞𝑐𝑝𝑐\displaystyle=\left\{\mathbf{q}_{XRWC}\mid\mathbf{q}_{XRWC}\in\mathcal{P}_{% \text{ZI MAR}},\>\forall c\in\mathcal{C}\left(\mathbf{q}_{W\mid Rc}\mathbf{q}_% {RX\mid c}=\mathbf{p}_{WX\mid c}\right),\>\forall c\in\mathcal{C}\left(q(c)=p(% c)\right)\right\}.= { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W italic_C end_POSTSUBSCRIPT ∣ bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W italic_C end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT , ∀ italic_c ∈ caligraphic_C ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT ) , ∀ italic_c ∈ caligraphic_C ( italic_q ( italic_c ) = italic_p ( italic_c ) ) } . (46)

Geometrically speaking, this set is the intersection of our model 𝒫ZI MARsubscript𝒫ZI MAR\mathcal{P}_{\text{ZI MAR}}caligraphic_P start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT with the constraint set \mathcal{E}caligraphic_E, which is itself a Cartesian product,

\displaystyle\mathcal{E}caligraphic_E ={𝐪XRWCc𝒞(𝐪WRc𝐪RXc=𝐩WXc), and c𝒞(q(c)=p(c))}absentconditional-setsubscript𝐪𝑋𝑅𝑊𝐶formulae-sequencefor-all𝑐𝒞subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐 and for-all𝑐𝒞𝑞𝑐𝑝𝑐\displaystyle=\left\{\mathbf{q}_{XRWC}\mid\forall c\in\mathcal{C}\left(\mathbf% {q}_{W\mid Rc}\mathbf{q}_{RX\mid c}=\mathbf{p}_{WX\mid c}\right),\text{ and }% \forall c\in\mathcal{C}\left(q(c)=p(c)\right)\right\}= { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W italic_C end_POSTSUBSCRIPT ∣ ∀ italic_c ∈ caligraphic_C ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT ) , and ∀ italic_c ∈ caligraphic_C ( italic_q ( italic_c ) = italic_p ( italic_c ) ) } (47)
=c𝒞{𝐪XRWc𝐪WRc𝐪RXc=𝐩WXc}{p(c)}.absentsubscripttensor-product𝑐𝒞tensor-productconditional-setsubscript𝐪conditional𝑋𝑅𝑊𝑐subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐𝑝𝑐\displaystyle=\textstyle\bigotimes_{c\in\mathcal{C}}\left\{\mathbf{q}_{XRW\mid c% }\mid\mathbf{q}_{W\mid Rc}\mathbf{q}_{RX\mid c}=\mathbf{p}_{WX\mid c}\right\}% \otimes\left\{p(c)\right\}.= ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W ∣ italic_c end_POSTSUBSCRIPT ∣ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT } ⊗ { italic_p ( italic_c ) } .

Here we abuse notation {p(c)}𝑝𝑐\left\{p(c)\right\}{ italic_p ( italic_c ) } to mean the set with 1 element - the observed law p(C)𝑝𝐶p(C)italic_p ( italic_C ), which is not the model 𝒫Csubscript𝒫𝐶\mathcal{P}_{C}caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT.

Since the constraint 𝐪WRc𝐪RXc=𝐩WXcsubscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐\mathbf{q}_{W\mid Rc}\mathbf{q}_{RX\mid c}=\mathbf{p}_{WX\mid c}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT only concerns q(R,X,Wc)𝑞𝑅𝑋conditional𝑊𝑐q(R,X,W\mid c)italic_q ( italic_R , italic_X , italic_W ∣ italic_c ) and does not concern other q(R,X,Wc)𝑞𝑅𝑋conditional𝑊superscript𝑐q(R,X,W\mid c^{\prime})italic_q ( italic_R , italic_X , italic_W ∣ italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) in any way, we push each constraint to the corresponding 𝒫ZI MCAR(c)subscript𝒫ZI MCAR𝑐\mathcal{P}_{\text{ZI MCAR}}(c)caligraphic_P start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ). In other words, we will proceed to find the ZI MCAR compatibility bound for each level c𝑐citalic_c, as shown below. Mathematically, as Cartesian product could be written as intersection: If A,CU𝐴𝐶𝑈A,C\subseteq Uitalic_A , italic_C ⊆ italic_U and B,DV𝐵𝐷𝑉B,D\subseteq Vitalic_B , italic_D ⊆ italic_V, then

(AB)(CD)tensor-product𝐴𝐵tensor-product𝐶𝐷\displaystyle(A\otimes B)\cap(C\otimes D)( italic_A ⊗ italic_B ) ∩ ( italic_C ⊗ italic_D ) =(AV)(UB)(CV)(UD)absenttensor-product𝐴𝑉tensor-product𝑈𝐵tensor-product𝐶𝑉tensor-product𝑈𝐷\displaystyle=(A\otimes V)\cap(U\otimes B)\cap(C\otimes V)\cap(U\otimes D)= ( italic_A ⊗ italic_V ) ∩ ( italic_U ⊗ italic_B ) ∩ ( italic_C ⊗ italic_V ) ∩ ( italic_U ⊗ italic_D ) (48)
=(ACV)(U(BD))absent𝐴tensor-product𝐶𝑉tensor-product𝑈𝐵𝐷\displaystyle=(A\cap C\otimes V)\cap(U\otimes(B\cap D))= ( italic_A ∩ italic_C ⊗ italic_V ) ∩ ( italic_U ⊗ ( italic_B ∩ italic_D ) )
=(AC)(BD).absenttensor-product𝐴𝐶𝐵𝐷\displaystyle=(A\cap C)\otimes(B\cap D).= ( italic_A ∩ italic_C ) ⊗ ( italic_B ∩ italic_D ) .

We could transform

𝒬𝒬\displaystyle\mathcal{Q}caligraphic_Q =𝒫ZI MAR=((c𝒞𝒫ZI MCAR(c))𝒫C)(c𝒞{𝐪XRWc𝐪WRc𝐪RXc=𝐩WXc}{p(c)})absentsubscript𝒫ZI MARtensor-productsubscripttensor-product𝑐𝒞subscript𝒫ZI MCAR𝑐subscript𝒫𝐶subscripttensor-product𝑐𝒞tensor-productconditional-setsubscript𝐪conditional𝑋𝑅𝑊𝑐subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐𝑝𝑐\displaystyle=\mathcal{P}_{\text{ZI MAR}}\cap\mathcal{E}=\textstyle\bigg{(}% \left(\bigotimes_{c\in\mathcal{C}}\mathcal{P}_{\text{ZI MCAR}}(c)\right)% \otimes\mathcal{P}_{C}\bigg{)}\cap\bigg{(}\bigotimes_{c\in\mathcal{C}}\left\{% \mathbf{q}_{XRW\mid c}\mid\mathbf{q}_{W\mid Rc}\mathbf{q}_{RX\mid c}=\mathbf{p% }_{WX\mid c}\right\}\otimes\left\{p(c)\right\}\bigg{)}= caligraphic_P start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT ∩ caligraphic_E = ( ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ) ⊗ caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ) ∩ ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W ∣ italic_c end_POSTSUBSCRIPT ∣ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT } ⊗ { italic_p ( italic_c ) } ) (49)
=((c𝒞𝒫ZI MCAR(c))(c𝒞{𝐪XRWc𝐪WRc𝐪RXc=𝐩WXc})){p(c)}absenttensor-productsubscripttensor-product𝑐𝒞subscript𝒫ZI MCAR𝑐subscripttensor-product𝑐𝒞conditional-setsubscript𝐪conditional𝑋𝑅𝑊𝑐subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐𝑝𝑐\displaystyle=\textstyle\bigg{(}\left(\bigotimes_{c\in\mathcal{C}}\mathcal{P}_% {\text{ZI MCAR}}(c)\right)\cap\left(\bigotimes_{c\in\mathcal{C}}\left\{\mathbf% {q}_{XRW\mid c}\mid\mathbf{q}_{W\mid Rc}\mathbf{q}_{RX\mid c}=\mathbf{p}_{WX% \mid c}\right\}\right)\bigg{)}\otimes\left\{p(c)\right\}= ( ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ) ∩ ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W ∣ italic_c end_POSTSUBSCRIPT ∣ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT } ) ) ⊗ { italic_p ( italic_c ) }
=(c𝒞(𝒫ZI MCAR(c){𝐪XRWc𝐪WRc𝐪RXc=𝐩WXc})){p(c)}absenttensor-productsubscripttensor-product𝑐𝒞subscript𝒫ZI MCAR𝑐conditional-setsubscript𝐪conditional𝑋𝑅𝑊𝑐subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐𝑝𝑐\displaystyle=\textstyle\bigg{(}\bigotimes_{c\in\mathcal{C}}\left(\mathcal{P}_% {\text{ZI MCAR}}(c)\cap\left\{\mathbf{q}_{XRW\mid c}\mid\mathbf{q}_{W\mid Rc}% \mathbf{q}_{RX\mid c}=\mathbf{p}_{WX\mid c}\right\}\right)\bigg{)}\otimes\left% \{p(c)\right\}= ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT ( caligraphic_P start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ∩ { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W ∣ italic_c end_POSTSUBSCRIPT ∣ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT } ) ) ⊗ { italic_p ( italic_c ) }
=(c𝒞𝒬ZI MCAR(c)){p(c)},absenttensor-productsubscripttensor-product𝑐𝒞subscript𝒬ZI MCAR𝑐𝑝𝑐\displaystyle=\left(\textstyle\bigotimes_{c\in\mathcal{C}}\mathcal{Q}_{\text{% ZI MCAR}}(c)\right)\otimes\left\{p(c)\right\},= ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_Q start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ) ⊗ { italic_p ( italic_c ) } ,

where

𝒬ZI MCAR(c)subscript𝒬ZI MCAR𝑐\displaystyle\mathcal{Q}_{\text{ZI MCAR}}(c)caligraphic_Q start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ={q(R,X,Wc)𝐙,𝐀𝟏,𝐀𝟐,𝐪WRc𝐪RXc=𝐩WXc}.absentconditional𝑞𝑅𝑋conditional𝑊𝑐𝐙𝐀𝟏𝐀𝟐subscript𝐪conditional𝑊𝑅𝑐subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐\displaystyle=\left\{q(R,X,W\mid c)\mid{\bf Z,A1,A2},\>\mathbf{q}_{W\mid Rc}% \mathbf{q}_{RX\mid c}=\mathbf{p}_{WX\mid c}\right\}.= { italic_q ( italic_R , italic_X , italic_W ∣ italic_c ) ∣ bold_Z , bold_A1 , bold_A2 , bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT } . (50)

This is exactly the set 𝒬𝒬\mathcal{Q}caligraphic_Q described in Theorem 3. Therefore, this equation suggests the application of Theorem 3 to each stratum C=c𝐶𝑐C=citalic_C = italic_c. First, there are marginal constraints: c,x0,pw0x,c=pw0x1,cformulae-sequencefor-all𝑐for-all𝑥0subscript𝑝conditionalsubscript𝑤0𝑥𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐\forall c,\forall x\neq 0,p_{w_{0}\mid x,c}=p_{w_{0}\mid x_{1},c}∀ italic_c , ∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT. Second,

𝒬ZI MCAR(c)subscript𝒬ZI MCAR𝑐\displaystyle\mathcal{Q}_{\text{ZI MCAR}}(c)caligraphic_Q start_POSTSUBSCRIPT ZI MCAR end_POSTSUBSCRIPT ( italic_c ) ={(𝐪WR,c,𝐪RXc)𝐪RXc=[𝐪WRc]1𝐩WXc,𝐪WR,}\displaystyle=\left\{(\mathbf{q}_{W\mid R,c},\mathbf{q}_{RX\mid c})\mid\mathbf% {q}_{RX\mid c}=[\mathbf{q}_{W\mid R\mid c}]^{-1}\mathbf{p}_{WX\mid c},\mathbf{% q}_{W\mid R}\in\mathcal{B},\right\}= { ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R , italic_c end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT ) ∣ bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R ∣ italic_c end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT , bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∈ caligraphic_B , } (51)

where \mathcal{B}caligraphic_B contains stochastic matrix 𝐪WRcsubscript𝐪conditional𝑊𝑅𝑐\mathbf{q}_{W\mid Rc}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R italic_c end_POSTSUBSCRIPT satisfying

qw0r1,csubscript𝑞conditionalsubscript𝑤0subscript𝑟1𝑐\displaystyle q_{w_{0}\mid r_{1},c}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT =pw0x1,cabsentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐\displaystyle=p_{w_{0}\mid x_{1},c}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT
qw0r0,csubscript𝑞conditionalsubscript𝑤0subscript𝑟0𝑐\displaystyle q_{w_{0}\mid r_{0},c}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT {[pw0x0,c,1] if pw0x0,c>pw0x1,c[0,pw0x0,c] if pw0x0,c<pw0x1,c(0,1){pw0x0,c} if 0<pw0x0,c=pw0x1,c<1.absentcasessubscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐otherwise0subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐 if subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐 if 0subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐1otherwise\displaystyle\in\begin{cases}[p_{w_{0}\mid x_{0},c},1]\text{ if }p_{w_{0}\mid x% _{0},c}>p_{w_{0}\mid x_{1},c}\\ [0,p_{w_{0}\mid x_{0},c}]\text{ if }p_{w_{0}\mid x_{0},c}<p_{w_{0}\mid x_{1},c% }\\ (0,1)\setminus\{p_{w_{0}\mid x_{0},c}\}\text{ if }0<p_{w_{0}\mid x_{0},c}=p_{w% _{0}\mid x_{1},c}<1.\end{cases}∈ { start_ROW start_CELL [ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT , 1 ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ] if italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT } if 0 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1 . end_CELL start_CELL end_CELL end_ROW

Moreover, if pw0x0,c=pw0x1,csubscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐p_{w_{0}\mid x_{0},c}=p_{w_{0}\mid x_{1},c}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT then 0<pw0x0,c<10subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐10<p_{w_{0}\mid x_{0},c}<10 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1 is an additional condition, and zero inflation does not occur for stratum C=c𝐶𝑐C=citalic_C = italic_c, i.e., q(R=0c)=0𝑞𝑅conditional0𝑐0q(R=0\mid c)=0italic_q ( italic_R = 0 ∣ italic_c ) = 0. Since the compatibility set 𝒬𝒬\mathcal{Q}caligraphic_Q in this case is a Cartesian product of compatibility sets described in Theorem 3, which is sharp, 𝒬𝒬\mathcal{Q}caligraphic_Q is sharp. ∎

X(1)superscript𝑋1X^{(1)}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPTX𝑋Xitalic_XR𝑅Ritalic_RC𝐶Citalic_CW𝑊Witalic_W
Figure 7: The graph considered in Theorem 5: proxy-augmented ZI MAR model satisfying A1 and A2 (Fig. 2 (c) in the main paper).

Theorem 5 Consider a ZI MAR model in Fig. 2 (c) (reproduced in Fig. 7) under proxy assumptions A1 and A2, with categorical X,C𝑋𝐶X,Citalic_X , italic_C and binary R,W𝑅𝑊R,Witalic_R , italic_W. Given a consistent observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) satisfying positivity assumption, x,c,p(x,c)>0for-all𝑥𝑐𝑝𝑥𝑐0\forall x,c,p(x,c)>0∀ italic_x , italic_c , italic_p ( italic_x , italic_c ) > 0, the set of compatible proxy-indicator conditional distributions q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) is given by

qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =pw0x1absentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle=p_{w_{0}\mid x_{1}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT {[maxcpw0x0,c,1] if c~,pw0x0,c~>pw0x1[0,mincpw0x0,c] if c~,pw0x0,c~<pw0x1(0,1){pw0x1} if c,pw0x0,c=pw0x1absentcasessubscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1 if ~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise0subscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐 if ~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0~𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥1 if for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise\displaystyle\in\begin{cases}[\max_{c}p_{w_{0}\mid x_{0},c},1]\text{ if }% \exists\tilde{c},p_{w_{0}\mid x_{0},\tilde{c}}>p_{w_{0}\mid x_{1}}\\ [0,\min_{c}p_{w_{0}\mid x_{0},c}]\text{ if }\exists\tilde{c},p_{w_{0}\mid x_{0% },\tilde{c}}<p_{w_{0}\mid x_{1}}\\ (0,1)\setminus\{p_{w_{0}\mid x_{1}}\}\text{ if }\forall c,p_{w_{0}\mid x_{0},c% }=p_{w_{0}\mid x_{1}}\end{cases}∈ { start_ROW start_CELL [ roman_max start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT , 1 ] if ∃ over~ start_ARG italic_c end_ARG , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over~ start_ARG italic_c end_ARG end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , roman_min start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ] if ∃ over~ start_ARG italic_c end_ARG , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over~ start_ARG italic_c end_ARG end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } if ∀ italic_c , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL end_CELL end_ROW

These bounds are sharp. Moreover, if c,pw0x0,c=pw0x1for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\forall c,p_{w_{0}\mid x_{0},c}=p_{w_{0}\mid x_{1}}∀ italic_c , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) must satisfy c,0<pw0x0,c<1for-all𝑐0subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1\forall c,0<p_{w_{0}\mid x_{0},c}<1∀ italic_c , 0 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1, and zero inflation does not occur, i.e., q(R=0)=0𝑞𝑅00q(R=0)=0italic_q ( italic_R = 0 ) = 0.

Proof.

Model definition. We assume C𝐶Citalic_C is a cardinal variable, taking values in a finite set 𝒞𝒞\mathcal{C}caligraphic_C. Any joint distribution in this ZI MAR model is

q(X(1),R,X,W,C)=q(X(1),R,XC)q(WR)q(C)𝑞superscript𝑋1𝑅𝑋𝑊𝐶𝑞superscript𝑋1𝑅conditional𝑋𝐶𝑞conditional𝑊𝑅𝑞𝐶q\left(X^{(1)},R,X,W,C\right)=q\left(X^{(1)},R,X\mid C\right)q(W\mid R)q(C)italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X , italic_W , italic_C ) = italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ∣ italic_C ) italic_q ( italic_W ∣ italic_R ) italic_q ( italic_C ) (52)

Since the Markov factors are variationally independent, the ZI MAR model is a Cartesian product

𝒫ZI MAR(1)subscriptsuperscript𝒫1ZI MAR\displaystyle\mathcal{P}^{(1)}_{\text{ZI MAR}}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT =(c𝒞𝒫X(1)XR(c))𝒫WR𝒫Cabsenttensor-productsubscripttensor-product𝑐𝒞subscript𝒫superscript𝑋1𝑋𝑅𝑐subscript𝒫conditional𝑊𝑅subscript𝒫𝐶\displaystyle=\left(\textstyle\bigotimes_{c\in\mathcal{C}}\mathcal{P}_{X^{(1)}% XR}(c)\right)\otimes\mathcal{P}_{W\mid R}\otimes\mathcal{P}_{C}= ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT italic_X italic_R end_POSTSUBSCRIPT ( italic_c ) ) ⊗ caligraphic_P start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ⊗ caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT (53)
𝒫Csubscript𝒫𝐶\displaystyle\mathcal{P}_{C}caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT ={q(C)},absent𝑞𝐶\displaystyle=\{q(C)\},= { italic_q ( italic_C ) } ,
𝒫WRsubscript𝒫conditional𝑊𝑅\displaystyle\mathcal{P}_{W\mid R}caligraphic_P start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ={q(WR)det𝐪WR0},absentconditional𝑞conditional𝑊𝑅subscript𝐪conditional𝑊𝑅0\displaystyle=\{q(W\mid R)\mid\det\mathbf{q}_{W\mid R}\neq 0\},= { italic_q ( italic_W ∣ italic_R ) ∣ roman_det bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≠ 0 } ,
𝒫X(1)XR(c)subscript𝒫superscript𝑋1𝑋𝑅𝑐\displaystyle\mathcal{P}_{X^{(1)}XR}(c)caligraphic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT italic_X italic_R end_POSTSUBSCRIPT ( italic_c ) ={q(X(1),R,Xc)}absent𝑞superscript𝑋1𝑅conditional𝑋𝑐\displaystyle=\left\{q(X^{(1)},R,X\mid c)\right\}= { italic_q ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_X ∣ italic_c ) }

Lemma 4 says 𝒫X(1)XR(c)subscript𝒫superscript𝑋1𝑋𝑅𝑐\mathcal{P}_{X^{(1)}XR}(c)caligraphic_P start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT italic_X italic_R end_POSTSUBSCRIPT ( italic_c ) is 1-to-1 to the set 𝒫XR(c)={p(R,Xc)𝐙}subscript𝒫𝑋𝑅𝑐conditional𝑝𝑅conditional𝑋𝑐𝐙\mathcal{P}_{XR}(c)=\left\{p(R,X\mid c)\mid{\bf Z}\right\}caligraphic_P start_POSTSUBSCRIPT italic_X italic_R end_POSTSUBSCRIPT ( italic_c ) = { italic_p ( italic_R , italic_X ∣ italic_c ) ∣ bold_Z }. Hence, we are interested in the marginal model

𝒫ZI MARsubscript𝒫ZI MAR\displaystyle\mathcal{P}_{\text{ZI MAR}}caligraphic_P start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT =(c𝒞𝒫XR(c))𝒫WR𝒫C.absenttensor-productsubscripttensor-product𝑐𝒞subscript𝒫𝑋𝑅𝑐subscript𝒫conditional𝑊𝑅subscript𝒫𝐶\displaystyle=\left(\textstyle\bigotimes_{c\in\mathcal{C}}\mathcal{P}_{XR}(c)% \right)\otimes\mathcal{P}_{W\mid R}\otimes\mathcal{P}_{C}.= ( ⨂ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_X italic_R end_POSTSUBSCRIPT ( italic_c ) ) ⊗ caligraphic_P start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ⊗ caligraphic_P start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT . (54)

Finding compatible set.

Given observed law 𝐩WXCsubscript𝐩𝑊𝑋𝐶\mathbf{p}_{WXC}bold_p start_POSTSUBSCRIPT italic_W italic_X italic_C end_POSTSUBSCRIPT, we want to find the compatible set w.r.t. this law

𝒬𝒬\displaystyle\mathcal{Q}caligraphic_Q ={𝐪XRWC𝐪XRWC𝒫ZI MAR,c𝒞(𝐪WR𝐪RXc=𝐩WXc),c𝒞(q(c)=p(c))}.absentconditional-setsubscript𝐪𝑋𝑅𝑊𝐶formulae-sequencesubscript𝐪𝑋𝑅𝑊𝐶subscript𝒫ZI MARformulae-sequencefor-all𝑐𝒞subscript𝐪conditional𝑊𝑅subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐for-all𝑐𝒞𝑞𝑐𝑝𝑐\displaystyle=\left\{\mathbf{q}_{XRWC}\mid\mathbf{q}_{XRWC}\in\mathcal{P}_{% \text{ZI MAR}},\>\forall c\in\mathcal{C}\left(\mathbf{q}_{W\mid R}\mathbf{q}_{% RX\mid c}=\mathbf{p}_{WX\mid c}\right),\>\forall c\in\mathcal{C}\left(q(c)=p(c% )\right)\right\}.= { bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W italic_C end_POSTSUBSCRIPT ∣ bold_q start_POSTSUBSCRIPT italic_X italic_R italic_W italic_C end_POSTSUBSCRIPT ∈ caligraphic_P start_POSTSUBSCRIPT ZI MAR end_POSTSUBSCRIPT , ∀ italic_c ∈ caligraphic_C ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT ) , ∀ italic_c ∈ caligraphic_C ( italic_q ( italic_c ) = italic_p ( italic_c ) ) } . (55)

This is similar to the compatible set we consider when A1, A2 hold (e.g., when CW𝐶𝑊C\rightarrow Witalic_C → italic_W), except the same 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT is shared between the constraints 𝐪WR𝐪RXc=𝐩WXcsubscript𝐪conditional𝑊𝑅subscript𝐪conditional𝑅𝑋𝑐subscript𝐩conditional𝑊𝑋𝑐\mathbf{q}_{W\mid R}\mathbf{q}_{RX\mid c}=\mathbf{p}_{WX\mid c}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT. Each constraint restricts 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT in a different way, hence we cannot write 𝒬𝒬\mathcal{Q}caligraphic_Q as a Cartesian product to separate the constraints as we did before.

To proceed, note that 𝒬𝒬\mathcal{Q}caligraphic_Q is 1-to-1 to a set containing only 𝐪WRsubscript𝐪conditional𝑊𝑅\mathbf{q}_{W\mid R}bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT, just as in ZI MCAR proof.

𝒬𝒬\displaystyle\mathcal{Q}caligraphic_Q ={(𝐪WR,(𝐪RXc)c𝒞)c(𝐪RXc=[𝐪WR]1𝐩WXc),𝐪WR}{p(C)}absenttensor-productconditional-setsubscript𝐪conditional𝑊𝑅subscriptsubscript𝐪conditional𝑅𝑋𝑐𝑐𝒞for-all𝑐subscript𝐪conditional𝑅𝑋𝑐superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1subscript𝐩conditional𝑊𝑋𝑐subscript𝐪conditional𝑊𝑅𝑝𝐶\displaystyle=\left\{(\mathbf{q}_{W\mid R},(\mathbf{q}_{RX\mid c})_{c\in% \mathcal{C}})\mid\forall c\left(\mathbf{q}_{RX\mid c}=[\mathbf{q}_{W\mid R}]^{% -1}\mathbf{p}_{WX\mid c}\right),\>\mathbf{q}_{W\mid R}\in\mathcal{B}\right\}% \otimes\{p(C)\}= { ( bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT , ( bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT ) ∣ ∀ italic_c ( bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT ) , bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ∈ caligraphic_B } ⊗ { italic_p ( italic_C ) } (56)
\displaystyle\mathcal{B}caligraphic_B ={𝐪WR𝐪WR0,r(wqwr=1),qw0r0qw0r1,for each c:𝐪RXc0,rxqrxc=1,x0(qr0xc=0),where 𝐪RXc=[𝐪WR]1𝐩WXc.}.absentsubscript𝐪conditional𝑊𝑅missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1missing-subexpressionmissing-subexpressionmissing-subexpression:for each 𝑐formulae-sequencesubscript𝐪conditional𝑅𝑋𝑐0formulae-sequencesubscript𝑟𝑥subscript𝑞conditional𝑟𝑥𝑐1for-all𝑥0subscript𝑞subscript𝑟0𝑥𝑐0missing-subexpressionmissing-subexpressionmissing-subexpressionwhere subscript𝐪conditional𝑅𝑋𝑐superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1subscript𝐩conditional𝑊𝑋𝑐\displaystyle=\left\{\begin{array}[]{c|c}\mathbf{q}_{W\mid R}&\begin{aligned} % &&&\textstyle\mathbf{q}_{W\mid R}\geq 0,\>\forall r\left(\sum_{w}q_{w\mid r}=1% \right),\>q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}},\\ &&&\textstyle\text{for each }c:\mathbf{q}_{RX\mid c}\geq 0,\>\sum_{rx}q_{rx% \mid c}=1,\>\forall x\neq 0(q_{r_{0}xc}=0),\\ &&&\textstyle\quad\quad\quad\quad\quad\text{where }\mathbf{q}_{RX\mid c}=[% \mathbf{q}_{W\mid R}]^{-1}\mathbf{p}_{WX\mid c}.\end{aligned}\end{array}\right\}.= { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL for each italic_c : bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x ∣ italic_c end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x italic_c end_POSTSUBSCRIPT = 0 ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL where bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT . end_CELL end_ROW end_CELL end_ROW end_ARRAY } .

This set is the intersection =c𝒞csubscript𝑐𝒞subscript𝑐\mathcal{B}=\cap_{c\in\mathcal{C}}\mathcal{B}_{c}caligraphic_B = ∩ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT, in which each csubscript𝑐\mathcal{B}_{c}caligraphic_B start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT contains only constraints associated with values c𝑐citalic_c.

csubscript𝑐\displaystyle\mathcal{B}_{c}caligraphic_B start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ={𝐪WR𝐪WR0,r(wqwr=1),qw0r0qw0r1,𝐪RXc0,rxqrxc=1,x0(qr0xc=0),where 𝐪RXc=[𝐪WR]1𝐩WXc.}.absentsubscript𝐪conditional𝑊𝑅missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪conditional𝑊𝑅0for-all𝑟subscript𝑤subscript𝑞conditional𝑤𝑟1subscript𝑞conditionalsubscript𝑤0subscript𝑟0subscript𝑞conditionalsubscript𝑤0subscript𝑟1missing-subexpressionmissing-subexpressionmissing-subexpressionformulae-sequencesubscript𝐪conditional𝑅𝑋𝑐0formulae-sequencesubscript𝑟𝑥subscript𝑞conditional𝑟𝑥𝑐1for-all𝑥0subscript𝑞subscript𝑟0𝑥𝑐0missing-subexpressionmissing-subexpressionmissing-subexpressionwhere subscript𝐪conditional𝑅𝑋𝑐superscriptdelimited-[]subscript𝐪conditional𝑊𝑅1subscript𝐩conditional𝑊𝑋𝑐\displaystyle=\left\{\begin{array}[]{c|c}\mathbf{q}_{W\mid R}&\begin{aligned} % &&&\textstyle\mathbf{q}_{W\mid R}\geq 0,\>\forall r\left(\sum_{w}q_{w\mid r}=1% \right),\>q_{w_{0}\mid r_{0}}\neq q_{w_{0}\mid r_{1}},\\ &&&\textstyle\mathbf{q}_{RX\mid c}\geq 0,\>\sum_{rx}q_{rx\mid c}=1,\>\forall x% \neq 0\left(q_{r_{0}xc}=0\right),\\ &&&\textstyle\text{where }\mathbf{q}_{RX\mid c}=[\mathbf{q}_{W\mid R}]^{-1}% \mathbf{p}_{WX\mid c}.\\ \end{aligned}\end{array}\right\}.= { start_ARRAY start_ROW start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT end_CELL start_CELL start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ≥ 0 , ∀ italic_r ( ∑ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_w ∣ italic_r end_POSTSUBSCRIPT = 1 ) , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_r italic_x end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_r italic_x ∣ italic_c end_POSTSUBSCRIPT = 1 , ∀ italic_x ≠ 0 ( italic_q start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x italic_c end_POSTSUBSCRIPT = 0 ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL where bold_q start_POSTSUBSCRIPT italic_R italic_X ∣ italic_c end_POSTSUBSCRIPT = [ bold_q start_POSTSUBSCRIPT italic_W ∣ italic_R end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_W italic_X ∣ italic_c end_POSTSUBSCRIPT . end_CELL end_ROW end_CELL end_ROW end_ARRAY } . (57)

We have already solved csubscript𝑐\mathcal{B}_{c}caligraphic_B start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT before, it is the ZI MCAR compatibility set of q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ) in Theorem 3. Then all we need is to take the intersection of these results, one for each c𝑐citalic_c. This intersection =c𝒞csubscript𝑐𝒞subscript𝑐\mathcal{B}=\cap_{c\in\mathcal{C}}\mathcal{B}_{c}caligraphic_B = ∩ start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT is non-empty, because there is some q(R,X,W,C)𝑞𝑅𝑋𝑊𝐶q(R,X,W,C)italic_q ( italic_R , italic_X , italic_W , italic_C ) produces the given observed law. First, the identification of qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and marginal constraints are

c𝒞,x0,qw0r1=pw0x1,c=pw0x,c.formulae-sequencefor-all𝑐𝒞formulae-sequencefor-all𝑥0subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐subscript𝑝conditionalsubscript𝑤0𝑥𝑐\forall c\in\mathcal{C},\forall x\neq 0,q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}% ,c}=p_{w_{0}\mid x,c}.∀ italic_c ∈ caligraphic_C , ∀ italic_x ≠ 0 , italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x , italic_c end_POSTSUBSCRIPT . (58)

The last equality is due to the marginal constraint discussed in Theorem 3. Then we can write

qw0r1=pw0x1.subscript𝑞conditionalsubscript𝑤0subscript𝑟1subscript𝑝conditionalsubscript𝑤0subscript𝑥1q_{w_{0}\mid r_{1}}=p_{w_{0}\mid x_{1}}.italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (59)

Next, we consider each case of the bound for qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

  1. 1.

    Suppose pw0x0,c>pw0x1,csubscript𝑝conditionalsubscript𝑤0subscript𝑥0superscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1superscript𝑐p_{w_{0}\mid x_{0},c^{\prime}}>p_{w_{0}\mid x_{1},c^{\prime}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT for some csuperscript𝑐c^{\prime}italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, then by Theorem 3, qw0x0>pw0x1,c=qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1superscript𝑐subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid x_{0}}>p_{w_{0}\mid x_{1},c^{\prime}}=q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, where last equality follows from equation 58.

  2. 2.

    Suppose pw0x0,c′′<pw0x1,c′′subscript𝑝conditionalsubscript𝑤0subscript𝑥0superscript𝑐′′subscript𝑝conditionalsubscript𝑤0subscript𝑥1superscript𝑐′′p_{w_{0}\mid x_{0},c^{\prime\prime}}<p_{w_{0}\mid x_{1},c^{\prime\prime}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT for some c′′superscript𝑐′′c^{\prime\prime}italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT, then by Theorem 3, qw0x0<pw0x1,c′′=qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑥0subscript𝑝conditionalsubscript𝑤0subscript𝑥1superscript𝑐′′subscript𝑞conditionalsubscript𝑤0subscript𝑟1q_{w_{0}\mid x_{0}}<p_{w_{0}\mid x_{1},c^{\prime\prime}}=q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, where last equality follows from equation 58.

This means these 2 cases disjoint, i.e., we must have the following marginal constraint

either c(pw0x0,cpw0x1) or c(pw0x0,cpw0x1).either for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1 or for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\text{either }\forall c\left(p_{w_{0}\mid x_{0},c}\leq p_{w_{0}\mid x_{1}}% \right)\text{ or }\forall c\left(p_{w_{0}\mid x_{0},c}\geq p_{w_{0}\mid x_{1}}% \right).either ∀ italic_c ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) or ∀ italic_c ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ≥ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (60)

The corresponding bounds are

qw0r1subscript𝑞conditionalsubscript𝑤0subscript𝑟1\displaystyle q_{w_{0}\mid r_{1}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT =pw0x1absentsubscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle=p_{w_{0}\mid x_{1}}= italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT
qw0r0subscript𝑞conditionalsubscript𝑤0subscript𝑟0\displaystyle q_{w_{0}\mid r_{0}}italic_q start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT {[maxcpw0x0,c,1] if c,pw0x0,c>pw0x1,[0,mincpw0x0,c] if c,pw0x0,c<pw0x1,(0,1){pw0x1} if c,0<pw0x0,c=pw0x1<1.absentcasessubscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1 if superscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0superscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise0subscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐 if superscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0superscript𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1otherwise01subscript𝑝conditionalsubscript𝑤0subscript𝑥1 if for-all𝑐0subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥11otherwise\displaystyle\in\begin{cases}[\max_{c}p_{w_{0}\mid x_{0},c},1]\text{ if }% \exists c^{\prime},p_{w_{0}\mid x_{0},c^{\prime}}>p_{w_{0}\mid x_{1}},\\ [0,\min_{c}p_{w_{0}\mid x_{0},c}]\text{ if }\exists c^{\prime},p_{w_{0}\mid x_% {0},c^{\prime}}<p_{w_{0}\mid x_{1}},\\ (0,1)\setminus\{p_{w_{0}\mid x_{1}}\}\text{ if }\forall c,0<p_{w_{0}\mid x_{0}% ,c}=p_{w_{0}\mid x_{1}}<1.\end{cases}∈ { start_ROW start_CELL [ roman_max start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT , 1 ] if ∃ italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL [ 0 , roman_min start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ] if ∃ italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( 0 , 1 ) ∖ { italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } if ∀ italic_c , 0 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT < 1 . end_CELL start_CELL end_CELL end_ROW

The max/min appears since we take the intersection of the bounds for c𝑐citalic_c. Moreover, if c,pw0x0,c=pw0x1for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\forall c,p_{w_{0}\mid x_{0},c}=p_{w_{0}\mid x_{1}}∀ italic_c , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, then c,0<pw0x0,c<1for-all𝑐0subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐1\forall c,0<p_{w_{0}\mid x_{0},c}<1∀ italic_c , 0 < italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT < 1 is an additional condition, and zero inflation does not occur, i.e., q(R=0)=0𝑞𝑅00q(R=0)=0italic_q ( italic_R = 0 ) = 0. Due to the marginal constraints above, this exhausts all the cases.

Since the compatibility set \mathcal{B}caligraphic_B is the intersection of each csubscript𝑐\mathcal{B}_{c}caligraphic_B start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT, each is sharp in their own ZI MCAR model, the above abound is sharp.

We collect the marginal constraints obtained from the proofs of Theorem 4 and Theorem 5 into the following lemma,

Lemma 2 For a ZI MAR model in Fig. 2 (b) under A1 and A2, the observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) obeys

c,x0,pw0x,c=pw0x1,c.formulae-sequencefor-all𝑐for-all𝑥0subscript𝑝conditionalsubscript𝑤0𝑥𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1𝑐\forall c,\forall x\neq 0,p_{w_{0}\mid x,c}=p_{w_{0}\mid x_{1},c}.∀ italic_c , ∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT . (61)

For a ZI MAR model in Fig. 2 (c) under A1 and A2, the observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) obeys

c,x0,pw0x,c=pw0x1,formulae-sequencefor-all𝑐for-all𝑥0subscript𝑝conditionalsubscript𝑤0𝑥𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\forall c,\forall x\neq 0,p_{w_{0}\mid x,c}=p_{w_{0}\mid x_{1}},∀ italic_c , ∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x , italic_c end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (62)
either c(pw0x0,cpw0x1) or c(pw0x0,cpw0x1).either for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1 or for-all𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥0𝑐subscript𝑝conditionalsubscript𝑤0subscript𝑥1\displaystyle\text{either }\forall c\left(p_{w_{0}\mid x_{0},c}\leq p_{w_{0}% \mid x_{1}}\right)\text{ or }\forall c\left(p_{w_{0}\mid x_{0},c}\geq p_{w_{0}% \mid x_{1}}\right).either ∀ italic_c ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) or ∀ italic_c ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_c end_POSTSUBSCRIPT ≥ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) .

A.6 ZI MNAR proofs

X1(1)subscriptsuperscript𝑋11X^{(1)}_{1}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2(1)subscriptsuperscript𝑋12X^{(1)}_{2}italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTR1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTR2subscript𝑅2R_{2}italic_R start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW2subscript𝑊2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW1subscript𝑊1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT(a)𝑎(a)( italic_a )R1subscript𝑅1R_{1}italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX1subscript𝑋1X_{1}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTX2subscript𝑋2X_{2}italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW2subscript𝑊2W_{2}italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTW1subscript𝑊1W_{1}italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT(b)𝑏(b)( italic_b )
Figure 8: (a) The ZI bivariate block-parallel model. (b) The model Markov to this graph contains the marginal model for p(X1,X2,R1,W1,W2)𝑝subscript𝑋1subscript𝑋2subscript𝑅1subscript𝑊1subscript𝑊2p(X_{1},X_{2},R_{1},W_{1},W_{2})italic_p ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) in (a).

Lemma 3 Consider any ZI model in Section 3.2.1 under A1 and A2. Denote Zk{X,W,C}{Wk,Xk}subscript𝑍𝑘𝑋𝑊𝐶subscript𝑊𝑘subscript𝑋𝑘Z_{k}\triangleq\{X,W,C\}\setminus\{W_{k},X_{k}\}italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ { italic_X , italic_W , italic_C } ∖ { italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }. The observed law p(X,W,C)𝑝𝑋𝑊𝐶p(X,W,C)italic_p ( italic_X , italic_W , italic_C ) must satisfy, for each k𝑘kitalic_k,

{zk,x0,pwk0xk=x,zk=pwk0zk1,zk(pwk0xk0,zkpwk0xk1) or zk(pwk0xk0,zkpwk0zk1).casesformulae-sequencefor-allsubscript𝑧𝑘for-all𝑥0subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘𝑥subscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑧𝑘1otherwisefor-allsubscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘0subscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘1 or for-allsubscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑥𝑘0subscript𝑧𝑘subscript𝑝conditionalsubscript𝑤𝑘0subscript𝑧𝑘1otherwise\begin{cases}\forall z_{k},\forall x\neq 0,p_{w_{k0}\mid x_{k}=x,z_{k}}=p_{w_{% k0}\mid z_{k1}},\\ \forall z_{k}\left(p_{w_{k0}\mid x_{k0},z_{k}}\leq p_{w_{k0}\mid x_{k1}}\right% )\text{ or }\forall z_{k}\left(p_{w_{k0}\mid x_{k0},z_{k}}\geq p_{w_{k0}\mid z% _{k1}}\right).\end{cases}{ start_ROW start_CELL ∀ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_x ≠ 0 , italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_x , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_z start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ∀ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) or ∀ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_x start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_k 0 end_POSTSUBSCRIPT ∣ italic_z start_POSTSUBSCRIPT italic_k 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . end_CELL start_CELL end_CELL end_ROW (63)
Proof.

Let 𝒢~~𝒢\tilde{\mathcal{G}}over~ start_ARG caligraphic_G end_ARG be the graph where {Xk,Zk,Rk}subscript𝑋𝑘subscript𝑍𝑘subscript𝑅𝑘\{X_{k},Z_{k},R_{k}\}{ italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } is fully connected, and RkWksubscript𝑅𝑘subscript𝑊𝑘R_{k}\rightarrow W_{k}italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT → italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. The Markov model 𝒫𝒢~subscript𝒫~𝒢\mathcal{P}_{\tilde{\mathcal{G}}}caligraphic_P start_POSTSUBSCRIPT over~ start_ARG caligraphic_G end_ARG end_POSTSUBSCRIPT for this graph contains all joint distributions q(Xk,Wk,Zk,Rk)=q(WkRk)q(Xk,Rk,Zk)𝑞subscript𝑋𝑘subscript𝑊𝑘subscript𝑍𝑘subscript𝑅𝑘𝑞conditionalsubscript𝑊𝑘subscript𝑅𝑘𝑞subscript𝑋𝑘subscript𝑅𝑘subscript𝑍𝑘q(X_{k},W_{k},Z_{k},R_{k})=q(W_{k}\mid R_{k})q(X_{k},R_{k},Z_{k})italic_q ( italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_q ( italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) italic_q ( italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ), where q(Xk,Rk,Zk)𝑞subscript𝑋𝑘subscript𝑅𝑘subscript𝑍𝑘q(X_{k},R_{k},Z_{k})italic_q ( italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) is from the saturated model restricted by Z. In the original ZI model, we have p(X,W,C,Rk)=p(WkRk)p(Xk,Rk,Zk)𝑝𝑋𝑊𝐶subscript𝑅𝑘𝑝conditionalsubscript𝑊𝑘subscript𝑅𝑘𝑝subscript𝑋𝑘subscript𝑅𝑘subscript𝑍𝑘p(X,W,C,R_{k})=p(W_{k}\mid R_{k})p(X_{k},R_{k},Z_{k})italic_p ( italic_X , italic_W , italic_C , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_p ( italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∣ italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) italic_p ( italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) with p(Xk,Rk,Zk)𝑝subscript𝑋𝑘subscript𝑅𝑘subscript𝑍𝑘p(X_{k},R_{k},Z_{k})italic_p ( italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) satisfying Z. Hence the model for this joint distribution is contained in 𝒫𝒢~subscript𝒫~𝒢\mathcal{P}_{\tilde{\mathcal{G}}}caligraphic_P start_POSTSUBSCRIPT over~ start_ARG caligraphic_G end_ARG end_POSTSUBSCRIPT.

Subsequently, one could repeat the proof of Theorem 5 to find the bound for q(WR)𝑞conditional𝑊𝑅q(W\mid R)italic_q ( italic_W ∣ italic_R ), with Wksubscript𝑊𝑘W_{k}italic_W start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as W𝑊Witalic_W, Xksubscript𝑋𝑘X_{k}italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as X𝑋Xitalic_X, and Zksubscript𝑍𝑘Z_{k}italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as C𝐶Citalic_C. The bound is not sharp, because we do not know the exact model for q(Xk,Rk,Zk)𝑞subscript𝑋𝑘subscript𝑅𝑘subscript𝑍𝑘q(X_{k},R_{k},Z_{k})italic_q ( italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_Z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Similarly, application of lemma 2 yields the desired constraints. ∎

Appendix B Simulations

B.1 Bound validity in random DGPs

DGPs for ZI MCAR and ZI MAR are randomly selected according to Fig. 2 (a) and (b), respectively. In particular, a DGP for ZI MAR is a joint distribution which factorizes as

p(X,X(1),R,W,C)=p(XR,X(1))p(X(1)C)p(RC)p(C)p(WR)𝑝𝑋superscript𝑋1𝑅𝑊𝐶𝑝conditional𝑋𝑅superscript𝑋1𝑝conditionalsuperscript𝑋1𝐶𝑝conditional𝑅𝐶𝑝𝐶𝑝conditional𝑊𝑅\displaystyle p(X,X^{(1)},R,W,C)=p(X\mid R,X^{(1)})p(X^{(1)}\mid C)p(R\mid C)p% (C)p(W\mid R)italic_p ( italic_X , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W , italic_C ) = italic_p ( italic_X ∣ italic_R , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ∣ italic_C ) italic_p ( italic_R ∣ italic_C ) italic_p ( italic_C ) italic_p ( italic_W ∣ italic_R ) (64)

Then, the observed law is p(X,W,C)=X(1),Rp(X,X(1),R,W,C)𝑝𝑋𝑊𝐶subscriptsuperscript𝑋1𝑅𝑝𝑋superscript𝑋1𝑅𝑊𝐶p(X,W,C)=\sum_{X^{(1)},R}p(X,X^{(1)},R,W,C)italic_p ( italic_X , italic_W , italic_C ) = ∑ start_POSTSUBSCRIPT italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R end_POSTSUBSCRIPT italic_p ( italic_X , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_R , italic_W , italic_C ).

We randomly select a DGP by sampling the following parameters

p(C=0)𝑝𝐶0\displaystyle p(C=0)italic_p ( italic_C = 0 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ] (65)
p(X(1)=0C=0)𝑝superscript𝑋1conditional0𝐶0\displaystyle p(X^{(1)}=0\mid C=0)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 0 ∣ italic_C = 0 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ]
p(X(1)=0C=1)𝑝superscript𝑋1conditional0𝐶1\displaystyle p(X^{(1)}=0\mid C=1)italic_p ( italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 0 ∣ italic_C = 1 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ]
p(R=0C=0)𝑝𝑅conditional0𝐶0\displaystyle p(R=0\mid C=0)italic_p ( italic_R = 0 ∣ italic_C = 0 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ]
p(R=0C=1)𝑝𝑅conditional0𝐶1\displaystyle p(R=0\mid C=1)italic_p ( italic_R = 0 ∣ italic_C = 1 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ]
p(W=0R=0)𝑝𝑊conditional0𝑅0\displaystyle p(W=0\mid R=0)italic_p ( italic_W = 0 ∣ italic_R = 0 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ]
p(W=0R=1)𝑝𝑊conditional0𝑅1\displaystyle p(W=0\mid R=1)italic_p ( italic_W = 0 ∣ italic_R = 1 ) Uniform[0,1]similar-toabsentUniform01\displaystyle\sim\operatorname{Uniform}[0,1]∼ roman_Uniform [ 0 , 1 ]

Further more, to satisfy the ZI-consistency

p(X=0R=0,X(1))\displaystyle p(X=0\mid R=0,X^{(1)})italic_p ( italic_X = 0 ∣ italic_R = 0 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT ) =1;absent1\displaystyle=1;= 1 ; p(X=1R=1,X(1)=1)\displaystyle p(X=1\mid R=1,X^{(1)}=1)italic_p ( italic_X = 1 ∣ italic_R = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 1 ) =1;absent1\displaystyle=1;= 1 ; p(X=0R=1,X(1)=0)\displaystyle p(X=0\mid R=1,X^{(1)}=0)italic_p ( italic_X = 0 ∣ italic_R = 1 , italic_X start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 0 ) =1.absent1\displaystyle=1.= 1 . (66)

B.2 Numerical bounds results

We compute numerical bounds using method in Duarte et al. [2023] and compare to our analytical bounds for DGPs in ZI MCAR and ZI MAR. Since computation time for the dual bound may be very long (some DGP might take more than 36 hours), we report only DGPs where primary bound is available (whose computation time may take only a few minutes). We refer reader to original paper for distinction of dual/primal bounds.

dgp lb ub num lb num ub pw0r0subscript𝑝conditional𝑤0𝑟0p_{w0\mid r0}italic_p start_POSTSUBSCRIPT italic_w 0 ∣ italic_r 0 end_POSTSUBSCRIPT
0 0.556406 1.0 0.556411 1.0 0.820732
1 0.357830 1.0 0.357830 1.0 0.493695
2 0.0 0.520689 0.0 0.520689 0.453609
4 0.606499 1.0 0.606499 1.0 0.682699
5 0.0 0.524069 0.0 0.524061 0.496676
6 0.381825 1.0 0.381825 1.0 0.441227
8 0.652288 1.0 0.652288 1.0 0.659347
9 0.698149 1.0 0.698149 1.0 0.738794
10 0.0 0.443595 0.0 0.443595 0.442502
11 0.656867 1.0 0.656867 1.0 0.850498
12 0.211359 1.0 0.211359 1.0 0.856658
14 0.183034 1.0 0.183034 1.0 0.303129
15 0.648430 1.0 0.648430 1.0 0.833933
16 0.292337 1.0 0.292337 1.0 0.307559
17 0.500542 1.0 0.500542 1.0 0.553972
18 0.0 0.102988 0.0 0.102988 0.087253
20 0.0 0.479532 0.0 0.479532 0.238318
21 0.426615 1.0 0.426615 1.0 0.426787
22 0.399169 1.0 0.399169 1.0 0.494816
23 0.0 0.216052 0.0 0.216052 0.158163
24 0.436636 1.0 0.436636 1.0 0.533412
26 0.429579 1.0 0.429579 1.0 0.710488
27 0.0 0.500198 0.0 0.500199 0.451856
28 0.0 0.383471 0.0 0.383471 0.136093
29 0.0 0.325871 0.0 0.325871 0.070747
30 0.363744 1.0 0.363744 1.0 0.374293
Table 2: Comparison between our analytical lower and upper bound (lb/ub) to numerical bounds (num lb/num ub) for a randomly selected set of DGPs in ZI MCAR model corresponding to Fig. 2 (a) (reproduced in Fig. 5). True pw0|r0subscript𝑝conditionalsubscript𝑤0subscript𝑟0p_{w_{0}|r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is reported.
dgp lb ub num lb num ub pw0r0subscript𝑝conditional𝑤0𝑟0p_{w0\mid r0}italic_p start_POSTSUBSCRIPT italic_w 0 ∣ italic_r 0 end_POSTSUBSCRIPT
0 0.0 0.429089 0.0 0.429089 0.413267
1 0.834644 1.0 0.834644 1.0 0.848638
2 0.0 0.340484 0.0 0.340484 0.319264
3 0.300217 1.0 0.300217 1.0 0.515513
4 0.582249 1.0 0.582249 1.0 0.688620
5 0.938604 1.0 0.938604 1.0 0.991572
6 0.0 0.147758 0.0 0.147758 0.053637
7 0.534321 1.0 0.534321 1.0 0.569545
8 0.720775 1.0 0.720775 1.0 0.726467
9 0.585611 1.0 0.592962 1.0 0.686385
10 0.261442 1.0 0.261442 1.0 0.303129
11 0.378136 1.0 0.378136 1.0 0.481036
12 0.0 0.729282 0.0 0.729282 0.703234
13 0.425249 1.0 0.425249 1.0 0.426797
14 0.612665 1.0 0.612665 1.0 0.632688
15 0.319180 1.0 0.319180 1.0 0.628988
16 0.0 0.702582 0.0 0.702582 0.660187
17 0.661849 1.0 0.661849 1.0 0.726963
18 0.594456 1.0 0.594456 1.0 0.600720
19 0.531541 1.0 0.532509 1.0 0.536156
21 0.596110 1.0 0.600331 1.0 0.606306
22 0.513144 1.0 0.513144 1.0 0.692834
23 0.0 0.560519 0.0 0.560519 0.536384
24 0.837194 1.0 0.837212 1.0 0.844800
25 0.0 0.443658 0.0 0.443658 0.302563
26 0.469323 1.0 0.469323 1.0 0.479800
28 0.688084 1.0 0.688084 1.0 0.826820
30 0.0 0.230720 0.0 0.230720 0.103594
Table 3: Comparison between our analytical lower and upper bound (lb/ub) to numerical bounds (num lb/num ub) for a randomly selected set of DGPs in ZI MAR model corresponding to Fig. 2 (c) (reproduced in Fig. 7). True pw0|r0subscript𝑝conditionalsubscript𝑤0subscript𝑟0p_{w_{0}|r_{0}}italic_p start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is reported.

Appendix C Variable descriptions in the CLABSI Data Application

This section describes the covariates used in the CLABSI data application (all coded as binary variables). These covariates correspond to types of therapy, and types of catheter used.

  • Pediatrics: the CVC therapy is tailored for children.

  • Chemotherapy: the CVC therapy is used to administer chemotherapy.

  • OPAT: outpatient parenteral antimicrobial therapy (IV antibiotics).

  • TPN: parenteral nutrition delivered via the VC.

  • Other therapy: any other type of therapy not included in the above categories, such as hydration.

  • Port: a type of CVC in use.

  • PICC: peripherally inserted central catheter, another type of CVC.

  • Tunneled CVC: a CVC tunneled under the skin.