\automark

[section]section \setkomafontpageheadfoot \setkomafontpagenumber \clearpairofpagestyles \cohead\xrfill[0.525ex]0.6pt \theshorttitle \xrfill[0.525ex]0.6pt \cehead\xrfill[0.525ex]0.6pt \theshortauthor \xrfill[0.525ex]0.6pt \cfoot*\xrfill[0.525ex]0.6pt \pagemark \xrfill[0.525ex]0.6pt

Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data

Junyang Wang^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data,Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Kolyan Ray^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Pablo Brito-Parada^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Yves Plancherel^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Tom Bide^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Joseph Mankelow^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} John Morley^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Julia Stegemann^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data} Rupert Myers^{Bayesian material flow analysis for systems with multiple levels of disaggregation and high dimensional data}

Abstract

Abstract. Material Flow Analysis (MFA) is used to quantify and understand the life cycles of materials from production to end of use, which enables environmental, social and economic impacts and interventions. MFA is challenging as available data is often limited and uncertain, leading to an underdetermined system with an infinite number of possible stocks and flows values. Bayesian statistics is an effective way to address these challenges by principally incorporating domain knowledge, and quantifying uncertainty in the data and providing probabilities associated with model solutions.

This paper presents a novel MFA methodology under the Bayesian framework. By relaxing the mass balance constraints, we improve the computational scalability and reliability of the posterior samples compared to existing Bayesian MFA methods. We propose a mass based, child and parent process framework to model systems with disaggregated processes and flows. We show posterior predictive checks can be used to identify inconsistencies in the data and aid noise and hyperparameter selection. The proposed approach is demonstrated on case studies, including a global aluminium cycle with significant disaggregation, under weakly informative priors and significant data gaps to investigate the feasibility of Bayesian MFA. We illustrate just a weakly informative prior can greatly improve the performance of Bayesian methods, for both estimation accuracy and uncertainty quantification.

Keywords. Material flow analysis $\bullet$ Bayesian statistics $\bullet$ Probabilistic modelling $\bullet$ Uncertainty quantification $\bullet$ Missing data $\bullet$ Circular Economy

¹¹footnotetext: Department of Civil and Environmental Engineering, Imperial College London, United Kingdom. junyang.wang21@imperial.ac.uk, r.myers@imperial.ac.uk²²footnotetext: Department of Earth Science and Engineering, Imperial College London, United Kingdom. p.brito-parada@imperial.ac.uk,y.plancherel@imperial.ac.uk, john.morley18@imperial.ac.uk³³footnotetext: Department of Mathematics, Imperial College London, London, United Kingdom. kolyan.ray@imperial.ac.uk⁴⁴footnotetext: Department of Civil, Environmental and Geomatic Engineering, University College London, London, United Kingdom. j.stegemann@ucl.ac.uk⁵⁵footnotetext: British Geological Survey, United Kingdom. tode@bgs.ac.uk, jmank@bgs.ac.uk

1 Introduction

Increasingly, the world is facing major shifts in resource utilisation. To prevent the worst consequences of climate change, global carbon emissions must be significantly reduced, which requires a fundamental change in how fossil fuels and other high carbon footprint materials are used in the coming decades. Additionally, the instability of supply chains, changing behaviour caused by Covid-19 (Jowitt (2020)), as well as the growing world population, projected to reach almost 10 billion by 2050 (WPP (2022)), will also likely significantly impact resource utilisation globally (Jowitt et al. (2020)). However, many material flows of interest are poorly understood, which severely impedes the ability of policy makers to identify ways to use important materials or resources more efficiently and sustainably, and plan the transition to more sustainable systems of production and use (Mudd (2021); UNE (2020)).

Material flow analysis (MFA) is a broad terminology encompassing any quantitative method used to map and quantify flows and stocks of a material of interest in a well defined system. MFA is applicable to all materials and a range of economic, environmental and social scales, from a company’s supply chain to entire countries and economies. MFA studies have been conducted on metals, including steel and aluminium (Bertram et al. (2009) and Cullen et al. (2012)), non-metals such as glass and concrete (Westbroek et al. (2021) and da Costa Reis et al. (2019)), for entire countries (Matsubae-Yokoyama et al. (2009)), on a global scale (Miatto et al. (2017)) and for multiple materials across whole economies (Fischer-Kowalski et al. (2011). Common to all MFA models is the precise definition of a system, which includes the system boundary, the processes within this system, as well as stock and flow variables representing the storage within and movement between the processes. A physical assumption common to all MFA models is the conservation of mass, where the outflows and inflows of a process must balance with its change of stock.

Typically in static MFA problems it is difficult or impossible to collect data for all the flow and change in stock variables inside the system. This gives rise to an underdetermined system, in the sense that the size of available data combined with physical constraints, containing at least mass balance of processes, is still less than the number of model parameters. For modelling this places MFA datasets in the domain of high dimensional statistics (Lederer (2022)), which is statistically challenging as the number of unknown parameters exceeds the sample size (Wainwright (2019)). Well known statistical methods in this area include ridge regression, the LASSO, Elastic net, as well as Bayesian approaches (Hoerl and Kennard (2000); Tibshirani (1996); Zou and Hastie (2005); Kruschke (2015)).

Bayesian statistics is particularly well suited for MFA for three main reasons. Firstly, Bayesian statistics works naturally in the underdetermined setting by providing a posterior probability over all possible solutions instead of a single solution. Secondly, while commonly there is a lack of data in MFA problems, there is often domain knowledge or expert opinion that can be used to greatly improve the prediction accuracy of the model, and the Bayesian prior distribution provides a natural framework to incorporate such domain knowledge, which is missed in the traditional MFA approach. Thirdly, Bayesian statistics rigorously quantifies uncertainty in the data and propagates it into the posterior distribution. Additionally, authors including Brunner and Rechberger (2004) and Lupton and Allwood (2018) argued for an incremental approach to MFA, where the system diagram and data is continuously refined and improved until the required level of data certainty has been achieved. The Bayesian paradigm naturally faciliates this through iterative learning of models as new data becomes available by rerunning the model with new data, since the posterior distribution from a previous analysis can be interpreted as the prior distribution for the subsequent analysis.

Non Bayesian approaches to uncertainty quantification in MFA include works such as Laner et al. (2015), Schwab et al. (2016a), Schwab and Rechberger (2017). These approaches focus on qualitative and quantitative methods of assigning confidence onto data or mass balanced flows, but do not provide a modelling procedure that combines prior belief with data and mass balance to produce flow estimates, nor a framework for propagating uncertainty. Bayesian inference provides a modelling procedure that is able to propagate uncertainty, and a rigorous, probabilistic interpretation of uncertainty. The popular software ‘STAN’ (Cencic (2016)) uses least squares minimisation to conduct data reconciliation of MFA systems. However, STAN requires nonlinear data to be approximated by first order Taylor expansion, and a normality assumption is made on flows, which can permit negative flow values that are not physically meaningful.

1.1 Previous work on Bayesian approaches to MFA

The use of Bayesian statistics in MFA is still relatively limited. Perhaps the earliest use of Bayesian statistics in MFA is by Gottschalk et al. (2010), where the authors proposed a model where the mass balance equations are parametrised using specific flow ratios (known as transfer coefficients) in the system, and tested the model on a case study of flows of concentrations of nanoparticles in Switzerland using simulated data. In Lupton and Allwood (2018), the authors used the same mass balance parametrisation as Gottschalk et al. (2010), but with Dirichlet priors on the transfer coefficients rather than uniform or triangular priors, using a HMC (Hamiltonian Monte Carlo) based sampler called the NUTS (No-U-Turn Sampler) algorithm to sample from the posterior, and conducted a case study mapping global steel flows. Recently, Dong et al. (2023) proposed a method which combines prior information from multiple experts in the MFA setting, demonstrating it using the mass balance parametrisation of Gottschalk et al. (2010) and Lupton and Allwood (2018). However, when working with the model of Lupton and Allwood (2018) in practice, we found that many divergent samples were produced with the NUTS algorithm, suggesting the posterior samples have not converged. Dong et al. (2023) also encountered divergent samples in their study when using HMC, and opted to use Sequential Monte Carlo (SMC) instead to attempt to circumvent this problem. However, it is unclear that the samples from SMC are better converged than the samples from HMC, since SMC does not have access to the same divergent sample diagnostics as HMC. Furthermore, SMC appears to significantly increase computational time.

Cencic and Frühwirth (2015) developed a linear Bayesian data conciliation method that can be applied to MFA systems, and a subsequent paper Cencic and Frühwirth (2018) extended this method to include nonlinear constraints. These methods instead parametrise the model directly in terms of the flow variables, by partitioning the set of flow variables in the system into ‘free variables’ $\bm{v_{f}}$ and ‘dependent variables’ $\bm{v_{d}}$ , where the mass balance equations can be expressed as a relationship between the free variables and the dependent variables. For example, in the linear case, $\bm{v_{d}}=-D\bm{v_{f}}-\bm{d}$ can be obtained via Gaussian elimination for some constant matrix $D$ and vector $\bm{d}$ . In both papers the methodology was tested on low dimensional, simulated examples, with Metropolis Hastings (MH) used as the sampling algorithm on the free variables, with the prior distribution $f(\bm{v_{f})})$ of the free variables as the proposal distribution. However, even in low dimensional examples it was reported in Cencic and Frühwirth (2015) that the MH sampler can have a very low acceptance probability. Therefore it is unclear whether the method works well in high dimensional, underdetermined systems with hundreds of flow and change in stock variables, which is important since these systems are typical in MFA. In high dimensions, it becomes increasingly unlikely for proposals from the MH sampling algorithm to satisfy both the mass balance conditions and non negativity of flow variables. To see this, $\bm{v_{d}}=-D\bm{v_{f}}-\bm{d}$ does not guarantee every component of $\bm{v_{d}}$ will be positive for arbitrary fixed $D$ and $\bm{d}$ . As the dimension of $\bm{v_{d}}$ increases, it becomes increasingly likely that randomly sampled proposal $\bm{v_{f}}$ during MH will lead to at least one component of $\bm{v_{d}}$ to be negative, causing the proposal to be in a region of zero posterior probability and the MCMC algorithm to be stuck at the current value. More generally, sampling from constrained posteriors is known to be challenging (Lan and Kang (2023)), and HMC also have difficulties when encountering regions of zero posterior probability formed by the constraints (Hoffman and Gelman (2014)).

1.2 Scope of paper

This paper continues the development of Bayesian methodology for material flow analysis. To address the aforementioned computational issues, we relax the mass balance conditions via a noise term. This reduces regions of zero posterior probability in the parameter space, making the posterior easier to sample using MCMC algorithms. We show computationally with an aluminium cycle case study that this leads the NUTS sampling algorithm to converge well in high dimensions. Simultaneously, the noise term can be interpreted as a way of modelling epistemic uncertainty in the system, which is likely to be present as MFA systems are simplified rather than perfect representations of reality (Schwab et al. (2016a, b)). It is similar to the concept of ‘phantom flows’, which is used to account for unexplainable mass imbalances in MFA studies (e.g. Reck et al. (2008) ). The variance of the noise term can be chosen to be small so good approximate mass balance is still achieved when there is high confidence in the system definition.

We introduce a child and parent process parametrisation framework to model systems with multiple layers of disaggregation in processes and flows, which is a common feature in material flow datasets (Myers et al. (2019a) and Myers et al. (2019b)) but has not been considered in previous Bayesian MFA studies. To this end we assign priors directly on flow mass and change in stock variables in the material system, while retaining the ability to incorporate ratio data between arbitrary flows. We demonstrate our method on a high dimensional aluminium material flow system where change in stock and disaggregation of processes and flows are simultaneously present.

We illustrate how Bayesian posterior predictive checks can reveal inconsistencies in the data and aid hyperparameter selection. We also show the posterior distribution can inform data collection strategies by identifying which flow or changes in stock variables in the system retain the most uncertainty.

Previous work have not investigated under what conditions estimates and uncertainty quantification produced by Bayesian MFA are reliable. This is important as strong theoretical guarantees which hold in low-dimensional parametric models, such as the Bernstein-von Mises theorem (Vaart (1998)), can fail to hold in high-dimensional settings (Johnstone (2010)) such as the present MFA setting. We address this gap by examining how Bayesian MFA performs on a high dimensional aluminium case study under relatively weak assumptions, namely a weakly informative prior and significant data gaps. We also conduct a simulation study on a zinc cycle to examine the estimation accuracy and uncertainty quantification of the posterior distributions of our model from a frequentist perspective, which can be found in the supporting information.

2 Methods

In this section, we present the details of our proposed Bayesian MFA methodology. Our model produces estimates and uncertainty quantification of all flow and change in stock variables of interest in any given material flow system, in the form of a posterior distribution, which mathematically combines prior domain knowledge and expert opinion with available data while simultaneously propagating uncertainty.

A MFA analysis begins with the definition of a system diagram, a graph like structure of nodes representing $processes$ , where the material of interest can be stored as stock, and edges representing $flows$ of the material of interest between processes. Notably however, flows in both directions between any two processes are permitted. The system diagram will also contain a system boundary, which is used to describe the flows between the system and some external environment. This framework is quite broad, and allows processes within a system to not only represent a physical manufacturing process (such as components of a blast furnace), but also examples such as the Earth’s lithosphere, the environment, or various usage outlets like households where the material of interest can be stored or flow in and out of. Similarly, the system’s scope can range from a small supply chain, to a global flow of a metal such as zinc. The level of detail of the system diagram is chosen by the modeller to fit the scope of the problem being examined.

Often in MFA problems, certain processes in the system can be disaggregated into constituent subprocesses (see e.g. Myers et al. (2019b) ). We define a $parent$ $process$ to be a process which contains subprocesses, and a $child$ $process$ be a process which contains no subprocesses instead. By definition, parent and child processes form a partition of the set of all processes in the system. The MFA practitioner should decide the level of disaggregation of each parent process according to their requirements, but parent processes where data is only available on some of its child processes may still be worth disaggregating to incorporate additional data into the model.

The parent and child process structure is useful for modelling the disaggregation of processes. To see this, we assume the stocks and flows of any parent process can be expressed as a linear combination of its constituent child processes. Under this parent and child process framework, multiple levels of disaggregation of processes can be reduced to just two levels (the set of parent processes and the set of child processes), which greatly simplifies modelling of the MFA system. A simple example illustrating the parent and child process framework can be found in the supplementary information.

2.1 Formulation of the physical model

Suppose there are $m$ child processes in the system, indexed by $P_{0},P_{1},\dots P_{m-1}$ . Let $s_{i}(t)$ to be the stock variable associated with the process $P_{i}$ , denoting the amount of stock in process $P_{i}$ at time $t$ . In practice material flow data are typically recorded as the total amount of flow over a period time (such as on a monthly or yearly basis), so let $U_{j,i}$ represents the total amount of flow of the material of interest from process $j$ to process $i$ during the time period $t-\Delta t$ to $t$ . Here $\Delta t$ represents the period during which the total amount of flow was reported, which can for example be in months or years. Note typically in MFA systems not all processes necessarily possess a stock variable, for example if the physical process which it is modelling does not contain a physical stock of the material of interest. Similarly, most processes will not receive flows from or flow to every other process, so the notation introduced in this paragraph such as $s_{i}$ and $U_{j,k}$ are understood to be over existing stocks and flow variables only. For each process $P_{i}$ , we assume mass is conserved between its stock, inflows and outflows over the time period $t-\Delta t$ to $t$ , which we formulate as:

S_{i}=s_{i}(t)-s_{i}(t-\Delta t)=\sum_{j}U_{j,i}-\sum_{k}U_{i,k}

(1)

The left hand side $S_{i}=s_{i}(t)-s_{i}(t-\Delta t)$ is simply the change in stock during the time period $t$ to $t-\Delta t$ . In this paper we only consider stationary models at a snapshot of time $t$ , so the variables of interest in the model are the $S_{i}$ and $U_{j,k}$ . For more compact notation, let $\bm{S}$ be the vector of $q$ change in stock variables $S_{i}$ of the child processes in the system, and $\bm{U}$ a vector of $p-q$ flow variables $U_{j,k}$ of the child processes in the system (for a total of $p$ variables). Note because flows and stocks changes of parent processes can be expressed linearly in terms of its constituent child processes, conservation of mass for all child processes automatically implies conservation of mass for all parent processes. Similarly, Bayesian modelling only needs to be conducted on the child processes, as the posterior samples on flow and stocks change variables of parent processes can be obtained by simply summing the posterior samples of the constituent child processes.

2.2 Data structure

Data in MFA can in principle be any arbitrary function $f(\bm{S},\bm{U})$ of the variables of interest $\bm{S}$ and $\bm{U}$ . However, data and mass balance conditions typically come in the following four forms, which we can express in terms of $\bm{S}$ and $\bm{U}$ :

1. Observations of changes in stock of child processes or observations of flows between child processes. For example, in Figure 2, $S_{0}=-37.2$ Mt could be used to describe the change in stock of the ‘Reserves’ process, while the flow from ‘Reserves’ to ‘Mining’ can be described by $U_{0,1}=37.2$ Mt. Here the process ‘Reserves’ is labelled by $P_{0}$ and the process ‘Mining’ by $P_{1}$ .

2. Observations of changes in stock of parent processes, or observations of flows between processes where at least one process is a parent, which can be treated as the sum of multiple flows. For example, the combined flow of $9.8$ Mt in Figure 2 from ‘WasteManagement’ to ‘Recycling’ consists of ‘WasteManagement’ as the origin process, and the destination processes are ‘Remelting’ and ‘Refining’, two of the subprocesses of ‘Recycling’. This flow can be represented by the sum of flows $U_{31,7}+U_{31,8}=9.8$ Mt. Here we used $P_{31}$ to denote ‘WasteManagement’, and $P_{7},P_{8}$ ‘Remelting’ and ‘Refining’ subprocesses of ‘Recycling’ respectively. So for example, $U_{31,7}$ represents the flow from ‘WasteManagement’ to ‘Remelting’.

3. Conservation of mass. For child process $i$ this is represented by equation (1).

4. Ratio data between flows or sums of flows, for instance transfer coefficients:

\frac{U_{i,j}}{{\sum_{k}U_{i,k}}}=\alpha_{i,j}

(2)

where $\alpha_{i,j}$ is a known transfer coefficient of the flow from process $i$ to process $j$ , and is defined as the ratio between the flow from process $i$ to process $j$ , divided by the total outflow of process $i$ . In case studies of this paper we only consider transfer coefficients of processes without a change in stock variable. In general, a change in stock can be split into two flows (flow into and out of stock), which allows transfer coefficients to be calculated. We also consider ratios between aggregated flows as an extension beyond standard transfer coefficients.

Ratio data can also be alternatively parametrised linearly in the following way:

U_{i,j}-\alpha_{i,j}{\sum_{k}U_{i,k}}=0

(3)

We note it is possible to conveniently formulate all the most common forms of data and mass balance conditions in MFA in terms of linear relationships between the change in stock variables $S_{i}$ and the flow variables $U_{j,k}$ , even when the system contains disaggregation of processes. However, for a more general framework and to demonstrate our model can incorporate nonlinear data as well, we choose to parametrise ratio data as Equation 2 when evaluating the model on the aluminium case study in the Results section.

Using more compact notation, the relationship between the flow and change in stock variables and the data and physical constraints can be represented by the following model:

\displaystyle\bm{Y}=\begin{bmatrix}X\bm{\theta}\\ \bm{R}(\bm{\theta})\end{bmatrix}+\bm{\epsilon},\quad\bm{\theta}=\begin{bmatrix% }\bm{S}\\ \bm{U}\end{bmatrix}

(4)

where $\bm{\theta}$ is a concatenated vector of $\bm{S}$ and $\bm{U}$ of length $p$ , representing all the child flow and change in stock variables the system. $\bm{Y}$ is a vector of length $n$ of observed values, or $0$ for mass balance conditions. $X$ is a design matrix representing linear data and mass balance conditions, and $\bm{R}(\bm{\theta})$ a vector representing nonlinear data. $\bm{\epsilon}$ is a random vector (of length $n$ ) representing uncertainty in the data, which could be caused by measurement or rounding errors. Note in the MFA setting, it is often the case that $n\ll p$ .

The goal of any Bayesian model is to obtain posterior distributions over the variables of interest, in this case $\bm{\theta}$ , in order to perform inference such as point estimation or uncertainty quantification. In the following section we describe how to construct the prior for our model, the form of the likelihood and how to obtain the posterior of $\bm{\theta}$ via Bayes’s theorem.

2.3 Bayesian model detail

Bayesian inference is a statistical framework in which Bayes’s theorem, a fundamental result in probability theory, is used to update one’s beliefs regarding parameters of interest $\bm{\theta}$ based on new data or evidence $\bm{Y}$ . Mathematically the prior distribution $p(\bm{\theta})$ is used to express one’s belief prior to seeing the data, and the likelihood function $p(\bm{Y}|\bm{\theta})$ used to express the probability of observing the data. The goal in Bayesian inference is to obtain the posterior distribution $p(\bm{\theta}|\bm{Y})$ , which represents the state of one’s updated beliefs after observing the data. This is done via Bayes’s theorem:

p(\bm{\theta}|\bm{Y})=\frac{p(\bm{\theta})p(\bm{Y}|\bm{\theta})}{\int p(\bm{% \theta})p(\bm{Y}|\bm{\theta})d\bm{\theta}}

(5)

Most posterior distributions do not have a closed form expression due not being possible to evaluate $\int p(\bm{\theta})p(\bm{Y}|\bm{\theta})d\bm{\theta}$ in closed form. Markov Chain Monte Carlo (MCMC) methods are computational algorithms used to sample from distributions that do not have a closed-form, including posterior distributions. MCMC methods construct a Markov Chain that converges to the target distribution, by iteratively sampling from a proposal distribution and only accepting samples that satisfy a suitable criteria that suggests they could be feasibly generated from the target distribution.

For our Bayesian model, we consider normal priors for the change in stock variables $S_{i}$ , which reflects the fact that the change in stock can be both positive or negative. For the flow variables $U_{j,k}$ , we consider truncated normal priors for the flow variables on the positive interval $[0,L]$ , where $L$ is chosen based on domain knowledge of the scale of flows inside the system being modelled, or simply chosen to be positive infinite if none is available. This is to ensure the posterior distribution of the flow variables have positive support, as flow quantities are always positive in reality. Furthermore, the normal and truncated normal distributions are flexible, in the sense that an informative prior distribution can be assigned by choosing the prior mode as a confident estimate (from expert knowledge), and choosing a small prior variance to create a narrow distribution around the prior mode. On the other hand, an uninformative prior distribution can be assigned by choosing a large prior variance around a rough guess for the prior mode instead.

\displaystyle S_{i}\sim\mathcal{N}(\mu_{i},\sigma_{i}^{2}),\quad U_{j,k}\sim% \mathcal{TN}(\mu_{j,k},\sigma_{j,k}^{2},0,L)

(6)

where $\mu_{i}$ and $\sigma_{i}^{2}$ are the prior mean and variance of $i$ th change in stock variable $S_{i}$ , respectively, and $\mu_{j,k}$ and $\sigma_{j,k}^{2}$ the corresponding prior hyperparameters respectively for the truncated normal distribution for the flow variable $U_{j,k}$ , representing the flow quantity from process $j$ to process $k$ . We also assume prior variables are independent, so the overall prior distribution $p(\bm{\theta})$ over the change in stock and flow variables is of the form:

\displaystyle p(\bm{\theta})\propto\prod_{i}\frac{1}{\sigma_{i}}\exp(\frac{-(S% _{i}-\mu_{i})^{2}}{2\sigma_{i}^{2}})\prod_{(j,k)}\frac{1}{\sigma_{j,k}}\exp(% \frac{-(U_{j,k}-\mu_{j,k})^{2}}{2\sigma_{j,k}^{2}})\mathrm{I}_{U_{j,k}\in[0,L]}

(7)

where $\mathrm{I}_{U_{j,k}\in[0,L]}$ is an indicator function, and the products are over the change in stock variables $S_{i}$ and flow variables $U_{j,k}$ .

For the likelihood, we again use a truncated normal likelihood for any data on observed flow values to ensure the data generated by the likelihood is plausible (since flow data should be strictly positive), and a normal likelihood for data on observed change in stock values and general nonlinear data. We include an additional normally distributed noise term in the mass balance conditions mainly for practical reasons: the noise term reduces the region of zero posterior probability, which makes sampling from the posterior computationally much more tractable. Additionally, the noise term has the interpretation of modelling epistemic or systematic uncertainty in the MFA system. In theory, every process should be exactly mass conserved if the system definition and diagram are a perfect representation of the underlying real system being modelled. However, in reality the system diagram and definition are simplified and approximate models of reality, and so it is reasonable to account for uncertainty in the underlying system itself, which can be modelled through a noise term in the mass balance conditions. In practice, flows reported in MFA studies are rarely exactly mass balanced, so likewise a Bayesian MFA approach does not need to produce perfectly mass balanced flows to provide useful estimates.

$\displaystyle Y_{i^{\prime}}\|\bm{\theta}$	$\displaystyle\sim\mathcal{N}(\bm{x_{i^{\prime}}}^{\top}\bm{\theta},\tau_{i^{% \prime}}^{2})\quad\text{for observations}\ Y_{i^{\prime}}\ \text{on change in % stock data}$	(8)
$\displaystyle Y_{j^{\prime}}\|\bm{\theta}$	$\displaystyle\sim\mathcal{TN}(\bm{x_{j^{\prime}}}^{\top}\bm{\theta},\tau_{j^{% \prime}}^{2},0,\infty)\quad\text{for observations}\ Y_{j^{\prime}}\ \text{on % flow data}$	(9)
$\displaystyle Y_{k^{\prime}}\|\bm{\theta}$	$\displaystyle\sim\mathcal{N}(R(\bm{\theta})_{k^{\prime}},\tau_{k^{\prime}}^{2}% )\quad\text{for observations}\ Y_{k^{\prime}}\ \text{on nonlinear data}$	(10)
$\displaystyle Y_{l^{\prime}}\|\bm{\theta}$	$\displaystyle\sim\mathcal{N}(\bm{x_{l^{\prime}}}^{\top}\bm{\theta},\tau_{l^{% \prime}}^{2})\quad\text{for}\ Y_{l^{\prime}}=0\ \text{on mass balance conditions}$	(11)

Here $Y_{i^{\prime}}$ is the $i^{\prime}$ th row or entry of the observation vector $\bm{Y}$ , and $\bm{x_{i^{\prime}}}^{\top}$ the $i^{\prime}$ th row of the design matrix $X$ , $R(\bm{\theta})_{k^{\prime}}$ the $k^{\prime}$ th row of the vector $\bm{R}(\bm{\theta})$ and $\tau_{i^{\prime}}^{2}$ representing variance of the noise of the $i^{\prime}$ th observation, which are assumed to be independent. The overall likelihood $p(\bm{Y}|\bm{\theta})$ is therefore of the form:

\displaystyle p(\bm{Y}|\bm{\theta})=\prod_{i^{\prime}}p(Y_{i^{\prime}}|\bm{% \theta})\prod_{j^{\prime}}p(Y_{j^{\prime}}|\bm{\theta})\prod_{k^{\prime}}p(Y_{% k^{\prime}}|\bm{\theta})\prod_{l^{\prime}}p(Y_{l^{\prime}}|\bm{\theta})

(12)

where

$\displaystyle p(Y_{i^{\prime}}\|\bm{\theta})$	$\displaystyle\propto\frac{1}{\tau_{i^{\prime}}}\exp(\frac{-(Y_{i^{\prime}}-\bm% {x_{i^{\prime}}}^{\top}\bm{\theta})^{2}}{2\tau_{i^{\prime}}^{2}})$	(13)
$\displaystyle p(Y_{j^{\prime}}\|\bm{\theta})$	$\displaystyle\propto\frac{1}{\tau_{j^{\prime}}}\exp(\frac{-(Y_{j^{\prime}}-\bm% {x_{j^{\prime}}}^{\top}\bm{\theta})^{2}}{2\tau_{j^{\prime}}^{2}})\mathrm{I}_{Y% _{j^{\prime}}\in[0,\infty)}$	(14)
$\displaystyle p(Y_{k^{\prime}}\|\bm{\theta})$	$\displaystyle\propto\frac{1}{\tau_{k^{\prime}}}\exp(\frac{-(Y_{k^{\prime}}-R(% \bm{\theta})_{k^{\prime}})^{2}}{2\tau_{k^{\prime}}^{2}})$	(15)
$\displaystyle p(Y_{l^{\prime}}\|\bm{\theta})$	$\displaystyle\propto\frac{1}{\tau_{l^{\prime}}}\exp(\frac{-(Y_{l^{\prime}}-\bm% {x_{l^{\prime}}}^{\top}\bm{\theta})^{2}}{2\tau_{l^{\prime}}^{2}})$	(16)

Given the prior and likelihood, we can use Bayes’s Theorem 5 to obtain the posterior distribution $p(\bm{\theta}|\bm{Y})$ . However, the posterior induced by this model does not admit an analytical form, and so we employ the No-U-Turn Sampler (NUTS) algorithm of Hoffman and Gelman (2014) to sample from the posterior, implemented via the PyMC3 library Salvatier et al. (2016) in Python. NUTS is a Hamiltonian Monte Carlo that achieves increased sampling efficiency over traditional MCMC methods, such as Metropolis Hastings, by exploiting the gradient information of the target distribution to generate more informed sample proposals and explore the target distribution more efficiently. This is especially important in high dimensions (which applies to many MFA systems), since the probability mass of the target distribution are more likely concentrated in smaller regions, which is inefficient to explore via Metropolis Hastings due to random walk behaviour.

2.4 Posterior predictive checks

For Bayesian modelling of MFA systems, we recommend performing posterior predictive checks to verify whether data generated by the model is similar to the observed data and adequately mass conserved, which gives some assurance that the prior, model and parameters chosen are sensible. Here we briefly describe the method of posterior predictive checking outlined in Chapter 6 of Gelman et al. (2013). Recall that in our model, the observation vector $\bm{Y}$ represents the observed data (such as on flows or change in stocks), as well as physical conditions such as mass balance, and $\bm{\theta}$ the vector of parameters. Let $\bm{Y^{rep}}$ be replicated data that could have been observed, the distribution of $\bm{Y^{rep}}$ given the observed data $\bm{Y}$ , also known as the posterior predictive distribution, is given by:

p(\bm{Y^{rep}}|\bm{Y})=\int p(\bm{Y^{rep}}|\bm{\theta})p(\bm{\theta}|\bm{Y})\,% d\bm{\theta}

(17)

Typically, the check is done on suitable scalar test quantities $T(\bm{Y},\bm{\theta})$ , chosen based on the real problem being modelled. The test quantity of the replicated data $T(\bm{Y^{rep}},\bm{\theta})$ is compared with the test quantity of the observed data $T(\bm{Y},\bm{\theta})$ through statistical tests or graphical checks to look for systematic discrepancies between the simulated and originally observed data. For our Bayesian material flow analysis model, we choose the test quantities to be each individual observed data and mass balance conditions of child processes in the system, in other words we choose test quantities $T_{i}(\bm{Y},\bm{\theta})=Y_{i}$ for each $i$ . This ensures we minimally check using the posterior predictive distribution that the model is consistent with the existing data as well as mass balance of child processes. One way to compare the observed data with the posterior predictive distribution is to calculate the Bayesian posterior predictive p-values for the test statistic, in our case the marginal observations $Y_{i}$ , which are given by

pval_{i}=P(Y_{i}^{rep}\geq Y_{i}|\bm{Y})=\int\int\mathrm{I}_{Y_{i}^{rep}\geq Y% _{i}}p(\bm{Y^{rep}}|\bm{\theta})p(\bm{\theta}|\bm{Y})\,d\bm{Y^{rep}}d\bm{\theta}

(18)

Very large or small p-value (e.g. greater than 0.95 or smaller than 0.05 as suggested by Gelman et al. (2013)) suggests the observed test quantity is unlikely to be replicated in repeated experiments if the model were true, implying there is inconsistency between the model and the data. We also calculate the posterior predictive marginal $95\%$ highest density interval for each $Y_{i}|\bm{Y}$ to see if they contain the observed values $Y_{i}$ (which include the conservation of mass conditions where $Y_{i}=0$ ).

In practice, the integrals in Equation 17 and Equation 18 are analytically intractable, so we again approximate the posterior predictive distribution via sampling. Specifically we simulate one sample of $\bm{Y^{rep}}$ from the posterior predictive distribution for each posterior sample of $\bm{\theta}$ , and we approximate p-values of Equation 18 by checking the proportion of the posterior predictive samples of $Y_{i}^{rep}$ that exceed the observed value $Y_{i}$ , for each $i$ .

2.5 Hyperparameter and noise parameter selection

In general, the prior hyperparameters $\mu_{i}$ , $\mu_{j,k}$ should be specified through domain knowledge to reflect the modeller’s best estimate of the stock change and flow variables apriori, and $\sigma_{i}^{2}$ , $\sigma_{j,k}^{2}$ chosen to reflect prior uncertainty, the less confident the estimates for $\mu_{i}$ , $\mu_{j,k}$ , the larger $\sigma_{i}^{2}$ , $\sigma_{j,k}^{2}$ .

The noise variance parameters $\tau_{i^{\prime}}^{2}$ should also ideally be chosen to reflect uncertainty in the data, and in the case the mass balance conditions, epistemic uncertainty in the system definition. We recommend using posterior predictive checking to help select suitable noise parameter values, especially if no knowledge of the degree of data uncertainty is available. One can start with a small choice of standard deviation parameters (such as $10\%$ of the observed data value, and a small constant for the mass balance conditions), run the model and conduct posterior predictive checks, and identify the data points and mass balance conditions which exhibit extreme Bayesian p-values. The standard deviation parameter for those data points or mass balance conditions should be increased and the model rerun until no extreme Bayesian p-values remain.

Full details of hyper parameters choice in the case studies examined in the Results section can be found in the supporting information. We give examples of how to specify prior hyperparameters in the case where there is weakly informative domain knowledge where some flows are known to the nearest order of magnitude, as well as flows where there is no prior knowledge available.

3 Results

In this section we demonstrate our Bayesian MFA method on an aluminium cycle, containing significant disaggregations of processes and flows. We evaluate our model on the aluminium cycle under a weakly informative prior for two different levels of data availability and compare the results. We also present posterior predictive checks to identify inconsistencies between the model and data, and processes which are not mass balanced by the available data.

3.1 Aluminium cycle

We evaluate our model on the global anthropogenic metallurgical aluminium cycle in 2009 from Liu et al. (2013). The associated system diagram Figure 2 is adapted from Figure 1 of Liu et al. (2013). The aluminium cycle contains significant disaggregations of processes. For example, the ‘Use’ Process contains subprocesses such as ‘Use_BC’ (building and construction) and ‘Use_ME’ (machinery and equipment) representing different use product categories of aluminium. Furthermore, for aggregated processes such as ‘Manufacturing’ and ‘Recycling’, Figure 2 only displays data on aggregated flows and change in stocks. For example, the flow $12.1$ Mt from ‘Manufacturing’ is a combined flow to ‘Use_BC’ and the subprocesses ‘Remelting’ and ‘Refining’ of ‘Recycling’. So it is not clear from Figure 2 alone in what proportion this flow should split among the constituent subprocesses. In tables S5, S6 and S8 of the supplementary information, Liu et al. (2013) provides ratios specifying how certain aggregated flows should split into constituent subflows (known as transfer coefficients), specifically for the aggregate flows from ‘Semi-manufacturing’ to ‘Manufacturing’ and ‘Recycling’, and from ‘Manufacturing’ to ‘Use’ and ‘Recycling’. For the purposes of testing our model, we treat the values presented in Figure 1 of Liu et al. (2013) and transfer coefficients in tables S5,S6,S6 of the supplementary information as data. We also henceforth use ‘reported value’ to refer to any flow and change in stock values in Figure 1 of Liu et al. (2013), as well as values of disaggregated flow and change in stock values calculated through the transfer coefficients in the supplementary material of Liu et al. (2013).

We evaluate the model under two different scenarios. In scenario A, we deliberately withhold the transfer coefficients in the supplementary information of Liu et al. (2013) from the model, and only use the data displayed in Figure 2. Instead, where transfer coefficients are available to calculate the value of the disaggregated flow, we set the prior mode of the disaggregated flow variable to the nearest power of $10$ of the reported value, otherwise an uninformative prior with mode $1.0$ Mt is used. The purpose of scenario A is to see if our Bayesian MFA model can still produce useful estimates of flows and changes in stock under a weakly informative prior with significant amount of missing data. Scenario A also mimics a situation that could be applicable to many MFA problems, where data is available on a aggregated/parent level but not the disaggregated/child level, and a rough estimate like the order of magnitude of the flows and changes in stock on the disaggregated level is obtained from surveying domain experts or approximate calculations, which can be used to construct a weakly informative prior. In scenario B, the same weakly informative prior is used, but the flow ratios in the supplementary information of Liu et al. (2013) are provided to the model as well. Scenario B mimics a situation towards the end of an MFA analysis, when data are available for most of the flow and change in stock variables in the system. In both scenarios, we assume a low degree of epistemic uncertainty in the system diagram and set the standard deviation of mass balance conditions to a constant $0.05$ Mt. This choice leads to well mass balanced posterior means and samples of the flow and change in stock variables, and we provide an analysis of posterior mass balance conditions in the supporting information (section S5) for full detail. Full detail of prior hyperparameters can also be found in the supporting information (section S4).

Figure 3 displays a selection of marginal posterior distributions for flow and change in stock variables of disaggregated/child processes in the aluminium cycle. Only a selection is displayed here for brevity as there are around $180$ flow or change in stock variables in the model. Under scenario A, the marginal posteriors are relatively more biased away from the reported value, which is not unexpected as data on the disaggregated level was withheld from the model in scenario A. Nevertheless, all of the reported values of flows and change in stock (when they are available in the supplement of Liu et al. (2013)) are contained within the $95\%$ posterior marginal highest density intervals (HDI). A small number of reported values are near the edge of HDI like the flow ‘ShapeCasting to Manuf_TAU’. This could be caused by the reported value (around $5.1$ Mt) being close to the middle of the nearest orders of magnitudes ( $1.0$ and $10.0$ Mt), which is more difficult for a prior based on the nearest order of magnitude ( $1.0$ Mt in this case) to capture without actual data on the flow. Overall under scenario A, the posterior estimates and uncertainty quantification for disaggregated flows or change in stock variables obtained under still captures the reported values reasonably well despite not given data on any disaggregated flows. With the addition of transfer coefficient data under scenario B, the posterior marginal distributions generally possess narrower HDI compared to scenario A and centred much more around the reported value, and again the reported values are contained within the $95\%$ posterior HDI. Quantitatively, the average length of the $95\%$ posterior HDI over all flow and change in stock variables under scenario A is $2.15$ Mt, while under scenario B is $1.46$ Mt.

In both scenarios it took the NUTS algorithm around 90 minutes to generate $24000$ posterior samples across two chains on an Intel i5-1145G7 CPU, 2.6GHz. The traceplots and convergence checks suggests the NUTS algorithm has converged and we did not observe any divergent samples. A selection of traceplots and further diagnostics can be found in section S2 of the supporting information. Posterior pairplots illustrating posterior correlation between a selection of flow variables can be found in section S3. In section S6 we include a simulation study on a zinc cycle to examine the estimation accuracy and uncertainty quantification of our model from a frequentist perspective. We demonstrate on a zinc cycle that incorporating even a weakly informative prior can significantly reduce estimation error, and the posterior credible intervals can consistently contain the true value of flows and stock changes and be interpreted as confidence intervals.

3.2 Posterior predictive checks on aluminium model

In this section we present some additional results for the aluminium model. First, we apply the posterior predictive checks described in Section 2.4 on the aluminium model under scenario B. From Figures 4(a), 4(b), 4(c) and 4(d), it can be seen that the $95\%$ HDIs all contain the observed values for all change in stock data, flow data, conservation of mass conditions and ratio data. Moreover, from Figures 4(e), 4(f), 4(g) and 4(h), it can be seen that the posterior predictive p-values are mostly between $0.3$ and $0.7$ and no p-value is smaller than $0.05$ or greater than $0.95$ , suggesting that no p-values are extreme and the model generally fits the data reasonably well. There are a few variables close to extreme values however. In particular, the 27th flow data in Figure 4(f) has a p value of around $0.07$ , which represents the total outflow from ‘Internal remelting’ to ‘Semi-manufacturing’ of $9.3$ Mt. Upon reviewing the data, it was found the total inflow of ‘Internal remelting’ only summed to $7.1$ Mt. The posterior predictive check therefore helpfully highlighted a discrepancy that suggests the data for the process Internal remelting has more sizable mass imbalance. Similarly, the 49th data has a relatively high p-values of around $0.8$ , which is the data representing the outflow from Manuf_OTD of $0.8$ Mt. Upon examining the data it was found the total inflow of Manuf_OTD is $1.0$ .

In MFA, data is expensive to collect, so it is useful to prioritise which flow and change in stock variables to collect more data on. This will likely depend on what questions the modeller is most interested in answering regarding the real system being modelled. Without specific questions however, Bayesian inference provides default strategies for prioritising which data points to collect, by ranking the variables in terms of descending posterior uncertainty. We plot the top $10$ flow and change in stock variables in descending length of their marginal $95\%$ HDI for both scenarios A (Figure 4(i)) and B (Figure 4(j)). In scenario B, the most uncertain variables are mostly flows from ‘Casting’ or ‘Recycling’ to ‘Semi-manufacturing’, which is expected as those are the disaggregated flows where data is not available. For scenario A, the most uncertain variables are more scattered throughout the system, as scenario A has no data on any disaggregated flows.

4 Discussion

This paper presented a novel MFA methodology under the Bayesian framework that addresses existing challenges and expands the applicability of Bayesian inference in MFA. By relaxing the mass balance constraints with a noise term, we improved upon the computational scalability and reliability of posterior samples compared to existing methods, while still retaining well mass balanced posterior estimates of stock changes and flows. We introduced a child and parent parametrisation that can conveniently deal with MFA systems with multiple layers of disaggregation of processes and flows, providing posterior distributions on flows and stock changes on all levels of disaggregation in the system, including lower levels where data is often unavailable. We showed that even an weakly informative prior, specifically a prior based on the nearest order of magnitude of stock changes and flows, is capable of greatly improving the model’s estimation accuracy and quality of its uncertainty quantification, especially during the early stages of the analysis when there is a lack of data, reaffirming the benefit of a Bayesian approach to MFA. We also demonstrated how posterior predictive checks can be used to check if the model is consistent with the data and mass balance conditions, and help identify data inconsistencies and aid in selecting noise parameter values for the data and mass balance conditions.

Like other Bayesian approaches, our method requires prior distributions of all flow and stock change variables of interest to be manually specified, which is an additional requirement compared to traditional MFA. However, we argue that this should be a standard part of any material flow analysis, where domain knowledge should be continuously collated, and the prior distribution offers a principled, mathematical way of incorporating this knowledge into the model. The priors used in this paper are unimodal and only weakly informative at most, in order to reduce requirements on the prior so that it can be applied in a wider range of MFA settings. In principle more or less informative priors can be used based on domain knowledge of the application. In the presence of multiple expert opinions, mixture priors that combine multiple experts similar to the approach of Dong et al. (2023) can be considered in our model framework.

The Bayesian approach inevitably comes with a higher computational cost, as the full posterior distribution of each variable of interest needs to be calculated rather than just a point estimate, and may run into convergence issues Betancourt (2017). The No-U-Turn Sampler HMC algorithm used for our model took around $90$ minutes to generate $24000$ posterior samples across two chains for the aluminium dataset, which we consider acceptable. We also note that most MFA literature do not use very large datasets so we do not anticipate computational cost to be a major issue for most MFA studies. However, for much larger material flow datasets, approximate methods such as variational Bayes can be employed to reduce the computational cost, or minimally just estimating the posterior mode directly (and forgoing uncertainty quantification) can potentially yield better point estimates than a non Bayesian method if informative priors are available.

Acknowledgements

This work was supported by the UKRI Interdisciplinary Circular Economy Centre For Mineral-based Construction Materials under the EPSRC Grant EP/V011820/1. The authors are grateful to Mohit Arora, Nicola Gambaro, Ugo Legendre and Shen Zhenxia for useful discussions.

References

UNE [2020] United Nations Resource Management System: An overview of concepts, objectives and requirements (ECE energy series no. 68), 2020. URL https://unece.org/sustainable-energy/publications/united-nations-resource-management-system-overview-concepts.
WPP [2022] World population prospects - population division, 2022. URL https://population.un.org/wpp/.
Bertram et al. [2009] M. Bertram, K. J. Martchek, and G. Rombach. Material Flow Analysis in the Aluminum Industry. Journal of Industrial Ecology, 13(5):650–654, 2009. doi: https://doi.org/10.1111/j.1530-9290.2009.00158.x. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1530-9290.2009.00158.x.
Betancourt [2017] M. Betancourt. A Conceptual Introduction to Hamiltonian Monte Carlo. 2017. doi: 10.48550/ARXIV.1701.02434. URL https://arxiv.org/abs/1701.02434.
Brunner and Rechberger [2004] P. H. Brunner and H. Rechberger. Practical handbook of material flow analysis. The International Journal of Life Cycle Assessment, 9(5):337–338, Sep 2004. ISSN 1614-7502. doi: 10.1007/BF02979426. URL https://doi.org/10.1007/BF02979426.
Cencic [2016] O. Cencic. Nonlinear data reconciliation in material flow analysis with software stan. Sustainable Environment Research, 26(6):291–298, 2016. ISSN 2468-2039. doi: https://doi.org/10.1016/j.serj.2016.06.002. URL https://www.sciencedirect.com/science/article/pii/S246820391630053X.
Cencic and Frühwirth [2015] O. Cencic and R. Frühwirth. A general framework for data reconciliation—Part I: Linear constraints. Computers & Chemical Engineering, 75:196–208, 2015. ISSN 0098-1354. doi: https://doi.org/10.1016/j.compchemeng.2014.12.004. URL https://www.sciencedirect.com/science/article/pii/S0098135414003330.
Cencic and Frühwirth [2018] O. Cencic and R. Frühwirth. Data reconciliation of nonnormal observations with nonlinear constraints. Journal of Applied Statistics, 45(13):2411–2428, 2018. doi: 10.1080/02664763.2017.1421916. URL https://doi.org/10.1080/02664763.2017.1421916.
Cullen et al. [2012] J. M. Cullen, J. M. Allwood, and M. D. Bambach. Mapping the Global Flow of Steel: From Steelmaking to End-Use Goods. Environmental Science & Technology, 46(24):13048–13055, Dec 2012. ISSN 0013-936X. doi: 10.1021/es302433p. URL https://doi.org/10.1021/es302433p.
da Costa Reis et al. [2019] D. da Costa Reis, Y. Mack-Vergara, and V. M. John. Material flow analysis and material use efficiency of Brazil’s mortar and concrete supply chain. Journal of Industrial Ecology, 23(6):1396–1409, 2019. doi: https://doi.org/10.1111/jiec.12929. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/jiec.12929.
Dong et al. [2023] J. Dong, J. Liao, X. Huan, and D. Cooper. Expert elicitation and data noise learning for material flow analysis using bayesian inference. Journal of Industrial Ecology, 27(4):1105–1122, 2023. doi: 10.1111/jiec.13399.
Fischer-Kowalski et al. [2011] M. Fischer-Kowalski, F. Krausmann, S. Giljum, S. Lutter, A. Mayer, S. Bringezu, Y. Moriguchi, H. Schütz, H. Schandl, and H. Weisz. Methodology and Indicators of Economy-wide Material Flow Accounting. Journal of Industrial Ecology, 15(6):855–876, 2011. doi: https://doi.org/10.1111/j.1530-9290.2011.00366.x. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1530-9290.2011.00366.x.
Gelman et al. [2013] A. Gelman, J. Carlin, H. Stern, D. Dunson, A. Vehtari, and D. Rubin. Bayesian Data Analysis, Third Edition. Chapman and Hall/CRC, 2013. ISBN 9781439840955.
Gottschalk et al. [2010] F. Gottschalk, R. W. Scholz, and B. Nowack. Probabilistic material flow modeling for assessing the environmental exposure to compounds: Methodology and an application to engineered nano-TiO2 particles. Environmental Modelling & Software, 25(3):320–332, 2010. ISSN 1364-8152. doi: https://doi.org/10.1016/j.envsoft.2009.08.011. URL https://www.sciencedirect.com/science/article/pii/S1364815209002394.
Graedel et al. [2005] T. E. Graedel, D. van Beers, M. Bertram, K. Fuse, R. B. Gordon, A. Gritsinin, E. M. Harper, A. Kapur, R. J. Klee, R. Lifset, L. Memon, and S. Spatari. The Multilevel Cycle of Anthropogenic Zinc. Journal of Industrial Ecology, 9(3):67–90, 2005. doi: https://doi.org/10.1162/1088198054821573. URL https://onlinelibrary.wiley.com/doi/abs/10.1162/1088198054821573.
Hastie et al. [2001] T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning. Springer Series in Statistics. Springer New York Inc., New York, NY, USA, 2001.
Hoerl and Kennard [2000] A. E. Hoerl and R. W. Kennard. Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics, 42(1):80–86, 2000. ISSN 00401706. URL http://www.jstor.org/stable/1271436.
Hoffman and Gelman [2014] M. Hoffman and A. Gelman. The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo. Journal of Machine Learning Research, 15:1953–1623, 2014.
Horn and Johnson [1985] R. A. Horn and C. R. Johnson. Matrix Analysis. Cambridge University Press, 1985.
Johnstone [2010] I. M. Johnstone. High dimensional bernstein-von mises: Simple examples. Institute of Mathematical Statistics Collections, page 87–98, 2010. doi: 10.1214/10-imscoll607.
Jowitt [2020] S. M. Jowitt. COVID-19 and the Global Mining Industry. SEG Discovery, (122):33–41, 07 2020. ISSN 1550-297X. doi: 10.5382/SEGnews.2020-122.fea-02. URL https://doi.org/10.5382/SEGnews.2020-122.fea-02.
Jowitt et al. [2020] S. M. Jowitt, G. M. Mudd, and J. F. H. Thompson. Future availability of non-renewable metal resources and the influence of environmental, social, and governance conflicts on metal production. Nature News, Sep 2020. URL https://www.nature.com/articles/s43247-020-0011-0.
Kruschke [2015] J. K. Kruschke. Doing Bayesian Data Analysis: A tutorial with R, Jags, and Stan. Academic Press, 2015.
Lan and Kang [2023] S. Lan and L. Kang. Sampling constrained continuous probability distributions: A review. WIREs Computational Statistics, 15(6):e1608, 2023. doi: https://doi.org/10.1002/wics.1608. URL https://wires.onlinelibrary.wiley.com/doi/abs/10.1002/wics.1608.
Laner et al. [2015] D. Laner, J. Feketitsch, H. Rechberger, and J. Fellner. A novel approach to characterize data uncertainty in material flow analysis and its application to plastics flows in austria. Journal of Industrial Ecology, 20(5):1050–1063, 2015. doi: 10.1111/jiec.12326.
Lederer [2022] J. C. Lederer. Fundamentals of high-dimensional statistics: With exercises and R labs. Springer Texts in Statistics. Springer, 1 edition, 2022.
Liu et al. [2013] G. Liu, C. E. Bangs, and D. B. Müller. Stock dynamics and emission pathways of the global aluminium cycle. Nature Climate Change, 3(4):338–342, Apr 2013. ISSN 1758-6798. doi: 10.1038/nclimate1698. URL https://doi.org/10.1038/nclimate1698.
Lupton and Allwood [2018] R. C. Lupton and J. M. Allwood. Incremental Material Flow Analysis with Bayesian Inference. Journal of Industrial Ecology, 22(6):1352–1364, 2018. doi: https://doi.org/10.1111/jiec.12698. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/jiec.12698.
Mannan and Al-Ghamdi [2020] M. Mannan and S. G. Al-Ghamdi. Environmental impact of water-use in buildings: Latest developments from a life-cycle assessment perspective. Journal of Environmental Management, 261:110198, 2020. ISSN 0301-4797. doi: https://doi.org/10.1016/j.jenvman.2020.110198. URL https://www.sciencedirect.com/science/article/pii/S030147972030133X.
Matsubae-Yokoyama et al. [2009] K. Matsubae-Yokoyama, H. Kubo, K. Nakajima, and T. Nagasaka. A Material Flow Analysis of Phosphorus in Japan. Journal of Industrial Ecology, 13(5):687–705, 2009. doi: https://doi.org/10.1111/j.1530-9290.2009.00162.x. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1530-9290.2009.00162.x.
Miatto et al. [2017] A. Miatto, H. Schandl, T. Fishman, and H. Tanikawa. Global Patterns and Trends for Non-Metallic Minerals used for Construction. Journal of Industrial Ecology, 21(4):924–937, 2017. doi: https://doi.org/10.1111/jiec.12471. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/jiec.12471.
Mirsky [1975] L. Mirsky. A trace inequality of John von Neumann. Monatshefte für Mathematik, 79(4):303–306, Dec 1975. ISSN 1436-5081. doi: 10.1007/BF01647331. URL https://doi.org/10.1007/BF01647331.
Mudd [2021] G. M. Mudd. Assessing the Availability of Global Metals and Minerals for the Sustainable Century: From Aluminium to Zirconium. Sustainability, 13(19), 2021. ISSN 2071-1050. doi: 10.3390/su131910855. URL https://www.mdpi.com/2071-1050/13/19/10855.
Myers et al. [2019a] R. Myers, B. Reck, and T. Graedel. YSTAFDB, a unified database of material stocks and flows for sustainability science. Scientific Data, 6, 2019a. doi: 10.1038/s41597-019-0085-7. URL http://dx.doi.org/10.1038/s41597-019-0085-7.
Myers et al. [2019b] R. J. Myers, T. Fishman, B. K. Reck, and T. E. Graedel. Unified Materials Information System (UMIS): An Integrated Material Stocks and Flows Data Structure. Journal of Industrial Ecology, 23(1):222–240, 2019b. doi: https://doi.org/10.1111/jiec.12730. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/jiec.12730.
Reck et al. [2008] B. K. Reck, D. B. Müller, K. Rostkowski, and T. E. Graedel. Anthropogenic nickel cycle: Insights into use, trade, and recycling. Environmental Science & Technology, 42(9):3394–3400, May 2008. ISSN 0013-936X. doi: 10.1021/es072108l. URL https://doi.org/10.1021/es072108l.
Salvatier et al. [2016] J. Salvatier, T. V. Wiecki, and C. Fonnesbeck. Probabilistic programming in Python using PyMC3. PeerJ Computer Science, 2:e55, apr 2016. doi: 10.7717/peerj-cs.55. URL https://doi.org/10.7717/peerj-cs.55.
Schwab and Rechberger [2017] O. Schwab and H. Rechberger. Information content, complexity, and uncertainty in material flow analysis. Journal of Industrial Ecology, 22(2):263–274, 2017. doi: 10.1111/jiec.12572.
Schwab et al. [2016a] O. Schwab, D. Laner, and H. Rechberger. Quantitative evaluation of data quality in regional material flow analysis. Journal of Industrial Ecology, 21(5):1068–1077, 2016a. doi: 10.1111/jiec.12490.
Schwab et al. [2016b] O. Schwab, O. Zoboli, and H. Rechberger. A data characterization framework for material flow analysis. Journal of Industrial Ecology, 21(1):16–25, 2016b. doi: 10.1111/jiec.12399.
Tibshirani [1996] R. Tibshirani. Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society. Series B (Methodological), 58(1):267–288, 1996. ISSN 00359246. URL http://www.jstor.org/stable/2346178.
Vaart [1998] A. W. v. d. Vaart. Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1998. doi: 10.1017/CBO9780511802256.
Wainwright [2019] M. J. Wainwright. High-Dimensional Statistics: A Non-Asymptotic Viewpoint. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 2019. doi: 10.1017/9781108627771.
Westbroek et al. [2021] C. D. Westbroek, J. Bitting, M. Craglia, J. M. C. Azevedo, and J. M. Cullen. Global material flow analysis of glass: From raw materials to end of life. Journal of Industrial Ecology, 25(2):333–343, 2021. doi: https://doi.org/10.1111/jiec.13112. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/jiec.13112.
Zou and Hastie [2005] H. Zou and T. Hastie. Regularization and Variable Selection via the Elastic Net. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 67(2):301–320, 2005. ISSN 13697412, 14679868. URL http://www.jstor.org/stable/3647580.

Appendix A Supplementary Information

A.1 Parent and child processes

In this section we elaborate on the parent and child process framework used to model disaggregation of processes or flows in MFA systems, and provide a simple illustrative example. Recall that a child process is any process which does not contain subprocesses, and a parent process is any process which does contain subprocesses. This by definition partitions all processes in the system into either child or parent processes, and all parent processes can be broken down into a set of constituent child processes. This partition of parent and child processes is useful because it allows multiple levels of disaggregation to be modelled using just two types of processes with a bottom up approach. In particular, only the child processes need to be modelled, and the parent processes variables are simply the sum of its constituent child processes variables. To illustrate this, consider the (non mass conserved) simple example of Figure 5:

From a typical top down approach, this example system contains 3 levels of disaggregation, since ‘Process B’ contains ‘Process A’, and ‘Process A’ in turn contains ‘Process 1’ and ‘Process 2’. In our framework however, the child processes are ‘Process 1’,‘Process 2’,‘Process 3’,‘Process 4’ and ‘Process 5’ while the parent processes are ‘Process A’, ‘Process B’ and ‘Process C’. The data in Figure 5 are the flows $U_{1,3}=1.7$ , $U_{4,B}=2.3$ , $U_{B,C}=10.4$ and $U_{A,5}=5.8$ , and the change of stock $S_{C}=11.6$ . The change in stock or flow data involving parent or aggregated processes can all be expressed in terms of flows involving child processes:

$\displaystyle S_{C}$	$\displaystyle=S_{4}+S_{5}$	(19)
$\displaystyle U_{4,B}$	$\displaystyle=U_{4,1}+U_{4,2}+U_{4,3}$	(20)
$\displaystyle U_{B,C}$	$\displaystyle=U_{1,4}+U_{2,4}+U_{3,4}+U_{1,5}+U_{2,5}+U_{3,5}$	(21)
$\displaystyle U_{A,5}$	$\displaystyle=U_{1,5}+U_{2,5}$	(22)

The data in this example can therefore be formulated with the following design matrix:

\setcounter{MaxMatrixCols}{12}\begin{pmatrix}0&0&1&0&0&0&0&0&0&0&0&0\\ 1&1&0&0&0&0&0&0&0&0&0&0\\ 0&0&0&0&0&0&0&0&0&1&1&1\\ 0&0&0&1&1&1&1&1&1&0&0&0\\ 0&0&0&0&1&0&1&0&0&0&0&0\end{pmatrix}\begin{pmatrix}S_{4}\\ S_{5}\\ U_{1,3}\\ U_{1,4}\\ U_{1,5}\\ U_{2,4}\\ U_{2,5}\\ U_{3,4}\\ U_{3,5}\\ U_{4,1}\\ U_{4,2}\\ U_{4,3}\end{pmatrix}=\begin{pmatrix}1.7\\ 11.6\\ 2.3\\ 10.4\\ 5.8\end{pmatrix}

(23)

A.2 Model convergence diagnostics

In this section we describe model diagnostic checks used to examine the convergence of the Markov Chains Monte Carlo (MCMC) sampling algorithm (No-U Turn Sampler of Hoffman and Gelman [2014]). We present trace plots for a selection of flow and change in stock variables in the aluminium model for illustration. Using the No-U-Turn Sampler, we sampled 2 independent chains of $12000$ samples, with the first $2000$ samples used for tuning and discarded from the final posterior samples. The purpose of the tuning samples are to give the Markov Chains iterations to converge to the target posterior distribution before extracting samples to approximate the posterior distribution. The traceplots in Figure 6 suggests both chains have converged to the same distribution. Furthermore, for both scenario A and scenario B of the aluminium model, the Gelman-Rubin diagnostic statistic (see for example Chapter 11 of Gelman et al. [2013]) for all flow and change in stock variables in the model have converged to $1.0$ and no divergent samples were reported, indicating the posterior samples are reliable (Betancourt [2017]).

A.3 Posterior pairplots

Here we present a small selection of pairplots to illustrate correlation between the posterior distribution of flow variables. In particular, we display all the outflows of the ‘Foil’ process, which is a subprocess of semi-manufacturing. The outflow processes of ‘Foil’ are ‘InternalRemelting’, ‘Manuf_TAU’, ‘Manuf_TOT’, ‘Manuf_POT’, ‘Manuf_CD’. Here one can see from the pairplots that it’s very uncommon for the total of any two outflows to exceed around $6$ Mt, which is consistent with the data which has the total outflow of ‘Foil’ at $6.4$ Mt.

A.4 Detail of prior parameters for the aluminium model

In this section we give detail on the choice of prior parameters for the model evaluation of the aluminium dataset, A summary of the parameter values described in this section can be found in Table 1. Below we describe and motivate the choices:

The prior mode $\mu_{i}$ or $\mu_{j,k}$ of flow and change in stock variables respectively are chosen to be equal to the nearest power of $10$ of the reported value when it is available. For example, the flow from ‘Mining’ to ‘Refining’ with a reported value $35.4$ Mt is assigned a prior mode of $\mu_{1,2}=10.0$ Mt. If the flow or change in stock variable has no reported value, it is assigned an uninformative prior mode of $1.0$ Mt instead.

The prior standard deviation $\sigma_{i}$ for the change in stock variables are chosen to be $\sigma_{i}=\max(\sqrt{40}|\mu_{i}|,0.1)$ , or $10.0$ if there is no reported value for the flow or change in stock variable. The corresponding parameter $\sigma_{j,k}$ for the flow variables are chosen similarly. For example, $\sigma_{1,2}=\sqrt{40}*10.0=63.25$ for the flow from ‘Mining’ to ‘Refining’. This choice is motivated by when $\mu_{i}$ is specified to the correct order of magnitude, the true value of the variable is at most $4.5|\mu_{i}|$ away, and so $\sigma_{i}$ is chosen on a similar scale but somewhat larger to ensure the prior is not over-confident. The maximum here is to ensure a minimum prior standard deviation of $0.1$ .

Note we are not advocating for this choice of prior distribution to be uniquely correct. The overall rationale of this choice for the prior is to assign a weakly informative prior distribution with a prior mode that captures the order of magnitude of the flow or change in stock, coupled with a relatively large prior variance, in order to reflect a realistic baseline of domain knowledge to aim for when conducting MFA.

For lack of better information, the standard deviation parameters for change in stock data noise $\tau_{i^{\prime}}$ are chosen to be $\tau_{i^{\prime}}=\max(0.1|Y_{i^{\prime}}|,0.1)$ , where $Y_{i^{\prime}}$ are the observed change in stock data. The corresponding parameters $\tau_{j^{\prime}}$ for flow data noise is chosen similarly. In other words, the data standard deviation is equal to $10\%$ of the observed data as in Lupton and Allwood [2018], but with a minimum of $0.1$ to account for a minimum degree of uncertainty from rounding or other measurement errors. For the ratio data noise, the standard deviation parameters $\tau_{k^{\prime}}$ are chosen with a somewhat smaller minimum $\tau_{k^{\prime}}=\max(0.1Y_{k^{\prime}},0.01)$ , where $Y_{k^{\prime}}$ is the observed ratio. While the choice of standard deviation parameters here is somewhat arbitrary, the posterior predictive checks performed on the aluminium model do not exhibit extreme Bayesian p-values (smaller than 0.05 or greater than 0.95), which suggests the standard deviation parameters chosen are reasonable. This also suggests one way to tune standard deviation parameters is to start with a small choice of standard deviation parameters (such as $10\%$ of the observed data value), run the model, identify the data points which exhibit extreme Bayesian p-values in the posterior predictive checks, and increase the standard deviation parameter for those points and rerun the model until no extreme Bayesian p-values remain.

For the mass conservation conditions, a small constant standard deviation of $\tau_{l^{\prime}}=0.05$ is chosen for all mass balance conditions. Ideally these parameters should be chosen based on the modeller’s confidence regarding epistemic uncertainty in the system, giving higher standard deviations to processes where there is more uncertainty in the system definition. Like with the standard deviation parameters for the data, posterior predictive checks can help select suitable parameter values, where the standard deviation parameters for mass balance conditions with extreme Bayesian p-values should be increased until the Bayesian p-values are no longer extreme. Without prior confidence of epistemic uncertainty in the system, we recommend starting with a small constant standard deviation for the mass conservation conditions, but not too small so that the mass conservation conditions are sufficiently relaxed to ensure the NUTS algorithm converges well. In this case it turned out the choice of $\tau_{l^{\prime}}=0.05$ did not produce any extreme Bayesian p-values, produces well converged posterior samples and well mass balanced flows (see subsequent section for more detail).

Parameter	Parameter prior value
$\mu_{i}$	nearest power of $10$ of reported value, or $1.0$ if unavailable
$\mu_{j,k}$	nearest power of $10$ of reported value, or $1.0$ if unavailable
$\sigma_{i}$	$\max(\sqrt{40}\|\mu_{i}\|,0.1)$ , or $10.0$ if reported value is unavailable
$\sigma_{j,k}$	$\max(\sqrt{40}\mu_{j,k},0.1)$ , or $10.0$ if reported value is unavailable
$\tau_{i^{\prime}}$	$\max(0.1\|Y_{i^{\prime}}\|,0.1)$
$\tau_{j^{\prime}}$	$\max(0.1Y_{j^{\prime}},0.1)$
$\tau_{k^{\prime}}$	$\max(0.1Y_{k^{\prime}},0.01)$
$\tau_{l^{\prime}}$	0.05

Table 1: Table of parameters for the aluminium model

A.5 Assessment of posterior mass balance for the aluminium model

In this section we analyse how well the mass balance conditions are satisfied by the aluminium model, given the parameters chosen in the previous section, in particular the standard deviation parameters $\tau_{l^{\prime}}=0.05$ for the mass balance conditions. We examine the posterior distribution of the mass balance conditions to see how well they are centred around $0$ .

From the Figure above, it can be seen that the posterior mean of the mass balance conditions is very close to $0$ in both cases, having a mean of $-0.001$ Mt for both scenario A and scenario B when averaged over all processes, which is orders of magnitudes smaller than the average posterior flow magnitude of around $2.3$ Mt for both scenario A and B. This ensures that the posterior mean of the flow and stock change variables in the system, which is also the point estimate we report, is very well balanced. The individual posterior samples are somewhat less mass balanced, exhibiting a standard deviation of around $0.05$ Mt for all processes. However, this is still a small level of mass imbalance and it is not necessary for every posterior sample to be perfectly mass balanced for the model to produce useful point estimation or uncertainty quantification.

A.6 Zinc cycle simulation study

We conduct a simulation study to investigate the estimation accuracy and uncertainty quantification of our model. For this end we consider the zinc cycle in China, ca. 1994-1998, from Graedel et al. [2005]. The zinc cycle contains no missing data values, so for the purposes of this simulation, we treat the reported data values as the true parameter values of the flow and stock change variables. This allows us to assess our model by withholding some data from it and checking the model output against the true values of the stock change and flows. We examine how our model performs in terms of estimation accuracy and uncertainty quantification under different levels of prior knowledge, as data is incrementally added into the model. The zinc model considered is linear for ease of comparison with other models that require data to be in the form of a design matrix. However, we note that common MFA data types including flows, flow ratios and stock changes can be expressed under a linear model.

To assess estimation accuracy of the model, we calculate how the error of the model evolves as data is randomly added into the model one at a time, starting from no data, until the full dataset becomes available to the model. We consider two types of error, the root mean square error (rmse) and the maximum error, of the model’s predictions compared to the reported values of the flow and change in stocks variables. The posterior mode is taken as the model prediction. To account for the random order in which data is added into the model, we repeat this process 50 times and calculate the average error (for both rmse and the maximum error) over the 50 runs for each possible data size from $0$ to $20$ .

For comparison, we consider several alternate models. First, we consider an alternate Bayesian Gaussian model which has the advantage of a closed form solution, but carries the disadvantage of assigning significant probability to negative values for flow variables in the posterior distribution, which should be nonnegative as negative flows are not physically meaningful. Detail of the Gaussian model is provided in Section A.8. In addition, we also investigate the effect of different levels of prior knowledge has on the error. We utilise two different levels of prior knowledge, the first is an ‘uninformative’ prior which sets the prior mode of all variables to the average absolute value of the full data, and the correct sign for change in stock variables. This intuitively represents a prior that knows the average order of magnitude of the system but has little visibility on the individual magnitude of each flow or change in stock. The second is a ‘weakly informative’ prior which sets the prior mode of each flow and change in stock variable to the nearest power of $10$ of the true value. This intuitively represents a prior that has knowledge of each flow and change in stock variable to the nearest order of magnitude. We also provide results from non Bayesian regression models for comparison, specifically ridge regression and multilayer perceptron (see e.g. Hastie et al. [2001]). For each run, both algorithms are trained on the same datasets as the proposed and Gaussian model, and outputs a prediction on the vector of all flows and change in stock variables. While these models do not have a principled way of incorporating prior information, one can nevertheless train those models on the data and use them to estimate the value of all flow and change in stock variables in a system, which makes a comparison with Bayesian MFA models meaningful.

Figure 10 shows the results of our analysis, where the rmse and maximum error of the model are plotted against the number of data points available to the model, averaged over 50 runs. For ridge regression and multilayer perceptron, the starting rmse at no data is around 5.4, whereas for the uninformative prior it is around 4.0 and for the weakly informative prior around 1.5. As data is added, all models exhibit the trend of monotonically decreasing error, but the proposed and Gaussian models exhibit noticeably lower error until almost all the data becomes available to the model, indicating incorporating prior information leads to more accurate estimates. Notably, it takes around 10 data points for the uninformative prior models and 12 data points for ridge regression to achieve the starting rsme (around 1.5) of the weakly informative prior model. Therefore, one can consider only knowing the order of magnitude of each flow or change in stock to be roughly equivalent to $10$ data points in terms of rmse in this model, demonstrating the utility of being able to incorporate knowledge via a Bayesian prior in such an underdetermined system. For the Gaussian model, theoretical bounds on the mean square error provided in Section A.8 also support the trend exhibited in Figure 10, and the intuition that having an informative prior can greatly increase the model’s accuracy for MFA problems when there is a shortage of data but available expert domain knowledge.

To assess uncertainty quantification of the posterior distribution, we examine whether the $95\%$ posterior marginal highest density intervals (HDIs) of our model have similar level of frequentist coverage, i.e. form correct confidence intervals. Unlike credible intervals which have a designated posterior probability of containing the unknown parameter, confidence intervals are constructed such that upon repeated experiments (i.e. generate samples of data), a designated proportion (known as coverage) of the confidence intervals constructed will contain the true value of the unknown parameter. Bayesian credible intervals are in general not guaranteed to have good coverage, particularly in high-dimensional settings that are typical in MFA, but nevertheless it is desirable for them to have good coverage. Coverage is a property of the model procedure and does not require the repeated experiments to be the same experiment or dataset. It is therefore desirable for uncertainty quantification generated by a model procedure to contain the true value of the unknown parameter most of the time, over many runs for possibly different datasets. We investigate the coverage of our model’s credible intervals empirically in order to understand whether they can be used reliably as confidence sets.

The high dimensional nature of MFA models means the dimension of parameter space $p$ (corresponding to the number of flow and change in stock variables in the model) is usually greater or similar in size compared to the number of available data $n$ , and $n$ is typically not large in magnitude. Therefore, results which guarantees asymptotic frequentist coverage of credible regions, such as the Bernstein-von Mises theorem (see e.g. Vaart [1998]), may not apply in MFA problems. We therefore study in simulations the frequentist coverage of our model’s posterior credible intervals, in an effort to understand whether these can be used as reliable confidence intervals. Specifically, we sample $300$ sets of the zinc dataset $\bm{Y}_{1},\bm{Y}_{2},\dots,\bm{Y}_{300}$ from the model likelihood $p(\bm{Y}|\bm{\theta}_{0})$ , where we take the ‘true values’ $\bm{\theta}_{0}$ to be the originally reported flow and change in stock values reported in Graedel et al. [2005] . For each dataset sampled, we add data to the model in batches of $5$ datapoints at a time (in the order listed from top to bottom in Table 2), and calculate the posterior marginal HDIs to see if they contain the true value for each dataset size $n=5,10,15,20$ . The frequencies at which the posterior marginal HDIs contain the true value over the $300$ runs, known as the coverage probability, are reported in Table 2.

	Coverage probability(%), average width of $95\%$ HDI
	Weakly informative prior				Uninformative prior
Variable Name	5 data	10 data	15 data	20 data	5 data	10 data	15 data	20 data
$\Delta$ Stock: Use	95.3, 3.60	96.3, 3.44	98.0, 3.16	96.3, 3.15	96.0, 3.76	95.3, 3.63	97.7, 3.23	95.7, 3.21
Production to ImportExport	90.7, 3.56	96.3, 3.40	96.7, 3.41	98.0, 2.92	91.3, 3.78	95.7, 3.65	97.7, 3.60	96.0, 3.15
$\Delta$ Stock: Environment	98.0, 3.50	97.0, 3.35	97.3, 3.35	98.7, 3.18	95.0, 3.66	97.3, 3.44	96.0, 3.46	98.3, 3.25
WM to Production	94.7, 3.39	96.0, 3.15	96.7, 3.12	93.0, 3.02	94.7, 3.54	94.0, 3.47	94.7, 3.35	91.3, 3.24
ImportExport to WM	100.0, 0.83	100.0, 0.82	100.0, 0.82	100.0, 0.79	96.0, 2.29	99.3, 2.16	97.0, 2.12	98.7, 1.82
Unknown to WM	100.0, 5.07	99.0, 2.48	99.3, 2.18	100.0, 2.19	100.0, 8.52	99.0, 2.53	98.0, 2.42	97.7, 2.48
Lithosphere to Production	100.0, 7.73	96.3, 3.37	95.3, 3.38	93.7, 2.87	0.00, 11.3	96.0, 3.63	94.7, 3.55	94.7, 2.97
$\Delta$ Stock: Production	100.0, 0.39	100.0, 0.39	100.0, 0.39	100.0, 0.39	100.0, 16.1	95.0, 3.67	95.3, 3.59	96.0, 3.50
Production to F&M	100.0, 7.69	94.7, 3.32	99.0, 3.19	93.7, 3.06	3.33, 8.25	89.0, 3.55	91.7, 3.39	87.0, 3.21
WM to Environment	100.0, 4.19	98.0, 2.69	98.7, 2.50	99.0, 2.55	100.0, 4.88	96.3, 2.98	97.3, 2.72	95.7, 2.72
F&M to Use	100.0, 7.54	99.7, 7.20	96.3, 3.03	93.0, 2.95	100.0, 9.55	100.0, 9.57	96.3, 3.14	93.3, 3.02
$\Delta$ Stock: Unknown	100.0, 6.58	100.0, 4.62	100.0, 2.97	98.7, 2.97	100.0, 13.3	100.0, 7.20	98.0, 3.16	99.3, 3.16
Production to Unknown	100.0, 0.20	100.0, 0.20	100.0, 0.20	100.0, 0.20	100.0, 9.31	97.7, 4.90	93.7, 2.03	96.7, 2.01
F&M to WM	100.0, 6.44	100.0, 6.52	93.3, 3.30	94.3, 3.12	100.0, 10.6	100.0, 11.7	93.3, 3.45	93.7, 3.21
Use to WM	100.0, 5.91	100.0, 5.93	98.3, 2.60	98.0, 2.54	100.0, 7.32	100.0, 7.71	98.0, 2.63	98.7, 2.57
WM to F&M	100.0, 7.79	100.0, 7.29	100.0, 3.79	98.3, 2.63	95.0, 14.1	100.0, 12.4	100.0, 4.16	97.7, 2.74
$\Delta$ Stock: ImportExport	97.0, 5.21	99.3, 5.10	99.7, 5.10	96.0, 3.09	84.0, 9.63	90.3, 7.30	95.3, 6.55	93.0, 3.26
$\Delta$ Stock: Lithosphere	100.0, 8.21	99.0, 4.95	99.3, 4.95	95.3, 3.04	0.00, 12.2	99.0, 5.24	98.7, 5.18	95.7, 3.11
ImportExport to F&M	100.0, 0.86	100.0, 0.85	100.0, 0.82	100.0, 0.79	100.0, 7.19	100.0, 4.39	100.0, 3.17	98.0, 1.80
Production to Environment	100.0, 3.77	100.0, 3.74	99.3, 3.92	95.0, 2.90	100.0, 4.20	99.0, 3.74	99.7, 4.14	96.0, 2.97

Table 2: Table of coverage probability and average width of 95% highest density intervals (HDI) of the marginal posterior distributions of all flow and change in stock variables in the zinc data under different amounts of data and prior information, computed over

300

runs. Data was added in batches of

5

to the model in the order that they are listed in this table (from top to bottom).

From Table 2, it can be seen that the 95% HDI possess similar levels of coverage probability in most cases. The only notable exception being the uninformative prior at $5$ data points, where the coverage probability are close to $0$ for the flow variables ‘Lithosphere to Production’ and ‘Production to F& M’, and the change in stock variable for ‘Lithosphere’. This is likely caused by the first $5$ data points not containing any data on the ‘Lithosphere’ process, as well as containing little data on ‘Production’, so the model was unable deduce the true magnitude of how much material was taken from the ‘Lithosphere’ with an uninformative prior. On the other hand, the weakly informative prior does not possess this issue as the prior has sufficient coverage over the true value. In addition, the average width of the $95\%$ HDI are generally lower for the weakly informative prior until data for that variable becomes available, which is expected from a more informative prior.

A.7 Detail of parameters for the zinc models

In this section we give detail on the choice of model parameters for all models used for the zinc case study, including how the prior parameters were chosen. Recall that we considered two different priors for the zinc model, an uninformative prior and a weakly informative prior. For the uninformative prior, the prior modes $\mu_{i}$ of the change in stock variables are chosen to be $3.6575$ (measured in $10^{5}$ tonnes/Yr), equal to the mean of the absolute value of all the stock change and flow variables in the data, to represent a prior that only has a rough knowledge of the average order of magnitude of the system, but not the order of magnitude of each individual flow or change in stock. For weakly informative prior, $\mu_{i}$ are chosen the same way as the aluminium case, where the prior mode is set to the nearest power of $10$ of the ‘true’ reported value. The prior standard deviation parameters $\sigma_{i}$ are chosen to be a constant $\sqrt{40}$ in the uninformative case, and $\sigma_{i}=\max(\min(4|\mu_{i}|,4),0.1)$ in the weakly informative case. Like in the aluminium model, $\sigma_{i}$ for the weakly informative prior is chosen at a similar scale compared to $4.5|\mu_{i}|$ but somewhat smaller and capped at a maximum of $4.0$ to reflect the fact that the size of flows are relatively smaller in the zinc data (when measured in $10^{5}$ tonnes/Yr, compared to the aluminium data which is measured in Mt). The corresponding prior parameters for the flow variables $\mu_{j,k}$ and $\sigma_{j,k}$ are chosen similarly.

The noise standard deviation parameters of the observed data is chosen to be a constant $\tau=1.0$ throughout for all data, and similarly for the mass conservation conditions. Here the noise parameters are chosen to be somewhat larger than the aluminium model to speed up the computation of the No-U-Turn Sampler MCMC algorithm, as the model needed to be computed $2400$ times over the varying degrees of prior knowledge and available data to generate the coverage probability results in Table 1 of the main text. In particular, small values of the noise parameter induces high curvature which makes the sampling algorithm take longer to explore the posterior distribution. Similarly, for assessing point estimation (root mean squared error and maximum error) of the model on the zinc data, the posterior mode of the model is estimated directly via optimisation for faster computational speed, rather than through approximating the entire posterior distribution using MCMC.

Parameter	Parameter prior value (weakly informative prior)	Parameter prior value (uninformative prior)
$\mu_{i}$	nearest power of $10$ of reported value	$\pm 3.6575$ (sign depending if the change in stock is positive or negative)
$\mu_{j,k}$	nearest power of $10$ of reported value	$3.6575$
$\sigma_{i}$	$\max(\min(4\|\mu_{i}\|,4),0.1)$	$\sqrt{40}$
$\sigma_{j,k}$	$\max(\min(4\mu_{j,k},4),0.1)$	$\sqrt{40}$
$\tau$	1.0	1.0

Table 3: Table of parameters for the zinc model

The ridge regression model was trained with a regularization parameter of $\lambda=1.0/40.0=0.025$ , which was chosen to match the prior variance and noise standard deviation in the uninformative prior case for the Gaussian model. To see this, note the solution of the ridge regression is equal to the posterior mode of a model with prior $\bm{\theta}\sim\mathcal{N}(0,\sigma^{2}I)$ and likelihood $\bm{Y}|\bm{\theta}\sim\mathcal{N}(X\bm{\theta},\tau^{2}I)$ , and the regularization parameter can be shown to be equal to $\tau^{2}/\sigma^{2}$ . So effectively the ridge regression solution is similar to the uninformative prior case, except the prior mean is set to $0$ instead which is arguably less informative.

The multilayer perceptron regressor was trained with the default settings in the sklearn Python library, except with the maximum number of iterations increased to $1000$ . In particular the default loss function is the squared loss and default number of hidden layers equal to $100$ .

A.8 Gaussian model

For model comparison purposes and to gain intuition into how Bayesian models perform for MFA problems, it is useful to examine a model with simplified assumptions. In particular, we consider a Gaussian model, where both the prior and the likelihood function are assumed to be normally distributed, and the data is assumed to be linear in terms of $\bm{\theta}$ (recall from the Methods section in the main text of the paper that the most common forms of MFA data can be expressed linearly in $\bm{\theta}$ ):

	$\displaystyle\bm{\theta}$	$\displaystyle\sim\mathcal{N}(\bm{\mu},\Sigma)$		(24)
	$\displaystyle Y_{i}\|\bm{\theta}$	$\displaystyle\sim\mathcal{N}(\bm{x_{i}}^{\top}\bm{\theta},\tau_{i}^{2})$		(25)

where $\bm{\mu}$ is the prior mean and $\Sigma$ is the (positive definite) prior covariance matrix. This gives a mathematically convenient conjugate posterior that is also normally distributed, with its posterior mean $\bm{\mu}^{n}$ and covariance $\Sigma^{n}$ available in closed form:

	$\displaystyle\bm{\mu}^{n}$	$\displaystyle=\bm{\mu}+\Sigma X^{\top}(X\Sigma X^{\top}+T)^{-1}(\bm{Y}-X\bm{% \mu})$		(26)
	$\displaystyle\Sigma^{n}$	$\displaystyle=\Sigma-\Sigma X^{\top}(X\Sigma X^{\top}+T)^{-1}X\Sigma$		(27)

where $T$ is a $n$ by $n$ diagonal matrix with entries $\tau_{i}^{2}$ , $1\leq i\leq n$ . An advantage of the Gaussian model is that it allows the posterior mean and covariance to be calculated directly without potentially computationally intensive MCMC algorithms. This makes the Gaussian model an easy to compute baseline to compare with more complicated Bayesian models. Furthermore, owing to the closed form of the posterior, we can bound the mean squared error (MSE) between the posterior mean $\bm{\mu}^{n}$ and the true parameter (stock changes and flows) values $\bm{\theta}^{*}$ . A discussion on this is provided below, which gives some intuition why Bayesian methods can perform better than non Bayesian methods in the low data setting $n\ll p$ .

While computationally convenient, an disadvantage with the Gaussian conjugate model is that the posterior distributions for flow variables can contain non negligible probability on negative values when there is insufficient data or uninformative priors, which is not physically meaningful as flow quantities should be strictly positive, making it not fully Bayesian in this sense. While the posterior mean of the Gaussian model can perform adequately in terms of point estimation, we do not recommend it for uncertainty quantification without very informative priors.

Nevertheless, the Gaussian model usefully gives some intuition into why Bayesian methods perform better than non Bayesian methods in the low data setting $n\ll p$ . Below we provide some theoretical analysis for the mean squared error of the Gaussian model.

Owing to the closed form of the posterior of the Gaussian model described in the Methods section of the main text, we can bound the mean squared error (MSE) between the posterior mean $\bm{\mu}^{n}$ and the true parameter (stocks changes and flows) values $\bm{\theta}^{*}$ as follows:

Theorem 1.

Let $\tau_{i}^{2}=\tau^{2}$ , in other words $T=\tau^{2}I$ , $I$ the identity matrix. The mean squared error between the posterior mean of the Gaussian posterior $\bm{\mu}^{n}$ and the true stock change and flow values $\bm{\theta}^{*}$ can be bounded above in the following way:

	$\displaystyle\mathbb{E}(\\|\bm{\theta}^{}-\bm{\mu}^{n}\\|^{2}_{2})=\mathbb{E}(% \sum_{j=1}^{p}(\theta_{j}^{}-\mu_{j}^{n})^{2})$
	$\displaystyle\leq\operatorname{Tr}(\Sigma^{-1/2}(\bm{\theta}^{}-\bm{\mu})(\bm% {\theta}^{}-\bm{\mu})^{\top}\Sigma^{-1/2})\lambda_{1}(p-n+\sum_{j=1}^{n}\frac% {\tau^{4}}{(d_{j}^{2}+\tau^{2})^{2}})+\tau^{2}\sum_{j=1}^{n}\frac{\lambda_{1}d% _{j}^{2}}{(d_{j}^{2}+\tau^{2})^{2}}$		(28)

where $\lambda_{1}\geq\lambda_{2}\geq\dots\geq\lambda_{p}>0$ are the eigenvalues of $\Sigma$ in descending order, and $0\leq d_{1}\leq d_{2}\leq\dots\leq d_{n}$ are the singular values of $X\Sigma^{1/2}$ in ascending order.

The bound in Equation 28 helps to give some insight into how the mean squared error of the model evolves as more data is added. This is perhaps easiest to see in the case when $\tau$ is near $0$ (meaning the data is almost noiseless) and $0\leq\tau\ll d_{1}$ , which leaves

\operatorname{Tr}(\Sigma^{-1/2}(\bm{\theta}^{*}-\bm{\mu})(\bm{\theta}^{*}-\bm{% \mu})^{\top}\Sigma^{-1/2})\lambda_{1}(p-n)

as the dominant term. The $\operatorname{Tr}(\Sigma^{-1/2}(\bm{\theta}^{*}-\bm{\mu})(\bm{\theta}^{*}-\bm{% \mu})^{\top}\Sigma^{-1/2})$ part of this term can be interpreted as a measurement of how close the prior mean is to the true values of the parameters (it is $0$ when $\bm{\theta}^{*}=\bm{\mu}$ ) and does not depend on the data, while the factor $\lambda_{1}(p-n)$ decreases as $n$ increases, meaning as more data is added. This indicates that in a high dimensional setting $n\ll p$ which is inherent in MFA studies, the availability of informative priors is key to obtaining more accurate estimates of the true stock change and flow values.

Furthermore, in the case when $\Sigma=\lambda_{1}I$ , Equation 28 can be directly compared with the ridge regression case. Since ridge regression is equivalent to the case when $\bm{\mu}=\bm{0}$ and $\Sigma=\lambda_{1}I$ , Equation 28 is smaller for the Gaussian model compared to ridge regression whenever $\operatorname{Tr}((\bm{\theta}^{*}-\bm{\mu})(\bm{\theta}^{*}-\bm{\mu})^{\top})% =\|\bm{\theta}^{*}-\bm{\mu}\|^{2}_{2}$ is smaller than $\|\bm{\theta}^{*}\|^{2}_{2}$ . In other words whenever the prior mean $\bm{\mu}$ is closer to the true value in (L2) distance than the zero vector $\bm{0}$ , which is consistent with the simulations presented in Section A.6.

Proof of Theorem 1.

Recall that the posterior mean of the conjugate Gaussian model can be written as:

\displaystyle\bm{\mu}^{n}=\bm{\mu}+\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)% ^{-1}(X(\bm{\theta}^{*}-\bm{\mu})+\bm{\epsilon})

This can be derived from looking at the joint distribution $[\bm{\theta},X\bm{\theta}+\bm{\epsilon}]^{\top}$ and using standard Gaussian conditioning formulae:

\displaystyle\begin{bmatrix}\bm{\theta}\\ X\bm{\theta}+\bm{\epsilon}\end{bmatrix}\sim\mathcal{N}\left(\begin{bmatrix}\bm% {\mu}\\ X\bm{\mu}\end{bmatrix},\begin{bmatrix}\Sigma&\Sigma X^{\top}\\ X\Sigma&X\Sigma X^{\top}+\tau^{2}I\end{bmatrix}\right)

(29)

Suppose $dim(X)=(n,p)$ , $dim(\Sigma)=(p,p)$ , $dim(\bm{\mu})=dim(\bm{\theta}^{*})=(p,1)$ , $dim(\bm{\epsilon})=(n,1)$ , and we’ll assume $n<p$ , since material flow analysis typically has less data than parameters. Assume $\Sigma$ is symmetric positive definite and $X$ has full rank. The mean squared error (MSE) between the posterior mean $\bm{\mu}^{n}$ and the true value $\bm{\theta}^{*}$ is equal to:

	$\displaystyle MSE$	$\displaystyle=\mathbb{E}\\|\bm{\theta}^{}-\bm{\mu}-\Sigma X^{\top}(X\Sigma X^{% \top}+\tau^{2}I)^{-1}(X(\bm{\theta}^{}-\bm{\mu})+\bm{\epsilon})\\|^{2}_{2}$
		$\displaystyle=\mathbb{E}(\bm{\beta}-\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I% )^{-1}(X\bm{\beta}+\bm{\epsilon}))^{\top}(\bm{\beta}-\Sigma X^{\top}(X\Sigma X% ^{\top}+\tau^{2}I)^{-1}(X\bm{\beta}+\bm{\epsilon}))$
		$\displaystyle=\mathbb{E}(\bm{\beta}^{\top}(I-\Sigma X^{\top}(X\Sigma X^{\top}+% \tau^{2}I)^{-1}X)^{\top}(I-\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X)% \bm{\beta})$
		$\displaystyle+\mathbb{E}(\bm{\epsilon}^{\top}(\Sigma X^{\top}(X\Sigma X^{\top}% +\tau^{2}I)^{-1})^{\top}\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}\bm{% \epsilon})$

Where $\bm{\beta}=\bm{\theta}^{*}-\bm{\mu}$ . At this point we decompose the MSE into the bias and variance terms. Let

MSE_{1}=\mathbb{E}(\bm{\beta}^{\top}(I-\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{% 2}I)^{-1}X)^{\top}(I-\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X)\bm{% \beta})

denote the bias term. and

MSE_{2}=\mathbb{E}(\bm{\epsilon}^{\top}(\Sigma X^{\top}(X\Sigma X^{\top}+\tau^% {2}I)^{-1})^{\top}\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}\bm{\epsilon})

denote the variance term.

For the bias term, using the fact that the trace of a scalar is equal to itself and the cyclic property of trace, we have:

	$\displaystyle MSE_{1}=\operatorname{Tr}(\bm{\beta}^{\top}(I-\Sigma X^{\top}(X% \Sigma X^{\top}+\tau^{2}I)^{-1}X)^{\top}(I-\Sigma X^{\top}(X\Sigma X^{\top}+% \tau^{2}I)^{-1}X)\bm{\beta})$
	$\displaystyle=\operatorname{Tr}(\bm{\beta}\bm{\beta}^{\top}(I-\Sigma X^{\top}(% X\Sigma X^{\top}+\tau^{2}I)^{-1}X)^{\top}(I-\Sigma X^{\top}(X\Sigma X^{\top}+% \tau^{2}I)^{-1}X))$
	$\displaystyle=\operatorname{Tr}(\bm{\beta}\bm{\beta}^{\top}\Sigma^{-1/2}\Sigma% ^{1/2}(I-X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X\Sigma)\Sigma^{-1/2}\Sigma% \Sigma^{-1/2}(I-\Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X)\Sigma^{1/2}% \Sigma^{-1/2})$
	$\displaystyle=\operatorname{Tr}(\Sigma^{-1/2}\bm{\beta}\bm{\beta}^{\top}\Sigma% ^{-1/2}(I-\Sigma^{1/2}X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X\Sigma^{1/2})% \Sigma(I-\Sigma^{1/2}X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X\Sigma^{1/2}))$

Notice $\Pi_{\tau}=\Sigma^{1/2}X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1}X\Sigma^{1/2}$ is symmetric. Similarly $I-\Pi_{\tau}$ is symmetric and so $(I-\Pi_{\tau})^{2}$ is positive semidefinite.

Let $X\Sigma^{1/2}=UDV^{\top}$ be the singular value decomposition of $X\Sigma^{1/2}$ , where $d_{1}\leq d_{2}\leq\dots\leq d_{n}$ are the diagonal elements of $D$ (also known as the singular values of $X\Sigma^{1/2}$ ) arranged in ascending order, then we have:

\displaystyle\Pi_{\tau}

\displaystyle=VD^{\top}U^{\top}(UDD^{\top}U^{\top}+\tau^{2}I)^{-1}UDV^{\top}=% VD^{\top}(DD^{\top}+\tau^{2}I)^{-1}DV^{\top}

which has the same eigenvalues as $D^{\top}(DD^{\top}+\tau^{2}I)^{-1}D$ by similarity (Horn and Johnson [1985]), which are $d_{i}^{2}/(d_{i}^{2}+\tau^{2})$ and $0$ (repeated $p-n$ times). Likewise the eigenvalues of $I-\Pi_{\tau}$ are therefore $1$ (repeated $p-n$ times) and $\tau^{2}/(d_{i}^{2}+\tau^{2})$

returning to the bias term, we have:

$\displaystyle MSE_{1}$	$\displaystyle=\operatorname{Tr}(\Sigma^{-1/2}\bm{\beta}\bm{\beta}^{\top}\Sigma% ^{-1/2}(I-\Pi_{\tau})\Sigma(I-\Pi_{\tau}))$
	$\displaystyle\leq\operatorname{Tr}(\Sigma^{-1/2}\bm{\beta}\bm{\beta}^{\top}% \Sigma^{-1/2})\operatorname{Tr}((I-\Pi_{\tau})^{2}\Sigma)$
	$\displaystyle\leq\operatorname{Tr}(\Sigma^{-1/2}\bm{\beta}\bm{\beta}^{\top}% \Sigma^{-1/2})(\sum_{j=1}^{p-n}\lambda_{j}+\sum_{j=1}^{n}\lambda_{p-n+j}\tau^{% 4}/(d_{j}^{2}+\tau^{2})^{2})$
	$\displaystyle\leq\operatorname{Tr}(\Sigma^{-1/2}\bm{\beta}\bm{\beta}^{\top}% \Sigma^{-1/2})\lambda_{1}(p-n+\sum_{j=1}^{n}\tau^{4}/(d_{j}^{2}+\tau^{2})^{2})$	(30)

Where we used Von Neumann’s trace inequality (Mirsky [1975]) in the penultimate line, on the matrices $(I-\Pi_{\tau})^{2}$ and $\Sigma$ . So provided $\tau^{2}$ is small compared to the $d_{j}^{2}$ , $MSE_{1}$ decreases approximately linearly in $n$ , as $n$ increases to $p$ .

For the variance term $MSE_{2}$ , we once again use the singular value decomposition $X\Sigma^{1/2}=UDV^{\top}$ :

	$\displaystyle MSE_{2}$	$\displaystyle=\operatorname{Tr}(\mathbb{E}(\bm{\epsilon}^{\top}(\Sigma X^{\top% }(X\Sigma X^{\top}+\tau^{2}I)^{-1})^{\top}\Sigma X^{\top}(X\Sigma X^{\top}+% \tau^{2}I)^{-1}\bm{\epsilon}))$
		$\displaystyle=\operatorname{Tr}(\mathbb{E}(\bm{\epsilon}\bm{\epsilon}^{\top}(% \Sigma X^{\top}(X\Sigma X^{\top}+\tau^{2}I)^{-1})^{\top}\Sigma X^{\top}(X% \Sigma X^{\top}+\tau^{2}I)^{-1}))$
		$\displaystyle=\tau^{2}\operatorname{Tr}((X\Sigma X^{\top}+\tau^{2}I)^{-2}X% \Sigma^{2}X^{\top})$
		$\displaystyle=\tau^{2}\operatorname{Tr}((UDD^{\top}U^{\top}+\tau^{2}I)^{-2}UDV% ^{\top}\Sigma VD^{\top}U^{\top})$
		$\displaystyle=\tau^{2}\operatorname{Tr}(U^{\top}(UDD^{\top}U^{\top}+\tau^{2}I)% ^{-1}UU^{\top}(UDD^{\top}U^{\top}+\tau^{2}I)^{-1}UDV^{\top}\Sigma VD^{\top})$
		$\displaystyle=\tau^{2}\operatorname{Tr}(VD^{\top}(DD^{\top}+\tau^{2}I)^{-2}DV^% {\top}\Sigma)$
		$\displaystyle\leq\tau^{2}\sum_{j=1}^{n}\lambda_{1}d_{j}^{2}/(d_{j}^{2}+\tau^{2% })^{2}$

Where the last line again follows from Von Neumann’s trace inequality, on the matrices $VD^{\top}(DD^{\top}+\tau^{2}I)^{-2}DV^{\top}$ and $\Sigma$ .

$\blacksquare$

A.9 Discussion of model parametrisation

In this section we discuss some additional limitations with the parametrisation of Gottschalk et al. [2010] which led us to use a mass based parametrisation. These limitations do not apply in all MFA systems but could potentially be problematic in some. In order to discuss them we provide a summary of the parametrisation: let $U_{i,j}$ denote the value of the flow from process $i$ to process $j$ . Let $z_{i}=\sum_{k}U_{i,k}$ denote the total outflow of process $i$ , and $\phi_{i,j}=U_{i,j}/z_{i}$ be the transfer coefficient for the flow from process $i$ to process $j$ , equal to the fraction of the total outflow $z_{i}$ of process $i$ . In addition, an optional external flow $q_{i}$ is allowed for each process $i$ . Applying conservation of mass at process $i$ , we obtain:

q_{i}+\sum_{j}U_{j,i}=\sum_{k}U_{i,k}

(31)

which can be rewritten as

q_{i}+\sum_{j}z_{j}\phi_{j,i}=z_{i}

(32)

the conservation of mass equations 32 can therefore be written in the matrix form

(I-\Phi^{\top})\bm{z}=\bm{q}

(33)

where $\Phi$ is a matrix with elements $\phi_{i,j}$ . Lupton and Allwood [2018] and Gottschalk et al. [2010] then notes that this equation can be inverted to give $(I-\Phi^{\top})^{-1}\bm{q}=\bm{z}$ , the vector of the total outflow of every process, which can then be used to retrieve each individual flow variables $U_{i,j}$ via the relationship $U_{i,j}=\phi_{i,j}z_{i}$ . Using this, Lupton and Allwood [2018] and Gottschalk et al. [2010] assigns priors on the transfer coefficients $\phi_{i,j}$ and the external flow variables $q_{i}$ , which gives an implicit prior over the flow variables $U_{i,j}$ .

The first limitation is that it is unclear if $I-\Phi^{\top}$ is always invertible. From $\phi_{i,j}=U_{i,j}/\sum_{k}U_{i,k}$ we must have $\sum_{k}\phi_{i,k}=1$ , which means the matrix $\Phi$ has each row summing to $1$ (making it a stochastic matrix), which implies it must have eigenvalue $1$ (this can be easily shown by checking the vector with entries all equal to $1$ is an eigenvector). $I-\Phi^{\top}$ therefore must have $0$ as an eigenvalue and is not invertible.

It appears the way this issue is currently handled is to have at least one process in the system that has no outflows (see for example Figure 1 of Dong et al. [2023], which has processes 6,7,8 and 9 without outflows), essentially fixing one or more rows of $\Phi$ to equal to the $\bm{0}$ vector so that it is no longer a stochastic matrix. However, it is unclear if a process that has no outflows always exists (for example Figure 2 of Mannan and Al-Ghamdi [2020]) and such processes must be interpretable as not mass balanced (e.g. a stock or outflow of some kind).

Additionally, if $\bm{q}$ is no longer nonnegative (e.g. if a stock change, external outflow, or mass balance relaxation term is added to the right hand side of 33 ) and assuming $I-\Phi^{\top}$ is invertible, it will cause the total flow output variables $\bm{z}=(I-\Phi^{\top})^{-1}\bm{q}$ and by extension the flow variables $U_{i,j}$ to no longer be nonnegative. However, flow variables should be nonnegative to be physically meaningful. To see this, note that $(I-\Phi^{\top})^{-1}$ has Taylor expansion $\sum_{m=0}^{\infty}(\Phi^{\top})^{m}$ , which is a limit of a sum of matrices with nonnegative entries, so the entries of $(I-\Phi^{\top})^{-1}$ must all be nonnegative. Therefore in general $\bm{z}=(I-\Phi^{\top})^{-1}\bm{q}$ is nonnegative if and only if $\bm{q}$ is nonnegative. This leaves how to incorporate mass balance relaxations (in order to take into account epistemic uncertainty) and other nonnegative terms under Gottschalk et al. [2010]’s parametrisation a possible direction for future work.