Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Localization of Defects Via Residence Time Measures

2022, SIAM Journal on Applied Mathematics

Localization of defects via residence time measures A. Ciallella E mail: alessandro.ciallella@univaq.it arXiv:2011.08624v1 [cond-mat.stat-mech] 17 Nov 2020 Dipartimento di Ingegneria Civile, Edile – Architettura e Ambientale, Centro di Ricerca M&MoCS Università degli Studi dell’Aquila, via Giovanni Gronchi 18, 67100 L’Aquila, Italy. E.N.M. Cirillo E mail: emilio.cirillo@uniroma.it Dipartimento di Scienze di Base e Applicate per l’Ingegneria, Sapienza Università di Roma, via A. Scarpa 16, 00161, Roma, Italy. B. Vantaggi E mail: barbara.vantaggi@uniroma1.it Dipartimento Metodi e Modelli per l’Economia, il Territorio e la Finanza, Sapienza Università di Roma, via Del Castro Laurenziano 9, 00161, Roma, Italy. Abstract. We show that residence time measure can be used to identify the geometrical and transmission properties of a defect along a path. The model we study is based on a one–dimensional simple random walk. The sites of the lattice are regular, i.e., the jumping probabilities are the same in each site, except for a site, called defect, where the jumping probabilities are different. At each side of the lattice an absorbing site is present. We show that by measuring the fraction of particles crossing the channel and/or the typical time they need to cross it, it is possible to identify the main features of the lattice and of the defect site, namely, the jumping probabilities at regular and at the defect sites and the position of the defect in the lattice. Keywords: residence time, random walk, defect localization 1. Introduction The effect of obstacles in the transport of moving agents has been widely studied in different contexts [1]. Macromolecules, playing the role of obstacles for the diffusing smaller molecules [2–5], tend to slow down the dynamics, while in granular systems [6–9] and in the framework of pedestrian dynamics [10–15], the correct positioning of physical obstacles can accelerate the exit of the agents from a bounded region hindering clogging patterns [16–19]. ccv-defect˙v˙nov.tex – 18 novembre 2020 1 1:49 Geometrical and dynamical properties of obstacles, such as position, shape, and attitude to transfer mass, influence the transport properties of the whole system. We refer the reader to some old and more recent studies where the effect of obstacles has been studied with different techniques and for many different stochastic dynamics, such as the simple exclusion 1D model [20], a Probabilistic Cellular Automata [21] the 1D and 2D symmetric and asymmetric random walk [22, 23], the linear Boltzmann dynamics [24], the zero range process [25–27], and the simple exclusion 2D process [28, 29]. As it is clearly demonstrated by those studies, in the applications it can be of the outmost importance the knowledge of the geometrical and transport properties of obstacles which limit and affect the dynamics of the agents in a region of space, since the currents observed in the system strongly depend on them. On the other hand obstacles can be often not easily accessible to instruments for experimental measures: imagine an obstacle placed inside a biological channel or in a long pipe. Hence, it is rather natural to ask ourselves if it is possible to deduce the properties of the obstacle measuring observable quantities connected to the motion of agents. In this paper we shall consider a model of particles flowing on a lane and the observables that will be used to deduce the properties of the lane and of the obstacle are the fraction of particles crossing the lane and the mean time they take to cross it. We shall consider two different experiments. In experiment 1 we imagine to start a known number of particles at one end point of the lane and compute the number of particles which will exit the lane through the second end point. Thus, in this experiment we have access to the fraction of particles that succeed to cross the lane. In experiment 2 we imagine to start, all at the same time, an unknown number of particles at one end point of the lane and to measure the time that the particles that succeed to cross the lane take to exit it through the second end point. Thus, in this experiment we are not able to measure the fraction of particles that actually cross the lane, but we have access to the mean value of the time that the particles that cross the lane need to cross it. The typical time needed for a particle to cross the channel will be called residence time. We mention that the idea of residence time as average crossing time conditioned to crossing a region was firstly introduced in [28, 29] for a 2D simple exclusion walk: a thorough study of its properties in absence of obstacle is reported in [29], whereas the presence of obstacles was considered in [28]. We notice that in the 1D version of the simple exclusion model with Langmuir kinetics the idea of residence time was exploited in [30]. The problem introduced above is faced in this paper via a one–dimensional model based on a simple random walk. At each side of the lattice an absorbing site is present. We deal with the problem of identify the features of the lattice, once we measured some observable ccv-defect˙v˙nov.tex – 18 novembre 2020 2 1:49 such as the fraction of particles reaching the right absorbing site, and the average time needed for a walker to reach it. More precisely, the model we consider in this paper is a one–dimensional simple random walk on {0, 1, . . . , L} possibly with a defect site. The sites 0 and L ≥ 3 are absorbing, so that when the particle reaches one of these two sites the walk is stopped. All the sites 1, . . . , L − 1 are regular excepted for one site called defect [22, 23]. The defect site is the site d. At each unit of time the walker jumps to a neighboring site according to the following rule: if it is on a regular site, then it jumps to the right with probability p or to the left with probability q = 1 − p, so that it cannot stay still at the site. If it is at the defect site it jumps to the right with probability p0 or to the left with probability q 0 = 1 − p0 . The array 1, . . . , L − 1 will be called the lane. The sites 0 and L will be, respectively, called the left and right exit of the lane. In this framework the residence time is defined by starting the walk at site 1 and computing the typical time that the particle takes to reach the site L provided the walker reaches L before 0. More precisely, we let xt be the position of the walker at time t and denote by P and E the probability associated to the process and the related expected operator for the walk started at x0 = 1. We let τ be the first hitting time to L or crossing time, namely, τ = inf{t > 0 : xt = L} , (1.1) with the convention that τ = ∞ if the trajectory reaches 0 before reaching L. Note that the trajectories cannot be trapped for an infinite time in the lane because L is finite. Note also that, since the particle starts at site 1, the values that τ can assume are of the form L + 2k − 1, for any integer k ≥ 0. We thus define the residence time as R = E(τ |R.E.) , (1.2) where we conditioned to the event R.E. meaning that the particle exits the lane through the right exit in L. In absence of defect, namely, the sole parameters that one wants to deduce is the jumping probability p which accounts for the transport properties of the lane. In case of experiment 1, namely, when the fraction of particles crossing the lane is available, a standard maximum likelihood argument provides a correct estimate for the parameter, which is fully identified. On the other hand, in case of experiment 2, again an estimate is provided by a maximum likelihood argument, but a complete identification of the parameter p is lacking since the residence time is symmetric when p is exchanged with 1 − p. In presence of defects the situation is much more complicated, since one wants to deduce the three parameters p, d, and p0 , which respectively account for the transport properties of ccv-defect˙v˙nov.tex – 18 novembre 2020 3 1:49 the lane (p), the geometry of the defect (its position d), and the transmission property of the obstacle (p0 ). In our discussion we shall assume p to be known and we shall show that both in case of experiment 1 and 2 it will not possible to estimate the values of d and p0 , but it will possible to find an equation connecting the two. In other words, in the parameter space d–p0 , a level curve on which the pair (d, p0 ) has to lay will be identified. If the two experiments, 1 and 2, are performed simultaneously, namely, provided the fraction of particles crossing the lane and the residence time are both measured, then in many case the identifiability problem will be fully solved and an estimate for the parameters d and p0 will be found. The paper is organized as follows: in Section 2 we consider the case in which the lane does not present any defect. The presence of a defect site is then discussed in Section 3. Finally, the Section 4 is devoted to some brief conclusions. 2. Simple random walk: no defect We first consider the simple random walk case, namely, here we assume that p0 = p. For a given p, considering a single particle, the joint probability of the event that it exits through the right side starting from site 1 and the first hitting time to L is n = L + 2k − 1 is P(τ = n, R.E.) = αn p L+n−1 2 q n−(L−1) 2 = αL+2k−1 pL+k−1 q k , (2.3) where αn is the number of different possible paths that starts in 1 and exits the lane reaching the site L for the first time at time n, without hitting the site 0. Moreover, in each of these paths the walker performs (n − (L − 1))/2 steps to the left and (n − (L − 1))/2 + L − 1 steps to the right. We stress that αn is smaller that the total number of paths made of (n − (L − 1))/2 steps to the left and (n − (L − 1))/2 + L − 1 steps to the right, since the constraint that the particles reaches L before touching 0 must be taken into account. It is well known, see, e.g., [22, 31–33], that the probability of the event R.E. is given by the classic gambler’s ruin probability P(R.E.) = 1 − (q/p) 1 if p 6= q and P(R.E..) = if p = q. L 1 − (q/p) L (2.4) Thus, for p 6= q P(τ = n|R.E.) = αn p L−1 L X −1 q L − pL q = αn p k q k = α n pk q k pj q L−1−j , (q/p) − 1 q−p j=0 L−1+k k (q/p) (2.5) where, we recall, the time n is in the form L + 2k − 1. Note the last expression in (2.5) is valid also for p = q, indeed P(τ = n|R.E.) = αn pL+2k−1 L. ccv-defect˙v˙nov.tex – 18 novembre 2020 4 1:49 Now, we consider M independent identically distributed walkers starting at 1 and denote by τi the crossing time of the particle i, for i = 1, . . . , M . The probability that the M particles exit the lane through the right exit with crossing times ni = L + 2ki − 1 is P(τ1 = n1 , τ2 = n2 , . . . , τM = nM , R.E.1 , R.E.2 , . . . , R.E.M ) = M Y αni pL+ki −1 q ki , (2.6) i=1 where with R.E.i we denote the event that the i particle exits the lane through the right exit. Using, as above, the gambler’s ruin result, we get P(τ1 = n1 , τ2 = n2 , . . . , τM = nM |R.E.1 , R.E.2 , . . . , R.E.M ) = ! !M  M  M L−1 L L Y Y X PM PM q − p = αni pki q ki = αni p i=1 ki q i=1 ki pj q L−1−j q − p i=1 i=1 j=0 (2.7) We remark that conditioning to exiting through the right side do not influence the independence of the observed crossing times for given p. Consider, for simplicity, the case M = 2: P(τ1 = n1 , R.E.1 , τ2 = n2 , R.E.2 ) (2.8) P(τ1 = n1 , τ2 = n2 |R.E.1 , R.E.2 ) = P(R.E.1 , R.E.2 ) P(τ1 = n1 , R.E.1 ) P(τ2 = n2 , R.E.2 ) P(τ1 = n1 |R.E.1 )P(τ2 = n2 |R.E.2 ) = · . (2.9) P(R.E.1 ) P(R.E.2 ) Since the crossing times are independent and identically distributed random variables it follows immediately that the left–hand sides of (2.8)–(2.9) are equal. As we discussed in the Introduction, the problem we address in this Section is the estimate of the parameter p: we assume to perform an experiment and after we measure some observables we want to trace back the value of p. We discuss, now, two different experiments: in the first case we measure the fraction of particles which succeed to cross the lane, namely, the fraction of particles which exit the lane through the right exit in L. In the second experiment we shall assume to measure only the average time spent by the particles in the lane before exiting through the right end without having any information on the fraction of successful particles. In these experiments the role of identifiability of models is discussed. In the following a statistical model is said to be identifiable whenever the likelihood has no flat region, so it is theoretically possible to estimate the underlying parameters. This means that different values of the parameters must generate a different value for the likelihood over the observable variables, i.e. L(θ, (x)) 6= L(θ0 , (x)) for any θ 6= θ0 . For non-identifiable models: two or more values of the parameters are observationally equivalent. In these cases, it is relevant to determine the non-identifiable regions. Furthermore a ccv-defect˙v˙nov.tex – 18 novembre 2020 5 1:49 model is said locally identifiable whenever for any θ in the parameter space Θ there exists an open neighborhood Nθ of θ in Θ such that for any θ0 ∈ Nθ it holds L(θ, (x)) 6= L(θ0 , (x)) for any θ 6= θ0 . 2.1. Experiment 1: measuring the fraction of successful particles The knowledge of the fraction of the number of particles which exit through the right side is sufficient to find a maximum likelihood estimate of p using the probability of success in the gambler’s ruin problem (2.4). This probability as a function of p is strictly monotonic, hence, if we prepare N particles at site 1 and denote by m the number of those particles that exits the lane through the right exit in L, the equation in the unknown p m = P(R.E.) N (2.10) has an unique solution p̂m . Note that m/N is the maximum likelihood estimate of P(R.E.). Indeed, the random variables χi taking value 1 if the particle i reaches the right side of the interval and 0 otherwise are i.i.d. Bernoulli random variables, so that we recall here that in P this case the maximum likelihood estimator N i=1 χi /N is not biased. The classical computation follows for completeness: we can estimate p by finding the   N arg max (P(R.E.))m (1 − P(R.E.))N −m . p∈[0,1] m By considering the log–likelihood we write   N `(p) = ln + m ln P(R.E.) + (N − m) ln(1 − P(R.E.)) . m (2.11) To search for its critical point we derive with respect to p: d 1 d 1 d `(P(R.E.)) = m P(R.E.) − (N − m) P(R.E.) = dp P(R.E.) dp 1 − P(R.E.) dp m N −m d P(R.E.)[ − ]=0 = dp P(R.E.) 1 − P(R.E.) (2.12) Since dP(R.E.)/dp is strictly positive, the former has a unique solution P(R.E.) = m/N , as we wrote in (2.10). 2.2. Experiment 2: measuring the crossing time In case the fraction of particles crossing the lattice is not know, it is possible to setup an estimate of p by measuring the average crossing time. Even in this case we will be able to perform a maximum likelihood estimate. Suppose we measure the crossing time ni of m particles, for i = 1, . . . , m, we evaluate the parameter p by comparing the experimental mean ccv-defect˙v˙nov.tex – 18 novembre 2020 6 1:49  Pm crossing time n /m to the theoretical residence time R. The equation providing our i i=1 estimate for p will be found via a maximum likelihood argument. We note that the residence time presents a symmetry in the role of p and q = 1 − p that suggests the presence of identifiability problems. That is to say, if the residence time of walkers is the sole information we have access to, it is not possible to uniquely deduce the jumping parameter p that characterize the walk. Indeed, looking at (2.7), we notice that the crossing time observed for particles that eventually exit through the site L starting from site 1, is symmetric1 in p ∈ [0, 1] with respect to 1/2 (recall that q = 1 − p). The existence of this symmetry is also evident if one looks at the dependence of the residence time on p, see, e.g., Fig. 2.1. 3500 3000 2500 R 2000 1500 1000 500 0 0.0 0.2 0.4 0.6 0.8 1.0 p Figure 2.1: Residence time R as a function of p ∈ [0, 1] for L = 100 in absence of defect. In other words, looking at the symmetry residence time, we can expect that the experimental information on the crossing time won’t be sufficient to distinguish different estimates of p. More precisely, we find two possible estimates p̄ and 1 − p̄ for the parameter p. These two solutions will reduce to a single one in the case in which p̄ = 1 − p̄ = 1/2. This will be, indeed, the sole case in which we will have complete identifiability for the parameter p. Note, however, that, if the value of p is not very close to 1/2, a partial information on the fraction of particles exiting from the right side (e.g., the knowledge of the order of magnitude of the number of particles started from 1) is sufficient to distinguish between the two cases and identify the correct estimate p̄ or 1 − p̄. To construct the estimate of p based on the residence time measure we, now, setup a maximum likelihood argument: we focus on m particles that we know to have reached the site L and observe their crossing times ni . We know, from (2.5) and (2.7) the probability of observing τ1 = n1 , τ2 = n2 ,. . ., τm = nm conditioned to R.E.i , for i = 1, . . . , m. Maximum 1 We refer the interested reader to [33] and [34] where other classical results from Stern and Samuels for walks starting from the middle point are discussed. ccv-defect˙v˙nov.tex – 18 novembre 2020 7 1:49 likelihood gives the estimation pm of p as pm = arg max m Y p∈[0,1] = arg max ! αni p P p P i ki P q i ki i=1 m Y p∈[0,1] ! αni i=1 i ki P q i ki  qL − pL q−p L−1 X m (2.13) !m pj qL−1−j , j=0 where ni = L + 2ki − 1. Notice that the last expression shows that the function is symmetric in the exchange p into 1 − p. The log–likelihood is `(p) = m X ln αni + m X i=1 ki ln p + i=1 m X  ki ln(1 − p) + m ln i=1 (1 − p)L − pL 1 − 2p  (2.14) and, looking for the arg max of `(p), we find the critical points of `(p) as the solutions of the equation ! m X ki 1 − 2p 2 (1 − p)L−1 + pL−1 = + L . (2.15) 2 L − pL m p − p 2p − 1 (1 − p) i=1 To discuss the structure of the solutions of (2.15), due to the invariance under the exchange p in 1 − p, we restrict the discussion to the interval [1/2, 1] and denote by f (p) and g(p) its left–hand and right–hand sides. It is possible to check that the following statements hold: – lim f (p) = 0, lim− f (p) = −∞, lim g(p) = 0, lim− g(p) = −L + 2; p→1/2+ – lim f 0 (p) = −8 p→1/2+ p→1/2+ p→1 m X ki i=1 m p→1 , lim g 0 (p) = −8/3 + 4L − 4L2 /3; p→1/2+ – f (p) is a concave monotonically decreasing function in (1/2, 1). Moreover, we checekd numerically that there exists p∗ ∈ (1/2, 1] that depends on L such that g(p) is convex and strictly monotonically decreasing in (1/2, p∗ ), while it is monotonically P ki 2 increasing in (p∗ , 1). Hence, if −8 m i=1 m ≤ −8/3 + 4L − 4L /3 the sole solution of (2.15) is p = 1/2, and it is a maximum point. Otherwise, there exists one more solution of (2.15) in (1/2, 1). It is easy to see that such a solution is a maximum point, while 1/2 becomes in this case a minimum point. P ki 2 The above remarks yield the following conclusions: If −8 m i=1 m ≤ −8/3+4L−4L /3 the model is globally identifiable and the maximum likelihood estimate is p̄m = 1/2. Otherwise, the model is locally (not globally) identifiable and there exists two maximum likelihood estimates p̄m and 1 − p̄m . ccv-defect˙v˙nov.tex – 18 novembre 2020 8 1:49 4000 150 200 5000 100 2000 100 50 0 0 0 0 -50 -2000 -100 -100 -5000 -150 -4000 0.2 0.3 0.4 0.5 0.6 0.7 0.8 p -200 0.2 0.3 0.4 0.5 0.6 0.7 0.8 p 0.48 0.49 0.50 0.51 0.52 p 0.48 0.49 0.50 0.51 0.52 p Figure 2.2: Graphical solution of equation (2.15) for L = 100: the orange and the blue curves (color online) are, respectively, the plots of the right and left hand sides of the equation. The second and the fourth panel are a magnification of the first and the third, respectively. P The experimental measure i ki /m is equal to 895 in the first two panels and 1623 in the last two. The experimental data used in the first two plot have been obtained by simulating the process with p = 0.48, whereas for the last two plots the simulation has been run with p = 0.5. The two curves intersect each others at 0.4802, 0.5 and 0.5198 in the first two plots and at 0.5 in the last two. We conclude this section testing the above procedure performing a numerical experiment. We first run the model with L = 100 and p = 0.48. We find 1889.17 as numerical estimate P for the residence time, so that the experimental value of the quantity m i=1 ki /m appearing in (2.8) is 895.085. The graphical solution of (2.15) is depicted in the two leftmost panels of Fig. 2.2. The two curves intersect in 1/2 and in the two symmetric points 0.4802 and 0.5198. These two last values are the maximum likelihood estimate for p and, as we discussed above, the information provided by this experiment is not sufficient to identify correctly the parameter. On the other hand, by running the simulation with p = 1/2, we found 3344.65 as numerical estimate for the residence time, so that the experimental value of the quantity Pm i=1 ki /m appearing in (2.8) is 1622.83. The graphical solution of (2.15) is, now, depicted in the two rightmost panels of Fig. 2.2. The two curves intersect only in p = 1/2 which is the maximum likelihood estimate for p in this case. 3. Identification of a defect We consider now the case of presence of a defect in the lane. The defect is the site d, where the probabilities are p0 to jump to the right and q 0 = 1 − p0 to the left. We can have different situations: p, d, p0 can be known or unknown. In this section2 we suppose that p is 2 In case we suppose that the geometric and transport properties of the defect which, in our model, are represented by d and p0 , are known, then we face the problem of estimating p. Suppose we know N and we measure the number m of crossing particles: since for fixed p0 and d the probability of reaching the site L is monotonically increasing as a function of p, as we did in the case of no defect (see eq.(2.10)), the maximum ccv-defect˙v˙nov.tex – 18 novembre 2020 9 1:49 known, namely, the transport properties of the lane are known, and we want to estimate the geometric and transport properties of the defect which, in our model, are represented by d and p0 . Note first of all that this case is more complex than the previous one without the defect site. The probability for a walker to jump out from the right–side exit in a time exactly n, starting from 1 is now P(τ = n, R.E.) = j k X X αi,j,d pL−1+k−i (p0 )i q k−(j−i) (q 0 )j−i , (3.16) j=1 i=1 where j counts the total number of times that the particles jumped on the defect d before reaching the site L, while i is the number of times that the particle starting from d jumped to d + 1. Moreover, αi,j,d is the total number of different paths that the particle can perform in the line for each choice of i, j, d. Calculation of αi,j,d is in principle possible but it is not easy and not necessary for our purpose. 3.1. Measuring the fraction of successful particle in presence of a defect It is possible to produce explicit formulas for the probability of exiting through the right end point even in presence of a defect (see [22]). Indeed, the probability to cross the lane is given by (3.17) and (3.18) and can be considered (for p fixed and known) as a function of p0 and d: p0 (p − q)q P(R.E.)(p0 , d) = , p 6= q; (3.17) p(−p0 q(−1 + (q/p)d ) + pq 0 (−(q/p)L + (q/p)d )) 1 , p = q. (3.18) d + (L − d)q 0 /p0 Using the expressions of the probability to reach the right side (3.17) and (3.18), the observation of how many particles reached the site L allows to estimate (p0 , d) through a maximum likelihood procedure. Indeed, if m of N independent particles started at 1 exit the lane through the right end point, similarly to what we did in absence of defect we can write   N 0 ˆ (p̂ , d) = arg 0 max (P(R.E.))m (1 − P(R.E.))N −m , (3.19) p ∈[0,1],d∈{2,...,L−2} m P(R.E.)(p0 , d) = likelihood estimation gives m/N = P(R.E.). Hence, solving such an equation we get an estimate for p and the parameter p is completely identifiable. On the other hand, in case N is not known, but it is possible to measure the mean crossing time we find a lack of complete identifiability. Indeed, as in the case of no defect, the identification of p as the most likely value producing the observed residence time in general do not allow to distinguish between two different possible values of p, since the residence time as a function of p is monotonically increasing up to a value p∗ and then it decreases (see, for instance, [22, right panel of Fig. 9]). The effect of the presence of the defect is that of modifying the numerical value of the measured residence time and to shift the value of p∗ , that in the case of no defect is 0.5. ccv-defect˙v˙nov.tex – 18 novembre 2020 10 1:49 under the assumption that the Bernoulli random variables taking values 1 if R.E. is true and zero otherwise are independent and identically distributed. We consider d to be a continuous variable, to simplify the evaluation, but we should remember that only discrete values d = {2, 3, . . . , L − 2} for the defect are significant in our problem. Considering the log–likelihood we write   N 0 `(d, p ) = ln + m ln P(R.E.) + (N − m) ln(1 − P(R.E.)) (3.20) m and we derive it with respect to p0 and d to search for critical points   ∂` = m ∂ P(R.E.) − N −m ∂ P(R.E.) =  m − N −m  ∂ P(R.E.) = 0 ∂p0 P(R.E.) ∂p0 1−P(R.E.) ∂p0 P(R.E.) 1−P(R.E.) ∂p0  ∂ m ∂ N −m ∂ m ∂`  = P(R.E.) − P(R.E.) = − N −m P(R.E.) = 0 . ∂d P(R.E.) ∂d 1−P(R.E.) ∂d P(R.E.) 1−P(R.E.) ∂d Hence, all the points (d, p0 ) that verify P(R.E.) = m N (3.21) are critical, where, we recall, P(R.E.) is given in (3.17) and (3.18). To prove that the solutions of (3.21) are the sole critical point for the maximum likelihood function, we show that ∂P(R.E.)/∂p0 and ∂P(R.E.)/∂d are different from zero. We first discuss in detail the case p = 1/2: from (3.18) we have ∂ (L − d)/p02 P(R.E.) =  2 ∂p0 d + (L − d)(1 − p0 )/p0 and ∂ 1 − (1 − p0 )/p0 P(R.E.) =  2 . ∂d d + (L − d)(1 − p0 )/p0 The quantity on the left is always positive for d < L and it is equal to 0 if and only if d = L, while the quantity on the right is positive for p0 > 1/2 and negative for p0 < 1/2, and it is equal to 0 if and only if p0 = 1/2, that is to say in absence of defect (p = q = p0 = 1/2). The case p 6= q can be discussed analogously and, again, we find that the partial derivative with respect to p0 is equal to zero if and only if d = L, while the partial derivative with respect to d is equal to zero if and only if p0 = p, that is to say in the case where no defect is present. Given m and N , the solutions of the equation (3.21) determine a curve in the plane p0 –d. These curves are depicted in Fig. 3.3 for different values of p. The contour plots in Fig. 3.3 show that, given m and N , the geometric locus made of the solutions of (3.21) provides d as a function of p0 which is decreasing if p0 < p, increasing for p0 > p, and constant for p0 = p. Note that equation (3.21) is easily explicitly solvable, finding p0 as a function of d and taking into account only the values of d that produce p0 ∈ [0, 1]. Summarizing, dealing with the experiment 1 we are not able to fully estimate the transport and geometric properties of the obstacle, p0 and d. Indeed, we are able to find only a ccv-defect˙v˙nov.tex – 18 novembre 2020 11 1:49 d 60 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 100 80 d 60 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0.00400 0.00300 0.00200 0.00100 0.00076 0.00050 0.00025 0.00010 0.14000 0.13000 0.12000 0.11500 0.11321 0.11000 0.10000 0.09000 80 60 d 80 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 100 80 60 d 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0.030 0.026 0.022 0.018 0.014 0.010 0.006 0.002 0.18400 0.18300 0.18200 0.18182 0.18100 0.18000 0.17900 0.17800 Figure 3.3: Contour plot of P(R.E.) in the plane p0 –d for L = 100. Top left p = 0.49, top right p = 0.5, bottom left p = 0.53, bottom right p = 0.55. relation between the two parameters, that is to say we can determine a curve in the plane p0 –d where the parameters have to lay. Hence, the model is not anymore locally identifiable. 3.2. Residence time in presence of a defect In presence of a defect we have just shown that experiment 1, namely, the measure of the fraction of particle crossing the lane, is not sufficient to achieve an estimate for the model parameters p0 and d. In this section we approach the problem also from the point of view of the residence time estimate: we shall see that using both the fraction and the residence time measure, in some cases, it will be possible to identify completely the two parameters. Due to the complicated structure of the problem, we do not rely on a pure maximum likelihood estimate, but we simply identify the theoretical residence time with the experimental mean crossing time by referring the method of moments. In order to derive a theoretical expression for the residence time we follow the strategy ccv-defect˙v˙nov.tex – 18 novembre 2020 12 1:49 outlined in [22, 31–33] and based on a generating function computation. We let un be the probability that the walk started at 1 exits through the right end after n steps. We construct the generating function of the probability of exiting from the right side starting P from 1, U (s) = n≥0 un sn . The generating function can be explicitly calculated but has not a simple and concise expression; we report the computation in the Appendix A for completeness, see also [22]. Now, we note that P(R.E.) = U (1). Since the derivative of the generating function is P 0 U (s) = n≥1 nun sn−1 , following [32] we have that X X X lim− U 0 (s) = nun = nP(τ = n, R.E.) = nP(τ = n|R.E.)P(R.E.) = R P(R.E.), s→1 n≥1 n≥1 n≥1 (3.22) where R is the residence time as defined in (1.2). We note that R is finite for any fixed p, q, p0 , and q 0 in (0, 1). By deriving two times U (s) we find the following expression, that is useful to calculate the variance of the residence time: " # X X X X lim U 00 (s) = n(n − 1)un = n2 un − nun = P(R.E.) n2 P(τ = n|R.E.) − R s→1− n≥2 n≥1 n≥1 n≥1   = E(τ 2 |R.E.) − R P(R.E.) . Thus, it is possible to calculate the variance of the residence time as Var(τ |R.E.) = E(τ 2 |R.E.) − [E(τ |R.E.)]2 = 1 [U 00 (1) + U 0 (1)] − R2 . P(R.E.) The residence time as a function of p0 and d can be computed using the equations shown above. Data for L = 100 are reported in Fig. 3.4 as a scatter plot. We first notice that the picture in the case p = 0.49 is specular to that one for p = 0.51, and this is always true for any choice of p and 1 − p. In the symmetric case p = 1/2 the residence time has a peculiar behavior: for p0 = p = 1/2 (no real defect is indeed present) and in the case d = L/2 the residence time in presence of the defect is not influenced by the defect, it is the same as the symmetric case with no defect, see the yellow region in the plot (panel top right). These two straight lines divide the rectangle [0, 1] × Z ∩ {2, 3, . . . , L − 2} into four zones symmetric with respect to the center: two specular regions where the residence time is smaller with respect to the case of no defect (blue zone in the plot) and two specular regions where the presence of the defect produce a larger residence time (from red to green in the plot). In particular, a defect with p0 < p = 1/2 close to the entrance favors a faster egress of (a smaller number of) particles, as well as the presence of a defect with p0 > p = 1/2 close to the exit. The defects close to the ccv-defect˙v˙nov.tex – 18 novembre 2020 13 1:49 25 0.5 25 0.5 3000 0.25 0.5 0.75 p' 1 16000 14000 residence time d 50 4000 0 22000 20000 18000 16000 14000 12000 10000 8000 6000 4000 2000 75 5000 2000 0.75 p' 0.25 50 25 75 12000 10000 d 0.25 6000 residence time 50 7000 75 residence time residence time d 75 8000 d 22000 20000 18000 16000 14000 12000 10000 8000 6000 4000 2000 50 8000 6000 4000 25 2000 0 0.75 0.25 p' 0.5 0.75 p' Figure 3.4: Contour plot of the residence time R for L = 100, p = 0.49 (top left), p = 0.50 (top right), p = 0.51 (bottom left), p = 0.53 (bottom right). exit with p0 < p = 1/2 tend to trap particles that reached the second half of the interval, as well as defects with p0 > p = 1/2 close to the entrance that tend to favor a long permanence of the walkers in the first half of the interval. We remark that every observation about the residence time takes into account only the particles that eventually reach the right exit site L, ignoring those that come out of the interval at the 0 site. In the case p > 1/2, the behavior is a perturbation of the case p = 1/2: the four zones are still present if p is close to 1/2, but the two regions in the top of the plot tend to invade the whole rectangle [0, 1] × Z ∩ {2, 3, . . . , L − 2} when p is increased. It is still present a cross made of points (yellow in the plots in Fig. 3.4) where the residence time is the same as in the case of no defect, the vertical arm of the cross is at p0 = p, while the horizontal one moves downward when p is increased (in the plot is visible at about d = 27 for p = 0.51, at about d = 10 for p = 0.53). The regions below the horizontal arm are not the specularly ccv-defect˙v˙nov.tex – 18 novembre 2020 14 1:49 symmetric of the top regions, indeed, the values of the residence time vary more consistently in the top regions. When p is close to 1 in the residence time plot only two regions are left: for p0 < p the residence time is larger than in the no defect case, while for p0 > p it is the opposite. The case p < 1/2 is specular to the case p > 1/2. 3.3. Numerical Experiments We consider, now, a numerical experiment in which we melt both experiments 1 and 2, namely, considered N particles started at site 1 of the lane, we count the number of particles m which exit the lane through the right end and we measure the crossing time ni that each of them takes to reach the exit. Hence, we get both a numerical measure of the crossing P fraction m/N and of the mean crossing time m i=1 ni /m. The question we pose is the following: supposing to know the transport properties of the lane expressed by the parameter p, can we estimate the geometry and the transport property of the obstacle represented respectively by d and p0 ? Recalling the definition of the i.i.d. Bernoulli random variables χi introduced in Section 2.1, we note that m is the value assumed in the experiment by the random variable (χ1 + . . . + χN )/N and remark that, by the law of large numbers, such a variable converges to P(R.E.) as N → ∞. We now estimate the rate of convergence: for any given ε > 0, thanks to the Central Limit Theorem, we have that √  (χ + . . . + χ )    Nε 1 N P − P(R.E.)) ≤ ε ' 2Φ −1 (3.23) N P(R.E.)(1 − P(R.E.)) √ where Φ is the cumulative Gaussian distribution. Given ε > 0, for N ε/[P(R.E.)(1 − P(R.E.))] ≥ 3 we can estimate this probability to be very close to 1 (about 0.9974). Note that we do not know the correct value of P(R.E.) since we do not know (p0 , d) that identify the defect. By equation (3.21) the probability P(R.E.) is estimated by m/N . Considering the unbiased sample variance estimator of the i.i.d. χi ! N  N 2  m 2 X X 1 m N 1 ŝ2 = χi − = χ2i − , N − 1 i=1 N N − 1 N i=1 N we estimate ŝ2 as   m N −m N m m2 − 2 = . N −1 N N N −1 N The standard error of the mean is now computed as the observed realization of the sample standard deviation divided by the square root of the sample size: r ŝ m N −m σ̂χ = √ = . N − 1 N2 N ccv-defect˙v˙nov.tex – 18 novembre 2020 15 1:49 label p p0 d N m rex 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 0.51 0.51 0.51 0.51 0.51 0.51 0.51 0.51 0.53 0.53 0.53 0.53 0.53 0.53 0.55 0.55 0.55 0.130 0.130 0.065 0.130 0.750 0.800 0.800 0.800 0.050 0.800 0.250 0.250 0.250 0.800 0.250 0.100 0.700 19 8 92 70 10 85 85 85 43 50 75 75 75 15 78 43 25 1.0 · 108 2.0 · 108 1.0 · 106 2.0 · 107 2.0 · 107 1.0 · 106 1.0 · 107 1.0 · 108 5.0 · 106 1.0 · 107 1.0 · 107 5.0 · 107 2.0 · 108 1.0 · 107 1.0 · 107 5.0 · 106 5.0 · 106 1069462 1505795 36345 634906 1412009 40614 404194 4040196 507651 1134138 1132454 5660505 22631148 1285723 1817491 908079 914097 2441.266 2460.133 6366.541 5440.954 2930.654 2358.224 2349.55 2349.153 3751.445 1292.222 1701.426 1702.267 1701.527 1361.100 1030.056 1392.074 877.913 Table 1: List of experiments discussed in Section 3.3. A progressive number is associated to each experiment and the parameters p, p0 , d, and N are listed. In the last two columns we list the measured values for m and rex . Given an experimental observation of the fraction of the particles exiting the lane through L, we can consider the interval [m/N − 3σ̂χ , m/N + 3σ̂χ ] where P(R.E.) lies with very high probability. In the same way we study the standard error of the residence time, i.e., of the average crossing time, producing the confidence interval at level 0.9974. Then, we can find the regions of points of the plane p0 –d to which corresponds a residence time in that interval and plot those regions. Note that in this second case the particles producing the experimental estimate of the residence time, and so producing the corresponding confidence interval, are only those m which exit the lane through the right end point at L. We can thus conclude that, given the numerical measures fex = m/N for the fraction Pm of crossing particles and rex = i=1 ni /m for the mean crossing time, the estimate for the parameters d and p0 will be constructed by intersecting the regions of the plane p0 –d corresponding to the confidence intervals found for the crossing fraction and for the crossing ccv-defect˙v˙nov.tex – 18 novembre 2020 16 1:49 time. In the sequel of this section we discuss several experiments performed with different values of the jump probability p = 0.51, 0.53, 0.55 (see the list of the experiments in Table 1). d 60 40 20 80 0.01072548 0.01069462 0.01066376 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 100 80 2444.968 60 2441.266 d 80 100 40 2437.564 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 d 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.5: Experiment 1. Left: region associated to confidence interval for fex . Center: region associated to confidence interval for rex . Right: intersection of the two regions. Beside the real value of the obstacle parameters identified by the cyan point, a second intersection region around (0.1645, 11) is found (orange point). In Fig. 3.5 we show results for experiment 1: the regions associated to the confidence interval for fex and rex are shown together with their intersection. The confidence intervals for fex and rex are, respectively, [0.01066376, 0.01072548] and [2437.564, 2444.968]. The intersection is made by two small regions, from which we select as candidates only pairs (p0 , d) with integer d. The candidates we find are p0 ∈ [0.129445, 0.130208] for integer values of d = 19, and p0 ∈ [0.163927, 0.164804] for d = 11. For these values of the parameters, the residence time lies in the interval [2441.69, 2442.21] when p0 ∈ [0.129445, 0.130208] and d = 19 and it lies in the interval [2440.63, 2441.15] for p0 ∈ [0.163927, 0.164804] and d = 11. Thus, all these candidate pairs (p0 , d) give a residence time in the confidence interval for rex , so it is not possible to reject any of them. Hence, a complete identification is not possible. Nevertheless, it is clear that all candidate pairs (p0 , d) correspond to a defect close to the entrance which tends to obstruct the passage of walkers (p0 < p). No more pairs (p0 , d) with integer d lie on the intersection region. In Fig. 3.6 we show the results of experiment 2. The setting is similar to the experiment 1, but we are here able to identify the defect parameter d and restrict the possible values of p0 to a small interval. The regions associated to the confidence interval for fex and rex are shown together with a magnification of their intersection. The confidence intervals for fex and rex are, respectively, [0.00751064, 0.0754731] and [2456.74, 2465.526]. The intersection is made by two small regions. As we did in the case of experiment 1, we look for the pairs (p0 , d) with integer d which lie in this regions. The first region ccv-defect˙v˙nov.tex – 18 novembre 2020 17 1:49 100 d 60 40 20 80 0.00754731 0.00752898 60 d 80 25 40 0.00751064 20 2460.133 15 10 2456.740 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 2463.526 d 100 5 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0.08 0.10 0.12 0.14 p' Figure 3.6: Experiment 2. Left: region associated to confidence interval for fex . Center: region associated to confidence interval for rex . Right: magnification of the intersection of the two regions. Points identify the two intersections: the first around the actual defect (in cyan) and a second intersection at about (0.0835, 21.5) (in orange). let us identify as possible parameters of the defect p0 ∈ [0.129606, 0.130195] for the integer value of d = 8. No more admissible pairs can be found. In fact, considering here the two closest integer values (d1 = 21 and d2 = 22) to points in the second region. Taking d = 21, one can verify that the values of p0 that give a crossing probability laying in the confidence interval for fex are p0 ∈ [0.0846366, 0.0850611] but the residence times associated to these pairs are in the interval [2449.79, 2450.06], out of the confidence interval for rex . In the same way, taking d = 22, the candidate pairs associated to the confidence interval for fex are p0 ∈ [0.0817259, 0.0821386] whose associated residence times lie in the interval [2472.9, 2473.15], out of the confidence interval for rex . Hence, we identify the defect to have p0 ∈ [0.129606, 0.130195] and d = 8. 100 40 20 80 0.0369064 0.0363450 0.0357836 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 6442.95 60 6366.54 40 6290.13 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 d d 60 100 d 80 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.7: Experiment 3. Left, center and right panel as in Fig. 3.5. The cyan point identifies the defect. ccv-defect˙v˙nov.tex – 18 novembre 2020 18 1:49 In experiments 3 and 4, a defect with p0 < p, close to the exit site, can be identified as shown in Figs. 3.7 and 3.8. The intersection produces a unique connected region whose dimension is related to the number of starting particles N . In experiment 3, see Fig. 3.7, the regions associated to the confidence interval for fex and rex are shown together with their intersection. The confidence intervals for fex and rex are, respectively, [0.0357836, 0.0369064] and [6290.13, 6442.95]. The number of starting particles is N = 106 , that is sufficient to produce an intersection region that clearly identifies some peculiarities of the defect (d is close to L, so the defect is close to the exit site, and p0 is small, so that the defect hinder the passage) but not to identify a single possible value of d. 100 100 80 0.0318629 d 0.0317453 40 0.0316277 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 5440.95 d 60 80 5456.87 40 60 d 80 100 5425.03 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.8: Experiment 4. See caption of Fig. 3.7. In the case of experiment 4, instead, the confidence intervals for fex and rex are, respectively, [0.0316277, 0.0318629] and [5425.03, 5456.87]. The large value of N = 2 · 107 allow to individuate a smaller intersection region, so that the defect is completely identifiable in this case. The only pairs with integer d laying in the intersection region are those with p0 ∈ [0.129184, 0.130592] and d = 70. Defects that favor the passage (p0 > p) and close to the entrance site 1 can produce a lack of complete identifiability as shown in Fig. 3.9. There, experiment 5 is depicted. The intersections between the two regions associated to the confidence interval for fex and rex produce two connected regions. The confidence intervals for fex and rex are, respectively, [0.0704286, 0.0707723] and [2926.287, 2935.022]. In both regions of intersection pairs with integer d are present, so that a complete identification is not possible. The estimated values of the defect parameters are p0 ∈ [0.749253, 0.751496] when d = 10, or p0 ∈ [0.986593, 0.992496] when d = 20. In Fig. 3.10 the results of experiments 6, 7 and 8 are presented. Here, the defect parameters are still the same in the three cases, but the different number of starting particles N allows to find different intersection regions. In this case, it is interesting to show that even ccv-defect˙v˙nov.tex – 18 novembre 2020 19 1:49 d 60 40 20 100 80 0.0707723 60 0.0706004 2930.654 40 0.0704286 2926.287 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 2935.022 d 80 100 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 d 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.9: Experiment 5. Left, center and right panel as in Fig. 3.5. Beside the real value of the obstacle parameters identified by the cyan point, a second intersection region around (0.99, 20) is found (orange point). 100 100 2377.54 60 2338.91 60 40 2343.463 40 0.0406140 0.0406062 20 0.0404194 0.0400218 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 2351.074 2349.153 2347.232 40 0.0412062 20 80 2349.550 d 80 d 2358.22 d 80 100 2355.637 0.04046103 20 0.0402326 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0.04040196 0.04034289 Figure 3.10: From left to right: experiments 6 – 7 – 8 respectively. The cyan point identifies the actual defect, that is the same in the three experiments. The number of starting particles is different: N = 106 in experiment 6, N = 107 in experiment 7, N = 108 in experiment 8. for a relatively small number of starting particles (N = 106 , experiment 6) a unique connected region is found. Nevertheless, there are many different possible values of d. When N considered is larger, the intersection region dimension decreases, and the defect is identified to be in the last quarter of the lane, and to favor the passage of the particle (p0 > p), but still a large number N = 108 is not sufficient to identifies exactly the position of the defect. A further increasing of the number of starting particles N is needed to enhance the estimate. It is interesting to notice that in all the considered experiments the knowledge of the numbers of crossing particles n and starting particles N allows to deduce if p0 is larger or smaller than p, but on its own it does not give any clue where the defect is, except for the experiment 5. On the other hand, one can note that the only knowledge of the residence time ccv-defect˙v˙nov.tex – 18 novembre 2020 20 1:49 is generally not sufficient by itself to deduce if the defect is pushing forward or backward the walkers, since two unconnected regions with both possible behaviors are identified by the confidence interval found (see experiments 1–2 and 5–8). The lack of identifiability tends to disappear when p is larger, i.e. a single connected region is usually found as the intersection of the regions associated to the confidence interval for fex and rex . However, it is common that the intersection identifies a unique, connected but large region, so that many different values of d are possible. This happens due to the fact that in presence of a consistent drift (p far enough from 1/2) a single defect do not modify very consistently the probability to cross the interval if it is far from the entrance site 1 or if p0 is very close to p. Hence, this phenomenon was more rare in the case of p = 0.51 previously considered since the drift was sufficiently small, so that the region associated to the confidence interval of fex was usually quite small. The cases in which the defect is close to site 1 while p0 is not close to p are instead the cases in which it is easier to fully identify the defect. Note that in principle, increasing sufficiently the number of starting particle N , it is possible to let the intersection region become as small as wanted, but the required N can be very large. However, the larger the drift is, the more is common to deduce some information about the defect from the knowledge of the residence time alone. For instance, in many of the following experiments we can notice that the residence time allows to individuate if p0 is smaller or larger than p. Indeed, we recall that as shown in Fig. 3.4 the scatter plot of the residence time tends to reduce, when p move away from 1/2, to a first region where p0 < p and a second region (p0 > p), one of them in which the residence time is larger than the case of no defect, and the opposite in the other one. To clarify what we said we present now some experiments in which the different situations are encountered. Let us now consider the case of p = 0.53. 100 80 60 d 0.1015302 40 0.1011252 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 3763.11 80 60 3751.44 60 d 0.1019352 100 40 d 100 3739.78 20 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.11: Experiment 9. See caption of Fig. 3.7. ccv-defect˙v˙nov.tex – 18 novembre 2020 21 1:49 The experiment 9 is shown in Fig. 3.11. The actual defect is in the first half of the interval with a value of p0 that is very small. We are now in a case in which it is possible to fully identify the defect. The confidence intervals for fex and rex are, respectively, [0.1011252, 0.1019352] and [3739.78, 3763.11]. The defect is correctly identify to have p0 ∈ [0.049815, 0.050315] and d = 43, that are the pairs (p0 , d) with integer d that lay in the intersection region. 100 80 80 0.1137146 d 0.1134138 40 60 20 1292.222 40 0.1131130 60 40 1290.837 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 1293.608 d 60 100 d 100 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.12: Experiment 10. In Fig. 3.12 we plot the results of experiment 10. Here, as in the case of experiments 11–13, the complete identification is made difficult by the fact that the defect is far from the entrance site (the actual defect is in the middle of the lane), and the parameter p0 is not sufficiently high or low (p0 = 0.8 in the experiment). In this case the information that one can easily get is that p0 > 0.75 from the residence time, and that d > 40. Any refinement of this information would require a consistently larger number of starting particles N . 100 80 80 0.1135454 d 0.1132454 40 0.1129454 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 1703.673 60 1701.426 d 60 100 40 1699.179 20 60 d 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.13: Experiment 11. See caption of Fig. 3.7. In Figs. 3.13–3.14 we show that if the defect is not so influential on the probability to cross, the full identification of the defect will require very large N . The results of experiments ccv-defect˙v˙nov.tex – 18 novembre 2020 22 1:49 100 100 1703.673 80 80 1701.426 1699.179 40 0.1132454 60 1701.0557 d 40 40 0.1135454 20 1701.5270 1701.259 60 1701.9983 80 1702.267 d d 60 100 1703.274 0.1133445 20 0.1132101 0.1129454 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0.11322294 20 0.11315574 0.1130757 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0.11308854 Figure 3.14: From left to right: experiments 11 – 12 – 13 respectively. The cyan point identifies the actual defect, that is the same in the three experiments. The number of starting particles is different: N = 107 in experiment 11, N = 5 · 107 in experiment 12, N = 2 · 108 in experiment 13. 11, 12 and 13 are shown in these plots. The three experiments refer to the case of the same defect, with a different number of starting particles N . It is evident that in this case the increasing of N produces a very slow shrinkage of the intersection region. Nevertheless, it soon appears clear that the defect is in the second half of the interval, and p0 is smaller than p. 100 80 80 0.1288899 d 0.1285723 40 0.1282548 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 1362.549 60 1361.100 d 60 100 40 1359.651 20 60 d 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.15: Experiment 14. See caption of Fig. 3.7. Instead, in Fig. 3.15 we show that a defect with the same p0 = 0.8 of the case of the experiment 10 is fully identifiable if it is sufficiently close to the starting point (d = 15 in the experiment). Notice that in this experiment the information about the confidence interval of the residence time alone is not sufficient to determine whether the defect is pushing the walker forward or backward, nor where the defect is located. When p increases, the possible different behaviors further reduces. We propose here some ccv-defect˙v˙nov.tex – 18 novembre 2020 23 1:49 experiment where p = 0.55. 100 100 80 60 d 0.1817491 40 0.1813833 20 80 60 1030.8688 40 1030.0559 d 0.1821149 40 1029.2430 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 d 80 100 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.16: Experiment 15. See caption of Fig. 3.7. 100 80 80 0.1821318 d 0.1816158 40 60 20 1392.074 40 0.1810998 60 40 1389.923 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 1394.226 d 60 100 d 100 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.17: Experiment 16. See caption of Fig. 3.7. 100 80 80 0.1833384 d 0.1828194 40 0.1823004 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 80 878.7650 60 877.9133 d 60 100 40 877.0617 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' 60 d 100 40 20 0 0.0 0.2 0.4 0.6 0.8 1.0 p' Figure 3.18: Experiment 17. See caption of Fig. 3.7. In the experiments 15, 16 and 17 illustrated by Figs. 3.16, 3.17, 3.18 one can notice that the study of the region associated to the confidence interval of the experimental residence ccv-defect˙v˙nov.tex – 18 novembre 2020 24 1:49 time is always useful to identify whether the defect is favoring the walker to pass or not. In experiments 15 and 16, the situation is similar to that one of experiments 10–13. The defect is not sufficiently close to the starting site 1, so that the study of the confidence interval associated to the experimental fraction of particles crossing is not sufficient to fully identify the parameters of the defect, since many different couples (p0 , d) are still in the intersection region. The experiment 17 shows instead that a defect located at d = 25 (with p0 = 0.7 in the experiment) produces an intersection region that restrict the possible values of the estimated d to a small number, with a quite accurate estimate of the actual value of p0 . 4. Conclusions In the framework of the 1D simple random walk, we have studied the effect of obstacles in the transport of moving agents on a lane. We have shown how to setup experiments to measure the transport properties of a lane and the geometrical and transport properties of possibly present obstacles. In absence of obstacles, we have shown that measuring the fraction of particles that cross the lane (experiment 1) or the average time taken by the crossing particles to cross the lane (experiment 2) is sufficient to deduce the transport properties of the system, that in the case of study is the probability for a particle to jump to its right. To be precise, in the case of experiment 2, the problem is not completely identifiably, since the two symmetric estimates p and 1 − p are found. In presence of defects, supposing to know p, we are interested to estimate the geometric and transport properties of the obstacles, that is to say, its position d and the probability p0 that a particle sitting on the obstacle jumps to its right. We have shown that performing just experiment 1 or 2 is not sufficient to do the job, but knowing the results of both experiments allows to solve the problem and find an estimate of the two parameters d and p0 , apart from some particular case in which identifiability issues arise. We have discussed experiments for different values of the total number of moving particles N and we have shown that the identifiability problems are weakened when N is increased. A. Generating function For the sake of completeness, we report here the construction of the generating function U introduced in Section 3.2 following [22] (and references therein) and adapting, where necessary, the notation to the present setting. We call un,i the probability to exit the walk through the right side, starting from i and after n steps. We construct the generating function of the probability of exiting through the ccv-defect˙v˙nov.tex – 18 novembre 2020 25 1:49 P right side starting from i, Ui (s) = n≥0 un,i sn , see [31–33]. We define ti = Pi (R.E.), where the index i denotes that the walk is started at the site i. Note that ti = Ui (1). Therefore, the series defining Ui (s) is totally and thus uniformly convergent for s ∈ [0, 1]. Since the P derivative of the generating function is Ui0 (s) = n≥1 nun,i sn−1 , following [32] we have that lim− Ui0 (s) = s→1 X nun,i = ti Ri , (A.24) n≥1 where Ri is the conditional expectation of the duration of the game given that the random walk ends in L and it is finite for any fixed p, q, p0 , and q 0 . Following the approach of [31, 33], we find the generating function as the solution of a system combining the equations for the generating function in the bulks (regular sites) and on the singular site. Recalling that on the regular sites p and q are, respectively, the probabilities to jump to the right and to the left, we find that un+1,i = pun,i+1 + qun,i−1 (A.25) on the regular sites, namely, for i = 1, . . . , d − 1 and i = d + 1, . . . , L − 1, and boundary values un,0 = un,L = 0 when n ≥ 1, u0,L = 1, u0,i = 0 when i ≤ L. Thus, multiplying (A.25) by sn+1 and summing for n = 0, 1, 2, . . . we find the following equation in the bulk, i.e., i = 1, . . . , d − 1 and i = d + 1, . . . , L − 1, Ui (s) = psUi+1 (s) + qsUi−1 (s), (A.26) to be solved with boundary conditions UL (s) = 1 and U0 (s) = 0. The equation for Ud (s) on the defect site is Ud (s) = p0 sUd+1 (s) + q 0 sUd−1 (s), (A.27) since the probabilities on the singular site are p0 to jump to the right and q 0 to the left. It is known [31] that in the bulk of regular sites the generating function Ui for fixed s can be searched in the form λi (s), except for the case p = q and s = 1 at the same moment, where the generating function is linear, see later. Substituting in (A.26) it is found p 1 ± 1 − 4pqs2 . (A.28) λ± (s) = 2ps The generating function in the bulk can be written as a linear combination of terms in the form λi− and λi+ for i = 0, . . . , L. We consider two different linear combinations in the ccv-defect˙v˙nov.tex – 18 novembre 2020 26 1:49 bulk on the left and on the right of the defect site, namely, we introduce two coefficients on the left and two (possibly) different coefficients on the right. Thus we find two different representation of Ud (s). Therefore, we will be able to find the unknown coefficients by requiring that these two representation are equal for i = d, that is to say the equation at the defect (A.27) and the boundary conditions are satisfied. More precisely, the generating function reads Ui (s) = G(s)λi+ (s) + H(s)λi− (s), Uj (s) = U (s)λj+ (s) +V (s)λj− (s), i = 0, 1, . . . , d; (A.29) j = d, d + 1, . . . , L, and the coefficients G(s), H(s), U (s), and V (s) solve the system   Gλd+ + Hλd− = U λd+ + V λd−      Gλd + Hλd = q 0 s(Gλd−1 + Hλd−1 ) + p0 s(U λd+1 + V λd+1 ) + − + − + −  0=G+H      1 = U λL+ + V λL− . (A.30) The system (A.30) has a unique solution (G, H, U, V ) that can be explicitly expressed in terms of jump probabilities, λ± and s. Thus, by substituting λ± given by the (A.28) in the formulas (A.29), we find the explicit expressions of the generating function Ui (s). However, due to the length and complexity of these expressions, we prefer not to report here the solutions of the system in the general case. Note that in the easiest case, i.e., if the site d is regular, namely p0 = p and q 0 = q, the solutions are G = U and H = V which reduce to the classical ones in Feller [31, equation (4.10) in Paragraph XIV.4]. The conditional expectation Ri of the duration of the game starting from i and ending in L can be computed using equation (A.24). The limit allows us to include in this formula even the symmetric case p = q, where the generating function has not the form of a combination of powers λi anymore for s = 1. References [1] E. Cristiani and D. Peri. Applied Mathematical Modelling, 45:285 – 302, 2017. [2] M.J. Saxton. Biophysical Journal, 66:394–401, Feb 1994. [3] F. Höfling and T. Franosch. Reports on Progress in Physics, 76(4), 2013. [4] M.A. Mourão, J.B. Hakim, and S. Schnell. Biophysical Journal, 107:2761–2766, Jun 2017. ccv-defect˙v˙nov.tex – 18 novembre 2020 27 1:49 [5] A.J. Ellery, M.J. Simpson, S.W. McCue, and R.E. Baker. The Journal of Chemical Physics, 140(5):054108, 2014. [6] K. To, P. Lai, and H.K. Pak. Phys. Rev. Lett., 86:71–74, Jan 2001. [7] I. Zuriguel, A. Garcimartı́n, D. Maza, L.A. Pugnaloni, and J.M. Pastor. Phys. Rev. E, 71:051303, May 2005. [8] F. Alonso–Marroquin, S.I. Azeezullah, S.A. Galindo–Torres, and L.M. Olsen-Kettle. Phys. Rev. E, 85:020301, Feb 2012. [9] I. Zuriguel, A. Janda, A. Garcimartı́n, C. Lozano, R. Arévalo, and D. Maza. Phys. Rev. Lett., 107:278001, Dec 2011. [10] D. Helbing. Rev. Mod. Phys., 73:1067–1141, Dec 2001. [11] D. Helbing, I. Farkas, P. Molnàr, and T. Vicsek. In M. Schreckenberg and S. D. Sharma, editors, Pedestrian and Evacuation Dynamics, pages 21–58, Berlin, 2002. Springer. [12] E.N.M. Cirillo and A. Muntean. Physica A: Statistical Mechanics and its Applications, 392(17):3578 – 3588, 2013. [13] A. Muntean, E.N.M. Cirillo, O. Krehel, M. Bohm, In “Collective Dynamics from Bacteria to Crowds”, An Excursion Through Modeling, Analysis and Simulation Series: CISM International Centre for Mechanical Sciences, Vol. 553 Muntean, Adrian, Toschi, Federico (Eds.) 2014, VII, 177 p. 29 illus, Springer, 2014. [14] E.N.M. Cirillo, A. Muntean, Comptes Rendus Macanique 340, 626–628, 2012. [15] A. Ciallella, E.N.M. Cirillo, P.L. Curseu, A. Muntean, Mathematical Models and Methods in Applied Sciences, https://doi.org/10.1142/S0218202518400079. [16] G. Albi, M. Bongini, E. Cristiani, and D. Kalise. SIAM Journal on Applied Mathematics, 76(4):1683–1710, 2016. [17] D. Helbing, I. Farkas, and T. Vicsek. Nature, 407:487–490, Sep 2000. [18] D. Helbing, L. Buzna, A. Johansson, and T. Werner. Transportation Science, 39(1):1–24, 2005. ccv-defect˙v˙nov.tex – 18 novembre 2020 28 1:49 [19] R. Escobar and A. De La Rosa. In W. Banzhaf, J. Ziegler, T. Christaller, P. Dittrich, and J.T. Kim, editors, Advances in Artificial Life, Proceedings of the 7th European Conference, ECAL, 2003, Dortmund, germany, September 14–17, 2003, Proceedings. Lecture Notes in Computer Science, vol. 2801., pages 97–106, Berlin, 2003. Springer. [20] S.A. Janowsky, J.L. Lebowitz, J. Stat. Phys. 77, 35–51 (1994). [21] B. Scoppola, C. Lancia, R. Mariani, J. Stat. Phys. 161, 843–858 (2015). [22] A. Ciallella, E.N.M. Cirillo, Conditional expectation of the duration of the classical gambler problem with defects European Physical Journal Special Topics 228 111-128, 2019 [23] A. Ciallella, E.N.M. Cirillo, J. Sohier. Physical Review E, 97:052116, 2018. [24] A. Ciallella and E.N.M. Cirillo. Kinetic & Related Models 11, 1475–1501 (2018). [25] E.N.M. Cirillo, M. Colangeli, and A. Muntean. Physical Review E 94, 042116 (2016). [26] E.N.M. Cirillo, M. Colangeli, Phys. Rev. E 96, 052137, 2017. [27] E.N.M. Cirillo, M. Colangeli, A. Muntean, Physica A 488, 30–38 (2017) [28] E.N.M. Cirillo, O. Krehel, A. Muntean, and R. van Santen. Phys. Rev. E, 94:042115, Oct 2016. [29] E.N.M. Cirillo, O. Krehel, A. Muntean, R. van Santen, and Aditya S. Physica A: Statistical Mechanics and its Applications, 442:436 – 457, 2016. [30] J. Messelink, R. Rens, M. Vahabi, F.C. MacKintosh, A. Sharma, Physical Review E 93, 01211 (2016) [31] W. Feller. An Introduction to Probability Theory and its Applications, volume 1. John wiley & Sons, Inc, New York – London – Sidney, 1968. [32] F. Stern, Math. Mag. 48, 200–203 (1975). [33] T. Lengyel, Applied Mathematics Letters 22, 351–355 (2009). [34] W.A. Beyer, M.S. Waterman, Symmetries for conditioned ruin problems, Math. Mag. 50 (1) (1977) 42–45. ccv-defect˙v˙nov.tex – 18 novembre 2020 29 1:49