Search | arXiv e-print repository

Distance-Restricted Explanations: Theoretical Underpinnings & Efficient Implementation

Authors: Yacine Izza, Xuanxiang Huang, Antonio Morgado, Jordi Planes, Alexey Ignatiev, Joao Marques-Silva

Abstract: The uses of machine learning (ML) have snowballed in recent years. In many cases, ML models are highly complex, and their operation is beyond the understanding of human decision-makers. Nevertheless, some uses of ML models involve high-stakes and safety-critical applications. Explainable artificial intelligence (XAI) aims to help human decision-makers in understanding the operation of such complex… ▽ More The uses of machine learning (ML) have snowballed in recent years. In many cases, ML models are highly complex, and their operation is beyond the understanding of human decision-makers. Nevertheless, some uses of ML models involve high-stakes and safety-critical applications. Explainable artificial intelligence (XAI) aims to help human decision-makers in understanding the operation of such complex ML models, thus eliciting trust in their operation. Unfortunately, the majority of past XAI work is based on informal approaches, that offer no guarantees of rigor. Unsurprisingly, there exists comprehensive experimental and theoretical evidence confirming that informal methods of XAI can provide human-decision makers with erroneous information. Logic-based XAI represents a rigorous approach to explainability; it is model-based and offers the strongest guarantees of rigor of computed explanations. However, a well-known drawback of logic-based XAI is the complexity of logic reasoning, especially for highly complex ML models. Recent work proposed distance-restricted explanations, i.e. explanations that are rigorous provided the distance to a given input is small enough. Distance-restricted explainability is tightly related with adversarial robustness, and it has been shown to scale for moderately complex ML models, but the number of inputs still represents a key limiting factor. This paper investigates novel algorithms for scaling up the performance of logic-based explainers when computing and enumerating ML model explanations with a large number of inputs. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2404.03840 [pdf]

FarView: An In-Situ Manufactured Lunar Far Side Radio Array Concept for 21-cm Dark Ages Cosmology

Authors: Ronald S. Polidan, Jack O. Burns, Alex Ignatiev, Alex Hegedus, Jonathan Pober, Nivedita Mahesh, Tzu-Ching Chang, Gregg Hallinan, Yuhong Ning, Judd Bowman

Abstract: FarView is an early-stage concept for a large, low-frequency radio observatory, manufactured in-situ on the lunar far side using metals extracted from the lunar regolith. It consists of 100,000 dipole antennas in compact subarrays distributed over a large area but with empty space between subarrays in a core-halo structure. FarView covers a total area of ~200 km2, has a dense core within the inner… ▽ More FarView is an early-stage concept for a large, low-frequency radio observatory, manufactured in-situ on the lunar far side using metals extracted from the lunar regolith. It consists of 100,000 dipole antennas in compact subarrays distributed over a large area but with empty space between subarrays in a core-halo structure. FarView covers a total area of ~200 km2, has a dense core within the inner ~36 km2, and a ~power-law falloff of antenna density out to ~14 km from the center. With this design, it is relatively easy to identify multiple viable build sites on the lunar far side. The science case for FarView emphasizes the unique capabilities to probe the unexplored Cosmic Dark Ages - identified by the 2020 Astrophysics Decadal Survey as the discovery area for cosmology. FarView will deliver power spectra and tomographic maps tracing the evolution of the Universe from before the birth of the first stars to the beginning of Cosmic Dawn, and potentially provide unique insights into dark matter, early dark energy, neutrino masses, and the physics of inflation. What makes FarView feasible and affordable in the timeframe of the 2030s is that it is manufactured in-situ, utilizing space industrial technologies. This in-situ manufacturing architecture utilizes Earth-built equipment that is transported to the lunar surface to extract metals from the regolith and will use those metals to manufacture most of the array components: dipole antennas, power lines, and silicon solar cell power systems. This approach also enables a long functional lifetime, by permitting servicing and repair of the observatory. The full 100,000 dipole FarView observatory will take 4 - 8 years to build, depending on the realized performance of the manufacturing elements and the lunar delivery scenario. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: 26 pages, 7 figures, 2 tables

arXiv:2312.06973 [pdf, other]

Anytime Approximate Formal Feature Attribution

Authors: Jinqiang Yu, Graham Farr, Alexey Ignatiev, Peter J. Stuckey

Abstract: Widespread use of artificial intelligence (AI) algorithms and machine learning (ML) models on the one hand and a number of crucial issues pertaining to them warrant the need for explainable artificial intelligence (XAI). A key explainability question is: given this decision was made, what are the input features which contributed to the decision? Although a range of XAI approaches exist to tackle t… ▽ More Widespread use of artificial intelligence (AI) algorithms and machine learning (ML) models on the one hand and a number of crucial issues pertaining to them warrant the need for explainable artificial intelligence (XAI). A key explainability question is: given this decision was made, what are the input features which contributed to the decision? Although a range of XAI approaches exist to tackle this problem, most of them have significant limitations. Heuristic XAI approaches suffer from the lack of quality guarantees, and often try to approximate Shapley values, which is not the same as explaining which features contribute to a decision. A recent alternative is so-called formal feature attribution (FFA), which defines feature importance as the fraction of formal abductive explanations (AXp's) containing the given feature. This measures feature importance from the view of formally reasoning about the model's behavior. It is challenging to compute FFA using its definition because that involves counting AXp's, although one can approximate it. Based on these results, this paper makes several contributions. First, it gives compelling evidence that computing FFA is intractable, even if the set of contrastive formal explanations (CXp's) is provided, by proving that the problem is #P-hard. Second, by using the duality between AXp's and CXp's, it proposes an efficient heuristic to switch from CXp enumeration to AXp enumeration on-the-fly resulting in an adaptive explanation enumeration algorithm effectively approximating FFA in an anytime fashion. Finally, experimental results obtained on a range of widely used datasets demonstrate the effectiveness of the proposed FFA approximation approach in terms of the error of FFA approximation as well as the number of explanations computed and their diversity given a fixed time limit. △ Less

Submitted 11 December, 2023; originally announced December 2023.

arXiv:2307.03380 [pdf, other]

On Formal Feature Attribution and Its Approximation

Authors: Jinqiang Yu, Alexey Ignatiev, Peter J. Stuckey

Abstract: Recent years have witnessed the widespread use of artificial intelligence (AI) algorithms and machine learning (ML) models. Despite their tremendous success, a number of vital problems like ML model brittleness, their fairness, and the lack of interpretability warrant the need for the active developments in explainable artificial intelligence (XAI) and formal ML model verification. The two major l… ▽ More Recent years have witnessed the widespread use of artificial intelligence (AI) algorithms and machine learning (ML) models. Despite their tremendous success, a number of vital problems like ML model brittleness, their fairness, and the lack of interpretability warrant the need for the active developments in explainable artificial intelligence (XAI) and formal ML model verification. The two major lines of work in XAI include feature selection methods, e.g. Anchors, and feature attribution techniques, e.g. LIME and SHAP. Despite their promise, most of the existing feature selection and attribution approaches are susceptible to a range of critical issues, including explanation unsoundness and out-of-distribution sampling. A recent formal approach to XAI (FXAI) although serving as an alternative to the above and free of these issues suffers from a few other limitations. For instance and besides the scalability limitation, the formal approach is unable to tackle the feature attribution problem. Additionally, a formal explanation despite being formally sound is typically quite large, which hampers its applicability in practical settings. Motivated by the above, this paper proposes a way to apply the apparatus of formal XAI to the case of feature attribution based on formal explanation enumeration. Formal feature attribution (FFA) is argued to be advantageous over the existing methods, both formal and non-formal. Given the practical complexity of the problem, the paper then proposes an efficient technique for approximating exact FFA. Finally, it offers experimental evidence of the effectiveness of the proposed approximate FFA in comparison to the existing feature attribution algorithms not only in terms of feature importance and but also in terms of their relative order. △ Less

Submitted 28 August, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

arXiv:2306.15272 [pdf, ps, other]

doi 10.1609/aaai.v38i11.29170

Delivering Inflated Explanations

Authors: Yacine Izza, Alexey Ignatiev, Peter Stuckey, Joao Marques-Silva

Abstract: In the quest for Explainable Artificial Intelligence (XAI) one of the questions that frequently arises given a decision made by an AI system is, ``why was the decision made in this way?'' Formal approaches to explainability build a formal model of the AI system and use this to reason about the properties of the system. Given a set of feature values for an instance to be explained, and a resulting… ▽ More In the quest for Explainable Artificial Intelligence (XAI) one of the questions that frequently arises given a decision made by an AI system is, ``why was the decision made in this way?'' Formal approaches to explainability build a formal model of the AI system and use this to reason about the properties of the system. Given a set of feature values for an instance to be explained, and a resulting decision, a formal abductive explanation is a set of features, such that if they take the given value will always lead to the same decision. This explanation is useful, it shows that only some features were used in making the final decision. But it is narrow, it only shows that if the selected features take their given values the decision is unchanged. It's possible that some features may change values and still lead to the same decision. In this paper we formally define inflated explanations which is a set of features, and for each feature of set of values (always including the value of the instance being explained), such that the decision will remain unchanged. Inflated explanations are more informative than abductive explanations since e.g they allow us to see if the exact value of a feature is important, or it could be any nearby value. Overall they allow us to better understand the role of each feature in the decision. We show that we can compute inflated explanations for not that much greater cost than abductive explanations, and that we can extend duality results for abductive explanations also to inflated explanations. △ Less

Submitted 27 June, 2023; originally announced June 2023.

arXiv:2212.05990 [pdf, other]

On Computing Probabilistic Abductive Explanations

Authors: Yacine Izza, Xuanxiang Huang, Alexey Ignatiev, Nina Narodytska, Martin C. Cooper, Joao Marques-Silva

Abstract: The most widely studied explainable AI (XAI) approaches are unsound. This is the case with well-known model-agnostic explanation approaches, and it is also the case with approaches based on saliency maps. One solution is to consider intrinsic interpretability, which does not exhibit the drawback of unsoundness. Unfortunately, intrinsic interpretability can display unwieldy explanation redundancy.… ▽ More The most widely studied explainable AI (XAI) approaches are unsound. This is the case with well-known model-agnostic explanation approaches, and it is also the case with approaches based on saliency maps. One solution is to consider intrinsic interpretability, which does not exhibit the drawback of unsoundness. Unfortunately, intrinsic interpretability can display unwieldy explanation redundancy. Formal explainability represents the alternative to these non-rigorous approaches, with one example being PI-explanations. Unfortunately, PI-explanations also exhibit important drawbacks, the most visible of which is arguably their size. Recently, it has been observed that the (absolute) rigor of PI-explanations can be traded off for a smaller explanation size, by computing the so-called relevant sets. Given some positive δ, a set S of features is δ-relevant if, when the features in S are fixed, the probability of getting the target class exceeds δ. However, even for very simple classifiers, the complexity of computing relevant sets of features is prohibitive, with the decision problem being NPPP-complete for circuit-based classifiers. In contrast with earlier negative results, this paper investigates practical approaches for computing relevant sets for a number of widely used classifiers that include Decision Trees (DTs), Naive Bayes Classifiers (NBCs), and several families of classifiers obtained from propositional languages. Moreover, the paper shows that, in practice, and for these families of classifiers, relevant sets are easy to compute. Furthermore, the experiments confirm that succinct sets of relevant features can be obtained for the families of classifiers considered. △ Less

Submitted 12 December, 2022; originally announced December 2022.

Comments: arXiv admin note: text overlap with arXiv:2207.04748, arXiv:2205.09569

arXiv:2206.09551 [pdf, other]

Eliminating The Impossible, Whatever Remains Must Be True

Authors: Jinqiang Yu, Alexey Ignatiev, Peter J. Stuckey, Nina Narodytska, Joao Marques-Silva

Abstract: The rise of AI methods to make predictions and decisions has led to a pressing need for more explainable artificial intelligence (XAI) methods. One common approach for XAI is to produce a post-hoc explanation, explaining why a black box ML model made a certain prediction. Formal approaches to post-hoc explanations provide succinct reasons for why a prediction was made, as well as why not another p… ▽ More The rise of AI methods to make predictions and decisions has led to a pressing need for more explainable artificial intelligence (XAI) methods. One common approach for XAI is to produce a post-hoc explanation, explaining why a black box ML model made a certain prediction. Formal approaches to post-hoc explanations provide succinct reasons for why a prediction was made, as well as why not another prediction was made. But these approaches assume that features are independent and uniformly distributed. While this means that "why" explanations are correct, they may be longer than required. It also means the "why not" explanations may be suspect as the counterexamples they rely on may not be meaningful. In this paper, we show how one can apply background knowledge to give more succinct "why" formal explanations, that are presumably easier to interpret by humans, and give more accurate "why not" explanations. In addition, we show how to use existing rule induction techniques to efficiently extract background information from a dataset, and also how to report which background information was used to make an explanation, allowing a human to examine it if they doubt the correctness of the explanation. △ Less

Submitted 30 November, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

arXiv:2205.09971 [pdf, ps, other]

doi 10.1613/jair.1.13575

On Tackling Explanation Redundancy in Decision Trees

Authors: Yacine Izza, Alexey Ignatiev, Joao Marques-Silva

Abstract: Decision trees (DTs) epitomize the ideal of interpretability of machine learning (ML) models. The interpretability of decision trees motivates explainability approaches by so-called intrinsic interpretability, and it is at the core of recent proposals for applying interpretable ML models in high-risk applications. The belief in DT interpretability is justified by the fact that explanations for DT… ▽ More Decision trees (DTs) epitomize the ideal of interpretability of machine learning (ML) models. The interpretability of decision trees motivates explainability approaches by so-called intrinsic interpretability, and it is at the core of recent proposals for applying interpretable ML models in high-risk applications. The belief in DT interpretability is justified by the fact that explanations for DT predictions are generally expected to be succinct. Indeed, in the case of DTs, explanations correspond to DT paths. Since decision trees are ideally shallow, and so paths contain far fewer features than the total number of features, explanations in DTs are expected to be succinct, and hence interpretable. This paper offers both theoretical and experimental arguments demonstrating that, as long as interpretability of decision trees equates with succinctness of explanations, then decision trees ought not be deemed interpretable. The paper introduces logically rigorous path explanations and path explanation redundancy, and proves that there exist functions for which decision trees must exhibit paths with arbitrarily large explanation redundancy. The paper also proves that only a very restricted class of functions can be represented with DTs that exhibit no explanation redundancy. In addition, the paper includes experimental results substantiating that path explanation redundancy is observed ubiquitously in decision trees, including those obtained using different tree learning algorithms, but also in a wide range of publicly available decision trees. The paper also proposes polynomial-time algorithms for eliminating path explanation redundancy, which in practice require negligible time to compute. Thus, these algorithms serve to indirectly attain irreducible, and so succinct, explanations for decision trees. △ Less

Submitted 30 September, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

ACM Class: I.2.4; I.2.6

Journal ref: J. Artif. Intell. Res. Vol. 75 (2022)

arXiv:2205.09569 [pdf, ps, other]

Provably Precise, Succinct and Efficient Explanations for Decision Trees

Authors: Yacine Izza, Alexey Ignatiev, Nina Narodytska, Martin C. Cooper, Joao Marques-Silva

Abstract: Decision trees (DTs) embody interpretable classifiers. DTs have been advocated for deployment in high-risk applications, but also for explaining other complex classifiers. Nevertheless, recent work has demonstrated that predictions in DTs ought to be explained with rigorous approaches. Although rigorous explanations can be computed in polynomial time for DTs, their size may be beyond the cognitive… ▽ More Decision trees (DTs) embody interpretable classifiers. DTs have been advocated for deployment in high-risk applications, but also for explaining other complex classifiers. Nevertheless, recent work has demonstrated that predictions in DTs ought to be explained with rigorous approaches. Although rigorous explanations can be computed in polynomial time for DTs, their size may be beyond the cognitive limits of human decision makers. This paper investigates the computation of δ-relevant sets for DTs. δ-relevant sets denote explanations that are succinct and provably precise. These sets represent generalizations of rigorous explanations, which are precise with probability one, and so they enable trading off explanation size for precision. The paper proposes two logic encodings for computing smallest δ-relevant sets for DTs. The paper further devises a polynomial-time algorithm for computing δ-relevant sets which are not guaranteed to be subset-minimal, but for which the experiments show to be most often subset-minimal in practice. The experimental results also demonstrate the practical efficiency of computing smallest δ-relevant sets. △ Less

Submitted 19 May, 2022; originally announced May 2022.

arXiv:2112.03329 [pdf, other]

Inconsistent Planning: When in doubt, toss a coin!

Authors: Yuriy Dementiev, Fedor V. Fomin, Artur Ignatiev

Abstract: One of the most widespread human behavioral biases is the present bias -- the tendency to overestimate current costs by a bias factor. Kleinberg and Oren (2014) introduced an elegant graph-theoretical model of inconsistent planning capturing the behavior of a present-biased agent accomplishing a set of actions. The essential measure of the system introduced by Kleinberg and Oren is the cost of irr… ▽ More One of the most widespread human behavioral biases is the present bias -- the tendency to overestimate current costs by a bias factor. Kleinberg and Oren (2014) introduced an elegant graph-theoretical model of inconsistent planning capturing the behavior of a present-biased agent accomplishing a set of actions. The essential measure of the system introduced by Kleinberg and Oren is the cost of irrationality -- the ratio of the total cost of the actions performed by the present-biased agent to the optimal cost. This measure is vital for a task designer to estimate the aftermaths of human behavior related to time-inconsistent planning, including procrastination and abandonment. As we prove in this paper, the cost of irrationality is highly susceptible to the agent's choices when faced with a few possible actions of equal estimated costs. To address this issue, we propose a modification of Kleinberg-Oren's model of inconsistent planning. In our model, when an agent selects from several options of minimum prescribed cost, he uses a randomized procedure. We explore the algorithmic complexity of computing and estimating the cost of irrationality in the new model. △ Less

Submitted 6 December, 2021; originally announced December 2021.

arXiv:2107.01654 [pdf, other]

Efficient Explanations for Knowledge Compilation Languages

Authors: Xuanxiang Huang, Yacine Izza, Alexey Ignatiev, Martin C. Cooper, Nicholas Asher, Joao Marques-Silva

Abstract: Knowledge compilation (KC) languages find a growing number of practical uses, including in Constraint Programming (CP) and in Machine Learning (ML). In most applications, one natural question is how to explain the decisions made by models represented by a KC language. This paper shows that for many of the best known KC languages, well-known classes of explanations can be computed in polynomial tim… ▽ More Knowledge compilation (KC) languages find a growing number of practical uses, including in Constraint Programming (CP) and in Machine Learning (ML). In most applications, one natural question is how to explain the decisions made by models represented by a KC language. This paper shows that for many of the best known KC languages, well-known classes of explanations can be computed in polynomial time. These classes include deterministic decomposable negation normal form (d-DNNF), and so any KC language that is strictly less succinct than d-DNNF. Furthermore, the paper also investigates the conditions under which polynomial time computation of explanations can be extended to KC languages more succinct than d-DNNF. △ Less

Submitted 8 July, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

arXiv:2106.01350 [pdf, ps, other]

On Efficiently Explaining Graph-Based Classifiers

Authors: Xuanxiang Huang, Yacine Izza, Alexey Ignatiev, Joao Marques-Silva

Abstract: Recent work has shown that not only decision trees (DTs) may not be interpretable but also proposed a polynomial-time algorithm for computing one PI-explanation of a DT. This paper shows that for a wide range of classifiers, globally referred to as decision graphs, and which include decision trees and binary decision diagrams, but also their multi-valued variants, there exist polynomial-time algor… ▽ More Recent work has shown that not only decision trees (DTs) may not be interpretable but also proposed a polynomial-time algorithm for computing one PI-explanation of a DT. This paper shows that for a wide range of classifiers, globally referred to as decision graphs, and which include decision trees and binary decision diagrams, but also their multi-valued variants, there exist polynomial-time algorithms for computing one PI-explanation. In addition, the paper also proposes a polynomial-time algorithm for computing one contrastive explanation. These novel algorithms build on explanation graphs (XpG's). XpG's denote a graph representation that enables both theoretical and practically efficient computation of explanations for decision graphs. Furthermore, the paper proposes a practically efficient solution for the enumeration of explanations, and studies the complexity of deciding whether a given feature is included in some explanation. For the concrete case of decision trees, the paper shows that the set of all contrastive explanations can be enumerated in polynomial time. Finally, the experimental results validate the practical applicability of the algorithms proposed in the paper on a wide range of publicly available benchmarks. △ Less

Submitted 3 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

arXiv:2106.00546 [pdf, ps, other]

Efficient Explanations With Relevant Sets

Authors: Yacine Izza, Alexey Ignatiev, Nina Narodytska, Martin C. Cooper, Joao Marques-Silva

Abstract: Recent work proposed $δ$-relevant inputs (or sets) as a probabilistic explanation for the predictions made by a classifier on a given input. $δ$-relevant sets are significant because they serve to relate (model-agnostic) Anchors with (model-accurate) PI- explanations, among other explanation approaches. Unfortunately, the computation of smallest size $δ$-relevant sets is complete for ${NP}^{PP}$,… ▽ More Recent work proposed $δ$-relevant inputs (or sets) as a probabilistic explanation for the predictions made by a classifier on a given input. $δ$-relevant sets are significant because they serve to relate (model-agnostic) Anchors with (model-accurate) PI- explanations, among other explanation approaches. Unfortunately, the computation of smallest size $δ$-relevant sets is complete for ${NP}^{PP}$, rendering their computation largely infeasible in practice. This paper investigates solutions for tackling the practical limitations of $δ$-relevant sets. First, the paper alternatively considers the computation of subset-minimal sets. Second, the paper studies concrete families of classifiers, including decision trees among others. For these cases, the paper shows that the computation of subset-minimal $δ$-relevant sets is in NP, and can be solved with a polynomial number of calls to an NP oracle. The experimental evaluation compares the proposed approach with heuristic explainers for the concrete case of the classifiers studied in the paper, and confirms the advantage of the proposed solution over the state of the art. △ Less

Submitted 1 June, 2021; originally announced June 2021.

arXiv:2106.00154 [pdf, ps, other]

Explanations for Monotonic Classifiers

Authors: Joao Marques-Silva, Thomas Gerspacher, Martin Cooper, Alexey Ignatiev, Nina Narodytska

Abstract: In many classification tasks there is a requirement of monotonicity. Concretely, if all else remains constant, increasing (resp. decreasing) the value of one or more features must not decrease (resp. increase) the value of the prediction. Despite comprehensive efforts on learning monotonic classifiers, dedicated approaches for explaining monotonic classifiers are scarce and classifier-specific. Th… ▽ More In many classification tasks there is a requirement of monotonicity. Concretely, if all else remains constant, increasing (resp. decreasing) the value of one or more features must not decrease (resp. increase) the value of the prediction. Despite comprehensive efforts on learning monotonic classifiers, dedicated approaches for explaining monotonic classifiers are scarce and classifier-specific. This paper describes novel algorithms for the computation of one formal explanation of a (black-box) monotonic classifier. These novel algorithms are polynomial in the run time complexity of the classifier and the number of features. Furthermore, the paper presents a practically efficient model-agnostic algorithm for enumerating formal explanations. △ Less

Submitted 31 May, 2021; originally announced June 2021.

arXiv:2105.06782 [pdf, other]

SAT-Based Rigorous Explanations for Decision Lists

Authors: Alexey Ignatiev, Joao Marques-Silva

Abstract: Decision lists (DLs) find a wide range of uses for classification problems in Machine Learning (ML), being implemented in a number of ML frameworks. DLs are often perceived as interpretable. However, building on recent results for decision trees (DTs), we argue that interpretability is an elusive goal for some DLs. As a result, for some uses of DLs, it will be important to compute (rigorous) expla… ▽ More Decision lists (DLs) find a wide range of uses for classification problems in Machine Learning (ML), being implemented in a number of ML frameworks. DLs are often perceived as interpretable. However, building on recent results for decision trees (DTs), we argue that interpretability is an elusive goal for some DLs. As a result, for some uses of DLs, it will be important to compute (rigorous) explanations. Unfortunately, and in clear contrast with the case of DTs, this paper shows that computing explanations for DLs is computationally hard. Motivated by this result, the paper proposes propositional encodings for computing abductive explanations (AXps) and contrastive explanations (CXps) of DLs. Furthermore, the paper investigates the practical efficiency of a MARCO-like approach for enumerating explanations. The experimental results demonstrate that, for DLs used in practical settings, the use of SAT oracles offers a very efficient solution, and that complete enumeration of explanations is most often feasible. △ Less

Submitted 14 May, 2021; originally announced May 2021.

arXiv:2102.01904 [pdf, other]

A Scalable Two Stage Approach to Computing Optimal Decision Sets

Authors: Alexey Ignatiev, Edward Lam, Peter J. Stuckey, Joao Marques-Silva

Abstract: Machine learning (ML) is ubiquitous in modern life. Since it is being deployed in technologies that affect our privacy and safety, it is often crucial to understand the reasoning behind its decisions, warranting the need for explainable AI. Rule-based models, such as decision trees, decision lists, and decision sets, are conventionally deemed to be the most interpretable. Recent work uses proposit… ▽ More Machine learning (ML) is ubiquitous in modern life. Since it is being deployed in technologies that affect our privacy and safety, it is often crucial to understand the reasoning behind its decisions, warranting the need for explainable AI. Rule-based models, such as decision trees, decision lists, and decision sets, are conventionally deemed to be the most interpretable. Recent work uses propositional satisfiability (SAT) solving (and its optimization variants) to generate minimum-size decision sets. Motivated by limited practical scalability of these earlier methods, this paper proposes a novel approach to learn minimum-size decision sets by enumerating individual rules of the target decision set independently of each other, and then solving a set cover problem to select a subset of rules. The approach makes use of modern maximum satisfiability and integer linear programming technologies. Experiments on a wide range of publicly available datasets demonstrate the advantage of the new approach over the state of the art in SAT-based decision set learning. △ Less

Submitted 3 February, 2021; originally announced February 2021.

arXiv:2012.11067 [pdf, other]

On Relating 'Why?' and 'Why Not?' Explanations

Authors: Alexey Ignatiev, Nina Narodytska, Nicholas Asher, Joao Marques-Silva

Abstract: Explanations of Machine Learning (ML) models often address a 'Why?' question. Such explanations can be related with selecting feature-value pairs which are sufficient for the prediction. Recent work has investigated explanations that address a 'Why Not?' question, i.e. finding a change of feature values that guarantee a change of prediction. Given their goals, these two forms of explaining predict… ▽ More Explanations of Machine Learning (ML) models often address a 'Why?' question. Such explanations can be related with selecting feature-value pairs which are sufficient for the prediction. Recent work has investigated explanations that address a 'Why Not?' question, i.e. finding a change of feature values that guarantee a change of prediction. Given their goals, these two forms of explaining predictions of ML models appear to be mostly unrelated. However, this paper demonstrates otherwise, and establishes a rigorous formal relationship between 'Why?' and 'Why Not?' explanations. Concretely, the paper proves that, for any given instance, 'Why?' explanations are minimal hitting sets of 'Why Not?' explanations and vice-versa. Furthermore, the paper devises novel algorithms for extracting and enumerating both forms of explanations. △ Less

Submitted 20 December, 2020; originally announced December 2020.

arXiv:2010.11034 [pdf, ps, other]

On Explaining Decision Trees

Authors: Yacine Izza, Alexey Ignatiev, Joao Marques-Silva

Abstract: Decision trees (DTs) epitomize what have become to be known as interpretable machine learning (ML) models. This is informally motivated by paths in DTs being often much smaller than the total number of features. This paper shows that in some settings DTs can hardly be deemed interpretable, with paths in a DT being arbitrarily larger than a PI-explanation, i.e. a subset-minimal set of feature value… ▽ More Decision trees (DTs) epitomize what have become to be known as interpretable machine learning (ML) models. This is informally motivated by paths in DTs being often much smaller than the total number of features. This paper shows that in some settings DTs can hardly be deemed interpretable, with paths in a DT being arbitrarily larger than a PI-explanation, i.e. a subset-minimal set of feature values that entails the prediction. As a result, the paper proposes a novel model for computing PI-explanations of DTs, which enables computing one PI-explanation in polynomial time. Moreover, it is shown that enumeration of PI-explanations can be reduced to the enumeration of minimal hitting sets. Experimental results were obtained on a wide range of publicly available datasets with well-known DT-learning tools, and confirm that in most cases DTs have paths that are proper supersets of PI-explanations. △ Less

Submitted 21 October, 2020; originally announced October 2020.

arXiv:2010.09919 [pdf, other]

Optimal Decision Lists using SAT

Authors: Jinqiang Yu, Alexey Ignatiev, Pierre Le Bodic, Peter J. Stuckey

Abstract: Decision lists are one of the most easily explainable machine learning models. Given the renewed emphasis on explainable machine learning decisions, this machine learning model is increasingly attractive, combining small size and clear explainability. In this paper, we show for the first time how to construct optimal "perfect" decision lists which are perfectly accurate on the training data, and m… ▽ More Decision lists are one of the most easily explainable machine learning models. Given the renewed emphasis on explainable machine learning decisions, this machine learning model is increasingly attractive, combining small size and clear explainability. In this paper, we show for the first time how to construct optimal "perfect" decision lists which are perfectly accurate on the training data, and minimal in size, making use of modern SAT solving technology. We also give a new method for determining optimal sparse decision lists, which trade off size and accuracy. We contrast the size and test accuracy of optimal decisions lists versus optimal decision sets, as well as other state-of-the-art methods for determining optimal decision lists. We also examine the size of average explanations generated by decision sets and decision lists. △ Less

Submitted 19 October, 2020; originally announced October 2020.

arXiv:2008.05803 [pdf, other]

Explaining Naive Bayes and Other Linear Classifiers with Polynomial Time and Delay

Authors: Joao Marques-Silva, Thomas Gerspacher, Martin C. Cooper, Alexey Ignatiev, Nina Narodytska

Abstract: Recent work proposed the computation of so-called PI-explanations of Naive Bayes Classifiers (NBCs). PI-explanations are subset-minimal sets of feature-value pairs that are sufficient for the prediction, and have been computed with state-of-the-art exact algorithms that are worst-case exponential in time and space. In contrast, we show that the computation of one PI-explanation for an NBC can be a… ▽ More Recent work proposed the computation of so-called PI-explanations of Naive Bayes Classifiers (NBCs). PI-explanations are subset-minimal sets of feature-value pairs that are sufficient for the prediction, and have been computed with state-of-the-art exact algorithms that are worst-case exponential in time and space. In contrast, we show that the computation of one PI-explanation for an NBC can be achieved in log-linear time, and that the same result also applies to the more general class of linear classifiers. Furthermore, we show that the enumeration of PI-explanations can be obtained with polynomial delay. Experimental results demonstrate the performance gains of the new algorithms when compared with earlier work. The experimental results also investigate ways to measure the quality of heuristic explanations △ Less

Submitted 4 November, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

arXiv:2007.15140 [pdf, other]

Computing Optimal Decision Sets with SAT

Authors: Jinqiang Yu, Alexey Ignatiev, Peter J. Stuckey, Pierre Le Bodic

Abstract: As machine learning is increasingly used to help make decisions, there is a demand for these decisions to be explainable. Arguably, the most explainable machine learning models use decision rules. This paper focuses on decision sets, a type of model with unordered rules, which explains each prediction with a single rule. In order to be easy for humans to understand, these rules must be concise. Ea… ▽ More As machine learning is increasingly used to help make decisions, there is a demand for these decisions to be explainable. Arguably, the most explainable machine learning models use decision rules. This paper focuses on decision sets, a type of model with unordered rules, which explains each prediction with a single rule. In order to be easy for humans to understand, these rules must be concise. Earlier work on generating optimal decision sets first minimizes the number of rules, and then minimizes the number of literals, but the resulting rules can often be very large. Here we consider a better measure, namely the total size of the decision set in terms of literals. So we are not driven to a small set of rules which require a large number of literals. We provide the first approach to determine minimum-size decision sets that achieve minimum empirical risk and then investigate sparse alternatives where we trade accuracy for size. By finding optimal solutions we show we can build decision set classifiers that are almost as accurate as the best heuristic methods, but far more concise, and hence more explainable. △ Less

Submitted 29 July, 2020; originally announced July 2020.

arXiv:1907.02509 [pdf, other]

On Validating, Repairing and Refining Heuristic ML Explanations

Authors: Alexey Ignatiev, Nina Narodytska, Joao Marques-Silva

Abstract: Recent years have witnessed a fast-growing interest in computing explanations for Machine Learning (ML) models predictions. For non-interpretable ML models, the most commonly used approaches for computing explanations are heuristic in nature. In contrast, recent work proposed rigorous approaches for computing explanations, which hold for a given ML model and prediction over the entire instance spa… ▽ More Recent years have witnessed a fast-growing interest in computing explanations for Machine Learning (ML) models predictions. For non-interpretable ML models, the most commonly used approaches for computing explanations are heuristic in nature. In contrast, recent work proposed rigorous approaches for computing explanations, which hold for a given ML model and prediction over the entire instance space. This paper extends earlier work to the case of boosted trees and assesses the quality of explanations obtained with state-of-the-art heuristic approaches. On most of the datasets considered, and for the vast majority of instances, the explanations obtained with heuristic approaches are shown to be inadequate when the entire instance space is (implicitly) considered. △ Less

Submitted 4 July, 2019; originally announced July 2019.

arXiv:1811.10656 [pdf, other]

Abduction-Based Explanations for Machine Learning Models

Authors: Alexey Ignatiev, Nina Narodytska, Joao Marques-Silva

Abstract: The growing range of applications of Machine Learning (ML) in a multitude of settings motivates the ability of computing small explanations for predictions made. Small explanations are generally accepted as easier for human decision makers to understand. Most earlier work on computing explanations is based on heuristic approaches, providing no guarantees of quality, in terms of how close such solu… ▽ More The growing range of applications of Machine Learning (ML) in a multitude of settings motivates the ability of computing small explanations for predictions made. Small explanations are generally accepted as easier for human decision makers to understand. Most earlier work on computing explanations is based on heuristic approaches, providing no guarantees of quality, in terms of how close such solutions are from cardinality- or subset-minimal explanations. This paper develops a constraint-agnostic solution for computing explanations for any ML model. The proposed solution exploits abductive reasoning, and imposes the requirement that the ML model can be represented as sets of constraints using some target constraint reasoning system for which the decision problem can be answered with some oracle. The experimental results, obtained on well-known datasets, validate the scalability of the proposed approach as well as the quality of the computed solutions. △ Less

Submitted 26 November, 2018; originally announced November 2018.

arXiv:1803.04646 [pdf, ps, other]

On Cryptographic Attacks Using Backdoors for SAT

Authors: Alexander Semenov, Oleg Zaikin, Ilya Otpuschennikov, Stepan Kochemazov, Alexey Ignatiev

Abstract: Propositional satisfiability (SAT) is at the nucleus of state-of-the-art approaches to a variety of computationally hard problems, one of which is cryptanalysis. Moreover, a number of practical applications of SAT can only be tackled efficiently by identifying and exploiting a subset of formula's variables called backdoor set (or simply backdoors). This paper proposes a new class of backdoor sets… ▽ More Propositional satisfiability (SAT) is at the nucleus of state-of-the-art approaches to a variety of computationally hard problems, one of which is cryptanalysis. Moreover, a number of practical applications of SAT can only be tackled efficiently by identifying and exploiting a subset of formula's variables called backdoor set (or simply backdoors). This paper proposes a new class of backdoor sets for SAT used in the context of cryptographic attacks, namely guess-and-determine attacks. The idea is to identify the best set of backdoor variables subject to a statistically estimated hardness of the guess-and-determine attack using a SAT solver. Experimental results on weakened variants of the renowned encryption algorithms exhibit advantage of the proposed approach compared to the state of the art in terms of the estimated hardness of the resulting guess-and-determine attacks. △ Less

Submitted 13 March, 2018; originally announced March 2018.

arXiv:1707.01972 [pdf, other]

Model Based Diagnosis of Multiple Observations with Implicit Hitting Sets

Authors: Alexey Ignatiev, Antonio Morgado, Joao Marques-Silva

Abstract: Model based diagnosis finds a growing range of practical applications, and significant performance-wise improvements have been achieved in recent years. Some of these improvements result from formulating the problem with maximum satisfiability (MaxSAT). Whereas recent work focuses on analyzing failing observations separately, it is also the case that in practical settings there may exist many fail… ▽ More Model based diagnosis finds a growing range of practical applications, and significant performance-wise improvements have been achieved in recent years. Some of these improvements result from formulating the problem with maximum satisfiability (MaxSAT). Whereas recent work focuses on analyzing failing observations separately, it is also the case that in practical settings there may exist many failing observations. This paper first investigates the drawbacks of analyzing failing observations separately. It then shows that existing solutions do not scale for large systems. Finally, the paper proposes a novel approach for diagnosing systems with many failing observations. The proposed approach is based on implicit hitting sets and so is tightly related with the original seminal work on model based diagnosis. The experimental results demonstrate not only the importance of analyzing multiple observations simultaneously, but also the significance of the implicit hitting set approach. △ Less

Submitted 6 July, 2017; originally announced July 2017.

arXiv:1705.05335 [pdf, other]

Horn Maximum Satisfiability: Reductions, Algorithms & Applications

Authors: Joao Marques-Silva, Alexey Ignatiev, Antonio Morgado

Abstract: Recent years have witness remarkable performance improvements in maximum satisfiability (MaxSAT) solvers. In practice, MaxSAT algorithms often target the most generic MaxSAT formulation, whereas dedicated solvers, which address specific subclasses of MaxSAT, have not been investigated. This paper shows that a wide range of optimization and decision problems are either naturally formulated as MaxSA… ▽ More Recent years have witness remarkable performance improvements in maximum satisfiability (MaxSAT) solvers. In practice, MaxSAT algorithms often target the most generic MaxSAT formulation, whereas dedicated solvers, which address specific subclasses of MaxSAT, have not been investigated. This paper shows that a wide range of optimization and decision problems are either naturally formulated as MaxSAT over Horn formulas, or permit simple encodings using Horn MaxSAT. Furthermore, the paper also shows how linear time decision procedures for Horn formulas can be used for developing novel algorithms for the Horn MaxSAT problem. △ Less

Submitted 15 May, 2017; originally announced May 2017.

arXiv:1705.01477 [pdf, other]

On Tackling the Limits of Resolution in SAT Solving

Authors: Alexey Ignatiev, Antonio Morgado, Joao Marques-Silva

Abstract: The practical success of Boolean Satisfiability (SAT) solvers stems from the CDCL (Conflict-Driven Clause Learning) approach to SAT solving. However, from a propositional proof complexity perspective, CDCL is no more powerful than the resolution proof system, for which many hard examples exist. This paper proposes a new problem transformation, which enables reducing the decision problem for formul… ▽ More The practical success of Boolean Satisfiability (SAT) solvers stems from the CDCL (Conflict-Driven Clause Learning) approach to SAT solving. However, from a propositional proof complexity perspective, CDCL is no more powerful than the resolution proof system, for which many hard examples exist. This paper proposes a new problem transformation, which enables reducing the decision problem for formulas in conjunctive normal form (CNF) to the problem of solving maximum satisfiability over Horn formulas. Given the new transformation, the paper proves a polynomial bound on the number of MaxSAT resolution steps for pigeonhole formulas. This result is in clear contrast with earlier results on the length of proofs of MaxSAT resolution for pigeonhole formulas. The paper also establishes the same polynomial bound in the case of modern core-guided MaxSAT solvers. Experimental results, obtained on CNF formulas known to be hard for CDCL SAT solvers, show that these can be efficiently solved with modern MaxSAT solvers. △ Less

Submitted 5 July, 2017; v1 submitted 3 May, 2017; originally announced May 2017.

arXiv:1612.07559 [pdf, other]

Approach for modelling quantum-mechanical collapse

Authors: A. Yu. Ignatiev

Abstract: A long-standing quantum-mechanical puzzle is whether the collapse of the wave function is a real physical process or simply an epiphenomenon. This puzzle lies at the heart of the measurement problem. One way to choose between the alternatives is to assume that one or the other is correct and attempt to draw physical, observable consequences which then could be empirically verified or ruled out. As… ▽ More A long-standing quantum-mechanical puzzle is whether the collapse of the wave function is a real physical process or simply an epiphenomenon. This puzzle lies at the heart of the measurement problem. One way to choose between the alternatives is to assume that one or the other is correct and attempt to draw physical, observable consequences which then could be empirically verified or ruled out. As a working hypothesis, we propose simple models of collapse as a real physical process for direct binary symmetric measurements made on one particle. This allows one to construct irreversible unstable Schrödinger equations capable of describing continuously the process of collapse induced by the interaction of the quantum system with the measuring device. Due to unknown initial conditions the collapse outcome remains unpredictable so no contradictions with quantum mechanics arise. Our theoretical framework predicts a finite time-scale of the collapse and links with experiment. Sensitive probes of the collapse dynamics could be done using Bose-Einstein condensates, ultracold neutrons or ultrafast optics. If confirmed, the formulation could be relevant to the transition from quantum fluctuations to classical inhomogeneities in early cosmology and to establishing the ultimate limits on the speed of quantum computation and information processing. △ Less

Submitted 22 December, 2016; originally announced December 2016.

Comments: 28 pages, 5 figures, much expanded companion paper to arXiv:1204.3373 [quant-ph]

arXiv:1604.08229 [pdf, other]

Propositional Abduction with Implicit Hitting Sets

Authors: Alexey Ignatiev, Antonio Morgado, Joao Marques-Silva

Abstract: Logic-based abduction finds important applications in artificial intelligence and related areas. One application example is in finding explanations for observed phenomena. Propositional abduction is a restriction of abduction to the propositional domain, and complexity-wise is in the second level of the polynomial hierarchy. Recent work has shown that exploiting implicit hitting sets and propositi… ▽ More Logic-based abduction finds important applications in artificial intelligence and related areas. One application example is in finding explanations for observed phenomena. Propositional abduction is a restriction of abduction to the propositional domain, and complexity-wise is in the second level of the polynomial hierarchy. Recent work has shown that exploiting implicit hitting sets and propositional satisfiability (SAT) solvers provides an efficient approach for propositional abduction. This paper investigates this earlier work and proposes a number of algorithmic improvements. These improvements are shown to yield exponential reductions in the number of SAT solver calls. More importantly, the experimental results show significant performance improvements compared to the the best approaches for propositional abduction. △ Less

Submitted 27 April, 2016; originally announced April 2016.

arXiv:1408.3059 [pdf, ps, other]

doi 10.1139/cjp-2014-0164

Testing MOND on Earth

Authors: A. Yu. Ignatiev

Abstract: MOND is one of the most popular alternatives to Dark Matter (DM). While efforts to directly detect DM in laboratories have been steadily pursued over the years, the proposed Earth-based tests of MOND are still in their infancy. Some proposals recently appeared in the literature are briefly reviewed, and it is argued that collaborative efforts of theorists and experimenters are needed to move forwa… ▽ More MOND is one of the most popular alternatives to Dark Matter (DM). While efforts to directly detect DM in laboratories have been steadily pursued over the years, the proposed Earth-based tests of MOND are still in their infancy. Some proposals recently appeared in the literature are briefly reviewed, and it is argued that collaborative efforts of theorists and experimenters are needed to move forward in this exciting new area. Possible future directions are outlined. △ Less

Submitted 13 August, 2014; originally announced August 2014.

Comments: 9 pages, Invited paper to appear in the special issue of the Canadian Journal of Physics

arXiv:1204.3373 [pdf, ps, other]

doi 10.1088/1742-6596/410/1/012153

How fast is the wave function collapse?

Authors: A. Yu. Ignatiev

Abstract: Using complex quantum Hamilton-Jacobi formulation, a new kind of non-linear equations is proposed that have almost classical structure and extend the Schroedinger equation to describe the collapse of the wave function as a finite-time process. Experimental bounds on the collapse time are reported (of order 0.1 ms to 0.1 ps) and its convenient dimensionless measure is introduced. This parameter hel… ▽ More Using complex quantum Hamilton-Jacobi formulation, a new kind of non-linear equations is proposed that have almost classical structure and extend the Schroedinger equation to describe the collapse of the wave function as a finite-time process. Experimental bounds on the collapse time are reported (of order 0.1 ms to 0.1 ps) and its convenient dimensionless measure is introduced. This parameter helps to identify the areas where sensitive probes of the possible collapse dynamics can be done. Examples are experiments with Bose-Einstein condensates, ultracold neutrons or ultrafast optics. △ Less

Submitted 22 February, 2013; v1 submitted 16 April, 2012; originally announced April 2012.

Comments: 9 pages; v2: a shorter version to suit the 4 page limit of Proceedings of International Conference on Mathematical Modelling in Physical Sciences, 3-7 September 2012, Budapest, Hungary (IC-MSQUARE 2012)

Journal ref: J. Phys.: Conf. Ser. 410 (2013) 012153

arXiv:1204.2670 [pdf, ps, other]

doi 10.1103/PhysRevB.83.125415

Confined bulk states as a long-range sensor for impurities and a transfer channel for quantum information

Authors: Oleg O. Brovko, Pavel A. Ignatiev, Valeri S. Stepanyuk

Abstract: We show that confinement of bulk electrons can be observed at low-dimensional surface structures and can serve as a long-range sensor for the magnetism and electronic properties of single impurities or as a quantum information transfer channel with large coherence lengths. Our ab initio calculations reveal oscillations of electron density in magnetic chains on metallic surfaces and help to unambig… ▽ More We show that confinement of bulk electrons can be observed at low-dimensional surface structures and can serve as a long-range sensor for the magnetism and electronic properties of single impurities or as a quantum information transfer channel with large coherence lengths. Our ab initio calculations reveal oscillations of electron density in magnetic chains on metallic surfaces and help to unambiguously identify the electrons involved as bulk electrons. We furthermore discuss the possibility of utilizing bulk state confinement to transfer quantum information, encoded in an atom's species or spin, across distances of several nanometers with high efficiency. △ Less

Submitted 12 April, 2012; originally announced April 2012.

Comments: 5 pages, 2 figures

Journal ref: Phys. Rev. B 83, 125415 (2011)

arXiv:1102.5702 [pdf, ps, other]

doi 10.1142/S0217751X11054528

Two photon decay of Z' as a probe of Bose symmetry violation at the CERN LHC

Authors: S. N. Gninenko, A. Yu. Ignatiev, V. A. Matveev

Abstract: The question if the Bose statistics is broken at the TeV scale is discussed. The decay of a new heavy spin 1 gauge boson Z' into two photons, Z'-> 2 gamma, is forbidden by the Bose statistics among other general principles of quantum field theory (Landau-Yang theorem). We point out that the search for this decay can be effectively used to probe the Bose symmetry violation at the CERN LHC. The question if the Bose statistics is broken at the TeV scale is discussed. The decay of a new heavy spin 1 gauge boson Z' into two photons, Z'-> 2 gamma, is forbidden by the Bose statistics among other general principles of quantum field theory (Landau-Yang theorem). We point out that the search for this decay can be effectively used to probe the Bose symmetry violation at the CERN LHC. △ Less

Submitted 16 October, 2011; v1 submitted 28 February, 2011; originally announced February 2011.

Comments: 10 pages, 3 figures. Published version, but with extended introductory discussion

Journal ref: Int.J.Mod.Phys. A26 (2011) 4367-4385

arXiv:0804.3298 [pdf, ps, other]

doi 10.1103/PhysRevLett.101.036809

Tailoring exchange interactions in engineered nanostructures: Ab initio study

Authors: O. O. Brovko, P. A. Ignatiev, V. S. Stepanyuk, P. Bruno

Abstract: We present a novel approach to spin manipulation in atomic-scale nanostructures. Our ab initio calculations clearly demonstrate that it is possible to tune magnetic properties of sub-nanometer structures by adjusting the geometry of the system. By the example of two surface-based systems we demonstrate that (i) the magnetic moment of a single adatom coupled to a buried magnetic Co layer can be s… ▽ More We present a novel approach to spin manipulation in atomic-scale nanostructures. Our ab initio calculations clearly demonstrate that it is possible to tune magnetic properties of sub-nanometer structures by adjusting the geometry of the system. By the example of two surface-based systems we demonstrate that (i) the magnetic moment of a single adatom coupled to a buried magnetic Co layer can be stabilized in either a ferromagnetic or an antiferromagnetic configuration depending on the spacer thickness. It is found that a buried Co layer has a profound effect on the exchange interaction between two magnetic impurities on the surface. (ii) The exchange interaction between magnetic adatoms can be manipulated by introducing artificial nonmagnetic Cu chains to link them. △ Less

Submitted 21 April, 2008; originally announced April 2008.

Comments: 4 pages, submitted to PRL

arXiv:0802.1599 [pdf, other]

doi 10.1103/PhysRevD.77.102001

Newton's second law versus modified-inertia MOND: a test using the high-latitude effect

Authors: A. Yu. Ignatiev

Abstract: The modified-inertia MOND is an approach that proposes a change in Newton's second law at small accelerations as an alternative to dark matter. Recently it was suggested that this approach can be tested in terrestrial laboratory experiments. One way of doing the test is based on the Static High-Latitude Equinox Modified Inertia (SHLEM) effect: around each equinox date, 2 spots emerge on the Eart… ▽ More The modified-inertia MOND is an approach that proposes a change in Newton's second law at small accelerations as an alternative to dark matter. Recently it was suggested that this approach can be tested in terrestrial laboratory experiments. One way of doing the test is based on the Static High-Latitude Equinox Modified Inertia (SHLEM) effect: around each equinox date, 2 spots emerge on the Earth where static bodies experience spontaneous displacement due to the violation of Newton's second law required by the modified-inertia MOND. Here, a detailed theory of this effect is developed and estimates of the magnitude of the signal due to the effect are obtained. The expected displacement of a mirror in a gravitational wave interferometer is found to be about 10^{-14} m. Some experimental aspects of the proposal are discussed. △ Less

Submitted 12 February, 2008; originally announced February 2008.

Comments: 15 pages, 1 figure

Journal ref: Phys.Rev.D77:102001,2008

arXiv:0709.1631 [pdf, ps, other]

doi 10.1103/PhysRevLett.99.246102

Size-dependent Surface States on Strained Cobalt Nanoislands on Cu(111)

Authors: M. V. Rastei, B. Heinrich, L. Limot, P. A. Ignatiev, V. S. Stepanyuk, P. Bruno, J. P. Bucher

Abstract: Low-temperature scanning tunneling spectroscopy over Co nanoislands on Cu(111) showed that the surface states of the islands vary with their size. Occupied states exhibit a sizeable downward energy shift as the island size decreases. The position of the occupied states also significantly changes across the islands. Atomic-scale simulations and ab inito calculations demonstrate that the driving f… ▽ More Low-temperature scanning tunneling spectroscopy over Co nanoislands on Cu(111) showed that the surface states of the islands vary with their size. Occupied states exhibit a sizeable downward energy shift as the island size decreases. The position of the occupied states also significantly changes across the islands. Atomic-scale simulations and ab inito calculations demonstrate that the driving force for the observed shift is related to size-dependent mesoscopic relaxations in the nanoislands. △ Less

Submitted 11 September, 2007; originally announced September 2007.

Comments: 4 pages, 4 figures

arXiv:gr-qc/0612159 [pdf, ps, other]

doi 10.1103/PhysRevLett.98.101101

Is violation of Newton's second law possible?

Authors: A. Yu. Ignatiev

Abstract: Astrophysical observations (usually explained by dark matter) suggest that classical mechanics could break down when the acceleration becomes extremely small (the approach known as modified Newtonian dynamics, or MOND). I present the first analysis of MOND manifestations in terrestrial (rather than astrophysical) settings. A new effect is reported: around each equinox date, 2 spots emerge on the… ▽ More Astrophysical observations (usually explained by dark matter) suggest that classical mechanics could break down when the acceleration becomes extremely small (the approach known as modified Newtonian dynamics, or MOND). I present the first analysis of MOND manifestations in terrestrial (rather than astrophysical) settings. A new effect is reported: around each equinox date, 2 spots emerge on the Earth where static bodies experience spontaneous acceleration due to the possible violation of Newton's second law. Preliminary estimates indicate that an experimental search for this effect can be feasible. △ Less

Submitted 11 March, 2007; v1 submitted 25 December, 2006; originally announced December 2006.

Comments: 10 pages; minor changes to match the published version

Journal ref: Phys.Rev.Lett.98:101101,2007

arXiv:cond-mat/0602507 [pdf]

doi 10.1103/PhysRevLett.98.146403

Evidance for an Oxygen Diffusion Model for the Electric Pulse Induced Resistance Change Effect in Oxides

Authors: Y. B. Nian, J. Strozier, N. J. Wu, X. Chen, A. Ignatiev

Abstract: Electric pulse induced resistance (EPIR) switching hysteresis loops for Pr0.7Ca0.7MnO3 (PCMO) perovskite oxide films were found to exhibit an additional sharp "shuttle peak" around the negative pulse maximum for films deposited in an oxygen deficient ambient. The device resistance hysteresis loop consists of stable high resistance and low resistance states, and transition regions between them. T… ▽ More Electric pulse induced resistance (EPIR) switching hysteresis loops for Pr0.7Ca0.7MnO3 (PCMO) perovskite oxide films were found to exhibit an additional sharp "shuttle peak" around the negative pulse maximum for films deposited in an oxygen deficient ambient. The device resistance hysteresis loop consists of stable high resistance and low resistance states, and transition regions between them. The resistance relaxation of the "shuttle peak" and its temperature behavior as well as the resistance relaxation in the transition regions were studied, and indicate that the resistance switching relates to oxygen diffusion with activation energy about 0.4eV. An oxygen diffusion model with the oxygen ions (vacancies) as the active agent is proposed for the non-volatile resistance switching effect in PCMO. △ Less

Submitted 22 February, 2006; v1 submitted 21 February, 2006; originally announced February 2006.

Comments: 7 pages, 5 figures

Journal ref: refer to Physical Review Letters, 98, 146403 (2007)

arXiv:cond-mat/0601451 [pdf]

doi 10.1063/1.2236213

Spatially extended nature of resistive switching in perovskite oxide thin films

Authors: Xin Chen, NaiJuan Wu, John Strozier, Alex Ignatiev

Abstract: We report the direct observation of the electric pulse induced resistance-change (EPIR) effect at the nano scale on La1-xSrxMnO3 (LSMO) thin films by the current measurement AFM technique. After a switching voltage of one polarity is applied across the sample by the AFM tip, the conductivity in a local nanometer region around the AFM tip is increased, and after a switching voltage of the opposit… ▽ More We report the direct observation of the electric pulse induced resistance-change (EPIR) effect at the nano scale on La1-xSrxMnO3 (LSMO) thin films by the current measurement AFM technique. After a switching voltage of one polarity is applied across the sample by the AFM tip, the conductivity in a local nanometer region around the AFM tip is increased, and after a switching voltage of the opposite polarity is applied, the local conductivity is reduced. This reversible resistance switching effect is observed under both continuous and short pulse voltage switching conditions. It is important for future nanoscale non-volatile memory device applications. △ Less

Submitted 19 January, 2006; originally announced January 2006.

Comments: 11 pages, 3 figures

Journal ref: Applied Physics Letters, 89 (2006) 063507

arXiv:hep-ph/0510209 [pdf, ps, other]

doi 10.1016/j.physleta.2006.05.083

Neutrino statistics and non-standard commutation relations

Authors: A. Yu. Ignatiev, V. A. Kuzmin

Abstract: Recently it was suggested that the neutrino may violate the Pauli exclusion Principle (PEP). This renews interest in the systematic search for bilinear commutation relations that could describe deviations from PEP. In the context of this search we prove a no-go theorem which forbids a finite occupancy limit for an arbitrary system with a bilinear commutation relation. In other words, either the… ▽ More Recently it was suggested that the neutrino may violate the Pauli exclusion Principle (PEP). This renews interest in the systematic search for bilinear commutation relations that could describe deviations from PEP. In the context of this search we prove a no-go theorem which forbids a finite occupancy limit for an arbitrary system with a bilinear commutation relation. In other words, either the upper limit on the occupancy number is 1 (the ordinary fermionic case) or there is no upper limit at all. Some examples of the latter class include the usual Bose statistics, as well as non-standard quon statistics and infinite statistics. △ Less

Submitted 16 October, 2005; originally announced October 2005.

Comments: 11 pages, RevTeX4

Journal ref: Phys.Lett. A359 (2006) 26-30

arXiv:cond-mat/0510060 [pdf]

doi 10.1143/JJAP.45.1602

Buffer-Enhanced Electrical-Pulse-Induced-Resistive Memory Effect in Thin Film Perovskites

Authors: Xin Chen, Naijuan Wu, Alex Ignatiev, Qing Chen, Yue Zhang

Abstract: A multilayer perovskite thin film resistive memory device has been developed comprised of: a Pr0.7Ca0.3MnO3 (PCMO) perovskite oxide epitaxial layer on a YBCO bottom thin film electrode; a thin yttria stabilized zirconia (YSZ) buffer layer grown on the PCMO layer, and a gold thin film top electrode. The multi-layer thin film lattice structure has been characterized by XRD and TEM analyses showing… ▽ More A multilayer perovskite thin film resistive memory device has been developed comprised of: a Pr0.7Ca0.3MnO3 (PCMO) perovskite oxide epitaxial layer on a YBCO bottom thin film electrode; a thin yttria stabilized zirconia (YSZ) buffer layer grown on the PCMO layer, and a gold thin film top electrode. The multi-layer thin film lattice structure has been characterized by XRD and TEM analyses showing a high quality heterostructure. I-AFM analysis indicated nano granular conductivity distributed uniformly throughout the PCMO film surface. With the addition of the YSZ buffer layer, the pulse voltage needed to switch the device is significantly reduced and the resistance-switching ratio is increased compared to a non-buffered resistance memory device, which is very important for the device fabrication. The magnetic field effect on the multilayer structure resistance at various temperatures shows CMR behavior for both high and low resistance states implying a bulk material component to the switch behavior. △ Less

Submitted 3 October, 2005; originally announced October 2005.

Comments: 16 pages, 4 figures

Journal ref: Japanese Journal of Applied Physics, 45(3A) 1602 (2006)

arXiv:cond-mat/0510059 [pdf]

doi 10.1088/1367-2630/8/10/229

A Study of Apparent Symmetry Breakdown in Perovskite Oxide-based Symmetric RRAM Devices

Authors: X. Chen, J. Strozier, N. J. Wu, A. Ignatiev, Y. B. Nian

Abstract: A new model of a symmetric two-terminal non-volatile RRAM device based on Perovskite oxide thin film materials, specifically Pr1-xCaxMnO3 (PCMO), is proposed and analyzed. The model consists of two identical half-parts, which are completely characterized by the same resistance verses pulse voltage hysteresis loop, connected together in series. Even though the modeled device is physically symmetr… ▽ More A new model of a symmetric two-terminal non-volatile RRAM device based on Perovskite oxide thin film materials, specifically Pr1-xCaxMnO3 (PCMO), is proposed and analyzed. The model consists of two identical half-parts, which are completely characterized by the same resistance verses pulse voltage hysteresis loop, connected together in series. Even though the modeled device is physically symmetric with respect to the direction of current, it is found to exhibit switching of the resistance with the application of voltage pulses of sufficient amplitude and of different polarities. The apparent breakdown of parity conservation of the device is attributed to changes in resistance of the active material layer near the electrodes during switching. Thus the switching is history dependent, a feature that can be very useful for the construction of real non-volatile memory devices. An actual symmetric device, not previously reported in the literature and based on the proposed model, is fabricated in the PCMO material system. Measurements of the resistance of this new device generated an experimental hysteresis curve that matches well the calculated hysteresis curve of the model, thus confirming the features predicated by the new symmetric model. △ Less

Submitted 3 October, 2005; originally announced October 2005.

Comments: 13 pages, 4 figures

Journal ref: refer to New Journal of Physics, 8 (2006) 229.

arXiv:hep-ph/0509258 [pdf, ps, other]

doi 10.1016/j.radphyschem.2005.10.040

X rays test the Pauli exclusion principle

Authors: A. Yu. Ignatiev

Abstract: Since the publication of the models describing a small violation of the Pauli exclusion principle (PEP) there has been an explosion of word-wide interest in PEP tests and related theories. PEP forbids an atom to have more than 2 electrons in the K-shell. If PEP is slightly violated, a third electron can occasionally join in. This would result in an anomalous X-ray emission. A high-sensitivity… ▽ More Since the publication of the models describing a small violation of the Pauli exclusion principle (PEP) there has been an explosion of word-wide interest in PEP tests and related theories. PEP forbids an atom to have more than 2 electrons in the K-shell. If PEP is slightly violated, a third electron can occasionally join in. This would result in an anomalous X-ray emission. A high-sensitivity experiment places an upper limit of the order of 10^{-26} on the PEP violating parameter. I will outline the main theoretical and experimental ideas in this new exciting area. △ Less

Submitted 23 September, 2005; originally announced September 2005.

Comments: 12 pages, RevTeX4, Invited talk at the 20th International Conference on X-ray and Inner-shell Processes (Melbourne, Australia, 4-8 July, 2005)

arXiv:hep-ph/0509255 [pdf, ps, other]

Evolving Fundamental Constants and Metrology

Authors: A. Yu. Ignatiev, B. J. Carson

Abstract: Astrophysical observations suggest that the fine structure constant (alpha) may (or may not) be evolving over the cosmological time scale. This raises a much debated question: is alpha variation due to the variation of the speed of light (c), elementary electric charge (e), or the Planck constant (h)? Previously, we proposed the metrological approach based on the analysis of the relationships… ▽ More Astrophysical observations suggest that the fine structure constant (alpha) may (or may not) be evolving over the cosmological time scale. This raises a much debated question: is alpha variation due to the variation of the speed of light (c), elementary electric charge (e), or the Planck constant (h)? Previously, we proposed the metrological approach based on the analysis of the relationships between the fundamental units (e.g. of the length and time) and the fundamental constants. Our methodology allows one to find how each of the fundamental constants e, c, h evolves in time and offers a new outlook for this area. Here we give a brief outline of this approach and the main results it produces. △ Less

Submitted 23 September, 2005; originally announced September 2005.

Comments: 5 pages, RevTeX4, presented at the 16th National Congress of the Australian Institute of Physics (Canberra, 31 January - 4 February 2005)

Journal ref: Proceedings of the 16th National Congress of the Australian Institute of Physics (Canberra, 31 January - 4 February 2005), p.164-166

arXiv:cond-mat/0507432 [pdf]

doi 10.1063/1.2139843

Resistance profile measurements on a symmetric electrical pulse induced resistance change device

Authors: X. Chen, J. Strozier, N. J. Wu, A. Ignatiev

Abstract: We report the first direct measurements of the micro scale resistance profile between the terminals of a two terminal symmetric thin film Pr0.7Ca0.3MnO3 electrical pulse induced resistance change device composed of a Pr0.7Ca0.3MnO3 active layer. The symmetric device is one in which the electrode shape, size, composition, and deposition processing are identical. We show that under certain limitat… ▽ More We report the first direct measurements of the micro scale resistance profile between the terminals of a two terminal symmetric thin film Pr0.7Ca0.3MnO3 electrical pulse induced resistance change device composed of a Pr0.7Ca0.3MnO3 active layer. The symmetric device is one in which the electrode shape, size, composition, and deposition processing are identical. We show that under certain limitations of pulse switching voltage, such a symmetric electrical pulse induced resistance change device can exhibit either no net device resistance switching at room temperature, or bipolar switching with the resistance hysteresis curve exhibiting a "table leg" structure. The resistance measurements are made using surface scanning Kelvin probe microscopy, which allows for the measurement of the profile of resistance from one electrode, across the Pr0.7Ca0.3MnO3 material and into the second electrode, both before resistance switching and after switching. The results show that resistance switching in the symmetric device occurs primarily in the interface region within about 1 to 3 micron of the electrical contact surface. Resistance switching is also observed in the bulk Pr0.7Ca0.3MnO3 material although at a lower level. Symmetry considerations for a two terminal symmetric device that can switch resistance are discussed, and the data reported here is consistent with the symmetric model previously developed. △ Less

Submitted 19 July, 2005; v1 submitted 18 July, 2005; originally announced July 2005.

Comments: 14 pages, 4 figures

Journal ref: refer to Appl. Phys. Lett. 87, 233506 (2005)

arXiv:hep-ph/0506246 [pdf, ps, other]

doi 10.1016/j.physletb.2005.11.050

Possible new interactions of neutrino and the KATRIN experiment

Authors: A. Yu. Ignatiev, B. H. J. McKellar

Abstract: We analyse the possible role of new interactions of neutrino in the forthcoming tritium beta decay experiment KATRIN aimed at detecting the neutrino mass with the sensitivity of 0.3 - 0.2 eV. It is shown that under certain circumstances the standard procedure of data analysis would have to be modified by the introduction of an extra parameter describing the strength of the new interactions. Our… ▽ More We analyse the possible role of new interactions of neutrino in the forthcoming tritium beta decay experiment KATRIN aimed at detecting the neutrino mass with the sensitivity of 0.3 - 0.2 eV. It is shown that under certain circumstances the standard procedure of data analysis would have to be modified by the introduction of an extra parameter describing the strength of the new interactions. Our model simulations show that the modified procedure may improve the quality of the fit compared with the standard case. Ignoring the possibility of new interactions may lead to a systematic error in the neutrino mass determination. △ Less

Submitted 25 July, 2005; v1 submitted 24 June, 2005; originally announced June 2005.

Comments: 7 pages, 1 figure, revtex 4; corrected typos, a minor stylistic change in conclusions

Journal ref: Phys.Lett. B633 (2006) 89-92

arXiv:hep-ph/0312111 [pdf, ps, other]

doi 10.1016/j.physleta.2004.09.031

Metrological constraints on the variability of the fundamental constants $e$, $\hbar$, and $c$

Authors: A. Yu. Ignatiev, B. J. Carson

Abstract: We set up a framework for a model-independent analysis of the time variation of $e$, $\hbar$, and $c$ indiviually. It is shown that the time-evolution of each constant can be determined uniquely from the time evolution of the fine structure constant $α$ provided that the choice of basic time-independent units (i.e., the clock and ruler) is fixed. Realistic systems of units are considered as exam… ▽ More We set up a framework for a model-independent analysis of the time variation of $e$, $\hbar$, and $c$ indiviually. It is shown that the time-evolution of each constant can be determined uniquely from the time evolution of the fine structure constant $α$ provided that the choice of basic time-independent units (i.e., the clock and ruler) is fixed. Realistic systems of units are considered as examples and implications for metrology are discussed. △ Less

Submitted 16 September, 2004; v1 submitted 8 December, 2003; originally announced December 2003.

Comments: 8 pages, RevTex, revised references and two typos corrected

Journal ref: Phys.Lett. A331 (2004) 361-365

arXiv:hep-ph/0308126 [pdf, ps, other]

doi 10.1142/S0217751X0502495X

Spectator Effects in the Decay B -> K γγ

Authors: A. Yu. Ignatiev, G. C. Joshi, B. H. J. McKellar

Abstract: We report the results of the first computation related to the study of the spectator effects in the rare decay mode $B\to K γγ$ within the framework of Standard Model. It is found that the account of these effects results in the enhancement factor for the short-distance reducible contribution to the branching ratio. We report the results of the first computation related to the study of the spectator effects in the rare decay mode $B\to K γγ$ within the framework of Standard Model. It is found that the account of these effects results in the enhancement factor for the short-distance reducible contribution to the branching ratio. △ Less

Submitted 11 August, 2003; originally announced August 2003.

Comments: 5 pages, 5 figures, RevTeX 4

Journal ref: Int.J.Mod.Phys. A20 (2005) 4079-4084

arXiv:hep-ph/0306120 [pdf, ps, other]

Mirror matter

Authors: A. Yu. Ignatiev, R. R. Volkas

Abstract: One of the deepest unsolved puzzles of subatomic physics is why Nature prefers the left particles to the right ones. Mirror matter is an attempt to understand this mystery by assuming the existence of a "parallel''world where this preference is exactly opposite. Thus in the Universe consisting of the ordinary and the mirror matter the symmetry between the left and right is completely restored. M… ▽ More One of the deepest unsolved puzzles of subatomic physics is why Nature prefers the left particles to the right ones. Mirror matter is an attempt to understand this mystery by assuming the existence of a "parallel''world where this preference is exactly opposite. Thus in the Universe consisting of the ordinary and the mirror matter the symmetry between the left and right is completely restored. Mirror matter is constrained to interact with us only very weakly. Still, its existence can be inferred by using experimental evidence such as the observation of astrophysical objects related to the dark matter (MACHO), neutrino physics and other sources. This talk will focus on several key aspects of mirror matter physics including the possible existence of mirror matter inside the Earth and the suggestion that the recently observed "isolated" planets may in fact be orbiting around mirror stars. △ Less

Submitted 13 June, 2003; originally announced June 2003.

Comments: 9 pages, revtex, Talk given by A.Yu.Ignatiev at the 15th Biennual Congress of the Australian Institute of Physics (Sydney, July 2002)

arXiv:hep-ph/0304260 [pdf, ps, other]

doi 10.1103/PhysRevD.68.023518

Mirror dark matter and large scale structure

Authors: A. Yu. Ignatiev, R. R. Volkas

Abstract: Mirror matter is a dark matter candidate. In this paper, we re-examine the linear regime of density perturbation growth in a universe containing mirror dark matter. Taking adiabatic scale-invariant perturbations as the input, we confirm that the resulting processed power spectrum is richer than for the more familiar cases of cold, warm and hot dark matter. The new features include a maximum at a… ▽ More Mirror matter is a dark matter candidate. In this paper, we re-examine the linear regime of density perturbation growth in a universe containing mirror dark matter. Taking adiabatic scale-invariant perturbations as the input, we confirm that the resulting processed power spectrum is richer than for the more familiar cases of cold, warm and hot dark matter. The new features include a maximum at a certain scale $λ_{max}$, collisional damping below a smaller characteristic scale $λ'_S$, with oscillatory perturbations between the two. These scales are functions of the fundamental parameters of the theory. In particular, they decrease for decreasing $x$, the ratio of the mirror plasma temperature to that of the ordinary. For $x \sim 0.2$, the scale $λ_{max}$ becomes galactic. Mirror dark matter therefore leads to bottom-up large scale structure formation, similar to conventional cold dark matter, for $x \stackrel{<}{\sim} 0.2$. Indeed, the smaller the value of $x$, the closer mirror dark matter resembles standard cold dark matter during the linear regime. The differences pertain to scales smaller than $λ'_S$ in the linear regime, and generally in the non-linear regime because mirror dark matter is chemically complex and to some extent dissipative. Lyman-$α$ forest data and the early reionisation epoch established by WMAP may hold the key to distinguishing mirror dark matter from WIMP-style cold dark matter. △ Less

Submitted 30 April, 2003; v1 submitted 28 April, 2003; originally announced April 2003.

Comments: 17 pages, 4 figures; minor changes, reference added

Journal ref: Phys.Rev.D68:023518,2003

Showing 1–50 of 65 results for author: Ignatiev, A