Keywords

1 Is Trustworthy AI Achievable?

The accelerated enthusiasm of all walks of life for Artificial Intelligence and its applications has been both blessing and preoccupying. The AI solutions, backed by private and public sector interests have opened different and interesting avenues for how to address societal as well as business challenges, which were not thought possible.

As usual for any new technological advances, there have been many lessons learned, as a direct result of application of AI solutions in varied contexts. The lessons have shown us that, AI solutions, much more than any other type of solutions, are affected by the cognitive biases, inherent in the way, their human designers, developers and implementers among others, interact with their end-users. Every solution is biased in that the design choices, underlying its creation, naturally includes and excludes some groups of end-users. In the case of AI solutions, their inherent biased-by-design scope have been much faster detected and criticized, due to their rapid advances, achieved reach and application contexts. As a novel approach, adjectives such as Trustworthy and Ethical have been used in describing what AI solutions must be.

As a step towards this, European Union (EU) commissionā€™s High-level Expert Group on AI (AI HLEG) has formulated a framework for Trustworthy AIĀ [8]. They define Trustworthy AI as follows:

Trustworthy Al has three components: (1) it should be lawful, ensuring compliance with all applicable laws and regulations (2) it should be ethical, demonstrating respect for, and ensure adherence to, ethical principles and values and (3) it should be robust, both from a technical and social perspective, since, even with good intentions, AI systems can cause unintentional harm. Trustworthy Al concerns not only the trustworthiness of the Al system itself but also comprises the trustworthiness of all processes and actors that are part of the systemā€™s life cycle.

As well AI HLEG defines Ethical AI as:

The development, deployment and use of Al that ensures compliance with ethical norms, including fundamental rights as special moral entitlements, ethical principles and related core values. It is the second of the three core elements necessary for achieving Trustworthy Al.

Furthermore, they define the following non-exhaustive list of requirements, needed to ensure Trustworthy AI. The requirements include systemic, individual and societal factors:

  1. 1.

    Human agency and oversight, including fundamental rights, human agency and human oversight.

  2. 2.

    Technical robustness and safety, including resilience to attack and security, fall back plan and general safety, accuracy, reliability and reproducibility.

  3. 3.

    Privacy and data governance, including respect for privacy, quality and integrity of data, and access to data.

  4. 4.

    Transparency, including traceability, explainability and communication.

  5. 5.

    Diversity, non-discrimination and fairness, including the avoidance of unfair bias, accessibility and universal design, and stakeholder participation.

  6. 6.

    Societal and environmental wellbeing, including sustainability and environmental friendliness, social impact, society and democracy.

  7. 7.

    Accountability, including auditability, minimization and reporting of negative impact, trade-offs and redress.

These seven requirements are all interrelated in that they are ā€œall of equal importance, support each other, and should be implemented and evaluated throughout the Al systemā€™s life-cycleā€Ā [8]. This is depicted in Fig.Ā 1. As well, the AI HLEG has presented technical and non-technical methods to realize Trustworthy AI (Fig.Ā 2).

Fig. 1.
figure 1

(adopted fromĀ [8])

The interrelation of the 7 requirements of Trustworthy AI

Fig. 2.
figure 2

(sourceĀ [8])

Realizing Trustworthy AI throughout the systemā€™s entire life cycle

The focus on the design and design choices behind the development of AI solutions is inherent in the way the concept of Trustworthy AI has been handled in AI HLEG guidelines.

In our opinion, here lies the biggest challenge to achieving these guidelinesā€™ goals. The guidelines develop a set of check-lists and principals, which must be observed, whenever any taken design decision is to be examined for its ethical character and trustworthiness. This could be the task of a group of developers or internal ā€œAI auditorsā€. In any case, the dynamics of how such groups act have been studied by pioneers such as Nobel laureate, Kenneth ArrowĀ [1], and philosophers and economists such as Allan GibbardĀ [2,3,4], Mark SatterwaitheĀ [7], and Aanund HyllandĀ [6], as well as Cognitive Scientists such as GƤrdenforsĀ [5] to mention a few.

For instance, if we have, for a specific design factor, \(n\ \)alternatives, \(A_1\) to \(A_n\), where the members of the committee (or development team) are asked to present an ordering \(\succ \) to show their preference, e.g. \(A_1 \succ \ldots \succ A_n\),Ā [1] has shown (Theorem 3 in Sect.Ā 2, below) that, as long as;

  1. 1.

    There are at least 3 possible outcomes

  2. 2.

    If everybody in the group prefers, for instance, an alternative \(A_i\) to another alternative \(A_j\), then the result of the voting would also reflect this unanimity of preferences

  3. 3.

    Each voterā€™s preference between any two alternatives \(A_i\) and \(A_j\), is indifferent to the preferences of other alternatives

Then the voting process would be dictatorial, i.e. among the voters, there would be a dictator, in the sense of a person, whose choices and preferences dictates what the committee would decide, regardless of how other members of the committee are voting.

Even if we change the scheme, so the alternatives are ordered from most preferred to the least one as above, but, now, the alternatives are ranked using an ordinal system, we would still have challenges. Here, the alternatives are ordered, so the most preferred alternative (rank 1) is given the highest point (from an ordinal system), and so forth. Then the points are counted in a voting process to declare a winner. GibbardĀ [2] and SatterwaitheĀ [7] showed (TheoremĀ 2, Sect.Ā 2, below) that as long as there are at least 3 possible outcomes, then the voting process is either manipulable or dictatorial.

A manipulable voting process is one, where the voters can vote tactically. This means that they, realizing that their preferred or sincere ordering of the alternatives would not result in what serves them best, would change their vote (i.e. an insincere vote) so to achieve the result that is closest to their preference.

In above situations, the alternatives are ordered strictly, so the voters cannot order them in a way that there are ties among the alternatives (i.e. not a strict order). A more realistic situation would allow a not-strict ordering of alternatives. Even for these situations, GibbardĀ [2] showed (TheoremĀ 1, Sect.Ā 2, below) that if there are at least 3 outcomes, and the committee does not have a dictator, then the voting is manipulable.

Furthermore, GƤrdenforsĀ [5] has shown (TheoremĀ 4, Sect.Ā 2 below), that any democratic voting process with at least 3 voters is manipulable!

Here a voting process is democratic, if it is anonymous (i.e. the voting process treats every voter the same way), it is neutral (i.e. the voting process treats every alternative the same way), and it satisfies the Condorcet criterion (i.e. majority rules).

The results on the systemic issues of such decision making processes are many. One has even looked at situations where some of the alternatives are dependent of or discovered by chance (e.g.Ā [4] andĀ [6]). In the next section the above mentioned results are described in a more technical terms.

2 Dictators and Tactical Voters in a Decision-Making Context

In the following, we follow the definitions and technical format ofĀ [2].

Definition 1

Game form. A game form is characterized by:

  1. (i)

    A set X,Ā  whose members are called possible outcomes, or simply outcomes. Unless otherwise stated, variables x,Ā y,Ā  and z will range over outcomes.

  2. (ii)

    A positive integer n,Ā  called the number of players. The n players will be denoted by the integers 1 to n, and variables i,Ā j,Ā  and k will range over these integers.

  3. (iii)

    n sets \(S_{i},\) one for each i. For each i the members of \(S_{i}\) are called strategies for i. The word ā€œstrategy,ā€ then, refers here to what in game theory is usually called a ā€œpure strategy.ā€ An n-tuple \((s_{1}, \ldots , s_{n}),\) with \(s_{1} \in S_{1}, \ldots , s_{n} \in S_{n}\) will be called a strategy n-tuple. Strategy n-tuples will be indicated by bold-face small letters on the pattern \(\mathbf{s} =( s_{1}, \ldots , s_{n}), \mathbf{s} ^{\prime }=( s_{1}^{\prime }, \ldots , s_{n}^{\prime }),\) and so forth.

  4. (iv)

    A function g, defined for every strategy n-tuple, whose range is X. Strictly speaking, a game form is simply a function g which can be characterized as above. We can define a game form, then, as a function whose domain is the Cartesian product \(S_{1} \times \ldots \times S_{n}\) of a finite number of finite non-empty sets. Its values are called outcomes, its arguments are called strategy n-tuples, and a member of a set \(S_{i}\) is called a strategy for i.

Definition 2

Orderings

  1. (i)

    An ordering of a set Z is a two-place relation R between members of Z,Ā  (i.e. \(R \subseteq Z\times Z \)) such that for all x,Ā y and z in Z

    • \(\lnot (x R y \ \wedge \ y R x)\)

    • \(\quad x R z \Rightarrow (x R y \ \vee \ y R z)\)

    As such, we can define an ordering R of the set X of outcomes of a game form g.

  2. (ii)

    A preference ordering P is an ordering of X, the set of outcomes of a game form g. The relation xPy then means ā€œx is preferred to y ā€ under ordering P. For distinct x and y in X, we may have that \((x,y) \not \in P\), i.e. neither xPy nor yPx. In this case we say that x and y are indifferent under ordering P. As such P indicates strict preference between 2 elements of X.

  3. (iii)

    For any preference ordering P of set of outcomes X, we can define another binary relation \(R \subseteq X\times X\), such that: \(x R y \Longleftrightarrow \lnot y P x\). For any distinct x and y in X, the relation R indicates their preference or indifference.

  4. (iv)

    For any preference ordering P of set of outcomes X, we can define another binary relation \(I \subseteq X\times X\), such that: \(x I y \Longleftrightarrow (\lnot y P x \wedge \lnot x P y)\). For any distinct x and y in X, the relation I indicates their absolute indifference.

Remark 1

We will use the following notation. For any n-tuple \({\varvec{{s}}}=(s_{1}, \ldots , s_{n}),\) we indicate the result of the altering of its k-th place as follows:

\({\varvec{{s}}}\langle k / t \rangle = (s_{1}, \dots , s_{k-1}, t, s_{k+1}, \dots , s_{n}) \)

In other words, \({{\varvec{s}}}^{\prime }={{\varvec{s}}} \langle k / t \rangle \ \ \) iff \(\ \ \{ (s_{k}^{\prime }=t) \ \wedge \ (\forall i)\left[ i \ne k\ \Rightarrow \ s_{i}^{\prime }=s_{i}\right] \} \ \).

Definition 3

P -dominance. Where P is a preference ordering, a strategy t is P-dominant for k if for every strategy n-tuple \(\mathbf{s} \), we have that \(g(\mathbf{s} \langle k / t \rangle )\ R\ g(\mathbf{s} )\).

In other words, t is P-dominant for k iff no matter what strategies are fixed for everyone else, strategy t for k produces an outcome at least as high in preference ordering P as does any other. As such, the player k, by choosing the strategy t, needs not to think of other playersā€™ strategies. Strategy t serves the player kā€™s interests best.

Definition 4

Straightforward game form. A game form is straightforward if, for every preference ordering P and player k,Ā  there is a strategy which is P-dominant for k.

As such, in a straightforward game form, each player has a strategy, which serves his/her interests best, and it is independent of what any other player chooses as strategy. So the players need not play strategically (i.e. be attentive to what others do). Therefore, a straightforward game form is also called strategy-proof.

Definition 5

Manipulable game form. A manipulable game form is one which is not straightforward. In a manipulable game form, there exists a player k, who, given a preference ordering P, cannot find any strategy t, which is P-dominant for him/her. So given a game form g, a preference ordering P, and a strategy n-tuple \(\mathbf {s} =(s_{1}, \ldots , s_{n}),\) the player k needs to choose a strategy t, where even though \(s_{k} P t\), but by choosing this strategy, he/she would achieve \(g(\mathbf {s} \langle k / t \rangle )\ R\ g(\mathbf {s} )\).

Definition 6

Dictatorial game form. A player k is a dictator for game form g if, for every outcome x,Ā  there is a strategy s(x) for k such that for strategy n-tuple \(\mathbf {s} =(s_{1},\ldots , s_{k} \ldots , s_{n}),\) the \(g(\mathbf {s} )=x\) whenever \(s_{k}=s(x) .\) A game form g is dictatorial if there is a dictator for g.

Having a dictator in a game form means that there exists a player k whose choices are always the outcome of the game, no-matter what other players have chosen (i.e. their strategies).

Theorem 1

(GibbardĀ [2]). Every straightforward game form with at least three possible outcomes is dictatorial.

Definition 7

Voting scheme. A voting scheme is a game form v with set of possible outcomes X, such that for some set Z or set of alternatives with \(X \subseteq Z\), the set \(S_{i}\) of strategies open to each player i is the set of orderings of Z. We call this set \(\varPi _{Z}\). Then a voting scheme is a single valued function from \(\varPi ^{n}_{Z}\) to X, which given a n-tuple \(\mathbf {P} =(P_{1}, \ldots , P_{n}),\) returns a single possible outcome \(x \in X\).

In a voting scheme, an ordering \(P_{i}\) represents the ballet that voter i casts. The orderings can be fixed by an ordinal scheme, so the voter i, in his/her ordering \(P_{i}\), places his/her most preferred alternative as number 1, the second most preferred alternative as number 2, and so forth. Then, the voting scheme simulates a counting mechanisms of the ballets and their orderings, which results in naming an alternative as the winner.

Definition 8

Manipulable voting scheme. A voting scheme v is manipulable if for some voter k,Ā  and for some n-tuple \(\mathbf {P} \in \varPi ^{n}_{Z}\), there exists some ordering \(P^{*} \in \varPi _{Z}\), such that \(v(\mathbf{P} \langle k / P^{*}\rangle )\ P_{k} \ v(\mathbf{P} )\).

This means that for voter k, in the situation, which is represented by n-tuple \(\mathbf {P} =(P_{1}, \ldots , P_{k}, \ldots , P_{n})\), there exists another voting possibility (represented by ordering \(P^{*}\)), such that if the voter k changes \(P_{k}\) to \(P^{*}\), then the result of the voting would represent his/her interest best, in the sense of her original vote \(P_{k}\).

Definition 9

Dictatorial voting scheme. Player k is a dictator for a voting scheme v if, for every possible outcome \(x \in X,\) player k can choose an ordering P(x), so \(v(\mathbf {P} )=x\) whenever \(P_{k}=P(x),\) for any voting \(\mathbf {P} =(P_{1},\ldots , P_{k} \ldots , P_{n}).\) A voting scheme v is dictatorial if there is a dictator for it.

Theorem 2

(GibbardĀ [2], SatterthwaiteĀ [7]). Every voting scheme with at least three outcomes is either dictatorial or manipulable.

Definition 10

Social welfare function. A preference n-tuple over a set X is an n-tuple \(( P_{1}, \ldots , P_{n})\) whose terms are preference orderings of X, the set of possible outcomes. Preference n-tuples will be designated in bold-face type on the pattern \(\mathbf{P}\mathbf =(P_{1}, \ldots , P_{n}), \mathbf {P} ^{\prime }=(P_{1}^{\prime }, \ldots , P_{n}^{\prime })\) and so forth. A social welfare function is a function whose arguments, for some fixed n (or number of voters) and the set of possible outcomes X are all preference n-tuples \(\mathbf {P} \) over X,Ā  and whose values are preference orderings of X.

Given n voters and a set of possible outcomes X, a social welfare function \(f: \varPi ^{n}_{X}\longrightarrow \varPi _{X}\), is a mapping that takes a n-tuple \(\mathbf{P}\mathbf =(P_{1}, \ldots , P_{n}) \in \varPi ^{n}_{X}\) and delivers a preference ordering \(P\in \varPi _{X}\). Here \(\varPi _{X}\) represents the set of preference orderings on X.

Before we go further, we choose a notation that makes it easier to visualize the orderings. Let us use \(\succ \) for orderings of X. For alternatives x and y, the \(x \succ y\) signifies x is preferred to y. So if \(\succ \) is an ordering of \(x_{1}, x_{2}, \ldots , x_{m}\in X\) of m alternatives may look like this: \(x_{1}\succ x_{2}\succ \ldots \succ x_{m}\).

Theorem 3

(ArrowĀ [1]). Every social welfare function defined for a set of possible outcomes X, and n voters, violates one of following Arrow conditions:

  1. (i)

    Scope: X has at least three members.

  2. (ii)

    Unanimity or weak Pareto efficiency: If \(\ \mathbf {P }=(\succ _{1}, \ldots , \succ _{n})\) and \(\ \forall i.\ x \succ _{i} y,\ \) and \(f(\mathbf{P} ) = \ \succ \), then \(\ x \succ y\).

  3. (iii)

    Pairwise Determination: If for \(\ \mathbf {P}=(\succ _{1}, \ldots , \succ _{n})\) and \(\ \mathbf {P}^{\prime }=({\succ }^{\prime }_{1}, \ldots , {\succ }^{\prime }_{n}):\)

    • \(\forall i. \ \left[ x \succ _{i} y\ \Longleftrightarrow \ x \succ _{i}^{\prime } y \right] \ \ \) and \(\ \ \forall i. \ [y \succ _{i} x\ \Longleftrightarrow \ y \succ _{i}^{\prime } x] \)

    • \(f(\mathbf {P})= \ \succ \ \ \) and \(\ \ f\left( \mathbf {P}^{\prime }\right) = {\succ }^{\prime }\)

    then: \(x \succ y \ \Longleftrightarrow \ x \ {\succ }^{\prime } y\)

  4. (iv)

    Non-dictatorship: There is no dictator for f,Ā  where a dictator for f is a voter k such that for every preference ordering \(\succ \), and every x,Ā  \(y \in X\), if \(x \succ _{k} y\) and \(f(\mathbf{P} )=\ \succ ,\) then \(x \succ y\).

The Arrow conditions are a group of logical and fair conditions which can be put on any voting system. The Scope and non-Dictatorship conditions are self-explanatory. The Unanimity or weak Pareto efficiency condition says that if for any two possible outcomes x and y, every voter has the same preference, e.g. x is preferred to y, then the result of voting and the final order of preference of the candidates (i.e. what social welfare function returns) must also reflect that x is preferred to y. At the same time, the Pairwise Determination condition says that the order of preference between two candidates is not dependant of how the order of preference between other pairs of candidates is.

In the case of general game forms and voting schemes, we looked at the result of the game and/or scheme to be a single element x of X,Ā  the set of possible outcomes. In the case of social welfare functions, the result was a preference ordering \(\succ \) on \(X=\lbrace x_1, \ldots , x_m\rbrace \), where, for instance \(x_1 \succ \ldots \succ x_m\).

As such, the Gibbard-Satterwaithe result of TheoremsĀ 2 gives an impression that it is never possible to have a voting mechanism, which is non-manipulable, have more than 2 possible outcomes, and is non-dictatorial. Further investigation shows that the proof of TheoremĀ 2 is, very much dependent on the assumption that the voting process selects a single winnerĀ [5]. But this is not the case in the real world. Many times, we will have situations, where several alternatives are chosen, al par with each-other, as acceptable outcomes (i.e. tied winners). Then the tie is broken and a single winner is chosen by some extra-ordinary process (e.g. in alphabetic order, or by random chance).

In order to work with situations with possibility of tied winners, GƤrdenforsĀ [5] developed \(\gg \) as a new ordering between sets (e.g. \(\left\{ x, y \right\} \gg \left\{ y \right\} \) if \(x \succ y\)), which would be a generalization of the ordering \(\succ \) from above. The definition can be found inĀ [5]. In the following we will use it to find a very strong result (TheoremĀ 4, below) on democratic voting processes. This result is very important, since such democratic voting processes define the ideal processes towards the development of Trustworthy AI.

Definition 11

Social choice function. Let X be the set of possible outcomes, \(i=1, \ldots n\) the set of voters, and \(\varPi _{X}\) the set of preference orderings on X. A social choice function (SCF) is a function \(\mathbf{F} : \varPi _{X}^{n} \longrightarrow \mathbf 2 ^{X}-\emptyset ,\) where \(\mathbf 2 ^{X}\) denotes the set of all subsets of X.

Definition 12

A social choice function \(\mathbf{F} \) is manipulable by voter i at situation \(\mathbf{s} =\left( P_{1}, \ldots , P_{n}\right) \) iff there is an ordering \(P_{i}^{\prime }\) such that \(\mathbf{F} (\mathbf{s} \langle i / P_{i}^{\prime } \rangle ) \gg _{i} \mathbf{F} (\mathbf{s} ),\) where \(\gg _{i}\) is the ordering derived from \(P_{i}\). \(\mathbf{F} \) is non-manipulable or stable iff \(\mathbf{F} \) is nowhere manipulable.

Definition 13

A social choice function \(\mathbf{F} \) is anonymous iff whenever two situations \(\mathbf{s} _1\) and \(\mathbf{s} _2\) are identical except that for some voters i and j, we have \(P_{{i}\vert \mathbf{s _1}}=P_{{j}\vert \mathbf{s _2}}\) and \(P_{{j}\vert \mathbf{s _1}}=P_{{i}\vert \mathbf{s _2}}\) then \(\mathbf{F} (\mathbf{s} _1)=\mathbf{F} (\mathbf{s} _2)\). So a social choice function is annonymous if it treats every voter in the same way. Here \(P_{{i}\vert \mathbf{s }}\) means the preference \(P_{i}\) of voter i in the situation \(\mathbf{s} \).

Definition 14

A social choice function \(\mathbf{F} \) is neutral iff whenever two situations \(\mathbf{s} _1\) and \(\mathbf{s} _2\) are identical except for two alternatives x and y that have changed places everywhere (i.e. in the preferences of each voters), then \(x \in \mathbf{F} (\mathbf{s} _1)\) iff \(y \in \mathbf{F} (\mathbf{s} _2)\) and \(y \in \mathbf{F} (\mathbf{s} _1)\) iff \(x \in \mathbf{F} (\mathbf{s} _2)\). So a social choice function is neutral if it treats every alternative in the same way.

Definition 15

A social choice function \(\mathbf{F} \) satisfies the Condorcet criterion iff whenever there is an alternative x in a situation \(\mathbf{s} \) such that, for every alternative \(y \ne x,\) the number of individuals who strictly prefer x to y is greater than the number of individuals who strictly prefer y ot x, then \(\mathbf{F} (\mathbf{s} )=\{x\}\). Such an alternative is called a majority alternative in the situation \(\mathbf{s} \).

Definition 16

A social choice function \(\mathbf{F} \) which is anonymous and neutral, and satisfies the Condorcet criterion, is a democratic social choice function.

Theorem 4

(GƤrdenforsĀ [5]). Any democratic social choice function which is defined for at least three voters is manipulable.

3 Conclusions

We conclude that any development of AI HLEG proposed guidelines, framework and requirements for Trustworthy and Ethical AI suffers from systemic problems. We would be either in a situation where there are dictator decision-makers, or we will have design choices which are manipulated by those who are responsible for ensuring the trustworthiness of AI solutions. In both cases, the cognitive biases of the decision maker (either the dictator or manipulator) would have significant effect on the result of the designed and developed AI solution. This is the main challenge here.

As mentioned above, it is a fact that most of the voting processes can suffer of either having a dictator or voters who vote strategically to serve their own benefits. The issue is that what if the cognitive biases of the dictator decision-maker, or the ones of the tactical voter is exactly the biases, which we are trying to root-out from the design and architecture of AI solutions. As such, the very challenge of biases, which the EU guidelines are developed to answer, remains unanswered.