Download Free Sample

Abstract and Background

The increasing utilization of artificial intelligence (AI) apps and technology in our daily lives has had a significant impact on the development as well as implementation of such technologies. Since AI systems could be used in various sensitive circumstances to make crucial as well as life-changing decisions, it is imperative to make sure that decisions made by all these platforms do not constitute discriminatory practices toward certain individuals and communities. Recent developments in conventional deep learning and machine learning techniques have addressed these problems in several subdomains. Scientists are striving to overcome the biases that some of these applications may possess as a result of the commercialisation of such systems and are becoming increasingly conscious of them. In this study, we looked into several real-world apps that had displayed biases in a variety of ways, as well as we compiled a list of biases' possible sources. Furthermore, in order to eliminate the prejudice already present in AI systems, we created a classification for fair starts with the definition by machine learning experts. Additionally, we looked at various AI areas and subdomains to highlight what academics have noticed regarding unjust results in the most cutting-edge methods and the approaches they have attempted to remedy them. To lessen the issue of bias in Artificial intelligence systems, there are nevertheless numerous potential future avenues and solutions. By examining current research within their areas of jurisdiction, I hope that this thesis may inspire scholars to take on these problems soon.

1. Introduction

Algorithms for the ML(machine learning) have permeated every area of our lives. Algorithms propose movies, things to purchase, and partners. They are increasingly employed in risky situations like loan choices and hiring ones [2]. Algorithmic decision-making has definite benefits; unlike people, machines do not get weary or bored (and can consider several orders of magnitude more considerations). Algorithms, like humans, are susceptible to biases, which make their judgments "unfair" [4]. Equality is the absence of bias for a person or any group on the basis of their innate or learned attributes while making decisions. An algorithm that makes decisions that are biased against a certain set of people is said to be unfair. A standard illustration comes from a technique used by American courts to decide if to keep someone in custody pending trial or release them. The program, known as Correctional Offender Management Profile for Alternative Sanctions (COMPAS), computes the likelihood that a criminal that would commit future offense. Magistrates use COMPAS to determine whether to keep a perpetrator in prison or release them. Compass is more likely to have greater rates of false positives for African-American perpetrators than for Caucasian defendants in wrongly forecasting that they are at a heightened hazard of committing an offense or recidivism, according to a study of the software's bias towards African-Americans. Comparable breakthroughs have been made in other fields, including the software that recognizes faces on digital photography that overpredicts as winking or an AI system that rates beauty pageant finalists but is prejudiced against applicants with darker complexions. These skewed forecasts result from hidden or ignored biases inside the data or algorithms. Within this study, we uncover two possible causes of fairness in machine learning results: those caused by data biases as well as those caused by algorithmic biases. We examine research as to how biases in statistics skew whatever machine learning algorithms understand, as well as subtleties in the algorithm' operation that keep them from making just decisions—even whenever the information is neutral. Additionally, we note that biased algorithmic results may have an impact on user experience, creating a feedback mechanism among data, algorithms, as well as users that can reinforce and even exacerbate pre-existing biases. In Part 2, we introduce the review with many prominent real-world examples of how discriminating outcomes were produced by unfair machine learning algorithms. We explain the various varieties and sources of biases that exist inside the data-algorithms-users cycle in Part 3. The numerous ways the notion of fairness has been operationally defined and researched in the research are then presented in part 4 of this paper. We go over the connections between these two ideas. Finally, we will concentrate on various peoples of machine learning methods, how fairness expresses itself uniquely across each of them, as well as the state-of-the-art for addressing them in Part5. Part6 will then discuss prospective directions for further research across each of the areas.

2. Examples of real-world of algorithm unfairness

Limitations on safety as well as fairness have grown to be a serious problem for academics and engineers as a result of the success of machines learning and artificial intelligence over the past few decades along with their rapid proliferation in various applications. Courts employ machine learning to estimate the likelihood that a criminal would commit the same crime again. It is utilized in a variety of medical specialties, child welfare programs, and automated vehicles. Every one of these apps have a direct impact on our daily lives and, if not planned and developed properly, that is, taking fairness into account, they could hurt our society. [5] provides a list of the uses and how these AI systems' biases influence our daily lives, including the bias present in AI chatbots, employment matching, travel routing, computerized legal help for immigrants, searching as well as advertising placement algorithms, and more. [8] provides instances of how bias from the real world could seep into AI as well as robotic systems, including bias in voice commands, internet sites, and facial recognition software. Therefore, while modeling an algorithm or a system, scientists and technologists need to be careful about the downstream analysis as well as their possible negative repercussions.

2.1 Demonstrate Discrimination system

An example of a discriminatory system is COMPAS. Additionally, an algorithm that delivered adverts for jobs in the STEM sectors (Science, Technology, Engineering, and Math) displayed biased behaviour [10]. This ad was created with gender-neutral advertising delivery in mind. However, due to the gender gap, fewer women than males saw the commercial, making younger women a more valuable segment and hence more expensive to promote to. Although its initial and innocent goal was to be gender-neutral, this optimization system would serve advertisements in a discriminating manner. Bias in recommender systems and facial recognition systems has also received much study and evaluation, and in many situations it has been found to be discriminatory towards particular demographics and subgroups. We must understand the sources of these biases and what we can do to prevent them if we are to solve the problem of bias in these applications. The bias in COMPAS, a popular piece of business risk assessment software, has been listed. In addition to being biased, it has poor performance when measured against people. It was found in a research to be no better than a typical human's judgment when compared to non-expert human assessment. It's also interesting to notice that only 7 of the 137 features that COMPAS uses were explained to the study participants. [1] [2] goes on to say that when it comes to decision-making, COMPAS is no better than a straightforward logistic regression model. The use of these tools and the decisions that result from them touch people's lives, thus we should act responsibly. When building and engineering these kinds of delicate technologies, fairness limitations must be taken into account. In a related study, the writers in [2] compared SAVRY, a tool utilised in risk evaluation frameworks which encompasses human intervention in its procedure, with automatic ML methods to check that one is more precise and fairer while examining sources of group unfairness. To prevent harm, more of these kinds of research should be carried out, but before the tools are made available.

2.2 Assessment Tools

The development of tools that can gauge how fair a tool or system is is an intriguing research path. For instance, the toolbox Aequitas [3] enables users to evaluate models in relation to various bias and fairness indicators for various demographic segments. Data analysts, machine learning experts, and policymakers can use the reports produced by Aequitas to make informed choices and prevent damage and injuries to population subgroups. Another toolkit, AI Fairness 360 (AIF360), was created by IBM to assist in transferring fairness research algorithms into an industrial setting, to establish a baseline for fairness algorithms to be assessed, and to provide a forum for fairness scientists to exchange ideas [4]. These kinds of toolkits can be beneficial for students, academics, and professionals working in the field as they move away from biased behaviour and toward producing fair machine learning applications.

3 Data, Algorithms, As Well As User Experiences Bias

The majority of AI systems along with the algorithms that are data driven as well as need information to be trained on. Data and the operation of these methods and systems are thus closely related. When biases are present in the underlying training the data, the algorithms that were trained on them will pick them up and incorporate them into their predictions. Because of this, data biases may influence the algorithms that use the data, leading to skewed results. Even biases already present in the data might be amplified and maintained by algorithms. In contrast, even when the information is not biased, algorithms itself may exhibit biased actions as a result of specific design decisions. Such biased algorithms' outputs can then be incorporated into actual systems and influence user choices, producing additional biased data that can be used to train new algorithms. Imagine, for instance, a web search engine that prioritizes certain results above others. The biggest ones typically receive the greatest interaction from users, while those lower down the rankings receive minimal attention [6]. The web-based search engine would then gather user engagement with things, as well as the information will be utilized to decide in the future how information should be displayed based on user desire as well as popularity. Due to the obvious biased interactions & position of outcomes by such algorithms rather than a result of the substance of the findings, the results at the top will grow in popularity [7]. Figure 1 depicts the feedback loop involving data biases, algorithmic biases, as well as user involvement. In the part below, we classify concepts of bias using this loop.

Instances of bias definitions

Figure 1. Instances of bias definitions

3.1 Types of Bias

There are numerous distinct types of bias, a few of which could result in unfairness in various downstream academic tasks. In [8], writers discuss supervised ml biases that they identified alongside their classifications and descriptions in order to inspire more research into each of the types of bias mentioned in the study. The authors create a comprehensive list of the various sorts of prejudices that occur throughout different cycles from either the data's sources to its gathering and analysis within [10], along with their related descriptions. In addition to incorporating some material from other previous research studies, we here will highlight the key sources of bias discussed in such research documents. In the study, we would also present a distinct classification of these criteria based on the data, engine, as well as interactive query cycle.

3.1.1 Data into Algorithm

In this chapter, we discuss data biases that, when considered by ML training algorithms, may lead to biased algorithmic results.

(1) Bias in measurements. Distortion in measuring or producing reports from the way we select, apply, and evaluate specific features [11]. Prior convictions and friend convictions were employed as mediating variables to quantify level of "risk involved" or "crime" in the readmission predictive system COMPAS, although on their own may be seen as mismeasured approximations. The greater arrest statistics in minority groups are partially explained by the reality that they have been monitored and patrolled more regularly. Nevertheless, since there are differences between the way these groupings are evaluated and managed, one ought not draw the conclusion that just because individuals representing minority groups are arrested more frequently, they are inherently more hazardous [12].

(2) Variable bias was left out. Whenever one or perhaps more significant factors are omitted from the model, unobserved heterogeneity bias4 results [50]. An example of this situation would be if someone created a method to forecast, with a fair amount of accuracy, the yearly marginal rate at which clients would stop selling using a service, but they soon realized that the majority of subscribers were discontinuing their subscriptions without the model's intended forewarning. Suppose that a fresh, competing company has entered the market and is providing the same service for half the cost. leading to the cancellation of memberships. The competitor's presence was just something for which the model was not prepared, so it is regarded as an endogenous variable.

(3) Bias in representation. When gathering data, where we select a group has an impact on representation bias [40]. Subgroups are missing, and there are other irregularities in non-representative sampling, which missing overall diversity of the overall population. Sets of data like ImageNet, which lack geographic coverage (as seen in Figures 3 as well as 4), clearly have a bias in favor of Western societies.

(4) Aggregate Bias. Aggregation bias, also known as the ecology error, occurs when incorrect inferences are made about specific individuals based on studying the entire community. Clinical assistance tools are a prime illustration of this kind of prejudice. Think about diabetic patients who appear to have different morbidities depending on their gender and racial background. Particularly, the intricate differences across genders as well as ethnicities can be seen in HbA1c readings, which are frequently was using to determine and monitor diabetes. A framework that does not account for individual characteristics would therefore probably not be suitable for all racial and gender groupings in the community [41]. Even when they are evenly reflected in the training examples, this is still valid. Aggregate bias may emerge from any generalizations about the populace's subdivisions.

example of data biases

Figure 2. An example of data biases. The solid green line represents the unbiased regression, whereas the dotted green lines represent the regressions for every subgroup. The red line is the regression (MLR) for the overall population. (a) MLR demonstrates a favorable connection between the outcome and the independent variable once all subgroups are also of equally sized. (b) In less equal data, regression nearly never reveals a link. Nevertheless, the connections between the variables for each subgroup continue to be the same.

(a)Simpson’s Paradox. In the study of huge datasets, Simpson's dilemma is an example of an aggregate bias [43]. When the identical data is broken down into its constituent subcategories, a relationship that was seen in the aggregated either vanishes or changes direction (Fig. 2(a)). Another of the more well-known instances of this kind of paradox occurred during the lawsuit alleging gender discrimination in UC Berkeley's admissions process [16]. According to data on graduate program applications, there appears to be bias against women because fewer of them than their male colleagues are accepted to graduate schools. Nevertheless, whenever admittance data was divided and examined across departments, candidates who were women were treated equally, and in some circumstances even had a slight edge.

The proportion of every nation as shown by its two-letter ISO code

Figure 3. The proportion of every nation as shown by its two-letter ISO code in the Open Images as well as ImageNet image collections. The top countries in both datasets are the United States and Great Britain.

Europa and North America

Fig. 4. The Open Pictures information set's distribution patterns of the various nations. Nearly one-third of the information in their sample came from the US, whereas 60% of the data came from the six nations that were most concentrated in both Europa and North America.

than men. The contradiction arose as a result of the fact that women preferred to enroll to sectors with lower acceptance rates overall. Numerous fields, including as biology [41], psychiatry [42], astrophysics [43], and computerized social science [45], have noted Simpson's paradox [44]. (b) The Modifiable Areal Unit Issue is a quantitative bias that appears in geographic information system while modelling data at various spatial aggregate levels [49]. Whenever data is gathered at different geographic scales, this bias causes various trends to emerge. Sampling bias, number.

(5) Sample bias, which is related to representational bias, results from the non-random selection of subgroups for sampling. Because of sample bias, it's possible that the trends identified for one demographic won't apply to information gathered from a different population. Think about the illustration in Figure 2 to get the understanding. The information from three subsets that were equally selected during the investigation are depicted in the left graph (Fig. 2(a)). Assume that a particular grouping was selected greater regularly than the others the following time the investigation was carried out (Fig. 2(b)). Whereas the subset patterns (dashed coloured lines in the figure towards the right are unaltered), the positive development identified by the linear regression in the initial research almost totally vanishes.

(6) The fallacy of longitudinal studies. When examining temporal information, scientists must employ thorough analysis to follow cohorts across time and learn about their behaviour. Conversely, cross-sectional research, that integrates various cohorts at a single moment in time, is frequently used to model temporal features. Cross-sectional assessment may be skewed by the varied cohorts, yielding results that are different from those of longitudinal research. For instance, study of large Reddit data sets [47]showed that, on general, remark length declined over time. Bulk data, meanwhile, only provided a cross-sectional view of the demographic, which has included a variety of cohorts that entered Reddit at various times. Whenever data were broken down into cohorts, it was discovered that each cohort's remark length increased with time. Linking Bias, item.

(7). Linking bias develops when set of established derived from gsm communication, actions, or interactions are inconsistent and inaccurately reflect users' actual behaviour [48]. In [49], writers demonstrate how social media platforms could be skewed towards low-degree vertices when just considering the channel's connections and ignoring the information and actions of its users. Additionally, [15] demonstrates that customer interactions vary noticeably from network link structures that are dependent on variables like manner of interaction or temporal context. Numerous factors, like network sample, that can alter network metrics and lead to various issues, can contribute to variances and prejudices in the systems.

3.1.2 Algorithm to User

Customer behaviours is modulated by algorithms. Algorithm biases could induce biases in usage patterns. This section will discuss biases that arise from algorithmic results and have an impact on user conduct.

(1) Algorithmic bias. Algorithmic bias occurs whenever the bias is introduced only by the program and is not inherent in the data input [30]. Multiple regressions can be applied to the information as a whole or when considering subgroups, as well as decisions in algorithmic layout, such as the usage of specific optimization features, proper proportions, and the overall use of statistically significant biased estimation methods in algorithms, everyone can lead to biased computational decision making that really can bias the results of the methodologies.

(2) User Interaction Bias, A type of bias known as user interaction bias can be seen on the Internet and can also be caused by the user themselves by imposing biased behaviours and interaction [31] and by the user interface. Presentation and ranking biases, for example, can have an impact on this type of bias.

(a) Presentation Bias. The way that data is communicated can lead to presentation bias [30]. For instance, because people can really only click on anything on the Web that they have previously viewed, only the material that has already been viewed would receive clicks. In addition, not all of the data found on the Internet may be visible to the user [31].

(b) Ranking bias. The notion that the answers at the beginning of the page are the most pertinent and significant will cause these results to draw more views than some others. Search results [20]and crowdsourced tools [31]are also impacted by this prejudice.

(3) Favouritism Bias. Popular items are frequently exposed more. Nevertheless, popularity measures can be tricked, such as by bogus comments or social media bots [21]. This kind of bias, for example, could be observed in search results [20] [21]or recommendation engines where people would be shown more successful brands. This appearance, however, might not be the consequence of high quality; rather, it might be the product of other biased variables.

(4) Emergent Bias. Emergent bias develops through use as well as contact with actual users. Typically, some times that after work is complete, this bias develops as a result of changes in the population's cultural norms, or societal understanding [11]. Because touch screens typically reflect the abilities, traits, and habits of potential users by purpose, this form of bias seems to be more probably to be seen there [21]. As is covered in depth in [31], this form of bias can be further subdivided into several subtypes.

(5) Evaluation bias. When evaluating models, evaluation bias occurs [31]. This involves using incorrect and exorbitant benchmarks, including Audience as well as IJB-A benchmarks, to assess programs. Such standards can be used as illustrations for this kind of bias [41]because they were utilized in the assessment of facial recognition technologies that were prejudiced against gender as well as skin colour [21].

3.1.3 User to Data

ML models are trained using a variety of user-generated information sources. Users' ingrained biases may be mirrored in the information they produce. Moreover, any biases included in those algorithms may bring bias into the data generation process when they influence or regulate usage patterns. Following, we identify a number of significant subtypes of these biases.

(1) Bias due to history. Even with flawless sample and extraction of features, historical bias, which already exists as socio-technical problems around the world, might contaminate the data generating processes [42]. This form of bias may be seen in a 2018 google image result when looking for female CEOs eventually produced fewer female CEO photos even though only 5% of Fortune 500 executives were female—causing the results to be biased against male CEOs [32]. By obviously, these searching results reflected reality, however it is worth debating the question of whether algorithms typically should do the same.

(2) Population Disparities, Population bias occurs whenever the user group of the platform varies from the intended sample population in terms of statistics, demographic, representation, as well as user attributes [12]. Unrepresentative sample data are produced by population bias. Different account demography on various social media sites, including women using Pinterest, Facebook, and Instagram greater frequently than men using discussion boards like Reddit or Twitter, might be an illustration of this type of bias. One may find more information about social media usage among young adults in terms of gender, race, nationality, as well as parental education level in [19], together with additional examples as well as statistics.

(3) Self-Selection Bias. A subtype of selecting or sampling bias known as self-selection bias4 occurs when study participants choose oneself. The most ardent supporters of a presidential campaign are more inclined to take part in an internet survey to gauge their level of enthusiasm, which is an illustration of this kind of bias.

(4) Social prejudice. Social bias occurs whenever the behaviour of others influences the judgment [18]. An illustration of this form of bias is when we want to give something a negative rating or assessment, and yet are persuaded to give it a high rating because of the effect of other top marks [17].

(5) Behavioural bias. Multiple users’ behaviours across systems, situations, or datasets are the cause of behavioural bias [16]. This kind of prejudice is illustrated in [34], in which the authors explain how variations in emoji depictions across platforms can cause individuals to respond and behave in different ways, and even sometimes resulting in communication challenges.

(6) Temporal bias. Variations in populations as well as behaviours across time cause temporal bias [15]. On Twitter, for instance, you can see how people discussing a certain topic will use a hashtag to draw more attention for a short while before moving on to talk about just the subject in general [11].

(7) Bias in content creation. Material production bias results from disparities in the structure, lexicon, semantics, and syntax of user-generated content [13]. An illustration of this kind of bias may be found in [12], which discusses how language usage varies by gender and by age. Furthermore, there are variations in linguistic use between and among various nations and cultures.

Existing research tries to classify various bias definitions under categories, for instance those that only apply to information or user interaction. These categories are entangled, though, as a result of the feedback cycle phenomena [36], therefore we require a classification that accurately reflects this circumstance. This virtuous cycle exists between computers and human engagement as well as between information and algorithms [29]. Such publications served as our inspiration for our model of categorizing of bias definitions, which is depicted in Figure 1. We arranged these interpretations along the loop's directions wherever we believed they would be most useful. We reiterate that such criteria are interconnected and that individuals need to consider how they impact one another in this loop and respond appropriately.

3.2 Data Bias Examples

Discriminatory bias can enter data in a different version of ways. For instance, employing uneven data can lead to prejudices towards populations that are disadvantaged. [14] examines a few instances of biases that may be present in data as well as algorithms and makes recommendations and ideas for how to mitigate these problems.

3.2.1 Illustrations of bias in data from machine learning

The researchers of [25] demonstrate how unbalanced and predominately light-skinned people are present in databases like IJB-A and Adience, with 79.6% as well as 86.2% of subjects, respectively. Due to their underrepresentation in the data, dark-skinned groups may be rewarded in the study as a result. Another situation where bias might result from data use as well as analysis is when we fail to consider the data's introduced different. The researchers of [21] further demonstrate that offering merely male-female groups is insufficient and that it is necessary to further separate the gender groups according to race under light-skinned females, light-skinned male, dark-skinned males, as well as dark-skinned females. Just in this example can the bias against dark-skinned females be plainly illustrated; in prior instances, dark-skinned males will make accommodations for dark-skinned females, masking the underlying bias against this minority. Prominent machine-learning dataset might be skewed, which may be detrimental to the subsequent applications that depend on them. Such dataset provides the foundation for most produced algorithms and tools. For instance, two popular datasets in learning algorithms are ImageNet [15] as well as Open Images [22]. Researchers demonstrated that such datasets exhibit representational bias in [33], and they argued that geographic variety and inclusiveness should be considered while developing such datasets. The authors also discuss the representation biases that really are present in many knowledge sources that are frequently employed in Natural Language Processing (NLP) applications to solve various common-sense cognitive tasks in [44].

3.2.2 Instances of Data Bias in healthcare applications

Some options provide may make such data biases even hazardous. For example, the data collected and employed in the medical fields is frequently biased toward demographics, which could have harmful effects on the underrepresented community. [31] demonstrated how misclassifying African-Americans as just a consequence of their absence from clinical research led them to argue for scanning the chromosomes of diverse demographics in the data to protect underrepresented communities. In their analysis of the 23andMe genotyping dataset, the researchers [26]discovered that, of the 2,399 people who have publicly published their genomes, 2,098 (87%) are European, whereas only 58 (2%) are Asian as well as 50 (2%) are African. Another research of this kind was done, and it is stated in [27]that UK Biobank, a sizable as well as well-known genetic dataset, might not accurately present the population under study. Scientists determined proof of a bias in favour of "human volunteers." Other research on distortions in the information utilized in the medical field are illustrated in [29]. [28] examines machine-learning techniques as well as data used in the medical industry and discusses how not all individuals have benefited similarly from machine learning in medicine.

3.3 Discrimination

Discrimination is a form of unfairness, much like bias. Bias may be viewed as a source of unfairness which results from data gathering, sampling, as well as measurement, whereas discrimination may be viewed as a source of unfairness that results from individual prejudice and stereotyping based on sensitive traits, which might also occur deliberately or unintentionally. While bias may also be thought of as an unfairness caused by human discrimination and stereotyping, it makes more sense in the research on algorithmic fairness to classify them as such in light of the existing studies within those fields. In this study, we primarily pay attention to ideas that are pertinent to problems with computational fairness. The interested parties can go to [27] for additional in-depth information on discriminating concept that covers a wider range of transdisciplinary themes from legal principle, economics, as well as disciplines such as sociology.

3.3.1 Explainable Discrimination.

In some circumstances, differences in the course of therapy and the results for various groups might be justified as well as explained by certain characteristics. When these inequalities can be justified as well as understood, it is not regarded as being unlawful discrimination as well as is therefore referred to as understandable [35]. For example, writers in [32] claim that males typically earn more money every year than females inside the UCI Adult database [33], a popular dataset inside the fairness domain. Nevertheless, this is due to the fact that women work, on general, fewer hours each week than men. An element that can be utilized to explain small wage and must be taken into account is the number of hours worked per week. Male as well asfemale employees will receive lower pay then female employees if choices were made without taking into account working time, which would result in discriminatory treatment. Thus, discrimination that can be justified by other characteristics, such as working hours, is fair and legal. The researchers of [36] outline an approach to measure both legal and explicable discrimination in data. They contend that approaches that fail to account for the justifiable component of discrimination could provide unacceptable results, thus they add a form of reverse discrimination that is equally destructive and unwelcome. They describe how to evaluate as well as quantify discrimination in information or a classifier's classification decisions, that specifically take into account justifiable versus illegitimate discrimination.

3.3.2 Unjustified Discrimination.

Discrimination against a group that is unjustifiable and thus unlawful is known as unexplained discrimination, as opposed to understandable discrimination. Authors in [39] also provide regional methods for eliminating all unlawful or unreasonable discrimination and leaving just discernible variations in judgments. These pre-processing methods alter the data for training so that it no longer exhibits unexplained discriminating. We anticipate that classifiers developed using this pre-processed data won't detect unlawful or unreasonable discrimination. Both direct and indirect discrimination both fall under the category of unexplained discrimination. Direct Discrimination, first. Direct discrimination occurs whenever a person's protected characteristics cause specific unfavourable consequences to befall them [40]. In computer science research, "restricted" or "vulnerable" features are frequently those characteristics that are defined by legislation as those on which discrimination is prohibited. According to the Fair Homes as well as Equal Credit Opportunities Acts (FHA as well as ECOA), a list of a few of such protected characteristics is included in Table 3. [50]. Unintentional Discrimination In disparate treatment, people appear to be handled differently on the basis of neutral, non-protected characteristics; nevertheless, shielded groups or people still experience unfair treatment because of implied impacts from their discrimination laws (for example, a person's residences zip code may be used within decision-making procedures including credit applications). Nevertheless, regardless of the fact that postal address seems to be a non-sensitive trait, it may connect with racial due to the density of residential regions, this can still result in race prejudice, including such redlining) [49]

3.3.3 Resources of Discrimination.

(1) Systemic Discrimination. Policies, traditions, or conduct that are ingrained in a work success or organizational structure and may serve to maintain prejudice against particular populations or subgroups are referred to as systemic discrimination. [48] discovered that companies strongly favored qualified individuals who shared their culture, experience, as well as interests. Competent applicants who do not represent to such groupings may be discriminated against it when decision-makers tend to be disproportionately members of those organizations.

(2) Statistical Discrimination. Decision-makers may engage in statistical discrimination when they evaluate a member of a group based on the average statistics for that group. It typically happens whenever decision-makers (such companies or police officials) use a person's clear, observable traits as a stand-in for either concealed or harder-to-ascertain traits that may really be crucial to the result [39].

4. Algorithmic Fairness

It comes from a longstanding experience in psychology and philosophy and also more recently within computer vision, to combat discrimination and prejudice. Nevertheless, one must first understand fair in order for them to promote equality and attain it. Long since computer programming, philosophy along with psychology sought to describe the notion of fairness. That reality that there isn't a single, agreement on the definition of fairness demonstrates how challenging this issue is to resolve [37]. It is difficult to arrive at a single concept of fairness that really is agreeable to all participants in a scenario since varied attitudes and viewpoints in diverse cultures favour various approaches to evaluating fairness. In fact, there is still no unambiguous consensus on which constraints are the most suitable for those issues in computer scientific knowledge, even though the majority of something like the job on suggesting new equality restrictions for methodologies has come from of the Region and that many of such documents employ the same data sources and issues to demonstrate how their constraints perform. Fairness is, broadly speaking, the absence of bias or preference toward a person or a group depending on their inherent or obtained qualities in the decision-making environment. And although fairness is a feature that society values greatly, it may be quite challenging to implement in daily life. Numerous fairness criteria are proposed for solving the various algorithmic bias as well as discrimination problems covered in the preceding in light of these difficulties.

4.1 Definitions of Fairness

Authors attempted to connect political philosophy's ideas of fairness to machine learning in [37]. The development of fairness criteria in the educational fields as well as machine learning during the past 50 years was researched by authors in [36]. Several of the criteria for fairness within algorithm text categorization were given and discussed by the writers in [35]. The impression of several of these fairness criteria in computer science publications by the general population was investigated by writers in [38]. Here, we'll restate and offer some of the most commonly used terms, along with justifications drawn from [39].

Definition 1. (Equalized Odds). A prediction Y meets equalized chances with regard to protected attribute A as well as outcome Y if Y and A are independent conditional on Y, according to the definition of equalized odds given by [34]. P(Y=1|A=0,Y =y) equals P(Y=1|A=1,Y =y), where y = 0 and 1. This implies that for members of the protected and exposed group, the likelihood that someone in the positive class will be properly assigned a beneficial result and the likelihood that someone in the negative class would be erroneously assigned a beneficial result both should be equal [33]. In other terms, according to the exactly equal odds concept, the frequencies of genuine and false positive ought to be comparable between the protected versus unprotected categories.

Definition 2. Equality of opportunity If P(Y=1|A=0,Y=1) = P(Y=1|A=1,Y=1), then a bitwise predictor Y fulfills the absolutely equal condition with regard to A and Y [31]. This implies that both protected as well as unprotected (female and male) members of the group ought to have an equal chance of being allocated to a positive outcome if they are in a positive class [29]. In those other terms, the absolutely equal definition mandates that the genuine positive rates for protected and vulnerable groups be equal.

Definition 3. Descriptive parity, The term statistical parity is sometimes used. If P(Y | A = 0) = P(Y | A = 1), then a predictor Y fulfills demo-graphic parity [38]. If a person is in the protected category (for example, women), the likelihood of a favorable outcome [149] should be the same with respect as they are a member.

Definition 4. Fairness via knowledge According to [37], "An algorithm is fair if it makes similar predictions for similar people." In other terms, any two people who are comparable in terms of a similar (inverse proximity) metric established for a certain activity ought to experience a comparable result.

Definition 5. Fairness Through Ignorance As long just like any safeguarded characteristics A are not expressly utilized in the decision-making procedure, an algorithms is fair [36].

Definition 6. (Treatment Equality). Whenever the ratio for false - negative to false positives both for groups of protected groups is identical, treating equality has been attained [34].

Definition 7. (Test Fairness). When a score S = S(x) represents the same chance of recidivism regardless of an individual's participation in a group, R, it is considered test fair (well-calibrated). In other words, if P(Y = 1|S=s,R=b)=P(Y = 1|S=s,R=w) for all values of s. [34]. In those other terms, as per the concept of test fairness, participants in both protected and non-protected categories should have a fair probability of precisely belonging to the true positive for any anticipated probability score S. [25].

Definition 8. (Contradictory Fairness) If at all under the condition X =x and A =a, P(YAa(U)=y|X =x,A=a)=P(YAa ′(U)=y|X =x,A=a), (for all y and for any value a ′ realisable by A), then predictor Y is different troubles fair [22] . The idea that a choice is fair to a person if it holds true in both the real world as well as a counterfactual universe in which the person belongs to a distinct population group is indeed the foundation of the counterfactual unfairness concept.

Definition 9. In interpersonal domains, fairness A definition of fairness is "a view of fairness that is able to reflect the relational architecture in a domain—not just by taking into account the characteristics of individuals but also by taking into consideration the social, economic, and other interpersonal relationships between individuals" [23].

Definition 10. Statistical parity under certain conditions. Predictor Y meets conditional statistics parity for a collection of valid factors L if P (Y |L=1,A = 0) = P(Y|L=1,A = 1) [41]. According to conditionally statistics parity, when given a set of acceptable characteristics, individuals in both protected as well as exposed (female and male) categories ought to have an equal likelihood of being allocated to a favourable result L [48].

Name	Reference	Group	Subgroup	Individual
Demographic party	[7][8]
Conditional statistical party	[21]
Equalised odds	[13]
Equal treatment	[15]
Subgroup fairness	[14]
Fairness through unawareness	[38][39]
Fairness through awareness	[16][17]
Counterfactual fairness	[38][39]

Table 1. Distinct fairness notions

Fairness criteria can be categorized into the following categories:

1) Individual Equity. Give comparable people accurate compared [44].

(2) Group justice. Respect various groups equitably [11].

(3) Subgroup Equity. The goal of subgroups fairness is to combine the best aspects of group versus individual concepts of justice. Although it differs from these ideas, it makes use of them to produce greater results. It chooses a group fairness constraint, such as having equal false positives, and determines if it is true for a sizable number of subgroup [12].

It is significant to remember that [13], except for in extremely constrained exceptional instances, it is not feasible to satisfy all of the fair restrictions at once. Calibration as well as balancing both positive and negative categories are two requirements that the writers of [21]demonstrate have an intrinsic incompatibility with one another. It is crucial to evaluate the environment and applications wherein fairness criteria must be employed and use it correctly because these can't be satisfied with one another except under specific limitations [15]. Time as well as temporal study of the effects that these criteria may have on particular people or groups should also be taken into account. Authors demonstrate in [16]that existing fairness standards are not always beneficial and may not foster development for vulnerable groups—and in some circumstances, can even be harmful whenever studied over time. They demonstrate the importance of temporal modeling as well as measuring in the assessment of fairness requirements and present a new set of trade-offs as well as difficulties in this area. There also demonstrate how measurement mistakes might support these fairness definitions. Whenever attempting to address concerns about fairness, it also is crucial to consider the forms of bias as well as the causes of such biases.

5 Methods for Fair Machine Learning

In order to attain justice, a number of attempts to combat bias in AI, which are based on AI domains. This part will list the many AI fields and the effort each community has done to address bias as well as unfairness in their approaches. The several areas that we concentrate on in this study are listed in Table 2. Even though the majority of this part is domain-specific, taking a cross-domain approach can be helpful. Techniques that aim to correct algorithmic biases typically fall into one of three classifications:

(1) Pre-processing. Pre-processing approaches attempt to change the data in a way that eliminates the fundamental discrimination [25]. Pre-processing is an option if the method is permitted to change the data for training [26]. Secondly, in-processing. Modern learning algorithms are modified and changed by in-processing approaches in an effort to eliminate prejudice throughout the model training phase [34]. In-processing may be applied during the development of a model if the training procedure for a machine learning algorithm can be changed. This can be done by applying a restriction or by incorporating modifications into the optimization problem [35]. Third, post-processing. After learning, post-processing is carried out by utilizing a holdout set that was not used to the model's training [25]. Just post-processing could be employed if the algorithms could only consider the learnt models as a black box and cannot adjust the data for training or learning method. During in the post-processing stage, the labels nearly universally by the black-box frameworkare subsequently reallocated based on thefunction [21].

Table 4 provides samples of some current works and their classification into these categories. Due to the prominence of AI, such techniques have not only been extended to standard machine learning approaches but also to other fields like deep learning as well as natural language understanding. In several AI applications as well as areas, constructivist teaching methodologies have been developed, ranging from generating fair representation to generating fair word representations. Some of these techniques aim to eliminate excluding bias by attempting to have included people representing sensitive groups, whereas others aim to prevent unethical inclusion of delicate or guarded features through into decision-making processes. Additionally, some methodologies attempt to satisfy each or even more conceptions of fairness. For example, disparate learning processes (DLPs) enable shielded characteristics during in the learning phase but forbid them during in the inference time in order to attempt to gratify the conceptions of procedure disparity as well as affect disparity [9]. Table 3 provides a collection of restricted or confidential properties. They highlight the legal criteria that shouldn't be taken into account while making decisions about credit cards or mortgage loans. Although some causality techniques use causal networks and ignore some routes in the causative network that results in sensitive particular challenge the decision's result, other causal techniques try to treat sensitive values as clutter to ignore their impact on decision-making. Here, many bias-mitigating strategies and methods are explored for various domains—each specifically addressing a specific issue in various branches of machine learning. This can broaden the reader's perspective on the ways in which prejudice can effect the system while also attempting to assist researchers in thoroughly analysing several new issues pertaining to potential areas where discriminating and biased can influence a system's result.

5.1 Unbiasing Data

Each dataset is the product of various design choices the data curator made. These choices have an impact on the fairness of the final dataset, which in turn has an impact on the methods that are produced. Datasheets that serve as a required documents for the data and provide information about the dataset development process, its features, motives, as well as skews are just a couple of the broad ways that have been suggested to reduce the impact of bias in information [1]. [2] suggests a comparable strategy for the NLPtasks. Identical recommendations have been made for machines in [4]. The authors of [3] also advocate the use of labels to more accurately categorize each piece of data for each activity, much like nutritional labeling on food. In contrast to such generic strategies, some research has focused on biases that are more unique nature. For instance, [5]suggested methods to check for instances of Simpson's dilemma in the information, while [6]suggested methods to systematically find Simpson's dilemmas in information. In certain studies, direct discriminating in the information was also detected using hypothesized relationship and graphs, as well as a method for preventing it by modifying the data such that direct discrimination-related predictions are eliminated [7]. [8] also focused on the simultaneous, indirect, as well as direct consequences of discrimination in data mining. In order to eliminate biases from of the data, further pre-processing techniques such communication [19], selective sampling [20], and differential impact reduction [21]are also employed.

Area	Reference(s)
Classiﬁcation	[8][9][10][11][12][4][19]
Regression	[14][1]
PCA	[17]
Community detection	[27]
Clustering	[6][9]
Graph embedding	[23][34]
Causal inference	[33][50]
Variational auto encoders	[44][34]
Adversarial learning	[13][14][15]
Word embedding	[4]
Coreference resolution	[24]
Language model	[26]
Sentence embedding	[43][2]
Machine translation	[22]
Semantic role labelling	[21]
Named Entity Recognition	[20]

Table 2. Papers list that targets bias as well and fairness in distinct areas.

Attribute	FHA	ECOA
Race
Colour
National origin
Religion
Gender
Familial status
Disability
Exercised rights under CCPA
Status of marraige
Recipient of public assistance
Age of the people

Table 3. The Fair Housing as well as Equal Credit Opportunity Acts' (FHA and ECOA's) listing of protected characteristics is taken from [30].

5.2 Fair Machine Learning

A number of approaches that, based on the application, meet a portion of the fairness standards or other new meanings have been put out to address this problem.

5.2.1 Fair Classification.

It is crucial that such kinds of approaches are fair as well as devoid of biases that could affect specific populations because categorization is a classic ML issue and is frequently employed in regions that can come in direct contact to humans. As a result, some approaches that adhere to particular standards of categorization fairness have indeed been developed [21]. As an illustration, writers in [18] attempt to meet subgroups fairness in categorization, equal opportunities as well as equalized chances for [11], differential treatments and divergent effect in [19], and exactly equal odds throughout [12] [34]. Other approaches aim to be adaptable to changes in the test set while also attempting to satisfy certain fairness requirements [50]. A broad approach for training fair classifier is suggested by the researchers in [44]. Using this approach, fairness-aware categorization with fairness assurances can be created. For discrimination-free categorization, researchers in that other paper [43] suggest three potential adjustments to the current Naive Bayes classification algorithm. [20] employs a novel method for fair categorization by incorporating fairness requirements into an MTL architecture. This strategy can help minority groups by emphasizing increasing the average precision of every group rather than optimizing correctness as a total without consideration of accuracy among disparate factions, in additional to enforcing fairness throughout training. A disconnected classification scheme, where a different classifier is learnt for each group, is suggested by authors in a related work [21]. To address the issue of having insufficient information for minority populations, they apply learning algorithm. In [18], authors suggest using the Wasserstein distance metric to reduce the categorization result's reliance on the sensitive values in order to achieve fair categorization. To build a train set of data devoid of bias, preferred sampling (PS) is the strategy suggested by the researchers in [19]. Then, using this dataset devoid of prejudice, they learn a classification to create a classifier that does not discriminate. In [12], authors put forth a post-processing bias reduction technique that can give comprehensibility as well as uses the learning algorithm for categorization.

Algorithm	Reference	Pre-processing	In-processing	Post- processing
Community detection	[1][4]
Word embedding	[30]
Optimized pre-processing	[32]
Data pre-processing	[13]
Classiﬁcation	[12]
Regression	[11]
Adversarial learning	[34]

Table 4. Depending on if they are pre-processing, in-processing, or post-processing, algorithms are grouped into the proper categories.

5.2.2 Fair Regression.

In order to assess accuracy-fairness trade-offs, [11] suggests a fair regression approach and evaluates it using the "price of fairness" (POF) metric. They establish the following three fairness punishments:

Individual Equity: A simulation w is punished for how differentially it treats x and x' (valued by a function of |y - y'|), where s1 as well as s2 are separate groups from the survey respondents, according to the notion of individual fairness given in [14]. This is operationally defined in formal terms as

Group Fairness: "On aggregate, the examples in the 2 categories ought to have labels that are close to one another (weighted by that of the examples' labels' proximity)" [9].

Hybrid Fairness: "In an averaging so over 2 categories, hybrid fairness demands that both positively and negatively labelled cross pairings be handled similarly" [8].

In additional to the earlier research, [1] takes into account the formulation of the fair regularization term in relation to the concepts of statistical (demographic) symmetry for fairness and limited group losses. In additional to categorization, [2] uses decision trees to address differential impact as well as treatment in regression problems.

5.2.3 Structured Prediction.

Just 33% of the operator roles in culinary photos inside the imSitu training dataset are played by men, and the remaining 67% of culinary pictures are populated by women, according to research published in [30]. Additionally, they observed that the models would enhance any bias already present in the dataset, resulting in bias for "man" filling only 16% of cooking photographs after building a classifier 5 on the set of data. The researchers of the research [21] demonstrate that structured forecasting models run the risk of exploiting social prejudice in light of these facts. As a result, they suggest the calibrating method RBA (reduced bias amplification), which is a method for constructivist teaching systems by calibration predictions in structural predictions. Working to ensure that the numerical simulations match the training data's dispersion is the goal of RBA. They focus on the two scenarios of visual semantic function labelling categorization and multi-label objects. They demonstrate how the bias inside the information is amplified by these techniques.

5.2.4 Fair PCA.

The authors of [18]demonstrate how flavoured PCA can accentuate the model complexity in one population of individuals over a different demographic of equally sized. As a result, they suggest a fair approach for creating portrayals with comparable richness for various populations—instead of blurring their differences or concealing their reliance on delicate or protected attributes. They demonstrate that, also when sampling is carried out with same frequency for both categories, the reconstructive failure rate for male and female faces in the labelled features in the wild (LFW) collection [19]is less. They want to offer a matrix factorization method that preserves comparable fidelity across the dataset's various groupings and demographics. As a result, they propose a fair dimensional reduction approach and propose Fair PCA. Assuming m pieces of data in Rn, they define Fair PCA as the following, where A and B signify two subgroup while UA and UB indicate vectors where rows match to rows of U that include individuals of subgroups A with B:

The two steps of their suggested method are as follows:

(1) Convert the Fair PCA goal to a semi - definite programme (SDP), then resolve it.

(2) Reducing the rank of both the answer necessitates the execution of a programming problem.

5.2.5 Community Detection/ Clustering.

Other possible location for bias along withdiscrimination to effect populations is disparities in digital communities and on social connections. For instance, in online groups, those who have less friends or follows struggle to be noticed on social media [34]. In contrast, by disregarding these poorly connected individuals in the networks or by incorrectly allocating them to the unimportant and tiny groups, current technologies, including such constituency discovery methods, might exacerbate this bias. Study illustrates how this kind of bias occurs and therefore is supported by the overall community identification techniques in [35]. To lessen the harm done to marginalized groups within social media networks, they suggest a brand-new ascribed collaborative filtering technique known CLAN.

As shown below, CLAN is a 2- stageprocedure that addresses exclusion bias by taking into account both node properties and network structure:

(1) Use modularity characteristics to find groups (Step 1-unsupervised using only network structure).

(2) Utilizing held-out node attributes, train a classification to place users in the minor categories within one of the broad components (Step 2-supervised using other node attributes).

Fair techniques in fields resembling community detection, including graph anchoring and grouping, are also put forth.

5.2.6 Causal method to Fairness.

Correlational studies can establish links between factors that cause them. These causal linkages between factors (the chart's nodes) can be visualized utilizing causal graphs by drawing lines connecting the nodes. When building systems or regulations, these approaches can be utilized to eliminate unintended causal dependency of outcomes on delicate traits like gender or race [46]. Graphs as well as causal models have been commonly employed by researchers to address issues with fair within machine learning. Within [34], writers go into great length about causality and how crucial it is when creating just algorithms. There has been lot of study on identifying and eliminating prejudice that employs hypothesized relationship and graphing to make these decisions independent of delicate characteristics of groups or people. As an illustration, writers in [45] provide a causal-based approach that identifies discrimination both directly and indirectly in the data and suggests methods for removing it. [48] is a development of the earlier work. [46] provides a good summary of the majority of the writers' prior work in this field, examining system-, group-, while individual-level discrimination as well as addressing each with their prior methodologies, in addition to focusing on both indirect and direct discrimination. Researchers in [44] offer a comparable pathway technique for fair inference utilizing causal networks, building on and generalise past work. This would flexibly restrict some problematic as well as discriminatory pathways inside the causative graph provided any set of limitations. This is true when it is possible to recognize the path-specific impacts from the delegate responsibilities.

Researchers enhance the hypothetical fairness definition [40] by introducing the path-specific hypothetical fairness definition in [41] and proposing a way to implement it [42]. In [40], the researchers expanded on their earlier work's institutionalization of computational fairness to the context of learning optimum strategies that really are limited by constraints depending on definitions of fairness. By altering some of the current tactics, such Q-learning, value searching, as well as G-estimation, depending on some fairness concerns, they offer various approaches for learning optimized solutions. By identifying cases that are similar to that other example and determining whether a modification inside the protected feature will affect the decision's result, writers in [20] solely target discriminatory detection and also no elimination. If so, they claim that prejudice exists. The terms "unresolved prejudice" as well as "proxy discrimination" are described as follows in the article [19]:

Continuing Discrimination: When there is a directed path between variables A to variables V which is not obstructed by a resolution factor and factor V is itself non-resolving, then variable V inside a causal graph demonstrates unresolved discriminating [18].

When a direct path from A to V is obstructed by a proxy variable but V itself is not a proxy, a variable V in a causal network may display potential proxy discrimination [17]. They suggested ways to stop as well as stop them. Additionally, they demonstrate that no observable criteria may identify unresolved discriminating in a predictor, necessitating the use of a causal thinking paradigm.

5.3 Learning Fair Representation

5.3.1 Variational Auto Encoders

Many distinct research publications have suggested developing fair depictions and minimizing the unjust manipulation of sensitive features. The Variational Fair Autoencoder, first proposed in [17] is a well-known instance. Therefore, they use the sensitive factor as the distraction value in order to provide a representative sample by omitting the data on this variable. The posterior distribution over latent constructs is invariant thanks to an average maximum difference regularization term. Their prototype system for establishing the Variational Fair Auto - encoder is satisfied by adding this average maximum discrepancy (AMD) punishment into the bottom limit of their VAE design. Equivalent research that does not specifically address fairness has been presented in [11].

A debiased VAE architectural style, known as DB-VAE, is also proposed in [43]by the publishers. The above architectural style learns delicate latent constructs that can bias the model (such as skin colour, sex, etc.), and proposes an automated system on upper edge of this DB-VAE which uses these explanatory constructs to debias processes such as facial detection methods. The researchers of [12]model proposed representation-learning work as an optimisation problem that'd seek to reduce the information loss that will be shared between the coding as well as the vulnerable variable. Equation 1 illustrates this assumption in its relaxed form. They utilize this to demonstrate that providing guidance is unneeded and occasionally even harmful while learning about fair representation. The sensitivity factor in Equation 1 is c, while z is the decoding of x.

The authors of [13] describe flexibly fair word representations by detangling, which detangles data from numerous sensitive qualities. The fair variational auto - encoder is flexible not just with regard to sensitive values but also regarding regard to output task labeling. They tackle the idea of demographics parity, that can focus on a number of delicate qualities or any portion of those.

5.3.2 Adversarial Learning.

A strategy to reduce bias in methods created from data containing stereotypical connections is presented by the researchers in [12]. They offer a methodology in which they attempt to increase prediction precision on y while simultaneously reducing the adversary's capacity to forecast the protected or sensitive variable (stereotyping variable z). Figure 6 illustrates the model's 2 parts: the predictor as well as the adversary. Their model trains the predictor to forecast Y given X. The model attempts to discover the weight W by minimising certain nonlinear function LP(y, y) with the aid of a gradient-based technique, such as stochastic gradient descent. A rival network, referred to as the adversary, receives the output value. Z is being predicted by this system. Based on the required fairness criteria, the opponent may provide a variety of inputs. For example, the adversary could attempt to forecast the guarded variable Z just using the determined selection y given as an input to the predictor in in order to achieve Demographic Parity, but keeping the opponent from discovering that this is the predictor's intended outcome. Similarly to this, the opponent would receive both the genuine label Y and the forecasted label Y to attain Equality of Odds. They should only choose examples for the opponent where Y=y in order to achieve Equal Opportunities for a particular classy. By presenting FairGAN, that creates simulated data that really is devoid of discrimination and therefore is comparable to the actual statistics, [23] presents an intriguing and novel approach to resolving fairness problems utilizing adversarial networking. For testing and training, they substitute newly created, debiased time series from FairGAN for the actual data. In contrast to many of the recent methods, they create new datasets that are comparable to the real one only debiased and maintain strong data utility rather than attempting to eliminate discrimination from this. Figure 5 depicts the FairGAN model's architectural layout.

Structure of FairGAN

Figure 5. Structure of FairGAN

5.4 Fair NLP

5.4.1 Word Embedding

In [24], researchers noted that "man" would've been transferred to "software engineer" as well as "woman" into "homemaker" in phrase comparison tests utilizing cutting-edge word vectors. Such bias in favor of women led the authors to suggest a technique to debias word vectors. They suggest an approach that preserves encoding for terms with specified genders while debiasing embeddings for words with neutrality genders by doing the accompanying:

(1) Determine the gender subdomain. determining the embedding orientation that reflects the bias [25].

(2) Hard debiasing versus soft debiasing:

(a) (neutralize as well asequalize). In order to ensure that all gender-neutral terms are eliminated and the gender subspace is zeroed out, neutralise removes gender-neutral keywords from the population subspace [27].

When words are gender-neutralized, they become practically indistinguishable from of the collection of differentiated words [28].

(b) Correction for soft bias. tries to reduce movement in order to maintain as much resemblance to the source embedded as feasible while minimizing gender bias. There is a parameter that controls this trade-off [29].

That after in the researchers' footsteps, both this subsequent research made an effort to address this issue [40] by creating a gender-neutral edition of (Glove) named GN-Glove, which aims to keep gender data in a few of the term embedding's managed to learn aspects whilst also guaranteeing that other measurements are free from in this gender diversity and firm performance. The gendered property is the restricted attribute in this method, which mostly uses Glove as its basis model.

A new publication [48] makes the case against such debiasing methods, claiming that many previous studies on the subject have been cursory and that the methods just serve to mask the bias rather than to genuinely eradicate it. A recent study [23] changed course and offered a pre-processing strategy for identifying problematic learning corpus articles that contain biases. The study then attempted to debias the algorithm by troubling or effectively eliminating these troublesome learning corpus papers. Recent research [16] focuses on bias in ELMo's contextually relevant visual words and tries to understand and reduce the bias seen in the extracted features. They demonstrate a strong gender bias in the corpus was using to educate ELMo, with male objects being almost 3 times more prevalent than female objects. This inevitably results in gender bias in such contextually relevant pre - trained models extracted features. They suggest

(1) train-time data pre-processing technique and

(2) test-time neutralizing strategy as two ways to reduce the bias while employing the pre - trained models word embedding in a downstream task, semantic similarity resolving.

5.4.2 Coreference Resolution

The [21]paper demonstrates the gender bias in coreference methods. They present WinoBias, a standard that addresses gender bias in generally motivated. Additionally, they combine the use of word2vec debiasing approaches with such a data-augmentation strategy that eliminates bias in the current state-of-the-art referencing techniques.

They generally take the following stance: They initially create supplementary databases that used a rule-based methodology, swapping out all the male entities for female ones and vice versa.

Then, researchers use both the primary dataset as well as the supplementary dataset to train models. To create word embeddings, they combine the aforementioned approach with word2vec debiasing strategies. They also identify the causes of gender bias within coreference networks and suggest remedies. They demonstrate that now the training data is the primary bias source as well as suggest a remedy that creates an auxiliary data set by switching male and female entities. The recommended remedy is to substitute Glove with such a debiased embedding approach because another instance of resource bias occurs (embeddings are skewed). Finally, they suggested that leveling the numbers in the lists would address the issue of uneven gender listings as another possible cause of prejudice. By noting that for many vocations, these algorithms determine pronouns in a biased manner by favoring one gendered instead of the other, researchers in that other article [44] demonstrate the existence of gender bias in three cutting-edge coreference added and the solution.

5.4.3 Language Model

In addition to assessing the discrimination in the training texts directly, the researchers of [43] offer a metric for evaluating gender bias in a text created from a language model built using the recurrent neural networks and trained on a text corpus. The bias is measured using the average absolute as well as standard deviation of something like the proposed metric together with adapting a univariate model of linear regression out over, as well as the efficiency of each one of those performance measures is then examined whilst also assessing the bias. They do this by using Equation 2, for which w that's any term in the corpus, fis a set of gender specific phrases that relate towards the female category, like as she, her, woman, etc., as well as mto the male category.

Additionally, they add a normalization loss component to their language model which would reduce the project of embeddings learned by the encoder onto the embed of the gendered subdomain after the soft debiasing method described in [33]. Lastly, they assess how well their approach reduced gender bias and draw a conclusion by saying that ambiguity must be sacrificed in order to eliminate bias. Additionally, they highlight the superiority of word-level bias measurements over corpus-level measures.

5.4.4 Sentence Encoder

The work on bias detection in sentence anchoring algorithms is expanded upon by the researchers in [32]. By proposing their new phrase decoding bias-measuring methodology, the Phrase Encoder Association Testing, they attempt to generalize bias-measuring methods such as employing the Word Embedding Association Test (WEAT) in the setting of sentence encoders (SEAT). Modern phrase encoding approaches including CBoW, GPT, ELMo, as well as BERT were employed, and the researchers discovered that while there was some indication of bias resembling that of a human being in sentence encoders created using SEAT, more recent techniques like BERT are less susceptible to bias. Despite this, they do not assert that these models are devoid of bias; rather, they suggest that more advanced bias finding approaches may be applied in these circumstances, encouraging further research in this field.

5.4.5 Machine Translation

The two following phrases in [12] were translated from English into Spanish, and the writers discovered that even though that this term must be rendered the same manner in both situations, they got different outcomes. She is a hospital employee; my pal is a physician. She is employed by a hospital, and my friend is a physician. Although the findings did not match expectations, "friend" in each of these phrases ought to have been converted to "amiga," the feminine form of companion in Spanish. The word "friend" was changed to "amigo," the Spanish word for a man, for the second comment. This is due to the prejudice that a doctor is more associated with men than a nurse is with women, and the program chose to reflect this bias in its results. The researchers of [9] develop a method to address this problem by making advantage of the word word embedding which translation software employs. They implement the embeddings constructivist teaching techniques into the automatic translation process. Not only did this assisted them in reducing the bias already present in their algorithm and also improved its efficiency by one BLUE score. By translating paragraphs from the U.S. Bureau of Labor Statistics into these hundred genders-neutral language families, like Yoruba, Hungarian, as well as Chinese, and then back into English, the writers in [22] demonstrate how Google Translate could indeed exhibit gender bias by favouring men for stereotyped fields like STEM jobs. In [10], researchers compiled and evaluated the Europarl set of data [11], a sizable ideological, multiple languages set of data used during translation software, and found that the more men information is available throughout all age categories, with the possible exception of the youngest generation (20-30), which really only accounts for a very small percentage of the total number of sentences (0.71%). They also examined the complete dataset, which revealed that 67.39% of the statements were spoken by men. A tag indicating the speaker's gender was added to each phrase on the English input stage in order to reduce gender-related difficulties and increase morphological concordance in translation software. Although it assisted the system the majority of the time, it didn't always, thus additional research has indeed been suggested to discover other methods to incorporate speaker data.

5.4.6 Named Entity Recognition

In [10], writers look into a certain bias that exists in different isolated word recognition (NER) algorithms. Particularly, they noticed that even more female names than men's names are labeled as non-person organizations or aren't labeled at all in situations where an object must be marked as a person entity, like "John is a person" and "John is attending school." The researchers suggest six distinct evaluation criteria which would evaluate the degree of bias across wide variations in NER systems in order to better clarify these findings. They applied these measurements to names using identities from U.S. census data that were included into the functionalized words on human acts.

5.5 Comparison of Different Mitigation Algorithms

Studies in the domain of computational fairness remains in its early stages, thus there is room for enhancement. Having stated that, there seem to be existing articles that suggest fair AI algorithms including bias mitigation strategies, analyze alternative mitigation techniques using various benchmark functions, and discuss other related fairness-related topics. To develop fair approximations, for example, writers in [12] suggest a geometric approach that eliminates association between covered versus exposed characteristics. The suggested method uses a programmable parameter to regulate the trade-off among accuracy and fairness. The researchers who conducted this research assess the effectiveness of their method on numerous baseline methods, including COMPAS, Adult, as well as German, and contrast it to various other methods for fair machine learning that take fairness as well as accuracy parameters [15]. Additionally, IBM's AI Fairness 360 (AIF360) toolset [13] has put many of the most recent fair machine learning into practice as well as presented some of the findings as demos, that interested people can utilize to evaluate various approaches in relation to various fairness metrics.

6 Fairness Research Challenges and Opportunities

Fairness has been defined and approached in a variety of ways throughout literature, however this field of research is far from being finished. There are still many relevant to a research in the areas of fairness including algorithmic biases. In this part, we outline potential for the growth of under researched issues and pointers to current fairness research concerns.

6.1 Challenges

The fairness literature still has to tackle a number of issues. One of them is putting up a concept of fairness. The science has put out several definitions of what fairness might look such as from a machine learning standpoint. These definitions encompass a variety of use situations, and consequently consequence, they have somewhat different ideas about what is equitable.

As a result, it is very difficult to predict how a certain fairness solution might perform underneath a different understanding of fairness. Because it may make evaluating these concepts more unified and comparative, synthesizing various definitions into one is still an unsolved research challenge. The incompatibility problem with the some existing fairness definitions can also be resolved by establishing a more unified fairness definition and context.

(2) Equity follows equality. The interpretations given in the literature primarily center on equality, making sure that each person or group receives an equal portion of resources, focus, or results. Nevertheless, equity, the idea that each person or group is provided with the opportunities they really have to prosper, has received scant attention [55]. It will be intriguing to put this definition into practice and investigate how it complements or conflicts with current notions of fairness.

(3) Looking for Injustice It ought to be easy to locate examples of this unfairness in a certain dataset if one has a definition of what is fair. By identifying instances of Simpson's Paradox in random datasets, progress has been achieved in the domain of data bias; nonetheless, unfairness might require more attention because of the variety of definitions and the intricacies in identifying each and every one.

6.2 Opportunities

The present level of study into algorithmic biases including fairness has been researchers have been trying and described in this study, with a special emphasis on machine learning. The study is extensive, even within this single area alone. There have been attempts to improve the fairness of various subfields, including clustering algorithms, pattern recognition, and natural language processing. However, the scientific community has not given every topic the same level of attention. According to the nature and scope of the fairness concept, Figure 7 provides a summary of everything that has been done to deal with fairness in various contexts. Some topics (such as activity recognition at the subgroup level) have not gotten much focus in the literature but may make excellent subjects for future study.

7 Conclusions

In this poll, we discussed issues that could have a negative impact on the fairness as well as bias of AItechnologies. Data as well as techniques were largely used as lenses through which to perceive the problems. We presented issues that show how fairness is a crucial concern. We also provided instances of the potential negative effects that injustice has on society inside the real world, including facial recognition software, algorithms for advertising, and judicial systems. The concepts of prejudice versus fairness offered by researchers were again discussed. We offered some of the research done in several fields regarding tackling the biases which may effect AI systems including various techniques and genres in AI, like basic machine learning, deep learning, including natural language processing, to even further pique the curiosity of visitors. We next separated the fields even further to conduct a more in-depth investigation of each subdomain and the efforts being made to solve its fairness restrictions. It is hoped that through broadening the users' horizons, they will be inspired to think critically as they develop systems or methodologies that are less likely to be harmful to or biased towards a particular community. It is critical that scientists take this problem carefully and broaden their expertise in this subject given the increasing use of AI in modern society. In this study, we organized and taxonomized the work that has been done thus far to address various difficulties in many disciplines related to the fair problem. To solve the current issues and biases in AI which we covered in the preceding sections, additional future study and approaches may be pursued.

8 References

[1] Alekh Agarwal, Miroslav Dudik, and Zhiwei Steven Wu. 2019. Fair Regression: Quantitative Deﬁnitions and Reduction- Based Algorithms. In International Conference on Machine Learning. 120–129.

[2] Sina Aghaei, Mohammad Javad Azizi, and Phebe Vayanos. 2019. Learning optimal and fair decision trees for non- discriminative decision-making. In Proceedings of the AAAI Conference on Artiﬁcial Intelligence, Vol. 33. 1418–1426.

[3] Nazanin Alipourfard, Peter G Fennell, and Kristina Lerman. 2018. Can you Trust the Trend?: Discovering Simpson’s Paradoxes in Social Data. In Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining. ACM, 19–27.

[4] Nazanin Alipourfard, Peter G Fennell, and Kristina Lerman. 2018. Using Simpson’s Paradox to Discover Interesting Patterns in Behavioral Data. In Twelfth International AAAI Conference on Web and Social Media.

[5] Alexander Amini, Ava Soleimany, Wilko Schwarting, Sangeeta Bhatia, and Daniela Rus. 2019. Uncovering and Mitigating Algorithmic Bias through Learned Latent Structure. (2019).

[6] Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. 2016. Machine Bias: there’s software used across the country to predict future criminals. And it’s biased against blacks. ProPublica 2016.

[7] A. Asuncion and D.J. Newman. 2007. UCI Machine Learning Repository. http://www.ics.uci.edu/$\sim$mlearn/ {MLR}epository.html

[8] Arturs Backurs, Piotr Indyk, Krzysztof Onak, Baruch Schieber, Ali Vakilian, and Tal Wagner. 2019. Scalable Fair Clustering. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, Long Beach, California, USA, 405–413. http://proceedings.mlr.press/v97/backurs19a.html

[9] Ricardo Baeza-Yates. 2018. Bias on the Web. Commun. ACM 61, 6 (May 2018), 54–61. https://doi.org/10.1145/ 3209581

[10] Samuel Barbosa, Dan Cosley, Amit Sharma, and Roberto M. Cesar-Jr. 2016. Averaging Gone Wrong: Using Time- Aware Analyses to Better Understand Behavior. (April 2016), 829–841.

[11] Rachel KE Bellamy, Kuntal Dey, Michael Hind, Samuel C Hoffman, Stephanie Houde, Kalapriya Kannan, Pranay Lohia, Jacquelyn Martino, Sameep Mehta, Aleksandra Mojsilovic, et al. 2018. Ai fairness 360: An extensible toolkit for detecting, understanding, and mitigating unwanted algorithmic bias. arXiv preprint arXiv:1810.01943 (2018).

[12] Emily M. Bender and Batya Friedman. 2018. Data Statements for Natural Language Processing: Toward Mitigating System Bias and Enabling Better Science. Transactions of the Association for Computational Linguistics 6 (2018), 587–604. https://doi.org/10.1162/tacl_a_00041

[13] Misha Benjamin, Paul Gagnon, Negar Rostamzadeh, Chris Pal, Yoshua Bengio, and Alex Shee. [n.d.]. TOWARDS STANDARDIZATION OF DATA LICENSES: THE MONTREAL DATA LICENSE. ([n.d.]).

[14] Richard Berk, Hoda Heidari, Shahin Jabbari, Matthew Joseph, Michael Kearns, Jamie Morgenstern, Seth Neel, and Aaron Roth. 2017. A Convex Framework for Fair Regression. arXiv:1706.02409 [cs.LG]

[15] Richard Berk, Hoda Heidari, Shahin Jabbari, Michael Kearns, and Aaron Roth. [n.d.]. Fairness in criminal justice risk assessments: The state of the art. Sociological Methods & Research ([n. d.]), 0049124118782533.

[16] Peter J Bickel, Eugene A Hammel, and J William O’Connell. 1975. Sex bias in graduate admissions: Data from Berkeley. Science 187, 4175 (1975), 398–404.

[17] RDP Binns. 2018. Fairness in machine learning: Lessons from political philosophy. Journal of Machine Learning Research (2018).

[18] Colin R Blyth. 1972. On Simpson’s paradox and the sure-thing principle. J. Amer. Statist. Assoc. 67, 338 (1972), 364–366.

[19] Miranda Bogen and Aaron Rieke. 2018. Help wanted: an examination of hiring algorithms, equity. Technical Report. and bias. Technical report, Upturn.

[20] Tolga Bolukbasi, Kai-Wei Chang, James Y Zou, Venkatesh Saligrama, and Adam T Kalai. 2016. Man is to computer programmer as woman is to homemaker? debiasing word embeddings. In Advances in neural information processing systems. 4349–4357.

[21] Shikha Bordia and Samuel Bowman. 2019. Identifying and Reducing Gender Bias in Word-Level Language Models. In

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop. 7–15.

[22] Avishek Bose and William Hamilton. 2019. Compositional Fairness Constraints for Graph Embeddings. In International Conference on Machine Learning. 715–724.

[23] Marc-Etienne Brunet, Colleen Alkalay-Houlihan, Ashton Anderson, and Richard Zemel. 2019. Understanding the Origins of Bias in Word Embeddings. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, Long Beach, California, USA, 803–811. http://proceedings.mlr.press/v97/brunet19a.html

[24] Joy Buolamwini and Timnit Gebru. 2018. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classiﬁcation. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, New York, NY, USA, 77–91. http://proceedings.mlr.press/v81/buolamwini18a.html

[25] Toon Calders and Sicco Verwer. 2010. Three naive Bayes approaches for discrimination-free classiﬁcation. Data Mining and Knowledge Discovery 21, 2 (2010), 277–292.

[26] Aylin Caliskan, Joanna J Bryson, and Arvind Narayanan. 2017. Semantics derived automatically from language corpora contain human-like biases. Science 356, 6334 (2017), 183–186.

[27] Flavio Calmon, Dennis Wei, Bhanukiran Vinzamuri, Karthikeyan Natesan Ramamurthy, and Kush R Varshney. 2017.

Optimized Pre-Processing for Discrimination Prevention. In Advances in Neural Information Processing Systems 30, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc., 3992–4001. http://papers.nips.cc/paper/6988-optimized-pre-processing-for-discrimination-prevention.pdf [28] Manel Capdevila, Marta Ferrer, and Eulália Luque. 2005. La reincidencia en el delito en la justicia de menores. Centro de estudios jurídicos y formación especializada, Generalitat de Catalunya. Documento no publicado (2005).

[29] Allison JB Chaney, Brandon M Stewart, and Barbara E Engelhardt. 2018. How algorithmic confounding in recom- mendation systems increases homogeneity and decreases utility. In Proceedings of the 12th ACM Conference on Recommender Systems. ACM, 224–232.

[30] Jiahao Chen, Nathan Kallus, Xiaojie Mao, Geoffry Svacha, and Madeleine Udell. 2019. Fairness Under Unawareness: Assessing Disparity When Protected Class Is Unobserved. In Proceedings of the Conference on Fairness, Accountability, and Transparency. ACM, 339–348.

[31] Xingyu Chen, Brandon Fain, Liang Lyu, and Kamesh Munagala. 2019. Proportionally Fair Clustering. In International Conference on Machine Learning. 1032–1041.

[32] S. Chiappa. 2019. Path-speciﬁc Counterfactual Fairness. In Thirty-Third AAAI Conference on Artiﬁcial Intelligence. 7801–7808.

[33] S. Chiappa and W. S. Isaac. 2019. A Causal Bayesian Networks Viewpoint on Fairness. In E. Kosta, J. Pierson, D. Slamanig, S. Fischer-Hübner, S. Krenn (eds) Privacy and Identity Management. Fairness, Accountability, and Transparency in the Age of Big Data. Privacy and Identity 2018. IFIP Advances in Information and Communication Technology, Vol. 547. Springer, Cham.

[34] Alexandra Chouldechova. 2017. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data 5, 2 (2017), 153–163.

[35] Alexandra Chouldechova, Diana Benavides-Prado, Oleksandr Fialko, and Rhema Vaithianathan. 2018. A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, New York, NY, USA, 134–148. http://proceedings.mlr.press/ v81/chouldechova18a.html

[36] Alexandra Chouldechova and Aaron Roth. 2018. The frontiers of fairness in machine learning. arXiv preprint arXiv:1810.08810 (2018).

[37] John S Chuang, Olivier Rivoire, and Stanislas Leibler. 2009. Simpson’s paradox in a synthetic microbial system.

Science 323, 5911 (2009), 272–275.

[38] Kevin A Clarke. 2005. The phantom menace: Omitted variable bias in econometric research. Conﬂict management and peace science 22, 4 (2005), 341–352.

[39] Lee Cohen, Zachary C. Lipton, and Yishay Mansour. 2019. Efﬁcient candidate screening under multiple tests and implications for fairness. arXiv:1905.11361 [cs.LG]

[40] United States. Equal Employment Opportunity Commission. [n.d.]. EEOC compliance manual. [Washington, D.C.] : U.S. Equal Employment Opportunity Commission, [1992].

[41] Sam Corbett-Davies, Emma Pierson, Avi Feller, Sharad Goel, and Aziz Huq. 2017. Algorithmic decision making and the cost of fairness. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 797–806.

[42] Elliot Creager, David Madras, Joern-Henrik Jacobsen, Marissa Weis, Kevin Swersky, Toniann Pitassi, and Richard Zemel. 2019. Flexibly Fair Representation Learning by Disentanglement. In International Conference on Machine Learning. 1436–1445.

[43] Brian d’Alessandro, Cathy O’Neil, and Tom LaGatta. 2017. Conscientious classiﬁcation: A data scientist’s guide to discrimination-aware classiﬁcation. Big data 5, 2 (2017), 120–134.

[44] David Danks and Alex John London. 2017. Algorithmic Bias in Autonomous Systems.. In IJCAI. 4691–4697.

[45] Shai Danziger, Jonathan Levav, and Liora Avnaim-Pesso. 2011. Extraneous factors in judicial decisions. Proceedings of the National Academy of Sciences 108, 17 (2011), 6889–6892.

[46] Julia Dressel and Hany Farid. 2018. The accuracy, fairness, and limits of predicting recidivism. Science Advances 4, 1 (2018). https://doi.org/10.1126/sciadv.aao5580 arXiv:https://advances.sciencemag.org/content/4/1/eaao5580.full.pdf

[47] Dheeru Dua and Casey Graff. 2017. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml

[48] Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard Zemel. 2012. Fairness Through Awareness.

In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference (Cambridge, Massachusetts) (ITCS ’12). ACM, New York, NY, USA, 214–226. https://doi.org/10.1145/2090236.2090255

[49] Cynthia Dwork, Nicole Immorlica, Adam Tauman Kalai, and Max Leiserson. 2018. Decoupled Classiﬁers for Group-Fair and Efﬁcient Machine Learning. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.).

PMLR, New York, NY, USA, 119–133. http://proceedings.mlr.press/v81/dwork18a.html

[50] Golnoosh Farnadi, Behrouz Babaki, and Lise Getoor. 2018. Fairness in Relational Domains. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society (New Orleans, LA, USA) (AIES ’18). ACM, New York, NY, USA, 108–114. https://doi.org/10.1145/3278721.3278733

You Might Also Like:-

Early Childhood Social-Emotional Learning Assignment Sample

What do Systems Mean in Biology?

Rapidminer Assignment Help in Australia

Upto 30% Off*

Get A Free Quote in 5 Mins*

Name

Email *

Country

Phone No.*

Subject

Deadline (AEST) Applicable Time Zone is AEST [Sydney, NSW] (GMT+11)

Time

Upload your assignment

Kindly mention your assignment details

Captcha *

I accept the T&C and all policies of the website and agree to receive offers and updates.

Our Services

Why Us

All Time Assistance

Get 24x7 instant assistance whenever you need.

Student Friendly Prices

Get affordable prices for your every assignment.

Before Time Delivery

Assure you to deliver the assignment before the deadline

No Plag No AI

Get Plagiarism and AI content free Assignment

Expert Consultation

Get direct communication with experts immediately.

Get
500 Words Free
on your assignment today

Let's Start

SIT723 Charactenzing Bias in Training Data of ML Systems: Assessment Answer