Confirmation Bias
Confirmation bias occurs when AI algorithms selectively process information that confirms pre-existing beliefs, assumptions, or hypotheses, while disregarding contrary evidence. Outputs from AI models that have a confirmation bias can result in misinformed choices, or the reinforcement of existing stereotypes and misconceptions.
Business Impact
Confirmation bias in this AI model can result in reputational damage and indirect monetary loss due to the loss of customer trust in the output of the model.
Steps to Reproduce
Feed the AI model with data that includes information supporting a pre-existing bias.
Observe the model's responses and decisions, noting its preference for data that confirms the bias.
Present the model with contrary evidence and observe whether it dismisses or downplays it.
Proof of Concept (PoC)
The screenshot(s) below demonstrate(s) the vulnerability:
{{screenshot}}
Guidance
Provide a step-by-step walkthrough with a screenshot on how you exploited the bias. This will speed up triage time and result in faster rewards. Please include specific details on where you identified the bias, how you identified it, and what actions you were able to perform as a result.
Recommendation(s)
Establish practices and policies that ensure responsible data collection and training. This can include:
Conducting a comprehensive review of the training data to find and remediate biases. This includes re-sampling underrepresented groups and adjusting the model parameters to promote fairness.
Business processes that index ethical frameworks, best practices, and concerns should be developed, monitored, and evaluated.
Clearly define the desired outcomes of the AI model, then frame the key variables to capture.
Ensuring that the data collected and used to train the AI model illustrates the environment that it will be deployed in and contains diverse and representative data.
Design and develop algorithms that are sensitive to fairness considerations, and audit these regularly.
Practice data collection principles that do not disadvantage specific groups.
Document the development of the AI model, including all datasets, variables identified, and decisions made throughout the development cycle.
Last updated