PUBLICATIONS

Investigating and mitigating biases in crowdsourced data

Authors

Danula Hettiachchi,

Mark Sanderson,

Jorge Goncalves,

Gabriella Kazai,

Publication date

2021

Publisher

Total citations

Cited by 6

Description

It is common practice for machine learning systems to rely on crowdsourced label data for training and evaluation. It is also well-known that biases present in the label data can induce biases in the trained models. Biases may be introduced by the mechanisms used for deciding what data should/could be labelled or by the mechanisms employed to obtain the labels. Various approaches have been proposed to detect and correct biases once the label dataset has been constructed. However, proactively reducing biases during the data labelling phase and ensuring data fairness could be more economical compared to post-processing bias mitigation approaches. In this workshop, we aim to foster discussion on ongoing research around biases in crowdsourced data and to identify future research directions to detect, quantify and mitigate biases before, during and after the labelling process such that both task …

Publication

PUBLICATIONS

Investigating and mitigating biases in crowdsourced data

OptimalAI