Misc,

Predicting Early Dropout: Calibration and Algorithmic Fairness Considerations

M. Karimi-Haghighi, C. Castillo, D. Hernandez-Leo, and V. Oliver.
(2021)cite arxiv:2103.09068Comment: 10 pages, Companion Proceedings 11th International Conference on Learning Analytics & Knowledge (LAK21).

Abstract

In this work, the problem of predicting dropout risk in undergraduate studies is addressed from a perspective of algorithmic fairness. We develop a machine learning method to predict the risks of university dropout and underperformance. The objective is to understand if such a system can identify students at risk while avoiding potential discriminatory biases. When modeling both risks, we obtain prediction models with an Area Under the ROC Curve (AUC) of 0.77-0.78 based on the data available at the enrollment time, before the first year of studies starts. This data includes the students' demographics, the high school they attended, and their admission (average) grade. Our models are calibrated: they produce estimated probabilities for each risk, not mere scores. We analyze if this method leads to discriminatory outcomes for some sensitive groups in terms of prediction accuracy (AUC) and error rates (Generalized False Positive Rate, GFPR, or Generalized False Negative Rate, GFNR). The models exhibit some equity in terms of AUC and GFNR along groups. The similar GFNR means a similar probability of failing to detect risk for students who drop out. The disparities in GFPR are addressed through a mitigation process that does not affect the calibration of the model.

BibTeX key: karimihaghighi2021predicting
entry type: misc
year: 2021
url: http://arxiv.org/abs/2103.09068
note: cite arxiv:2103.09068Comment: 10 pages, Companion Proceedings 11th International Conference on Learning Analytics & Knowledge (LAK21)

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{karimihaghighi2021predicting, abstract = {In this work, the problem of predicting dropout risk in undergraduate studies is addressed from a perspective of algorithmic fairness. We develop a machine learning method to predict the risks of university dropout and underperformance. The objective is to understand if such a system can identify students at risk while avoiding potential discriminatory biases. When modeling both risks, we obtain prediction models with an Area Under the ROC Curve (AUC) of 0.77-0.78 based on the data available at the enrollment time, before the first year of studies starts. This data includes the students' demographics, the high school they attended, and their admission (average) grade. Our models are calibrated: they produce estimated probabilities for each risk, not mere scores. We analyze if this method leads to discriminatory outcomes for some sensitive groups in terms of prediction accuracy (AUC) and error rates (Generalized False Positive Rate, GFPR, or Generalized False Negative Rate, GFNR). The models exhibit some equity in terms of AUC and GFNR along groups. The similar GFNR means a similar probability of failing to detect risk for students who drop out. The disparities in GFPR are addressed through a mitigation process that does not affect the calibration of the model.}, added-at = {2021-04-18T09:40:10.000+0200}, author = {Karimi-Haghighi, Marzieh and Castillo, Carlos and Hernandez-Leo, Davinia and Oliver, Veronica Moreno}, biburl = {https://www.bibsonomy.org/bibtex/2acc4353f0e1b1af9fffd473d94c5de9d/ereidt}, interhash = {2993ee37f9ca993f8c5c2aa1f1625eed}, intrahash = {acc4353f0e1b1af9fffd473d94c5de9d}, keywords = {algorithms calibration drop-out ethics fairness highered learninganalytics predictivemodeling}, note = {cite arxiv:2103.09068Comment: 10 pages, Companion Proceedings 11th International Conference on Learning Analytics & Knowledge (LAK21)}, timestamp = {2021-04-18T09:40:10.000+0200}, title = {Predicting Early Dropout: Calibration and Algorithmic Fairness Considerations}, url = {http://arxiv.org/abs/2103.09068}, year = 2021 }

BibSonomy

Predicting Early Dropout: Calibration and Algorithmic Fairness Considerations

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on