Misc,

Transient-optimised real-bogus classification with Bayesian Convolutional Neural Networks -- sifting the GOTO candidate stream

T. Killestein, J. Lyman, D. Steeghs, K. Ackley, M. Dyer, K. Ulaczyk, R. Cutter, Y. Mong, D. Galloway, V. Dhillon, P. O'Brien, G. Ramsay, S. Poshyachinda, R. Kotak, R. Breton, L. Nuttall, E. Pallé, D. Pollacco, E. Thrane, S. Aukkaravittayapun, S. Awiphan, U. Burhanudin, P. Chote, A. Chrimes, E. Daw, C. Duffy, R. Eyles-Ferris, B. Gompertz, T. Heikkilä, P. Irawati, M. Kennedy, A. Levan, S. Littlefair, L. Makrygianni, D. Sánchez, S. Mattila, J. Maund, J. McCormac, D. Mkrtichian, J. Mullaney, E. Rol, U. Sawangwit, E. Stanway, R. Starling, P. Strøm, S. Tooke, K. Wiersema, and S. Williams.
(2021)cite arxiv:2102.09892Comment: 17 pages, 12 figures, resubmitted to MNRAS following reviewer comments.

Abstract

Large-scale sky surveys have played a transformative role in our understanding of astrophysical transients, only made possible by increasingly powerful machine learning-based filtering to accurately sift through the vast quantities of incoming data generated. In this paper, we present a new real-bogus classifier based on a Bayesian convolutional neural network that provides nuanced, uncertainty-aware classification of transient candidates in difference imaging, and demonstrate its application to the datastream from the GOTO wide-field optical survey. Not only are candidates assigned a well-calibrated probability of being real, but also an associated confidence that can be used to prioritise human vetting efforts and inform future model optimisation via active learning. To fully realise the potential of this architecture, we present a fully-automated training set generation method which requires no human labelling, incorporating a novel data-driven augmentation method to significantly improve the recovery of faint and nuclear transient sources. We achieve competitive classification accuracy (FPR and FNR both below 1%) compared against classifiers trained with fully human-labelled datasets, whilst being significantly quicker and less labour-intensive to build. This data-driven approach is uniquely scalable to the upcoming challenges and data needs of next-generation transient surveys. We make our data generation and model training codes available to the community.

BibTeX key: killestein2021transientoptimised
entry type: misc
year: 2021
url: http://arxiv.org/abs/2102.09892
note: cite arxiv:2102.09892Comment: 17 pages, 12 figures, resubmitted to MNRAS following reviewer comments

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Generic %1 killestein2021transientoptimised %A Killestein, T. L. %A Lyman, J. %A Steeghs, D. %A Ackley, K. %A Dyer, M. J. %A Ulaczyk, K. %A Cutter, R. %A Mong, Y. L. %A Galloway, D. K. %A Dhillon, V. %A O'Brien, P. %A Ramsay, G. %A Poshyachinda, S. %A Kotak, R. %A Breton, R. P. %A Nuttall, L. K. %A Pallé, E. %A Pollacco, D. %A Thrane, E. %A Aukkaravittayapun, S. %A Awiphan, S. %A Burhanudin, U. %A Chote, P. %A Chrimes, A. %A Daw, E. %A Duffy, C. %A Eyles-Ferris, R. %A Gompertz, B. %A Heikkilä, T. %A Irawati, P. %A Kennedy, M. R. %A Levan, A. %A Littlefair, S. %A Makrygianni, L. %A Sánchez, D. Mata %A Mattila, S. %A Maund, J. %A McCormac, J. %A Mkrtichian, D. %A Mullaney, J. %A Rol, E. %A Sawangwit, U. %A Stanway, E. %A Starling, R. %A Strøm, P. A. %A Tooke, S. %A Wiersema, K. %A Williams, S. C. %D 2021 %K bayesian cnn %T Transient-optimised real-bogus classification with Bayesian Convolutional Neural Networks -- sifting the GOTO candidate stream %U http://arxiv.org/abs/2102.09892 %X Large-scale sky surveys have played a transformative role in our understanding of astrophysical transients, only made possible by increasingly powerful machine learning-based filtering to accurately sift through the vast quantities of incoming data generated. In this paper, we present a new real-bogus classifier based on a Bayesian convolutional neural network that provides nuanced, uncertainty-aware classification of transient candidates in difference imaging, and demonstrate its application to the datastream from the GOTO wide-field optical survey. Not only are candidates assigned a well-calibrated probability of being real, but also an associated confidence that can be used to prioritise human vetting efforts and inform future model optimisation via active learning. To fully realise the potential of this architecture, we present a fully-automated training set generation method which requires no human labelling, incorporating a novel data-driven augmentation method to significantly improve the recovery of faint and nuclear transient sources. We achieve competitive classification accuracy (FPR and FNR both below 1%) compared against classifiers trained with fully human-labelled datasets, whilst being significantly quicker and less labour-intensive to build. This data-driven approach is uniquely scalable to the upcoming challenges and data needs of next-generation transient surveys. We make our data generation and model training codes available to the community.

@misc{killestein2021transientoptimised, abstract = {Large-scale sky surveys have played a transformative role in our understanding of astrophysical transients, only made possible by increasingly powerful machine learning-based filtering to accurately sift through the vast quantities of incoming data generated. In this paper, we present a new real-bogus classifier based on a Bayesian convolutional neural network that provides nuanced, uncertainty-aware classification of transient candidates in difference imaging, and demonstrate its application to the datastream from the GOTO wide-field optical survey. Not only are candidates assigned a well-calibrated probability of being real, but also an associated confidence that can be used to prioritise human vetting efforts and inform future model optimisation via active learning. To fully realise the potential of this architecture, we present a fully-automated training set generation method which requires no human labelling, incorporating a novel data-driven augmentation method to significantly improve the recovery of faint and nuclear transient sources. We achieve competitive classification accuracy (FPR and FNR both below 1%) compared against classifiers trained with fully human-labelled datasets, whilst being significantly quicker and less labour-intensive to build. This data-driven approach is uniquely scalable to the upcoming challenges and data needs of next-generation transient surveys. We make our data generation and model training codes available to the community.}, added-at = {2021-02-22T08:45:31.000+0100}, author = {Killestein, T. L. and Lyman, J. and Steeghs, D. and Ackley, K. and Dyer, M. J. and Ulaczyk, K. and Cutter, R. and Mong, Y. L. and Galloway, D. K. and Dhillon, V. and O'Brien, P. and Ramsay, G. and Poshyachinda, S. and Kotak, R. and Breton, R. P. and Nuttall, L. K. and Pallé, E. and Pollacco, D. and Thrane, E. and Aukkaravittayapun, S. and Awiphan, S. and Burhanudin, U. and Chote, P. and Chrimes, A. and Daw, E. and Duffy, C. and Eyles-Ferris, R. and Gompertz, B. and Heikkilä, T. and Irawati, P. and Kennedy, M. R. and Levan, A. and Littlefair, S. and Makrygianni, L. and Sánchez, D. Mata and Mattila, S. and Maund, J. and McCormac, J. and Mkrtichian, D. and Mullaney, J. and Rol, E. and Sawangwit, U. and Stanway, E. and Starling, R. and Strøm, P. A. and Tooke, S. and Wiersema, K. and Williams, S. C.}, biburl = {https://www.bibsonomy.org/bibtex/279804bef95d7ad92feee2d4610f2bd16/lprudenzi}, description = {Transient-optimised real-bogus classification with Bayesian Convolutional Neural Networks -- sifting the GOTO candidate stream}, interhash = {447c3cb7f7553bd7f8d28b8b07c6bb6d}, intrahash = {79804bef95d7ad92feee2d4610f2bd16}, keywords = {bayesian cnn}, note = {cite arxiv:2102.09892Comment: 17 pages, 12 figures, resubmitted to MNRAS following reviewer comments}, timestamp = {2021-02-22T08:45:31.000+0100}, title = {Transient-optimised real-bogus classification with Bayesian Convolutional Neural Networks -- sifting the GOTO candidate stream}, url = {http://arxiv.org/abs/2102.09892}, year = 2021 }

BibSonomy

Transient-optimised real-bogus classification with Bayesian Convolutional Neural Networks -- sifting the GOTO candidate stream

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on