Ensemble Deep Learning for Real–Bogus Classification with Sky Survey Images
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The discovery of the fifth gravitational wave, GW170817, and its electromagnetic counterpart, resulting from the merger of neutron stars by the LIGO and Virgo teams, marked a major milestone in astronomy. It was the first time that gravitational waves and light from the same cosmic event were observed simultaneously. The LIGO detectors in the United States recorded the signal for 100 s, longer than in previous detections. The merging of neutron stars emits both gravitational and electromagnetic waves across all frequencies—from radio to gamma rays. However, pinpointing the exact source remains difficult, requiring rapid sky scanning to locate it. To address this challenge, the Gravitational-Wave Optical Transient Observer (GOTO) project was established. It is specifically designed to detect optical light from transient events associated with gravitational waves, enabling faster follow-up observations and a deeper study of these short-lived astronomical phenomena, which appear and disappear quickly in the universe. In astrophysics, it has become more important to find astronomical transient events like supernovae, gamma-ray bursts, and stellar flares because they are linked to extreme cosmic processes. However, finding these short-lived events in huge sky survey datasets, like those from the GOTO project, is very hard for traditional analysis methods. This study suggests a deep learning methodology employing Convolutional Neural Networks (CNNs) to enhance transient classification. CNNs are based on how biological vision systems work and how they are structured. They mimic how animal brains hierarchically process visual information, making it possible to automatically find complex spatial patterns in astronomical images. Transfer learning and fine-tuning on pretrained ImageNet models are utilized to emulate adaptive learning observed in biological organisms, enabling swift adaptation to new tasks with minimal data. Data augmentation methods like rotation, flipping, and noise injection mimic changes in the environment to improve model generalization. Dropout and different batch sizes are used to stop overfitting, which is similar to how biological systems use redundancy and noise tolerance. Ensemble learning strategies, such as Soft Voting and Weighted Voting, draw inspiration from collective intelligence in biological systems, integrating multiple CNN models to enhance decision-making robustness. Our findings indicate that this bio-inspired framework substantially improves the precision and dependability of transient detection, providing a scalable solution for real-time applications in extensive sky surveys such as GOTO.