Discriminating Variable Star Candidates in Large Image Databases from the HiTS Survey Using NMF

https://doi.org/10.1016/j.procs.2015.07.276Get rights and content
Under a Creative Commons license
open access

Abstract

New instruments and technologies are allowing the acquisition of large amounts of data from astronomical surveys. Nowadays there is a pressing need for autonomous methods to discriminate the interesting astronomical objects in the vast sky. The High Cadence Transient Survey (HiTS) project is an astronomical survey that is trying to find a rare transient event that occurs during the first instants of a supernova. In this paper we propose an autonomous method to discriminate stellar variability from the HiTS database, that uses a feature extraction scheme based on Non-negative matrix factorization (NMF). Using NMF, dictionaries of image prototypes that represent the data in a compact way are obtained. The projections of the dataset into these dictionaries are fed into a random forest classifier. NMF is compared with other feature extraction schemes, on a subset of 500,000 transient candidates from the HiTS survey. With NMF a better class separability at feature level is obtained which enhances the classification accuracy significantly. Using the NMF features less than 4% of the true stellar transients are lost, at a manageable false positive rate of 0.1%.

Keywords

Data mining
Astronomy
Supernovae
Machine learning
Non-negative matrix factorization

Cited by (0)

Peer-review under responsibility of International Neural Network Society, (INNS).