Publications of the Astronomical Society of the Pacific

Publications of the Astronomical Society of the Pacific | DeepDyve

We propose a new sequential classification model for astronomical objects based on a recurrent convolutional neural network (RCNN) which uses sequences of images as inputs. This approach avoids the computation of light curves or difference images. This is the first time that sequences of images are used directly for the classification of variable objects in astronomy. The second contribution of this work is the image simulation process. We generate synthetic image sequences which take into account the instrumental and observing conditions, obtaining a realistic, unevenly sampled, and variable noise set of movies for each astronomical object. The simulated data set is used to train our RCNN classifier. This approach allows us to generate data sets to train and test our RCNN model for different astronomical surveys and telescopes. Moreover, using a simulated data set is faster and more adaptable to different surveys and classification tasks. We aim to build a simulated data set whose distribution is close enough to the real data set, so that fine tuning could match the distributions. To test the RCNN classifier trained with the synthetic data set, we used real-world data from the High cadence Transient Survey (HiTS), obtaining an average recall of 85%, improved to 94% after performing fine tuning with 10 real samples per class. We compare the results of our RCNN model with those of a light curve random forest classifier. The proposed RCNN with fine tuning has a similar performance on the HiTS data set compared to the light curve random forest classifier, trained on an augmented training set with 10 real samples per class. The RCNN approach presents several advantages in an alert stream classification scenario, such as a reduction of the data pre-processing, faster online evaluation, and easier performance improvement using a few real data samples. The results obtained encourage us to use the proposed method for astronomical alert broker systems that will process alert streams generated by new telescopes such as the Large Synoptic Survey Telescope.

journal article

LitStream Collection

A Comparison of Photometric Redshift Techniques for Large Radio Surveys

Norris, Ray P.; Salvato, M.; Longo, G.; Brescia, M.; Budavari, T.; Carliles, S.; Cavuoti, S.; Farrah, D.; Geach, J.; Luken, K.; Musaeva, A.; Polsterer, K.; Riccio, G.; Seymour, N.; Smolčić, V.; Vaccari, M.; Zinn, P.

2019 Publications of the Astronomical Society of the Pacific

doi: 10.1088/1538-3873/ab0f7bpmid: N/A

Future radio surveys will generate catalogs of tens of millions of radio sources, for which redshift estimates will be essential to achieve many of the science goals. However, spectroscopic data will be available for only a small fraction of these sources, and in most cases even the optical and infrared photometry will be of limited quality. Furthermore, radio sources tend to be at higher redshift than most optical sources (most radio surveys have a median redshift greater than 1) and so a significant fraction of radio sources hosts differ from those for which most photometric redshift templates are designed. We therefore need to develop new techniques for estimating the redshifts of radio sources. As a starting point in this process, we evaluate a number of machine-learning techniques for estimating redshift, together with a conventional template-fitting technique. We pay special attention to how the performance is affected by the incompleteness of the training sample and by sparseness of the parameter space or by limited availability of ancillary multiwavelength data. As expected, we find that the quality of the photometric-redshift degrades as the quality of the photometry decreases, but that even with the limited quality of photometry available for all-sky-surveys, useful redshift information is available for the majority of sources, particularly at low redshift. We find that a template-fitting technique performs best in the presence of high-quality and almost complete multi-band photometry, especially if radio sources that are also X-ray emitting are treated separately, using specific templates and priors. When we reduced the quality of photometry to match that available for the EMU all-sky radio survey, the quality of the template-fitting degraded and became comparable to some of the machine-learning methods. Machine learning techniques currently perform better at low redshift than at high redshift, because of incompleteness of the currently available training data at high redshifts.

journal article

LitStream Collection

Multiband Galaxy Morphologies for CLASH: A Convolutional Neural Network Transferred from CANDELS

Pérez-Carrasco, M.; Cabrera-Vives, G.; Martinez-Marin, M.; Cerulo, P.; Demarco, R.; Protopapas, P.; Godoy, J.; Huertas-Company, M.

2019 Publications of the Astronomical Society of the Pacific

doi: 10.1088/1538-3873/aaeeb4pmid: N/A

We present visual-like morphologies over 16 photometric bands, from ultraviolet to near-infrared, for 8412 galaxies in the Cluster Lensing And Supernova survey with Hubble (CLASH) obtained using a convolutional neural network (ConvNet) model. Our model follows the Cosmic Assembly Near-IR Deep Extragalactic Legacy Survey (CANDELS) main morphological classification scheme, obtaining the probability for each galaxy at each CLASH band of being spheroid, disk, irregular, point source, or unclassifiable. Our catalog contains morphologies for each galaxy with Hmag < 24.5 in every filter where the galaxy is observed. We trained an initial ConvNet model using approximately 7500 expert eyeball labels from CANDELS. We created eyeball labels for 100 randomly selected galaxies per each of the 16-filter set of CLASH (1600 galaxy images in total), where each image was classified by at least five of us. We use these labels to fine-tune the network to accurately predict labels for the CLASH data and to evaluate the performance of our model. We achieve a root-mean-square error of 0.0991 on the test set. We show that our proposed fine-tuning technique reduces the number of labeled images needed for training, as compared to directly training over the CLASH data, and achieves a better performance. This approach is very useful to minimize eyeball labeling efforts when classifying unlabeled data from new surveys. This will become particularly useful for massive data sets such as those coming from near-future surveys such as EUCLID or the LSST. Our catalog consists of prediction of probabilities for each galaxy by morphology in their different bands and is made publicly available at http://www.inf.udec.cl/~guille/data/Deep-CLASH.csv.

Related Journals: