We use the Southern Photometric Local Universe Survey (S-PLUS) Fourth Data Release (DR4) to identify and classify H{alpha}-excess point source candidates in the Southern Sky. This approach combines photometric data from 12 S-PLUS filters with machine learning techniques to improve source classification and advance our understanding of H{alpha}-related phenomena. Our goal is to enhance the classification of H{alpha}-excess point sources by distinguishing between Galactic and extragalactic objects, particularly those with redshifted emission lines, and to identify sources where the H{alpha} excess is associated with variability phenomena, such as short-period RR Lyrae stars. We selected H{alpha}-excess candidates using the (r-J0660) versus (r-i) colour-colour diagram from the S-PLUS main survey (MS) and Galactic Disk Survey (GDS). Dimensionality reduction was achieved using UMAP, followed by HDBSCAN clustering. We refined this by incorporating infrared data, improving the separation of source types. A Random Forest model was then trained on the clustering results to identify key colour features for the classification of H{alpha}-excess sources. New, effective colour-colour diagrams were constructed by combining data from S-PLUS MS and infrared data. These diagrams, alongside tentative colour criteria, offer a preliminary classification of H{alpha}-excess sources without the need for complex algorithms. Combining multiwavelength photometric data with machine learning techniques significantly improved the classification of H{alpha}-excess sources. We identified 6956 sources with excess in the J0660 filter, and cross-matching with SIMBAD allowed us to explore the types of objects present in our catalog, including emission-line stars, young stellar objects, nebulae, stellar binaries, cataclysmic variables, variable stars, and extragalactic sources such as QSOs, AGNs, and galaxies. The cross-match also revealed X-ray sources, transients, and other peculiar objects. Using S-PLUS colours and machine learning, we successfully separated RR Lyrae stars from both other Galactic stars and extragalactic objects. Additionally, we achieved a clear separation between Galactic and extragalactic sources. However, distinguishing cataclysmic variables from QSOs at specific redshifts remained challenging. Incorporating infrared data refined the classification, enabling us to separate Galactic from extragalactic sources and to distinguish cataclysmic variables from QSOs. The Random Forest model, trained on HDBSCAN results, highlighted key colour features that distinguish the different classes of H{alpha}-excess sources, providing a robust framework for future studies such as follow-up spectroscopy.
Cone search capability for table J/A+A/695/A104/hasms (Main survey H{alpha}-excess sources with UMAP/WISE)
Cone search capability for table J/A+A/695/A104/hasgds (Galactic Disk Survey H{alpha}-excess sources)