Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/45980
Title: | FLASC: a flare-sensitive clustering algorithm | Authors: | BOT, Daniël M. PEETERS, Jannes LIESENBORGS, Jori AERTS, Jan |
Issue Date: | 2025 | Source: | PeerJ Computer Science, 11 | Abstract: | Exploratory data analysis workflows often use clustering algorithms to find groups of similar data points. The shape of these clusters can provide meaningful information about the data. For example, a Y-shaped cluster might represent an evolving process with two distinct outcomes. This article presents flare-sensitive clustering (FLASC), an algorithm that detects branches within clusters to identify such shape-based subgroups. FLASC builds upon HDBSCAN*---a state-of-the-art density-based clustering algorithm---and detects branches in a post-processing step using within-cluster connectivity. Two algorithm variants are presented, which trade computational cost for noise robustness. We show that both variants scale similarly to HDBSCAN* regarding computational cost and provide similar outputs across repeated runs. In addition, we demonstrate the benefit of branch detection on two real-world data sets. Our implementation is included in the hdbscan Python package and available as a standalone package at https://github.com/vda-lab/pyflasc. | Keywords: | Subjects Algorithms and Analysis of Algorithms;Data Mining and Machine Learning;Data Science Keywords Exploratory data analysis;Density-based clustering;Branch-hierarchy detection;HDBSCAN* | Document URI: | http://hdl.handle.net/1942/45980 | e-ISSN: | 2376-5992 | DOI: | 10.7717/peerj-cs.2792 | ISI #: | 001480533400001 | Datasets of the publication: | 10.5281/zenodo.14888003 10.5281/zenodo.13326222 |
Rights: | Copyright 2025 Bot et al. Distributed under Creative Commons CC-BY 4.0 | Category: | A1 | Type: | Journal Contribution |
Appears in Collections: | Research publications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
published_paper.pdf | Published version | 4.87 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.