Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/45980
Title: FLASC: a flare-sensitive clustering algorithm
Authors: BOT, Daniël M. 
PEETERS, Jannes 
LIESENBORGS, Jori 
AERTS, Jan 
Issue Date: 2025
Source: PeerJ Computer Science, 11
Abstract: Exploratory data analysis workflows often use clustering algorithms to find groups of similar data points. The shape of these clusters can provide meaningful information about the data. For example, a Y-shaped cluster might represent an evolving process with two distinct outcomes. This article presents flare-sensitive clustering (FLASC), an algorithm that detects branches within clusters to identify such shape-based subgroups. FLASC builds upon HDBSCAN*---a state-of-the-art density-based clustering algorithm---and detects branches in a post-processing step using within-cluster connectivity. Two algorithm variants are presented, which trade computational cost for noise robustness. We show that both variants scale similarly to HDBSCAN* regarding computational cost and provide similar outputs across repeated runs. In addition, we demonstrate the benefit of branch detection on two real-world data sets. Our implementation is included in the hdbscan Python package and available as a standalone package at https://github.com/vda-lab/pyflasc.
Keywords: Subjects Algorithms and Analysis of Algorithms;Data Mining and Machine Learning;Data Science Keywords Exploratory data analysis;Density-based clustering;Branch-hierarchy detection;HDBSCAN*
Document URI: http://hdl.handle.net/1942/45980
e-ISSN: 2376-5992
DOI: 10.7717/peerj-cs.2792
ISI #: 001480533400001
Datasets of the publication: 10.5281/zenodo.14888003
10.5281/zenodo.13326222
Rights: Copyright 2025 Bot et al. Distributed under Creative Commons CC-BY 4.0
Category: A1
Type: Journal Contribution
Appears in Collections:Research publications

Files in This Item:
File Description SizeFormat 
published_paper.pdfPublished version4.87 MBAdobe PDFView/Open
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.