VelvetFlow: An engineering pipeline for robust multi-density clustering

Publish Year: 1404
نوع سند: مقاله ژورنالی
زبان: English
View: 50

This Paper With 26 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_JDMA-10-4_003

تاریخ نمایه سازی: 8 آذر 1404

Abstract:

Problem. Real-world datasets seldom respect a single density scale: tight blobs, elongated ribbons, and isolated points often coexist. Classical algorithms such as DBSCAN or \textit{k}-means require domain-specific parameter tuning and provide only ad-hoc support for anomaly detection.Solution. We introduce VelvetFlow, an engineering pipeline that turns a set of well-understood building blocks into a cohesive, end-to-end workflow for multi-density clustering \emph{and} principled outlier detection. The pipeline is composed of three reusable stages:(i) \emph{Contextual-density splitting} assigns every point to a high- or low-density partition using a single neighbourhood size k.(ii) \emph{Density-aware clustering} applies a Jaccard-guided \textit{FusedNeighbor}+DBSCAN routine to the sparse partition and HDBSCAN to the dense partition-without introducing new hyper-parameters.(iii) \emph{Scaled-MST verification} re-examines the complete k-NN graph, flags weakly connected components, and validates them with a k-NN gate; this step recovers small remote clusters while filtering genuine anomalies.

Authors

Hossein Eyvazi

Department of Computer Science, University of Tarbiat Modares, Tehran, I. R. Iran

Mohammad Badzohreh

Department of Computer Science, University of Tarbiat Modares, Tehran, I. R. Iran

Seyed Ali Shahrokhi

Department of Computer Science, University of Tarbiat Modares, Tehran, I. R. Iran