- Title
- An effective density-based clustering and dynamic maintenance framework for evolving medical data streams
- Creator
- Al-Shammari, Ahmed; Zhou, Rui; Naseriparsaa, Mehdi; Liu, Chengfei
- Date
- 2019
- Type
- Text; Journal article
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/183383
- Identifier
- vital:16264
- Identifier
-
https://doi.org/10.1016/j.ijmedinf.2019.03.016
- Identifier
- ISBN:1386-5056 (ISSN)
- Abstract
- Background: Medical data stream clustering has become an integral part of medical decision systems since it extracts highly-sensitive information from a tremendous flow of medical data. However, clustering and maintaining of medical data streams is still a challenging task. That is because the evolving of medical data streams imposes various challenges for clustering such as the ability to discover the arbitrary shape of a cluster, the ability to group data streams without a predefined number of clusters, and the ability to maintain the data clusters dynamically. Objective: To support the online medical decisions, there is a need to address the clustering challenges. Therefore, in this paper, we propose an effective density-based clustering and dynamic maintenance framework for grouping the patients with similar symptoms into meaningful clusters and monitoring the patients’ status frequently. Methods: For clustering, we generate a set of initial medical data clusters based on the combination of Piece-wise Aggregate Approximation and the density-based spatial clustering of applications with noise called (PAA+DBSCAN) algorithm. For maintenance, when new medical data streams arrive, we maintain the initially generated medical data clusters dynamically. Since the incremental cluster maintenance is time-consuming, we further propose an Advanced Cluster Maintenance (ACM) approach to improve the performance of the dynamic cluster maintenance. Results: The experimental results on real-world medical datasets demonstrate the effectiveness and efficiency of our proposed approaches. The PAA+DBSCAN algorithm is more efficient and effective than the exact DBSCAN algorithm. Moreover, the ACM approach requires less running time in comparison with the Baseline Cluster Maintenance (BCM) approach using different tuning parameter values in all datasets. That is because the BCM approach tracks all the data points in the cluster. Conclusion: The proposed framework is capable of clustering and maintaining the medical data streams effectively by means of grouping the patients who share similar symptoms and tracking the patients status that naturally tends to be changing over time. © 2019 Elsevier B.V.
- Publisher
- Elsevier Ireland Ltd
- Relation
- International Journal of Medical Informatics Vol. 126, no. (2019), p. 176-186
- Rights
- All metadata describing materials held in, or linked to, the repository is freely available under a CC0 licence
- Rights
- Copyright © 2019 Elsevier B.V.
- Subject
- 40 Engineering; 42 Health Sciences; 46 Information and Computing Sciences; Data mining algorithms; Density-based clustering; Dynamic maintenance; Medical data streams
- Reviewed
- Funder
- Ahmed Alshammari is supported by the Iraqi Ministry of Higher Education and Scientific Research . Rui Zhou is supported by the ARC Discovery Projects Grant No. DP170104747 and DP180100212 ; and Chengfei Liu is supported by the ARC Discovery Projects Grant No. DP160102412 and DP170104747.
- Hits: 506
- Visitors: 496
- Downloads: 0