I will make k-means clustering for a segmentation project.But I know that this algorithm is effectable from outliers.Which way should I perform for detecting outliers before doing k-means algorithm? For example should I perform anomaly detection algorithm on the data set to detect outliers and after detecting and excluding outliers ,performing k-means algorithm for more stable clustering? Or Should I detect outliers by using k-means algorithm itself by finding average distances and finding the outliers beyond these distances? What approach should be taken for detecting outliers and making more stable clustering? I need your suggestions.
Best Answer
You could try any of the standard outlier methods, such as kNN, LOF, LOOP, INFLO, etc.
There are also robust k-means variations such as k-means–.
Similar Posts:
- Solved – Machine learning to find an optimal set of parameters for a segmentation algorithm
- Solved – Identifying subsets for outlier detection in local outlier factor
- Solved – Clustering Data Using Gower and Kmeans
- Solved – Anomaly detection based on clustering
- Solved – Clustering given pairwise distances with unknown cluster number