當(dāng)前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

ML之K-means：基于DIY数据集利用K-means算法聚类(测试9种不同聚类中心的模型性能)

發(fā)布時間：2025/3/21 编程问答 11 豆豆

生活随笔收集整理的這篇文章主要介紹了 ML之K-means：基于DIY数据集利用K-means算法聚类(测试9种不同聚类中心的模型性能) 小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

ML之K-means：基于DIY數(shù)據(jù)集利用K-means算法聚類(測試9種不同聚類中心的模型性能)

輸出結(jié)果

設(shè)計思路

實現(xiàn)代碼

輸出結(jié)果

設(shè)計思路

1、使用均勻分布函數(shù)隨機三個簇，每個簇周圍10個數(shù)據(jù)樣本。
2、繪制30個數(shù)據(jù)樣本的分布圖像。
3、測試9種不同聚類中心數(shù)量下，每種情況的聚類質(zhì)量，并作圖。

實現(xiàn)代碼

import numpy as np import matplotlib.pyplot as plt from sklearn.cluster import KMeans from scipy.spatial.distance import cdist#1、使用均勻分布函數(shù)隨機三個簇，每個簇周圍10個數(shù)據(jù)樣本。 cluster1 = np.random.uniform(0.5, 1.5, (2, 10)) cluster2 = np.random.uniform(5.5, 6.5, (2, 10)) cluster3 = np.random.uniform(3.0, 4.0, (2, 10))#2、繪制30個數(shù)據(jù)樣本的分布圖像。 X = np.hstack((cluster1, cluster2, cluster3)).T plt.scatter(X[:,0], X[:, 1]) plt.xlabel('x1') plt.ylabel('x2') plt.title('DIY data:30, Random 3 clusters(10 data samples around each cluster)') plt.show()#3、測試9種不同聚類中心數(shù)量下，每種情況的聚類質(zhì)量，并作圖。 K = range(1, 10) meandistortions = []for k in K:kmeans = KMeans(n_clusters=k)kmeans.fit(X)meandistortions.append(sum(np.min(cdist(X, kmeans.cluster_centers_, 'euclidean'), axis=1))/X.shape[0])plt.plot(K, meandistortions, 'bx-') plt.xlabel('k') plt.ylabel('Average Dispersion') plt.title('K-means: Selecting k with the Elbow Method') plt.show()

總結(jié)

以上是生活随笔為你收集整理的ML之K-means：基于DIY数据集利用K-means算法聚类(测试9种不同聚类中心的模型性能)的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇： Dataset之Handwritten
下一篇： Paper：论文解读《Adaptive