python中使用k-means对鸢尾花数据集聚类

代码和结果:

import matplotlib.pyplot as plt  
import numpy as np  
from sklearn.cluster import KMeans
from sklearn import datasets  
X = iris.data[:, 2:4] ##表示我们只取特征空间中的后两个维度
print(X.shape)
#绘制数据分布图
plt.scatter(X[:, 0], X[:, 1], c = "red", marker=o, label=see)  
plt.xlabel(petal length)  
plt.ylabel(petal width)  
plt.legend(loc=2)  
plt.show()
estimator = KMeans(n_clusters=3)#构造聚类器
estimator.fit(X)#聚类
label_pred = estimator.labels_ #获取聚类标签
#绘制k-means结果
x0 = X[label_pred == 0]
x1 = X[label_pred == 1]
x2 = X[label_pred == 2]
plt.scatter(x0[:, 0], x0[:, 1], c = "red", marker=o, label=label0)  
plt.scatter(x1[:, 0], x1[:, 1], c = "green", marker=*, label=label1)  
plt.scatter(x2[:, 0], x2[:, 1], c = "blue", marker=+, label=label2)  
plt.xlabel(petal length)  
plt.ylabel(petal width)  
plt.legend(loc=2)  
plt.show()

光看这2个特征,那聚类算是非常完美了

经验分享 程序员 微信小程序 职场和发展