DBSCAN算法实现---Python

时间：2018-12-06 01:35:10 阅读：170 评论：0 收藏：0 [点我收藏+]

标签：之间领域算法实现字典 lse return point scan python

生活不易啊，公司考核，初步写出来了，脑阔疼。。。

思路：

　　设定阈值与半径；

　　计算点之间的距离(欧式距离实现)；

　　区分核心点、边界点与离群点；

　　将每个点的领域作为一个类(即将密度可达的点归为一个簇)；

　　找出每个独立的领域；

　　对最后的聚类进行标记；

　　可视化。

代码实现：

　　1、设定eps = 2，MinPts = 3；

　　2、实现点与点欧氏距离的计算

 1 def ecludDist(x,y):
 2     return np.sqrt(sum(np.sqrt(np.array(x) - np.array(y))))
 3 
 4 def euclidean_distance(data):
 5     all_points = []
 6     for i in data:
 7         temp = []
 8         for j in data:
 9             temp.append(ecludDist(i,j))
10         all_points.append(temp)
11     return all_points

　　3、点种类的划分：

 1 def classify(z):   #z为通过欧式距离计算所得的矩阵
 2     pts = []
 3     for row in z:
 4         density = np.sum(z.ix[row] < eps)
 5         pts = 0   
 6         if density > MinPts:
 7             pts = 1      #核心点
 8         elif density > 1:
 9             pts = 2　　　　#边界点
10         else:
11             pts = 0　　　　#离群点
12         pts.append(pts)
13 return pts

　　4、将每个点的领域作为一个类：

 1 def point_type:
 2     cluster = dict()
 3     i = 0
 4     for row in z:
 5         cluster[i] = np.where(z.ix[row] < eps)[0]
 6         i = i+1
 7     for i in range(len(cluster)):
 8         for j in range(len(cluster)):
 9             if len(set(cluster[i]) & set(cluster[j])) > 0 and i!=j:
10                 cluster[i] = cluster[i] | cluster[j]
11                 cluster[j] = []

　　5、找出独立的领域

1 def independent_filed(cluster):
2     j = 0
3     result = dict()
4     for i in range(len(cluster)):
5         if len(cluster[i]) >0:    
6             result[j] = cluster[i]
7             j =j+1
8     return result

　　6、对最后聚类的结果标记

 1 def mark(df):    #传进来的是转换后的数据
 2     for i in range(result)：   #result 为一个字典型结构
 3         for j in result[i]:
 4             df.at[j,‘type‘] = i     #新建一列，并给它把i值添加上去
 5 
 6 
 7 plt.scatter(
 8     df[‘one‘],
 9     df[‘two‘],
10     c=df[‘type‘])
11

最后把这些函数封装一下就Ok啦

DBSCAN算法实现---Python

标签：之间领域算法实现字典 lse return point scan python

原文地址：https://www.cnblogs.com/U940634/p/10074411.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行