pysparnn 模块使用,相似句子召回

时间：2020-02-29 22:32:24 阅读：129 评论：0 收藏：0 [点我收藏+]

标签：tran int fit pytho nsf transform 向量 multi idfv

import pysparnn.cluster_index as ci
from sklearn.feature_extraction.text import TfidfVectorizer

data = [
    "hello world",
    "oh hello there",
    "Play it",
    "Play it again Sam",
]


tv = TfidfVectorizer()
tv.fit(data)
#特征向量
features_vec = tv.transform(data)

#建立搜索索引
cp = ci.MultiClusterIndex(features_vec,data)

#搜索带有索引的
search_data = [
    "oh there",
    "Play it again Frank"
]

search_feature_vec = tv.transform(search_data)

#k是返回的个数，k_clusters代表聚类的个数
print(cp.search(search_feature_vec,k = 1,k_clusters=2,return_distance=False))

[[‘oh hello there‘], [‘Play it again Sam‘]]

标签：tran int fit pytho nsf transform 向量 multi idfv

原文地址：https://www.cnblogs.com/LiuXinyu12378/p/12386172.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行