首页 | 本学科首页   官方微博 | 高级检索  
检索        

基于论文相似网络拓扑结构的聚类方法比较
引用本文:黄鹏,崔雷.基于论文相似网络拓扑结构的聚类方法比较[J].中华医学图书情报杂志,2015,24(10):33-38.
作者姓名:黄鹏  崔雷
作者单位:中国医科大学医学信息学院
摘    要:以R语言中的复杂网络处理包igraph为工具,基于语义相似性算法构建论文相似网络,然后采用四种代表性网络聚类算法(随机游走法、标签传播法、最大模块度法、边介数法)对构建出的网络进行聚类分析。最后结合金标准和网络社团划分评价指标D函数比较四种算法的准确性和稳定性,发现随机游走算法最为卓越,同时明确了复杂网络的预处理也是一个影响聚类效果的重要因素。

关 键 词:社团结构  论文相似网络  随机游走法  标签传播法  最大模块度法  边介数法  Igraph
收稿时间:2015/8/10 0:00:00

Comparison of clustering methods in light of paper similarity network topology
HUANG Peng and CUI Lei.Comparison of clustering methods in light of paper similarity network topology[J].Chinese Journal of Medical Library and Information Science,2015,24(10):33-38.
Authors:HUANG Peng and CUI Lei
Institution:China Medical University Medical Informatics School, Shenyang 110112, Liaoning Province, China and China Medical University Medical Informatics School, Shenyang 110112, Liaoning Province, China
Abstract:The complex network processing package igraph in R was applied to clustering analysis paper similarity network built by semantic similarity algorithms with four representative algorithms (Walktrap algorithm , Label propagation algorithm , BGll algorithm, Girvan-Newman algorithm ). Finally, we compared the stability and precision of clustering algorithms using gold standard and a D function of network community detection index. Found that the walktrap algorithm was the most outstanding, at the same time, the pre-processing of complex networks was also an important factor which can influence clustering effect.
Keywords:Community structure  Paper similarity network  Label propagation algorithm  Walktrap algorithm    Girvan-Newman algorithm    BGll algorithm    Igraph
点击此处可从《中华医学图书情报杂志》浏览原始摘要信息
点击此处可从《中华医学图书情报杂志》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号