首页 | 本学科首页   官方微博 | 高级检索  
检索        

基于SPSS的共现聚类分析参数选择的实例研究
引用本文:隋明爽,崔雷.基于SPSS的共现聚类分析参数选择的实例研究[J].中华医学图书情报杂志,2016,25(1):52-56.
作者姓名:隋明爽  崔雷
作者单位:中国医科大学医学信息学院,辽宁 沈阳110001,中国医科大学医学信息学院,辽宁 沈阳110001
摘    要:以OHSUMED语料库内提供的明确相关提问对为金标准和研究材料,借助BICOMB软件生成主题词-来源文献矩阵和共词矩阵,并获得各种系数的相似(相异)矩阵,对比分析目前国内基于SPSS共现聚类分析过程中主题词-来源文献矩阵与共现矩阵、各种相似性参数和各种类间距离计算方法的聚类效果。结果表明:主题词-来源文献矩阵聚类结果优于共词矩阵,在聚类分析中应优先选择。共词矩阵选择相似系数时应结合实际矩阵数据性质,并注意聚类方法原理上的正确性。

关 键 词:SPSS  聚类分析  共现分析  相关系数
收稿时间:2015/8/24 0:00:00

SPSS-based selection of parameters in co-occurrence clustering analysis: A case study
SUI Min-shuang and CUI Lei.SPSS-based selection of parameters in co-occurrence clustering analysis: A case study[J].Chinese Journal of Medical Library and Information Science,2016,25(1):52-56.
Authors:SUI Min-shuang and CUI Lei
Institution:China Medical University Medical Informatics School, Shenyang 110001, Liaoning Province, China and China Medical University Medical Informatics School, Shenyang 110001, Liaoning Province, China
Abstract:Similar (differential) matrixes of different coefficients were established by generating subject heading-source literature matrix and co-word matrix using the BICOMB software with the OHSUMED-covered related questions as their golden standard and research material. The clustering effects of SPSS-based subject heading-source literature matrix and co-word matrix, similarity parameters and methods of calculating the distance between different matrixes were comparatively analyzed, which showed that the effect of subject heading-source literature matrix is better than that of co-word matrix and should thus be selected in clustering analysis. Similar coefficients should be selected in combination with the practical matrix data properties in co-occurrence analysis.
Keywords:SPSS  Clustering analysis  Co-occurrence analysis  Related coefficient
点击此处可从《中华医学图书情报杂志》浏览原始摘要信息
点击此处可从《中华医学图书情报杂志》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号