首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 578 毫秒
1.
Breast masses due to benign disease and malignant tumors related to breast cancer differ in terms of shape, edge-sharpness, and texture characteristics. In this study, we evaluate a set of 22 features including 5 shape factors, 3 edge-sharpness measures, and 14 texture features computed from 111 regions in mammograms, with 46 regions related to malignant tumors and 65 to benign masses. Feature selection is performed by a genetic algorithm based on several criteria, such as alignment of the kernel with the target function, class separability, and normalized distance. Fisher's linear discriminant analysis, the support vector machine (SVM), and our strict two-surface proximal (S2SP) classifier, as well as their corresponding kernel-based nonlinear versions, are used in the classification task with the selected features. The nonlinear classification performance of kernel Fisher's discriminant analysis, SVM, and S2SP, with the Gaussian kernel, reached 0.95 in terms of the area under the receiver operating characteristics curve. The results indicate that improvement in classification accuracy may be gained by using selected combinations of shape, edge-sharpness, and texture features.  相似文献   

2.
We are developing new computer vision techniques for characterization of breast masses on mammograms. We had previously developed a characterization method based on texture features. The goal of the present work was to improve our characterization method by making use of morphological features. Toward this goal, we have developed a fully automated, three-stage segmentation method that includes clustering, active contour, and spiculation detection stages. After segmentation, morphological features describing the shape of the mass were extracted. Texture features were also extracted from a band of pixels surrounding the mass. Stepwise feature selection and linear discriminant analysis were employed in the morphological, texture, and combined feature spaces for classifier design. The classification accuracy was evaluated using the area Az under the receiver operating characteristic curve. A data set containing 249 films from 102 patients was used. When the leave-one-case-out method was applied to partition the data set into trainers and testers, the average test Az for the task of classifying the mass on a single mammographic view was 0.83 +/- 0.02, 0.84 +/- 0.02, and 0.87 +/- 0.02 in the morphological, texture, and combined feature spaces, respectively. The improvement obtained by supplementing texture features with morphological features in classification was statistically significant (p = 0.04). For classifying a mass as malignant or benign, we combined the leave-one-case-out discriminant scores from different views of a mass to obtain a summary score. In this task, the test Az value using the combined feature space was 0.91 +/- 0.02. Our results indicate that combining texture features with morphological features extracted from automatically segmented mass boundaries will be an effective approach for computer-aided characterization of mammographic masses.  相似文献   

3.
A new classification scheme was developed to classify mammographic masses as malignant and benign by using interval change information. The masses on both the current and the prior mammograms were automatically segmented using an active contour method. From each mass, 20 run length statistics (RLS) texture features, 3 speculation features, and 12 morphological features were extracted. Additionally, 20 difference RLS features were obtained by subtracting the prior RLS features from the corresponding current RLS features. The feature space consisted of the current RLS features, the difference RLS features, the current and prior speculation features, and the current and prior mass sizes. Stepwise feature selection and linear discriminant analysis classification were used to select and merge the most useful features. A leave-one-case-out resampling scheme was used to train and test the classifier using 140 temporal image pairs (85 malignant, 55 benign) obtained from 57 biopsy-proven masses (33 malignant, 24 benign) in 56 patients. An average of 10 features were selected from the 56 training subsets: 4 difference RLS features, 4 RLS features, and 1 speculation feature from the current image, and 1 speculation feature from the prior, were most often chosen. The classifier achieved an average training Az of 0.92 and a test Az of 0.88. For comparison, a classifier was trained and tested using features extracted from the 120 current single images. This classifier achieved an average training Az of 0.90 and a test Az of 0.82. The information on the prior image significantly (p = 0.015) improved the accuracy for classification of the masses.  相似文献   

4.
Mammography is a widely used screening tool and is the gold standard for the early detection of breast cancer. The classification of breast masses into the benign and malignant categories is an important problem in the area of computer-aided diagnosis of breast cancer. A small dataset of 57 breast mass images, each with 22 features computed, was used in this investigation; the same dataset has been previously used in other studies. The extracted features relate to edge-sharpness, shape, and texture. The novelty of this paper is the adaptation and application of the classification technique called genetic programming (GP), which possesses feature selection implicitly. To refine the pool of features available to the GP classifier, we used feature-selection methods, including the introduction of three statistical measures—Student’s t test, Kolmogorov–Smirnov test, and Kullback–Leibler divergence. Both the training and test accuracies obtained were high: above 99.5% for training and typically above 98% for test experiments. A leave-one-out experiment showed 97.3% success in the classification of benign masses and 95.0% success in the classification of malignant tumors. A shape feature known as fractional concavity was found to be the most important among those tested, since it was automatically selected by the GP classifier in almost every experiment.  相似文献   

5.

Objective

Medical data sets are usually small and have very high dimensionality. Too many attributes will make the analysis less efficient and will not necessarily increase accuracy, while too few data will decrease the modeling stability. Consequently, the main objective of this study is to extract the optimal subset of features to increase analytical performance when the data set is small.

Methods

This paper proposes a fuzzy-based non-linear transformation method to extend classification related information from the original data attribute values for a small data set. Based on the new transformed data set, this study applies principal component analysis (PCA) to extract the optimal subset of features. Finally, we use the transformed data with these optimal features as the input data for a learning tool, a support vector machine (SVM). Six medical data sets: Pima Indians’ diabetes, Wisconsin diagnostic breast cancer, Parkinson disease, echocardiogram, BUPA liver disorders dataset, and bladder cancer cases in Taiwan, are employed to illustrate the approach presented in this paper.

Results

This research uses the t-test to evaluate the classification accuracy for a single data set; and uses the Friedman test to show the proposed method is better than other methods over the multiple data sets. The experiment results indicate that the proposed method has better classification performance than either PCA or kernel principal component analysis (KPCA) when the data set is small, and suggest creating new purpose-related information to improve the analysis performance.

Conclusion

This paper has shown that feature extraction is important as a function of feature selection for efficient data analysis. When the data set is small, using the fuzzy-based transformation method presented in this work to increase the information available produces better results than the PCA and KPCA approaches.  相似文献   

6.
7.
Malignant breast tumors and benign masses appear in mammograms with different shape characteristics: the former usually have rough, spiculated, or microlobulated contours, whereas the latter commonly have smooth, round, oval, or macrolobulated contours. Features that characterize shape roughness and complexity can assist in distinguishing between malignant tumors and benign masses. Signatures of contours may be used to analyze their shapes. We propose to use a signature based on the turning angle function of contours of breast masses to derive features that capture the characteristics of shape roughness as described above. We propose methods to derive an index of the presence of convex regions (XR ( TA )), an index of the presence of concave regions (VR ( TA )), an index of convexity (CX ( TA )), and two measures of fractal dimension (FD ( TA ) and FDd ( TA )) from the turning angle function. The methods were tested with a set of 111 contours of 65 benign masses and 46 malignant tumors with different parameters. The best classification accuracies in discriminating between benign masses and malignant tumors, obtained for XR ( TA ), VR ( TA ), CX ( TA ), FD ( TA ), and FDd ( TA ) in terms of the area under the receiver operating characteristics curve, were 0.92, 0.92, 0.93, 0.93, and, 0.92, respectively.  相似文献   

8.
Digital mammogram has become the most effective technique for early breast cancer detection modality. Digital mammogram takes an electronic image of the breast and stores it directly in a computer. The aim of this study is to develop an automated system for assisting the analysis of digital mammograms. Computer image processing techniques will be applied to enhance images and this is followed by segmentation of the region of interest (ROI). Subsequently, the textural features will be extracted from the ROI. The texture features will be used to classify the ROIs as either masses or non-masses. In this study normal breast images and breast image with masses used as the standard input to the proposed system are taken from Mammographic Image Analysis Society (MIAS) digital mammogram database. In MIAS database, masses are grouped into either spiculated, circumscribed or ill-defined. Additional information includes location of masses centres and radius of masses. The extraction of the textural features of ROIs is done by using gray level co-occurrence matrices (GLCM) which is constructed at four different directions for each ROI. The results show that the GLCM at 0o, 45o, 90o and 135o with a block size of 8X8 give significant texture information to identify between masses and non-masses tissues. Analysis of GLCM properties i.e. contrast, energy and homogeneity resulted in receiver operating characteristics (ROC) curve area of Az = 0.84 for Otsu's method, 0.82 for thresholding method and Az = 0.7 for K-mean clustering. ROC curve area of 0.8-0.9 is rated as good results. The authors' proposed method contains no complicated algorithm. The detection is based on a decision tree with five criterions to be analysed. This simplicity leads to less computational time. Thus, this approach is suitable for automated real-time breast cancer diagnosis system.  相似文献   

9.
The effect of pixel resolution on texture features computed using the gray-level co-occurrence matrix (GLCM) was analyzed in the task of discriminating mammographic breast lesions as benign masses or malignant tumors. Regions in mammograms related to 111 breast masses, including 65 benign masses and 46 malignant tumors, were analyzed at pixel sizes of 50, 100, 200, 400, 600, 800, and 1,000 μm. Classification experiments using each texture feature individually provided accuracy, in terms of the area under the receiver operating characteristics curve (AUC), of up to 0.72. Using the Bayesian classifier and the leave-one-out method, the AUC obtained was in the range 0.73 to 0.75 for the pixel resolutions of 200 to 800 μm, with 14 GLCM-based texture features using adaptive ribbons of pixels around the boundaries of the masses. Texture features computed using the ribbons resulted in higher classification accuracy than the same features computed using the corresponding regions within the mass boundaries. The t test was applied to AUC values obtained using 100 repetitions of random splitting of the texture features from the ribbons of masses into the training and testing sets. The texture features computed with the pixel size of 200 μm provided the highest average AUC with statistically highly significant differences as compared to all of the other pixel sizes tested, except 100 μm.  相似文献   

10.
Computerized methods have recently shown a great potential in providing radiologists with a second opinion about the visual diagnosis of the malignancy of mammographic masses. The computer-aided diagnosis (CAD) system we developed for the mass characterization is mainly based on a segmentation algorithm and on the neural classification of several features computed on the segmented mass. Mass-segmentation plays a key role in most computerized systems. Our technique is a gradient-based one, showing the main characteristic that no free parameters have been evaluated on the data set used in this analysis, thus it can directly be applied to data sets acquired in different conditions without any ad hoc modification. A data set of 226 masses (109 malignant and 117 benign) has been used in this study. The segmentation algorithm works with a comparable efficiency both on malignant and benign masses. Sixteen features based on shape, size and intensity of the segmented masses are extracted and analyzed by a multi-layered perceptron neural network trained with the error back-propagation algorithm. The capability of the system in discriminating malignant from benign masses has been evaluated in terms of the receiver-operating characteristic (ROC) analysis. A feature selection procedure has been carried out on the basis of the feature discriminating power and of the linear correlations interplaying among them. The comparison of the areas under the ROC curves obtained by varying the number of features to be classified has shown that 12 selected features out of the 16 computed ones are powerful enough to achieve the best classifier performances. The radiologist assigned the segmented masses to three different categories: correctly-, acceptably- and non-acceptably-segmented masses. We initially estimated the area under ROC curve only on the first category of segmented masses (the 88.5% of the data set), then extending the classification to the second subclass (reaching the 97.8% of the data set) and finally to the whole data set, obtaining A(z)=0.805+/-0.030, 0.787+/-0.024 and 0.780+/-0.023, respectively.  相似文献   

11.
OBJECTIVE: Detection and characterization of microcalcification clusters in mammograms is vital in daily clinical practice. The scope of this work is to present a novel computer-based automated method for the characterization of microcalcification clusters in digitized mammograms. METHODS AND MATERIAL: The proposed method has been implemented in three stages: (a) the cluster detection stage to identify clusters of microcalcifications, (b) the feature extraction stage to compute the important features of each cluster and (c) the classification stage, which provides with the final characterization. In the classification stage, a rule-based system, an artificial neural network (ANN) and a support vector machine (SVM) have been implemented and evaluated using receiver operating characteristic (ROC) analysis. The proposed method was evaluated using the Nijmegen and Mammographic Image Analysis Society (MIAS) mammographic databases. The original feature set was enhanced by the addition of four rule-based features. RESULTS AND CONCLUSIONS: In the case of Nijmegen dataset, the performance of the SVM was Az=0.79 and 0.77 for the original and enhanced feature set, respectively, while for the MIAS dataset the corresponding characterization scores were Az=0.81 and 0.80. Utilizing neural network classification methodology, the corresponding performance for the Nijmegen dataset was Az=0.70 and 0.76 while for the MIAS dataset it was Az=0.73 and 0.78. Although the obtained high classification performance can be successfully applied to microcalcification clusters characterization, further studies must be carried out for the clinical evaluation of the system using larger datasets. The use of additional features originating either from the image itself (such as cluster location and orientation) or from the patient data may further improve the diagnostic value of the system.  相似文献   

12.
The problem of computer-aided classification of benign and malignant breast masses using shape features is addressed. The aim of the study is to look at the exceptions in shapes of masses such as circumscribed malignant tumours and spiculated benign masses which are difficult to classify correctly using common shape analysis methods. The proposed methods of shape analysis treat the object's boundary in terms of local details. The boundaries of masses analysed using the proposed methods were manually drawn on mammographic images by an expert radiologist (JELD). A boundary segmentation method is used to separate major portions of the boundary and to label them as concave or convex segments. To analyse the shape information localised in each segment, features are computed through an iterative procedure for polygonal modelling of the mass boundaries. Features are based on the concavity fraction of a mass boundary and the degree of narrowness of spicules as characterised by a spiculation index. Two features comprising spiculation index (SI) and fractional concavity (fcc) developed in the present study when used in combination with the global shape feature of compactness resulted in a benign/malignant classification accuracy of 82%, with an area (Az) of 0.79 under the receiver operating characteristics (ROC) curve with a database of the boundaries of 28 benign masses and 26 malignant tumours. SI alone resulted in a classification accuracy of 80% with Az of 0.82. The combination of all the three features achieved 91% accuracy of circumscribed versus spiculated classification of masses based on shape.  相似文献   

13.
This work presents the usefulness of texture features in the classification of breast lesions in 5,518 images of regions of interest, which were obtained from the Digital Database for Screening Mammography that included microcalcifications, masses, and normal cases. Sixteen texture features were used, i.e., 13 were based on the spatial gray-level dependence matrix and 3 on the wavelet transform. The nonparametric K-NN classifier was used in the classification stage. The results obtained from receiver operating characteristic analysis indicated that the texture features can be used for separating normal regions and lesions with masses and microcalcifications, yielding the area under the curve (AUC) values of 0.957 and 0.859, respectively. However, the texture features were not very effective for distinguishing between malignant and benign lesions because the AUC was 0.617 for masses and 0.607 for microcalcifications. The study showed that the texture features can be used for the detection of suspicious regions in mammograms.  相似文献   

14.
Computer-aided diagnosis schemes are being developed to assist radiologists in mammographic interpretation. In this study, we investigated whether texture features could be used to distinguish between mass and non-mass regions in clinical mammograms. Forty-five regions of interest (ROIs) containing true masses with various degrees of visibility and 135 ROIs containing normal breast parenchyma were extracted manually from digitized mammograms as case samples. Spatial-grey-level-dependence (SGLD) matrices of each ROI were calculated and eight texture features were calculated from the SGLD matrices. The correlation and class-distance properties of extracted texture features were analysed. Selected texture features were input into a modified decision-tree classification scheme. The performance of the classifier was evaluated for different feature combinations and orders of features on the tree. A classification accuracy of about 89% sensitivity and 76% specificity was obtained for ordered features, sum average, correlation, and energy, during the training procedure. With a leave-one-out method, the test result was about 76% sensitivity and 64% specificity. The results of this preliminary study demonstrate the feasibility of using texture information for classification of mass and normal breast tissue, which will be likely to be useful for classifying true and false detections in computer-aided diagnosis programmes.  相似文献   

15.
为了使临床医生从数字乳房X射线图像中得到更多有用的肿块信息,通过研究增强后的数字乳房X射线图像,结合图像特点,提出利用区域生长方法对图像进行肿块分割的算法.此算法可以有效地分割出图像中的肿块区域,并很好地保持了肿块的边缘信息.  相似文献   

16.
Contourlet-based mammography mass classification using the SVM family   总被引:1,自引:0,他引:1  
This paper is concerned with the design and development of an automatic mass classification of mammograms. The proposed method consists of three stages. In the first stage, preprocessing is performed to remove the pectoral muscles and to segment regions of interest. In the next stage contourlet transform is employed as a feature extractor to obtain the contourlet coefficients. This stage is completed by feature selection based on the genetic algorithm, resulting in a more compact and discriminative texture feature set. This improves the accuracy and robustness of the subsequent classifiers. In the final stage, classification is performed based on successive enhancement learning (SEL) weighted SVM, support vector-based fuzzy neural network (SVFNN), and kernel SVM.The proposed approach is applied to the Mammograms Image Analysis Society dataset (MIAS) and classification accuracies of 96.6%, 91.5% and 82.1% are determined over an efficient computational time by SEL weighted SVM, SVFNN and kernel SVM, respectively. Experimental results illustrate that the contourlet-based feature extraction in conjunction with the state-of-art classifiers construct a powerful, efficient and practical approach for automatic mass classification of mammograms.  相似文献   

17.
The objectives in this study were to design and test a fully automated method for classification of microcalcification clusters into malignant and benign types, and to compare the method's performance with that of radiologists. A novel aspect of the approach is that the relative location and orientation of clusters inside the breast was taken into account for feature calculation. Furthermore, correspondence of location of clusters in mediolateral oblique (MLO) and cranio-caudal (CC) views, was used in feature calculation and in final classification. Initially, microcalcifications were automatically detected by using a statistical method based on Bayesian techniques and a Markov random field model. To determine malignancy or benignancy of a cluster, a method based on two classification steps was developed. In the first step, classification of clusters was performed and in the second step a patient based classification was done. A total of 16 features was used in the study. To identify meaningful features, a feature selection was applied, using the area under the receiver operating characteristic (ROC) curve (Az value) as a criterion. For classification the k-nearest-neighbor method was used in a leave-one-patient-out procedure. A database of 192 mammograms with 280 true positive detected microcalcification clusters was used for evaluation of the method. The set consisted of cases that were selected for diagnostic work up during a 4 year period of screening in the Nijmegen region (The Netherlands). Because of the high positive predictive value in the screening program (50%), this set did not contain obvious benign cases. The method's best patient-based performance on this set corresponded with Az = 0.83, using nine features. A subset of the data set, containing mammograms from 90 patients, was used for comparing the computer results to radiologists' performance. Ten radiologists read these cases on a light-box and assessed the probability of malignancy for each patient. All participants had experience in clinical mammography and participated in our observer study during the last 2 days of a 2-week training session leading to screening mammography certification. Results on the subset showed that the method's performance (Az = 0.83) was considerably higher than that of the radiologists (Az = 0.63).  相似文献   

18.
Computer-aided diagnosis (CAD) for characterization of mammographic masses as malignant or benign has the potential to assist radiologists in reducing the biopsy rate without increasing false negatives. The purpose of this study was to develop an automated method for mammographic mass segmentation and explore new image based features in combination with patient information in order to improve the performance of mass characterization. The authors' previous CAD system, which used the active contour segmentation, and morphological, textural, and spiculation features, has achieved promising results in mass characterization. The new CAD system is based on the level set method and includes two new types of image features related to the presence of microcalcifications with the mass and abruptness of the mass margin, and patient age. A linear discriminant analysis (LDA) classifier with stepwise feature selection was used to merge the extracted features into a classification score. The classification accuracy was evaluated using the area under the receiver operating characteristic curve. The authors' primary data set consisted of 427 biopsy-proven masses (200 malignant and 227 benign) in 909 regions of interest (ROIs) (451 malignant and 458 benign) from multiple mammographic views. Leave-one-case-out resampling was used for training and testing. The new CAD system based on the level set segmentation and the new mammographic feature space achieved a view-based Az value of 0.83 +/- 0.01. The improvement compared to the previous CAD system was statistically significant (p = 0.02). When patient age was included in the new CAD system, view-based and case-based Az values were 0.85 +/- 0.01 and 0.87 +/- 0.02, respectively. The study also demonstrated the consistency of the newly developed CAD system by evaluating the statistics of the weights of the LDA classifiers in leave-one-case-out classification. Finally, an independent test on the publicly available digital database for screening mammography with 132 benign and 197 malignant ROIs containing masses achieved a view-based Az value of 0.84 +/- 0.02.  相似文献   

19.
本研究以灰度共生矩阵描述乳腺钼靶X线影像中结构扭曲的纹理特征.对学习样本(乳腺结构扭曲样本44个,正常样本78个),计算五个反映纹理性质的特征参数,根据相应的Fisher系数,确定最适合作为分类依据的特征参数或特征参数组合.用线性判别分析对测试样本(乳腺结构扭曲样本43个,正常样本78个)进行分类.分类结果表明本研究确定的纹理特征熵(ENT)是识别乳腺结构扭曲的最佳统计参数(分类正确率达78.5%、ROC曲线下的面积为0.786).  相似文献   

20.
Computerized detection of breast masses in digitized mammograms   总被引:1,自引:0,他引:1  
We propose a system to detect malignant masses on mammograms. We investigated the behavior of an iris filter at different scales. After iris filter was applied, suspicious regions were segmented by means of an adaptive threshold. Suspected regions were characterized with features based on the iris filter output and, gray level, texture, contour-related, and morphological features extracted from the image. A backpropagation neural network classifier was trained to reduce the number of false positives. The system was developed and evaluated with two completely independent data sets. Results for a test set of 66 malignant and 49 normal cases, evaluated with free-response receiver operating characteristic analysis, yielded a sensitivity of 88% and 94% at 1.02 false positives per image for lesion-based and case-based evaluation, respectively. Results suggest that the proposed method could help radiologists as a second reader in mammographic screening.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号