首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
We investigate the problems of multiclass cancer classification with gene selection from gene expression data. Two different constructed multiclass classifiers with gene selection are proposed, which are fuzzy support vector machine (FSVM) with gene selection and binary classification tree based on SVM with gene selection. Using F test and recursive feature elimination based on SVM as gene selection methods, binary classification tree based on SVM with F test, binary classification tree based on SVM with recursive feature elimination based on SVM, and FSVM with recursive feature elimination based on SVM are tested in our experiments. To accelerate computation, preselecting the strongest genes is also used. The proposed techniques are applied to analyze breast cancer data, small round blue-cell tumors, and acute leukemia data. Compared to existing multiclass cancer classifiers and binary classification tree based on SVM with F test or binary classification tree based on SVM with recursive feature elimination based on SVM mentioned in this paper, FSVM based on recursive feature elimination based on SVM can find most important genes that affect certain types of cancer with high recognition accuracy.  相似文献   

2.
为了提高运动想象脑电信号分类的准确率,针对传统支持向量机(SVM)分类方法在脑电信号处理中存在寻优繁 琐、工作量大和分类正确率低等问题,本研究提出一种基于人工蜂群(ABC)算法优化SVM的分类识别方法。首先利用正 则化共空间模式对脑电信号进行特征提取,然后利用ABC算法优化SVM的惩罚因子和核参数,最后利用提取的右手和 右脚两类脑电信号样本特征对优化后的SVM进行训练和分类测试。实验结果表明ABC-SVM分类器提高了脑电信号分 类的准确率,比传统的SVM分类器准确率高出2.5%,证明该算法的可行性和较高准确性。  相似文献   

3.
Translation of electroencephalographic (EEG) recordings into control signals for brain–computer interface (BCI) systems needs to be based on a robust classification of the various types of information. EEG-based BCI features are often noisy and likely to contain outliers. This contribution describes the application of a fuzzy support vector machine (FSVM) with a radial basis function kernel for classifying motor imagery tasks, while the statistical features over the set of the wavelet coefficients were extracted to characterize the time–frequency distribution of EEG signals. In the proposed FSVM classifier, a low fraction of support vectors was used as a criterion for choosing the kernel parameter and the trade-off parameter, together with the membership parameter based solely on training data. FSVM and support vector machine (SVM) classifiers outperformed the winner of the BCI Competition 2003 and other similar studies on the same Graz dataset, in terms of the competition criterion of the mutual information (MI), while the FSVM classifier yielded a better performance than the SVM approach. FSVM and SVM classifiers perform much better than the winner of the BCI Competition 2005 on the same Graz dataset for the subject O3 according to the competition criterion of the maximal MI steepness, while the FSVM classifier outperforms the SVM method. The proposed FSVM model has potential in reducing the effects of noise or outliers in the online classification of EEG signals in BCIs.  相似文献   

4.
Magnetic resonance imaging (MRI) is playing an important role in the classification of breast tumors. MRI can be used to obtain multiparametric (mp) information, such as structural, hemodynamic, and physiological information. Quantitative analysis of mp-MRI data has shown potential in improving the accuracy of breast tumor classification. In general, a large set of quantitative and texture features can be generated depending upon the type of methodology used. A suitable combination of selected quantitative and texture features can further improve the accuracy of tumor classification. Machine learning (ML) classifiers based upon features derived from MRI data have shown potential in tumor classification. There is a need for further research studies on selecting an appropriate combination of features and evaluating the performance of different ML classifiers for accurate classification of breast tumors. The objective of the current study was to develop and optimize an ML framework based upon mp-MRI features for the characterization of breast tumors (malignant vs. benign and low- vs. high-grade). This study included the breast mp-MRI data of 60 female patients with histopathology results. A total of 128 features were extracted from the mp-MRI tumor data followed by features selection. Five ML classifiers were evaluated for tumor classification using 10-fold crossvalidation with 10 repetitions. The support vector machine (SVM) classifier based on optimum features selected using a wrapper method with an adaptive boosting (AdaBoost) technique provided the highest sensitivity (0.96 ± 0.03), specificity (0.92 ± 0.09), and accuracy (94% ± 2.91%) in the classification of malignant versus benign tumors. This method also provided the highest sensitivity (0.94 ± 0.07), specificity (0.80 ± 0.05), and accuracy (90% ± 5.48%) in the classification of low- versus high-grade tumors. These findings suggest that the SVM classifier outperformed other ML methods in the binary classification of breast tumors.  相似文献   

5.
In this paper a novel automatic approach to identify brain structures in magnetic resonance imaging (MRI) is presented for volumetric measurements. The method is based on the idea of active contour models and support vector machine (SVM) classifiers. The main contributions of the presented method are effective modifications on brain images for active contour model and extracting simple and beneficial features for the SVM classifier. The segmentation process starts with a new generation of active contour models, i.e., vector field convolution (VFC) on modified brain images. VFC results are brain images with the least non-brain regions which are passed on to the SVM classification. The SVM features are selected according to the structure of brain tissues, gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF). SVM classifiers are trained for each brain tissue based on the set of extracted features. Although selected features are very simple, they are both sufficient and tissue separately effective. Our method validation is done using the gold standard brain MRI data set. Comparison of the results with the existing algorithms is a good indication of our approach's success.  相似文献   

6.
支持向量机是在统计学习理论基础上发展而来的一种新的通用学习方法,较好地解决了有限样本的学习分类问题.在早期癌症诊断中,由于存在癌细胞缺乏、病人个体的特异性和数据本身的噪声等因素的影响,要进行非常准确的诊断是困难的.用支持向量机的分类算法,选取不同的核函数,构造了支持向量机的不同分类器,并将其应用于早期癌症诊断.非线性的支持向量机取得了较高的准确率,表明支持向量机在早期癌症的诊断中有很大的应用潜力.  相似文献   

7.
In this paper, a spike detection method is introduced. Traditional morphological filter is improved for extracting spikes from epileptic EEG signals and two key problems are addressed: morphological operation design and structure elements optimization. An average weighted combination of open-closing and close-opening operation, which can eliminate statistical deflection of amplitude, is utilized to separate background EEG and spikes. Then, according to the characteristic of spike component, the structure elements are constructed with two parabolas and a new criterion is put forward to optimize the structure elements. The proposed method is evaluated using normal and epileptic EEG data recorded from 12 test subjects. A comparison between the improved morphological filter, traditional morphological filter and wavelet analysis with Mexican hat function is presented, which indicates that the improved morphological filter is superior in restraining background activities. We demonstrate that the average detection rate of the improved morphological filter is much higher than that of the other two methods, and there is no false detection for normal EEG signals with the proposed method.  相似文献   

8.
心音信号可反映心脏的病理信息,是诊断心脏健康的重要依据之一。本文首先从心音信号提取时频域、梅尔倒谱系数等145个特征作为机器学习的输入数据集,然后在随机森林、LightGBM、XGBoost、GBDT、SVM共5种分类器中选出效果最佳分类器与递归特征消除算法结合进行数据挖掘,找出重要特征集并对其分类效果做比较与分析,最后运用Stacking模型融合方法优化模型。数据挖掘特征子集比同数量特征子集在准确率、召回率、精确率、F1值上分别提高了33.51%、14.54%、20.61%、24.04%;采用LightGBM和SVM模型融合可将F1值提高至92.6%。本文提出了一种有效的心音识别分类方法,挖掘出心音最重要的8个特征,为临床诊断提供参考。  相似文献   

9.
Various problems with the current state-of-the-art techniques for gated radiotherapy have prevented this new treatment modality from being widely implemented in clinical routine. These problems are caused mainly by applying various external respiratory surrogates. There might be large uncertainties in deriving the tumor position from external respiratory surrogates. While tracking implanted fiducial markers has sufficient accuracy, this procedure may not be widely accepted due to the risk of pneumothorax. Previously, we have developed a technique to generate gating signals from fluoroscopic images without implanted fiducial markers using template matching methods (Berbeco et al 2005 Phys. Med. Biol. 50 4481-90, Cui et al 2007b Phys. Med. Biol. 52 741-55). In this note, our main contribution is to provide a totally different new view of the gating problem by recasting it as a classification problem. Then, we solve this classification problem by a well-studied powerful classification method called a support vector machine (SVM). Note that the goal of an automated gating tool is to decide when to turn the beam ON or OFF. We treat ON and OFF as the two classes in our classification problem. We create our labeled training data during the patient setup session by utilizing the reference gating signal, manually determined by a radiation oncologist. We then pre-process these labeled training images and build our SVM prediction model. During treatment delivery, fluoroscopic images are continuously acquired, pre-processed and sent as an input to the SVM. Finally, our SVM model will output the predicted labels as gating signals. We test the proposed technique on five sequences of fluoroscopic images from five lung cancer patients against the reference gating signal as ground truth. We compare the performance of the SVM to our previous template matching method (Cui et al 2007b Phys. Med. Biol. 52 741-55). We find that the SVM is slightly more accurate on average (1-3%) than the template matching method, when delivering the target dose. And the average duty cycle is 4-6% longer. Given the very limited patient dataset, we cannot conclude that the SVM is more accurate and efficient than the template matching method. However, our preliminary results show that the SVM is a potentially precise and efficient algorithm for generating gating signals for radiotherapy. This work demonstrates that the gating problem can be considered as a classification problem and solved accordingly.  相似文献   

10.
A multichannel statistical classifier for detecting prostate cancer was developed and validated by combining information from three different magnetic resonance (MR) methodologies: T2-weighted, T2-mapping, and line scan diffusion imaging (LSDI). From these MR sequences, four different sets of image intensities were obtained: T2-weighted (T2W) from T2-weighted imaging, Apparent Diffusion Coefficient (ADC) from LSDI, and proton density (PD) and T2 (T2 Map) from T2-mapping imaging. Manually segmented tumor labels from a radiologist, which were validated by biopsy results, served as tumor "ground truth." Textural features were extracted from the images using co-occurrence matrix (CM) and discrete cosine transform (DCT). Anatomical location of voxels was described by a cylindrical coordinate system. A statistical jack-knife approach was used to evaluate our classifiers. Single-channel maximum likelihood (ML) classifiers were based on 1 of the 4 basic image intensities. Our multichannel classifiers: support vector machine (SVM) and Fisher linear discriminant (FLD), utilized five different sets of derived features. Each classifier generated a summary statistical map that indicated tumor likelihood in the peripheral zone (PZ) of the prostate gland. To assess classifier accuracy, the average areas under the receiver operator characteristic (ROC) curves over all subjects were compared. Our best FLD classifier achieved an average ROC area of 0.839(+/-0.064), and our best SVM classifier achieved an average ROC area of 0.761(+/-0.043). The T2W ML classifier, our best single-channel classifier, only achieved an average ROC area of 0.599(+/-0.146). Compared to the best single-channel ML classifier, our best multichannel FLD and SVM classifiers have statistically superior ROC performance (P=0.0003 and 0.0017, respectively) from pairwise two-sided t-test. By integrating the information from multiple images and capturing the textural and anatomical features in tumor areas, summary statistical maps can potentially aid in image-guided prostate biopsy and assist in guiding and controlling delivery of localized therapy under image guidance.  相似文献   

11.
Spikes are classified according to their finite differences in various orders. The fundamental idea that makes it work is that finite differences can extract and isolate features from spikes. This method showed better sorting quality and involved less labor than the methods of principal component analysis, original reduced feature set, and wavelet-based spike classifiers.  相似文献   

12.
支持向量机在血细胞分类中的应用   总被引:9,自引:1,他引:9  
支持向量机是根据统计理论提出的一种新的学习算法。该算法通常可用于解决二分类问题。本文将其推广到多分类问题。利用多级支持向量机分类器对骨髓中不同成熟阶段的血细胞进行了分类。文中首先提出了利用逐步分解的分级聚类算法进行多级支持向量机的构建。然后通过一定准则在各级中确定支持向量机相应的最优控制参数。为了进一步了解分类性能和较好的估计分类错误率,使用3次交叉验证法将其与传统的分类方法作了比较。实验表明,支持向量机分类器巧妙避开了维数灾难问题。具有较好的推广能力。可提高血细胞分类的正确率。  相似文献   

13.
Primary and Secondary Polycythemia are diseases of the bone marrow that affect the blood's composition and prohibit patients from becoming blood donors. Since these diseases may become fatal, their early diagnosis is important. In this paper, a classification system for the diagnosis of Primary and Secondary Polycythemia is proposed. The proposed system classifies input data into three classes; Healthy, Primary Polycythemic (PP) and Secondary Polycythemic (SP) and is implemented using two separate binary classification levels. The first level performs the Healthy/non-Healthy classification and the second level the PP/SP classification. To this end, a novel wrapper feature selection algorithm, called the LM–FM algorithm, is presented in order to maximize the classifier's performance. The algorithm is comprised of two stages that are applied sequentially: the Local Maximization (LM) stage and the Floating Maximization (FM) stage. The LM stage finds the best possible subset of a fixed predefined size, which is then used as an input for the next stage. The FM stage uses a floating size technique to search for an even better solution by varying the initially provided subset size. Then, the Support Vector Machine (SVM) classifier is used for the discrimination of the data at each classification level. The proposed classification system is compared with various well-established feature selection techniques such as the Sequential Floating Forward Selection (SFFS) and the Maximum Output Information (MOI) wrapper schemes, and with standalone classification techniques such as the Multilayer Perceptron (MLP) and SVM classifier. The proposed LM–FM feature selection algorithm combined with the SVM classifier increases the overall performance of the classification system, scoring up to 98.9% overall accuracy at the first classification level and up to 96.6% at the second classification level. Moreover, it provides excellent robustness regardless of the size of the input feature subset used.  相似文献   

14.
Automatic Removal of Eye-Movement and Blink Artifacts from EEG Signals   总被引:1,自引:0,他引:1  
Frequent occurrence of electrooculography (EOG) artifacts leads to serious problems in interpreting and analyzing the electroencephalogram (EEG). In this paper, a robust method is presented to automatically eliminate eye-movement and eye-blink artifacts from EEG signals. Independent Component Analysis (ICA) is used to decompose EEG signals into independent components. Moreover, the features of topographies and power spectral densities of those components are extracted to identify eye-movement artifact components, and a support vector machine (SVM) classifier is adopted because it has higher performance than several other classifiers. The classification results show that feature-extraction methods are unsuitable for identifying eye-blink artifact components, and then a novel peak detection algorithm of independent component (PDAIC) is proposed to identify eye-blink artifact components. Finally, the artifact removal method proposed here is evaluated by the comparisons of EEG data before and after artifact removal. The results indicate that the method proposed could remove EOG artifacts effectively from EEG signals with little distortion of the underlying brain signals.  相似文献   

15.
OBJECTIVE: Paroxysmal atrial fibrillation (PAF) is a serious arrhythmia associated with morbidity and mortality. We explore the possibility of distant prediction of PAF by analyzing changes in heart rate variability (HRV) dynamics of non-PAF rhythms immediately before PAF event. We use that model for distant prognosis of PAF onset with artificial intelligence methods. METHODS AND MATERIALS: We analyzed 30-min non-PAF HRV records from 51 subjects immediately before PAF onset and at least 45min distant from any PAF event. We used spectral and complexity analysis with sample (SmEn) and approximate (ApEn) entropies and their multiscale versions on extracted HRV data. We used that features to train the artificial neural networks (ANNs) and support vector machine (SVM) classifiers to differentiate the subjects. The trained classifiers were further tested for distant PAF event prognosis on 16 subjects from independent database on non-PAF rhythm lasting from 60 to 320 min before PAF onset classifying the 30-min segments as distant or leading to PAF. RESULTS: We found statistically significant increase in 30-min non-PAF HRV recordings from 51 subjects in the VLF, LF, HF bands and total power (p<0.0001) before PAF event compared to PAF distant ones. The SmEn and ApEn analysis provided significant decrease in complexity (p<0.0001 and p<0.001) before PAF onset. For training ANN and SVM classifiers the data from 51 subjects were randomly split to training, validation and testing. ANN provided better results in terms of sensitivity (Se), specificity (Sp) and positive predictivity (Pp) compared to SVM which became biased towards positive case. The validation results of the ANN classifier we achieved: Se 76%, Sp 93%, Pp 94%. Testing ANN and SVM classifiers on 16 subjects with non-PAF HRV data preceding PAF events we obtained distant prediction of PAF onset with SVM classifier in 10 subjects (58+/-18 min in advance). ANN classifier provided distant prediction of PAF event in 13 subjects (62+/-21 min in advance). CONCLUSION: From the results of distant PAF prediction we conclude that ANN and SVM classifiers learned the changes in the HRV dynamics immediately before PAF event and successfully identified them during distant PAF prognosis on independent database. This confirms the reported in the literature results that corresponding changes in the HRV data occur about 60 min before PAF onset and proves the possibility of distant PAF prediction with ANN and SVM methods.  相似文献   

16.
Signal separation of background EEG and spike by using morphological filter   总被引:2,自引:0,他引:2  
A signal separation method for extracting background electroencephalogram (EEG) from EEG containing spikes was proposed. Morphological filters were designed for extracting spike waveforms, and then the background EEG was obtained by subtracting the detected spike waveforms from the EEG with spike. The proposed method was evaluated by using simulated EEG data, which consisted of a summation of EEG without spike and model waveform of typical spike. The background EEG separated by the method was processed by the automatic background EEG interpretation.  相似文献   

17.
This paper presents an effective classification scheme consisting of the rough set theory (RST)-based feature selection and the fuzzy least squares support vector machine (LS-SVM) classifier for the surface electromyographic (sEMG)-based motion classification. The wavelet packet transform (WPT) is exploited to decompose the four-class motion EMG signals to the non-overlapped sub-bands and the energy characteristic of each sub-band is adopted to form the original feature set. In order to reduce the computation complexity, the RST is utilized to get the reduction feature set without compromising classification accuracy. In the feature reduction phase, cluster separation index (CSI) is introduced to evaluate the performance of the proposed algorithm. In the sequel, the Fuzzy LS-SVM is constructed for the multi-class classification task. The RST-based feature selection is independent of the classifier design. Consequently the classification performance will vary with different classifiers. We make the comparison between the proposed classification scheme and the commonly used classification scheme, such as the combination of the principal component analysis (PCA)-based feature selection and the neural network (NN) classifier. The results of comparative experiments show that the diverse motions can be identified with high accuracy by the proposed scheme. Compared with other feature extraction and selection algorithms and classifiers, superior performance of the proposed classification scheme illustrates the potential of the SVM techniques combined with WPT and RST in EMG motion classification.  相似文献   

18.
In this paper, we compare five common classifier families in their ability to categorize six lung tissue patterns in high-resolution computed tomography (HRCT) images of patients affected with interstitial lung diseases (ILD) and with healthy tissue. The evaluated classifiers are naive Bayes, k-nearest neighbor, J48 decision trees, multilayer perceptron, and support vector machines (SVM). The dataset used contains 843 regions of interest (ROI) of healthy and five pathologic lung tissue patterns identified by two radiologists at the University Hospitals of Geneva. Correlation of the feature space composed of 39 texture attributes is studied. A grid search for optimal parameters is carried out for each classifier family. Two complementary metrics are used to characterize the performances of classification. These are based on McNemar’s statistical tests and global accuracy. SVM reached best values for each metric and allowed a mean correct prediction rate of 88.3% with high class-specific precision on testing sets of 423 ROIs.  相似文献   

19.
Patients with obstructive sleep apnoea syndrome (OSAS) are at increased risk of developing hypertension and other cardiovascular diseases. This paper explores the use of support vector machines (SVMs) for automated recognition of patients with OSAS types (±) using features extracted from nocturnal ECG recordings, and compares its performance with other classifiers. Features extracted from wavelet decomposition of heart rate variability (HRV) and ECG-derived respiration (EDR) signals of whole records (30 learning sets from physionet) are presented as inputs to train the SVM classifier to recognize OSAS± subjects. The optimal SVM parameter set is then determined by using a leave-one-out procedure. Independent test results have shown that an SVM using a subset of a selected combination of HRV and EDR features correctly recognized 30/30 of physionet test sets. In comparison, classification performance of K-nearest neighbour, probabilistic neural network, and linear discriminant classifiers on test data was lower. These results, therefore, demonstrate considerable potential in applying SVM in ECG-based screening and can aid sleep specialists in the initial assessment of patients with suspected OSAS.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号