首页 | 本学科首页   官方微博 | 高级检索  
检索        

基于GPU的矩阵求逆性能测试和分析
引用本文:刘丽,沈杰,李洪林.基于GPU的矩阵求逆性能测试和分析[J].医学教育探索,2010(6):812-817.
作者姓名:刘丽  沈杰  李洪林
作者单位:华东理工大学信息科学与工程学院,上海 200237;华东理工大学药学院,上海 200237;华东理工大学药学院,上海 200237
基金项目:国家“973”计划基金项目(2009CB918501);国家自然科学基金项目(20803022)
摘    要:在CPU串行运算模式下实现大规模矩阵求逆是一个非常耗时的过程。为了解决这一问题,基于NVIDIA公司专为GPU(图形处理器)提供的CUDA(计算统一设备架构),从新的编程角度出发,利用GPU多线程并行处理技术,将矩阵求逆过程中大量的数据实现并行运算,从而获得了较大的加速比。同时,根据程序的执行结果,分析了GPU的单精度与双精度的浮点运算能力及其优、劣势。最后,通过分析数据传输时间对GPU性能的影响,总结出适合GPU的算法特征。

关 键 词:图形处理器(GPU)    计算统一设备架构(CUDA)    CPU    并行运算    矩阵求逆

Performance Testing and Analysis for Matrix Inversion Base on GPU
LIU Li,SHEN Jie and LI Hong-lin.Performance Testing and Analysis for Matrix Inversion Base on GPU[J].Researches in Medical Education,2010(6):812-817.
Authors:LIU Li  SHEN Jie and LI Hong-lin
Institution:School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China;School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
Abstract:For the CPU serial operation mode, it is a very time-consuming process to obtain the inverse of large-scale matrix. Aiming at the above shortcoming, this paper proposes a new programming method based on the common platform CUDA for GPU designed by NVIDIA. By using the multi-threaded parallel processing technology of GPU, a large scale of data during solving the inverse matrix are parallelly computed such that a higher speedup may be obtained. Moreover, both the single-precision and the double-precision FLOPS of GPU are analyzed according to the results of this program. Finally, some characteristics of the proposed algorithms are summarized by analyzing the effect of the data transmission time on the performance of GPU.
Keywords:GPU  CUDA  CPU  parallel computation  matrix inversion
点击此处可从《医学教育探索》浏览原始摘要信息
点击此处可从《医学教育探索》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号