首页 | 本学科首页   官方微博 | 高级检索  
     检索      

适用于片上并行计算阵列的超精简处理器架构
引用本文:周韧研,刘雷波,魏少军.适用于片上并行计算阵列的超精简处理器架构[J].电路与系统学报,2012,17(2):1-5.
作者姓名:周韧研  刘雷波  魏少军
作者单位:清华大学移动计算研究中心,北京100084;清华大学微电子学研究所,北京100084;清华信息科学与技术国家实验室,北京100084
基金项目:科技部重大专项(2009ZX03006-004);国家自然科学基金(60803018)
摘    要:提出一种超精简处理单元架构。该处理单元基于运算-跳转式单指令处理器体系。使用指令优化和内部总线上加速器,该处理单元能够执行传统算术运算式单指令处理器难于执行的高效位运算以及执行效率较低的数据转移操作。以该处理单元构成的片上大规模并行计算阵列可用于图像处理等局部性强、实时性要求高的计算任务。包含有该处理单元架构的16 16的原型阵列已经在FPGA上实现,性能达30.7GOPS@120MHz,平均功耗39.5mW。

关 键 词:超精简处理单元  并行计算  图像处理

Ultra-reduced microprocessor architecture in on-chip parallel computing array
ZHOU Ren-yan , LIU Lei-bo , WEI Shao-jun.Ultra-reduced microprocessor architecture in on-chip parallel computing array[J].Journal of Circuits and Systems,2012,17(2):1-5.
Authors:ZHOU Ren-yan  LIU Lei-bo  WEI Shao-jun
Institution:1,2,3(1.Research Center for Mobile Computing,Tsinghua University,Beijing,100084;2.Institute of Microelectronics,Tsinghua University,Beijing,100084;3.Tsinghua National Laboratory for Information Science and Technology,Beijing,100084)
Abstract:A design of ultra-reduced microprocessor architecture and its implementation are proposed in this paper.The architecture is based on one instruction set computer with instructions of arithmetic operation and conditional jump.With instruction optimization and dedicate hardware accelerators on local bus,the hardware architecture has significant execution efficiency on bitwise operations and data transfer operations,compared with traditional one instruction set computers.A parallel computing array incorporating with the proposed microprocessor enables computation tasks,such as most low-level image processing algorithms,that require high streaming throughput with characteristic of local operation.A 16x16 prototype array has been implemented on FPGA,delivering 30.7GOPS@120MHz with a power consumption of 39.5mW.
Keywords:ultra-reduced microprocessor  parallel computing  image processing
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号