Hyperscalar Processor Architecture and the Preliminary Performance Evaluation
スポンサーリンク
概要
- 論文の詳細を見る
This paper describes a novel processor architecture, called hyperscalar processor architecture, which encompasses the advantages of superscalar, VLIW, and vector processor architectures and excludes their disadvantages. In brief hyperscalar is a processor, i) whose instruction size and instruction-fetch bandwidth are the same as those of superscalar, ii) whose datapath is as large as that of VLIW, iii) which provides every independent functional unit with one or more compiler-visible registers, called instruction registers, and iv) which allows the program itself to load the instruction registers with instructions fetched from the memory and to execute them as a subroutine. As compiler techniques for creating an object code placed in the instruction registers, this paper proposes pseudo vector processing and software pipelining, and further discusses several issues on applying software pipeling to hyperscalar processors. This paper evaluates the performance attainable in hyperscalar processors, and then concludes that hyperscalar processors can outperform conventional superscalar, VLIW, and vector processors in terms of cost/performance.
- 九州大学の論文
著者
-
Saitoh Yasuhiko
Department Information Systems Kyushu University
-
Murakami Kazuaki
Department Of Information Systems Kyushu University
-
Murakami Kazuaki
Department Of Computer Science And Communication Engineering Kyushu University
-
Miyajima Hiroshi
Department of Information Systems,Kyushu University
-
Miyajima Hiroshi
Department Of Information Systems Kyushu University
-
Miyajima Hiroshi
Department Of Computer Science And Communication Engineering Kyushu University
関連論文
- Quantitative Evaluation of State-Preserving Leakage Reduction Algorithm for L1 Data Caches
- Improving Performance and Energy Saving in a Reconfigurable Processor via Accelerating Control Data Flow Graphs
- Character Projection Mask Set Optimization for Enhancing Throughput of MCC Projection Systems
- Reliable Cache Architectures and Task Scheduling for Multiprocessor Systems
- Architectural-Level Soft-Error Modeling for Estimating Reliability of Computer Systems(VLSI Design Technology,VLSI Technology toward Frontiers of New Market)
- A Reconfigurable Functional Unit with Conditional Execution for Multi-Exit Custom Instructions
- Temperature-Aware Configurable Cache to Reduce Energy in Embedded Systems
- The potential of temperature-aware configurable cache on energy reduction (計算機アーキテクチャ)
- The potential of temperature-aware configurable cache on energy reduction (集積回路)
- Custom Instructions with Multiple Exits : Generation and Execution
- Custom Instructions with Multiple Exits : Generation and Execution
- A Reconfigurable Functional Unit for Adaptable Custom Instructions
- A Reconfigurable Functional Unit for Adaptable Custom Instructions(集積回路技術とアーキテクチャ技術の協調・融合へ向けた,プロセッサ,並列処理,システムLSIアーキテクチャ及び一般)
- An Adaptive Dynamic Extensible Processor
- Performance Models for MPI Collective Communications with Network Contention
- Instruction Encoding for Reducing Power Consumption of I-ROMs Based on Execution Locality
- Trends in High-Performance, Low-Power Cache Memory Architectures
- Omitting Cache Look-up for High-Performance, Low-Power Microprocessors(Special Issue on High-Performance and Low-Power Microprocessors)
- Flavor Sensing with Polyarginine-Counteranion Complexes in Lipid Bilayers
- Hyperscalar Processor Architecture and the Preliminary Performance Evaluation
- PPRAM (Parallel Processing RAM) : A Merged-DRAM/Logic System-LSI Architecture
- A Message-Pool-Based Parallel Operating System for the Kyushu University Reconfigurable Parallel Processor : Parallel Creation of Multiple Threads
- Optimisations Techniques for the Automatic ISA Customisation Algorithm