Two-Step Physical Register Deallocation for Data Prefetching and Address Pre-Calculation
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes an instruction pre-execution scheme for a high performance processor, that reduces latency and early scheduling of loads. Our scheme exploits the difference between the amount of instruction-level parallelism available with an unlimited number of physical registers and that available with an actual number of physical registers. We introduce the two-step physical register deallocation scheme, which deallocates physical registers at the renaming stage as a first step, and eliminates pipeline stalls caused by a shortage of physical registers. Instructions wait for the final deallocation as a second step in the instruction window. While waiting, the scheme allows pre-execution of instructions, that enables prefetching of load data and early calculation of memory effective addresses. Our evaluation results show that our scheme improves the performance significantly, and achieves a 1.26 times speedup over a processor without a prefetcher. If combined with a stride prefetcher, it achieves a 1.18 times speedup over a processor with a stride prefetcher.
- Information and Media Technologies 編集運営会議の論文
著者
-
Yamamoto Akihiro
Department Of Biochemistry And Applied Biosciences Faculty Of Agriculture University Of Miyazaki
-
Ando Hideki
Department Of Biology Faculty Of Science Okayama University
-
Tanaka Yusuke
Department Of Applied Chemistry Graduate School Of Engineering Kyushu University
-
TANAKA Yusuke
Department of Computational Science and Engineering, Nagoya University
-
Shimada Toshio
Department of Biology, Faculty of Science and High Technology Research Center, Konan University
関連論文
- 4 Colonization and growth promotion characteristics of Herbaspirillum sp. B501 and Enterobacter sp for Brassica oleracea
- Colonization and growth promotion characteristics of Enterobacter sp. and Herbaspirillum sp. on Brassica oleracea(Soil Biology)
- Infection, Multiplication and Evaluation of the Nitrogen-Fixing Ability of Herbaspirillum sp. Strain B501gfp1 in Sugarcane Stems Inoculated by the Vacuum Infiltration Method
- Influence of inoculation technique on the endophytic colonization of rice by Pantoea sp. isolated from sweet potato and by Enterobacter sp. isolated from sugarcane(Soil Biology)
- Intragenomic variation in the internal transcribed spacer regions between 16S-23S rRNA genes among the three copies of Sinorhizobium fredii strains(Soil Biology)
- PJ-249 Clinical Implication of Delayed Contrast Enhancement by Gd-DTPA MRI and Elevated Brain Natriuretic Peptide Hormone in Aortic Stenosis(MRI/MRA-4 (I) PJ42,Poster Session (Japanese),The 70th Anniversary Annual Scientific Meeting of the Japanese Circul
- The Cutting Balloon Blades and Calcified Lesions : Are the Blades Cutting into the Calcification? : An Intravascular Ultrasound Investigation
- Cutting Balloon Angioplasty for the Treatment of Calcified Coronary Lesions : An Intravascular Ultrasound Study
- Flexible Field Emission Device Using Carbon Nanofiber Nanocomposite Sheet
- Estimation of nodulation tendency among Rj-genotype soybeans using the bradyrhizobial community isolated from an Andosol(Soil Biology)
- Estimation of the bacterial community diversity of soybean-nodulating bradyrhizobia isolated from Rj-genotype soybeans(Soil Biology)
- Diversity and distribution of indigenous soybean-nodulating rhizobia in the Okinawa islands, Japan(Soil Biology)
- Functional Immobilization of Recombinant Alkaline Phosphatases Bearing a Glutamyl Donor Substrate Peptide of Microbial Transglutaminase(ENZYMOLOGY, PROTEIN ENGINEERING, AND ENZYME TECHNOLOGY)
- Changes in Population Occupancy of Bradyrhizobia under Different Temperature Regimes
- Detection of Genes Encoding Bholera Toxin (CT), Zonula Occludens Toxin (ZOT), Accessory Cholera Enterotoxin (ACE) and Heat-Stable Enterotoxin (ST) in Vibrio mimcus Clinical Strains
- Hydrodynamic Evolution of Highly Energetic Matter Produced by Cylindrically Symmetric Heavy Ions Collisions
- A Priority Forwarding Scheme for Real-Time Multistage Interconnection Networks and Its Evaluation (実時間処理システムとその応用論文特集)
- Diesel Exhaust Particle-Induced Cell Death of Cultured Normal Human Bronchial Epithelial Cells
- Diesel Exhaust Particle-Induced Cell Death of Human Leukemic Promyelocytic Cells HL-60 and Their Variant Cells HL-NR6
- Involvement of Na+/Ca2+ Exchanger in Pentylenetetrazol-Induced Convulsion by Use of Na+/Ca2+ Exchanger Knockout Mice
- Synthesis and Structure of Group 10 Metal Complexes with New Tripodal Tetradentate Ligand Bearing One Phosphine and Three Thioether Moieties
- Changes in Population Occupancy of Bradyrhizobia under Different Temperature Regimes
- Extracorporeal Shock Wave Lithotripsy for the Treatment of Staghorn Calculi in 72 Patients
- Hephrogenic adenoma of the bladder in a chronic hemodialysis patient
- Extracorporeal Shock Wave Lithotripsy-induced Renal Laceration
- ESWL Monotherapy for the Treatment of Staghorn Calculi in 74 Patients
- PJ-177 Enhanced expression of V-1, a novel catecholamine biosynthesis regulatory protein, in atrial myocytes of hypertrophic heart of Dahl hypertensive rats(Hypertension, Basic 2 (H) : PJ30)(Poster Session (Japanese))
- Energy-Efficient Pre-Execution Techniques in Two-Step Physical Register Deallocation
- Vanadium-containing Silsesquioxane-catalyzed Photo-assisted Oxidation of Hydrocarbons
- Dielectric Study of Monoclinic RbD_2PO_4 in High-Temperature Phase
- Limits of Thread-Level Parallelism in Non-numerical Programs(System Evaluation)
- Register File Size Reduction through Instruction Pre-Execution Incorporating Value Prediction
- Fabrication of Two-Dimensional J-Aggregates on Au (111) Covered with Self-Assembled Cysteamine Monolayer
- Anterior cingulate activity during pain-avoidance and reward tasks in monkeys
- Neoadijuvant flutamide monotherapy for locally confined prostate cancer
- Three-dimensional flow of liquid crystalline polymers through rectangular channels with abrupt change in geometry
- Alternating Zonal Flows in a Two-Layer Wind-Driven Ocean
- Backward Flow of Mesonic Fluid in Heavy Ions Collision at Ultra High Energy : Particles and Fields
- Direct Asymmetric Aminoxylation Reaction Catalyzed by Axially Chiral Amino Acids
- kra-1,A GENE REQUIRED FOR KETAMINE RESPONSE IN THE NEMATODE Caenorhabditis elegans
- PI-36 Immunohistochemical Localization of PAF-receptor and Cross-talk between PAF-and ACTH-induced aldosterone secretion in Guinea Pig Adrenals
- Delay Evaluation of Issue Queue in Superscalar Processors with Banking Tag RAM and Correct Critical Path Identification
- Catalytic Asymmetric Synthesis of Isoxazoline-N-oxides through Conjugate Addition-Cyclization under Phase-Transfer Conditions
- Two-Step Physical Register Deallocation for Data Prefetching and Address Pre-Calculation
- Two-Step Physical Register Deallocation for Data Prefetching and Address Pre-Calculation
- Limits of Thread-Level Parallelism in Non-numerical Programs
- Mathematical Ecology Analysis of Geographical Distribution of Soybean-Nodulating Bradyrhizobia in Japan
- Limits of Thread-Level Parallelism in Non-numerical Programs
- Synthesis and Crystal Structure of a Chain Complex of Molybdenum(II) Benzoate with 1,2-Bis(4-pyridyl)ethylene Having an N2-Adsorption Property
- Synthesis and Crystal Structure of a Chain Complex of Molybdenum(II) Benzoate and 1,2-Bis(4-pyridyl)ethane with an N2-Adsorption Property