Exact, Fast and Flexible L1 Cache Configuration Simulation for Embedded Systems
スポンサーリンク
概要
- 論文の詳細を見る
Since target applications running on an embedded processor are much limited in embedded systems, we can optimize its cache configuration based on the number of sets, block size, and associativities. An extremely fast cache configuration simulation method, CRCB (Configuration Reduction approach by the Cache Behavior), has been recently proposed which can calculate cache hit/miss counts accurately for possible cache configurations when the three parameters above are changed. The CRCB method assumes LRU-based (Least Recently Used-based) cache but many recent processors use FIFO-based (First In First Out-based) cache or PLRU-based (Pseudo LRU-based) cache due to its hardware cost. In this paper, we propose exact and fast L1 cache configuration simulation algorithms for embedded applications that use PLRU or FIFO as a cache replacement policy. Firstly, we prove that the CRCB method can be applied not only to LRU but also to other cache replacement policies including FIFO and PLRU. Secondly, we prove several properties for FIFO- and PLRU-based caches and we propose associated cache simulation algorithms which can simulate simultaneously more than one cache configurations with different cache associativities accurately for FIFO or PLRU. Finally, many experimental results demonstrate that our cache configuration simulation algorithms obtain accurate cache hit/miss counts and run up to 249 times faster than a conventional cache simulator.
著者
-
Ohtsuki Tatsuo
Department of Computer Science and Engineering, Waseda University
-
Ohtsuki Tatsuo
Department Of Computer Science And Engineering Waseda University
-
Togawa Nozomu
Department Of Computer Science And Engineering Waseda University
-
Yanagisawa Masao
Department Of Cardiology Nippon Medical School
-
Tawada Masashi
Department of Computer Science and Engineering, Waseda University
関連論文
- OE-009 Identification of a Locus on Chromosome 5p that Confers Risk of Coronary Artery Disease by Genome Wide Association Study(OE02,ACS/AMI (Basic) (IHD),Oral Presentation (English),The 73rd Annual Scientific Meeting of The Japanese Circulation Society)
- New correction algorithms for multiple comparisons in case-control multilocus association studies based on haplotypes and diplotype configurations
- The influence of endogeneous factors to appearance of transient myocardial ischemia in patients with coronary artery disease
- The prognostic significance of exercise thalium 201 stress test in patients with silent myocardial ischemia
- The Influence of Humoral Factor to the Diurnal Variation of Transient Myocardial Ischemia in Patient with Vasospastic Angina
- Dose the defect of ^I-MIBG cardiac imaging reflect the severity of Diabetes Mellitus?
- -0772-THE LONG PROGNOSIS AND ITS PREDICTORS IN CAD PATIENTS WITH SILENT MYOCARDIAL ISCHEMIA
- -0459-CLINICAL SIGNIFICANCE OF ST DEVIATION ON EXERCISE TEST DURING THE CONVALESCENT PHASE OF ACUTE MYOCARDIAL INFARCTION IN THREE-VESSEL DISEASE
- -0079-PROGNOSTIC VALUE OF PRE-DISCHARGE EXERCISE TEST AFTER MYOCARDIAL INFARCTION
- -0056-THE PREDICTION OF THE PROGNOSIS OF CAD PATIENTS WITH SILENT MYOCARDIAL ISCHEMIA USING TREADMILL EXERCISE TESTING
- -0018-THE PREDICTION OF THE LONG TERM PROGNOSIS OF CAD PATIENTS WITH SILENT MYOCARDIAL ISCHEMIA USING HOLTER ECG MONITORING
- PJ-140 Thrombolysis with a Novel Modified t-PA, Monteplase, in Combination with Catheter Intervention for High-risk Patients with Acute Pulmonary Embolism(Thromboembolism/Antithrombotic therapy/Thrombolysis-4, The 71st Annual Scientific Meeting of the Jap
- OJ-041 Long Term Administration of Ascorbic Acid Induces to Decrease in Lactic Acid Production during Exercise in Patients after Myocardial Infarction(Exercise test/Cardiac rehabilitation-2, The 71st Annual Scientific Meeting of the Japanese Circulation S
- 3 Therapeutic Strategy for Acute Pulmonary Embolism : Current Status and Future Perspectives(Pulmonary Thromboembolism in Japan: Present and Future,Roundtable Discussion 9 (RT9) (H),The 70th Anniversary Annual Scientific Meeting of the Japanese Circulatio
- Differences in the Clinical Course of Acute Massive and Submassive Pulmonary Embolism : In-Hospital Onset vs Out-of-Hospital Onset
- PJ-535 Aggressive Catheter Treatment Interrupts Development of Persistent Pulmonary Hypertension Even in "Subacute" Massive Pulmonary Embolism Patients(Pulmonary Circulation 2 (H) : PJ90)(Poster Session (Japanese))
- Difference in the Clinical Course of Acute Massive Pulmonary Embolism : In-Hospital vs Out-Hospital Onset
- Comparison of aggressive interventional therapy with traditional thrombolysis for acute massive pulmonary embolism
- Inhibition of the Reverse Mode of Na^/Ca^ Exchange by KB-R7943 Augments Arrhythmogenicity in the Canine Heart during Rapid Heart Rates (Arrhythmia, Basic 5 (A), The 69th Annual Scientific Meeting of the Japanese Circulation Society)
- The Effects of Catecholamine Sensitive Focal Activity to the Wavefront Dynamics during Ventricular Fibrillation(Arrhythmia, Basic 3 (A), The 69th Annual Scientific Meeting of the Japanese Circulation Society)
- Roles of Ca^ Pump Function of Sarcoplasmic Reticulum in the Mechanism of Electrical and Mechanical Alternans
- Increased Peripheral Blood Mononuclear Cells Is an Important Determinant of Left Ventricular Remodeling in Patients with Acute Myocardial Infarction
- Thrombin-Induced Vascular Reactivity Is Modulated by NO Release in Coronary Artery
- PE-440 The Incidence of Perioperative Complete Atrioventricular Block Associated with Percutaneous Transluminal Septal Myocardial Ablation and the Indication of Permanent Pacemaker(PE074,Cardiomyopathy/Hypertrophy (Clinical) 2 (M),Poster Session (English)
- DPJ-045 Lack of Association Large Angiographic Late Loss and Low Risk of In-Stent Thrombus : Angioscopic Comparison between Paclitaxel- and Sirolimus-Eluting Stent(DPJ08,Intravascular Imagings (I),Digital Poster Session (Japanese),The 73rd Annual Scientif
- Adenosine Mediates the Antiarrhythmic Effect of Ischemic Preconditioning in Isolated Rat Hearts
- A Fast Selector-Based Subtract-Multiplication Unit and Its Application to Butterfly Unit
- Scan Vulnerability in Elliptic Curve Cryptosystems
- Power-Efficient LDPC Decoder Architecture Based on Accelerated Message-Passing Schedule(VLSI Architecture,VLSI Design and CAD Algorithms)
- PJ-423 Analysis of Risk Factors for Aspirin-Induced Gastrointestinal Mucosal injury in Patients with Cardiovascular Disease(Cardiovascular pharmacology, basic/clinical(05)(H),Poster Session(Japanese),The 72nd Annual Scientific Meeting of the Japanese Circ
- PE-476 Increased Peripheral Blood Mononuclear Cell Count is an Independent Predictor for Cardiac Events in Patients with Acute Myocardial Infarction(Acute myocardial infarction, clinical(diagnosis/treatment)(07)(IHD),Poster Session(English),The 72nd Annua
- OE-274 Long-term QOL after PTSMA in symptomatic HOCM Comparing with Mild Grade Patients : Importance of preventing Sudden Death in All Cases(Cardiomyopathy, basic/clinical(01)(M),Oral Presentation(English),The 72nd Annual Scientific Meeting of the Japanes
- 6 The Risk of Being Female; Different Clinical Presentation, Pathophysiology, and Diagnostic Limitations in Acute Coronary Syndrome(Long-Term Prediction and Prevention of Cardiac Diseases and the Risk Factors,Symposium 10 (SY10) (H),The 70th Anniversary A
- PJ-401 Comparison of Upstream Therapies for Paroxysmal Atrial Fibrillation in Patients without Overt Heart Diseases(Arrhythmia, therapy(20)(A),Poster Session(Japanese),The 72nd Annual Scientific Meeting of the Japanese Circulation Society)
- PE-282 Management of Ventricular Tachyarrhythmia following PTSMA : Characteristics in Highly Risked HOCM Whom Indicated both PTSMA and ICD-implantation, is PTSMA harmful?(Cardiomyopathy, basic/clinical(04)(M),Poster Session(English),The 72nd Annual Scient
- PE-113 The Differential Overdrive Pacing : A Novel Diagnostic Method of Atrial Tachycardia Irrespective of Retrograde VA Conduction(Arrhythmia, diagnosis/pathophysiology/EPS-08, The 71st Annual Scientific Meeting of the Japanese Circulation Society)
- PTD-FNK (Super Anti-apoptotic Factor) Inhibits Cell Death of Bone Marrow Mononuclear Cells and Promotes to Form a Blood Vessel-like Structure(Apoptosis/Necrosis/Regeneration 1 (M), The 69th Annual Scientific Meeting of the Japanese Circulation Society)
- Multi-biomarker Approach Stratifies Vulnerable Plaque with Sub-occlusive Thrombus, Minor Myocardial Damage, and Ischemic Insult in Patients with Acute Coronary Syndrome(Diagnosis and Treatment of Acute Coronary Syndrome, The 69th Annual Scientific Meeting
- 3 Impact of the Prevention of Contrast-induced Nephropathy on Clinical Outcome and Cost-effectiveness in Pre-dialysis Patients Undergoing Coronary Intervention(Economic Aspects of Cardiovascular Catheter Intervention,Symposium 17 (SY-17) (I),The 73rd Annu
- Impact of the Revised Criteria for Acute Myocardial Infarction Using Cardiac Troponins in a Japanese Population With Acute Coronary Syndromes
- Indications for successful aspiration thrombectomy using Rescue^ catheter
- Risk stratification by neurohumoral factors and hemodynamics on admission in acute heart failure
- Clinical characteristics of acute heart failure with and without improvements of mitral regurgitation
- Discrepancy between pre-synaptic and post-synaptic dysfunctions of cardiac sympathetic nerves in human heart failure
- Beneficial effect of nifekalant in patients with hemodynamically unstable paroxysmal atrial fibrillation resistant to direct current delivery
- FPGA-Based Reconfigurable Adaptive FEC(System Level Design)(VLSI Design and CAD Algorithms)
- PE-139 Mapping-guided Ablation of the Cavotricuspid Isthmus : A Novel Simplified Approach for Radio frequency Catheter Ablation of Isthmus-dependent Atrial Flutter(Arrhythmia, therapy-9 (A) PE24,Poster Session (English),The 70th Anniversary Annual Scienti
- Floorplan-Aware High-Level Synthesis for Generalized Distributed-Register Architectures
- Fast Scheduling and Allocation Algorithms for Entropy CODEC (Special Issue on Synthesis and Verification of Hardware Design)
- A Circuit Partitioning Algorithm with Path Delay Constraints for Multi-FPGA Systems (Special Section of Selected Papers from the 9th Karuizawa Workshop on Circuits and Systems)
- A Simultaneous Technology Mapping, Placement, and Global Routing Algorithm for FPGAs with Path Delay Constraints (Special Section of Selected Papers from the 8th Karuizawa Workshop on Circuits and Systems)
- A Circuit Partitioning Algorithm with Replication Capability for Multi-FPGA Systems
- Selective Low-Care Coding : A Means for Test Data Compression in Circuits with Multiple Scan Chains(Selected Papers from the 18th Workshop on Circuits and Systems in Karuizawa)
- A Fast Elliptic Curve Cryptosystem LSI Embedding Word-Based Montgomery Multiplier (System LSIs and Microprocessors, VLSI Design Technology in the Sub-100nm Era)
- A SIMD Instruction Set and Functional Unit Synthesis Algorithm with SIMD Operation Decomposition(Programmable Logic, VLSI, CAD and Layout, Recent Advances in Circuits and Systems-Part 1)
- Sub-operation Parallelism Optimization in SIMD Processor Core Synthesis(Selected Papers from the 17th Workshop on Circuits and Systems in Karuizawa)
- High-Level Power Optimization Based on Thread Partitioning(System Level Design)(VLSI Design and CAD Algorithms)
- A Hardware/Software Cosynthesis Algorithm for Processors with Heterogeneous Datapaths(Selected Papers from the 16th Workshop on Circuits and Systems in Karuizawa)
- A Hardware/Software Partitioning Algorithm for Processor Cores with Packed SIMD-Type Instructions(Design Methodology)(VLSI Design and CAD Algorithms)
- A Retargetable Simulator Generator for DSP Processor Cores with Packed SIMD-type Instructions(Simulation Acceletor)(VLSI Design and CAD Algorithms)
- A Retargetable Simulator Generator for DSP Processor Cores with Packed SIMD-type Instructions
- A Hardware/Software Cosynthesis System for Processor Cores with Content Addressable Memories
- A High-Level Energy-Optimizing Algorithm for System VLSIs Based on Area/Time/Power Estimation(Special Section on VLSI Design and CAD Algorithms)
- An Algorithm and a Flexible Architecture for Fast Block-Matching Motion Estimation(Special Section on VLSI Design and CAD Algorithms)
- C-5 A Software/Hardware Codesign for MPEG Encoder
- High-Level Area/Delay/Power Estimation for Low Power System VLSIs with Gated Clocks(Special Section of Selected Papers from the 14th Workshop on Circuits and Systems in Karuizawa)
- A New Hardware/Software Partitioning Algorithm for DSP Processor Cores with Two Types of Register Files(Special Section on VLSI Design and CAD Algorithms)
- Area and Delay Estimation in Hardware/Software Cosynthesis for Digital Signal Processor Cores(Special Section on VLSI Design and CAD Algorithms)
- An Area/Time Optimizing Algorithm in High-Level Synthesis of Control-Based Hardwares (Special Section on Discrete Mathematics and Its Applications)
- CAM Processor Synthesis Based on Behavioral Descriptions (Special Section on VLSI Design and CAD Algorithms)
- A Hardware / Software Cosynthesis System for Digital Signal Processor Cores with Two Types of Register Files (Special Section of Selected Papers from the 12th Workshop on Circuit and Systems in Karuizawa)
- Direct-Detection Optical Synchronous CDMA Systems with Interference Canceller Using Group Information Codes (Special Section on Spread Spectrum Techniques and Applications)
- Optical Spread Time CDMA Communication Systems with PPM Signaling
- Performance Analysis of Optical Frequency-Domain Encoding CDMA Enhancement of Frequency Division Multiplexing
- A Two-Level Cache Design Space Exploration System for Embedded Applications
- An L1 Cache Design Space Exploration System for Embedded Applications
- A Built-in Reseeding Technique for LFSR-Based Test Pattern Generation(Timing Verification and Test Generation)(VLSI Design and CAD Algorithms)
- A Built-in Reseeding Technique for LFSR-Based Test Pattern Generation
- A Selective Scan Chain Reconfiguration through Run-Length Coding for Test Data Compression and Scan Power Reduction(Test)(VLSI Design and CAD Algorithms)
- A Hybrid Dictionary Test Data Compression for Multiscan-Based Designs(Test)(VLSI Design and CAD Algorithms)
- A CAM-Based Parallel Fault Simulation Algorithm with Minimal Storage Size
- A Scan-Based Attack Based on Discriminators for AES Cryptosystems
- X-Handling for Current X-Tolerant Compactors with More Unknowns and Maximal Compaction
- Unified Dual-Radix Architecture for Scalable Montgomery Multiplications in GF(P) and GF(2^n)
- A Unified Test Compression Technique for Scan Stimulus and Unknown Masking Data with No Test Loss
- A Secure Test Technique for Pipelined Advanced Encryption Standard
- Scan-Based Side-Channel Attack against RSA Cryptosystems Using Scan Signatures
- A Hardware/Software Cosynthesis System for Digital Signal Processor Cores (Special Section on VLSI Design and CAD Algorithms)
- A Depth-Constrained Technology Mapping Algorithm for Logic-Blocks Composed of Tree-Structured LUTs (Special Section on Selected Papers from the 11th Workshop on Circuits and Systems in Karuizawa)
- A Fast Scheduling Algorithm Based on Gradual Time-Frame Reduction for Datapath Synthesis
- An FPGA Layout Reconfiguration Algorithm Based on Global Routes for Engineering Changes in System Design Specifications(Special Section on Discrete Mathematics and Its Applications)
- A Locality-Aware Hybrid NoC Configuration Algorithm Utilizing the Communication Volume among IP Cores
- Exact, Fast and Flexible L1 Cache Configuration Simulation for Embedded Systems
- Energy-efficient High-level Synthesis for HDR Architectures
- A Fast Weighted Adder by Reducing Partial Product for Reconstruction in Super-Resolution
- Energy-efficient High-level Synthesis for HDR Architectures with Clock Gating Based on Concurrency-oriented Scheduling
- A Thermal-Aware High-Level Synthesis Algorithm for RDR Architectures through Binding and Allocation
- Floorplan Driven Architecture and High-Level Synthesis Algorithm for Dynamic Multiple Supply Voltages
- Energy-efficient High-level Synthesis for HDR Architectures with Clock Gating Based on Concurrency-oriented Scheduling
- A Delay-variation-aware High-level Synthesis Algorithm for RDR Architectures