Decoupling Address Generation from Loads and Stores to Improve Data Access Energy Efficiency
Level-one data cache (L1 DC) accesses impact energy usage as they frequently occur and use significantly more energy than register file accesses. A memory access instruction consists of an address generation operation calculating the location where the data item resides in memory and the data access operation that loads/stores a value from/to that location. We propose to decouple these two operations into separate machine instructions to reduce energy usage. By associating the data translation lookaside buffer (DTLB) access and level-one data cache (L1 DC) tag check with an address generation instruction, only a single data array in a set-associative L1 DC needs to be accessed during a load instruction when the result of the tag check is known at that point. In addition, many DTLB accesses and L1 DC tag checks are avoided by memoizing the DTLB way and L1 DC way with the register that holds the memory address to be dereferenced. Finally, we are able to often coalesce an ALU operation with a load or store data access using our technique to reduce the number of instructions executed.
Conference DayTue 19 JunDisplayed time zone: Eastern Time (US & Canada) change
14:00 - 15:40
|Adaptive Deep Learning Model Selection on Embedded Systems|
|Optimizing RAID/SSD Controllers with Lifetime Extension for Flash-based SSD Array|
|Verification of Coarse-Grained Reconfigurable Arrays through Random Test Programs|
|Decoupling Address Generation from Loads and Stores to Improve Data Access Energy Efficiency|