Bridging Processor and Memory Performance in ILP Processors via Data-Remapping

dc.contributor.author Rabbah, Rodric Michel en_US
dc.contributor.author Palem, Krishna V.
dc.date.accessioned 2005-06-17T17:43:19Z
dc.date.available 2005-06-17T17:43:19Z
dc.date.issued 2001 en_US
dc.description.abstract Current system design trends continue to magnify the disparity between processor and memory performance. Thus, as microprocessors perform increasingly better than the mem-ory systems supporting them, it is ever more important to bridge the performance gap to help translate the promise of Moore s law into overall performance delivered to the end applica-tions. This gap in performance between the processor and the memory is further exacerbated in the context of modern processors with high-levels of instruction level parallelism (ILP), especially for data-intensive applications. In these processors, increased demands for data delivery lead to concomitant needs for higher memory bandwidth and cache sizes. In this paper we provide a fast compile-time data-remapping technique which helps in bridging the gap between the ILP processor and its memory system, by enhancing the spatial locality of data-access. Our strategy is the first automatic approach applicable to pointer-intensive dy-namic applications for which existing optimizations are mostly inadequate. We demonstrate an average performance improvement of 27% for several data-intensive applications. This is attributed to enhanced data locality, resulting in lowered demand on the bandwidth between cache levels, as well as between the cache subsystem and main memory. We also show that with increasing levels of ILP and fixed memory bandwidth, our remapping technique enables very high levels of performance with smaller cache sizes. For example, as much as a factor of 15 reduction in multi-level caches can be tolerated without a loss in performance. Although we use cycle-accurate simulators to detail the benefits of our remapping, we also measure 24% performance improvements for the Intel Pentium II and III processors, and a 9% yield on the Sun UltraSparc-II. en_US
dc.format.extent 326058 bytes
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/6568
dc.language.iso en_US
dc.publisher Georgia Institute of Technology en_US
dc.relation.ispartofseries CC Technical Report; GIT-CC-01-14 en_US
dc.subject Performance
dc.subject Instruction level parallelism
dc.subject Data locality
dc.title Bridging Processor and Memory Performance in ILP Processors via Data-Remapping en_US
dc.type Text
dc.type.genre Technical Report
dspace.entity.type Publication
local.contributor.corporatename College of Computing
local.relation.ispartofseries College of Computing Technical Report Series
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
relation.isSeriesOfPublication 35c9e8fc-dd67-4201-b1d5-016381ef65b8
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
318.42 KB
Adobe Portable Document Format