Title:
Model, predict, and mitigate scalability bottlenecks for parallel application on many-core processors

dc.contributor.advisor Prvulovic, Milos
dc.contributor.author Liang, Ching-Kai
dc.contributor.committeeMember Kim, Hyesoon
dc.contributor.committeeMember Qureshi, Moinuddin K.
dc.contributor.committeeMember Yalamanchili, Sudhakar
dc.contributor.committeeMember Hughes, Christopher J.
dc.contributor.department Computer Science
dc.date.accessioned 2019-08-21T13:50:17Z
dc.date.available 2019-08-21T13:50:17Z
dc.date.created 2018-08
dc.date.issued 2018-07-27
dc.date.submitted August 2018
dc.date.updated 2019-08-21T13:50:17Z
dc.description.abstract Ensuring the continuous scaling of parallel applications is challenging on many-core processors, due to the complex relationship of available parallelism in application and the limited shared on-chip resources. Two main bottlenecks that limit the scalability of parallel applications are synchronization and memory bandwidth. With this thesis, I proposed MiSAR, a minimalistic synchronization accelerator (MSA) that supports all three commonly used synchronization (locks, barriers, and condition variables), and a novel overflow management unit (OMU) that dynamically manages its (very) limited hardware synchronization resources. The OMU allows safe and efficient dynamic transitions between using hardware (MSA) and software synchronization implementations. Along with MSA, our proposed hardware synchronization accelerator was able to reduce the impact of synchronization latency on the scaling of parallel applications. In this thesis, we also proposed a new performance model that captures program characteristics of multi-threaded applications, allowing it to use few-threaded runs along with small input sets to predict performance of many-threaded runs with large input sets. Our model considers the effect of increasing memory bandwidth demand and workload imbalance, as well as the increase in lock contention. Results show that our model can accurately predict the parallel speedup of an application with increasing thread count and identify the scalability bottlenecks that are limiting the scaling of an application.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/61667
dc.language.iso en_US
dc.publisher Georgia Institute of Technology
dc.subject Parallel Applications Many-core processor
dc.subject Performance modeling
dc.subject Lock contention
dc.subject Synchronization
dc.subject Synchronization Accelerator
dc.subject Application Scaling
dc.title Model, predict, and mitigate scalability bottlenecks for parallel application on many-core processors
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.advisor Prvulovic, Milos
local.contributor.corporatename College of Computing
relation.isAdvisorOfPublication 2d678067-bb81-43c7-be94-bd87bced736e
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
LIANG-DISSERTATION-2018.pdf
Size:
2.24 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.87 KB
Format:
Plain Text
Description: