Title:
On the Architectural Requirements for Efficient Execution of Graph Algorithms

dc.contributor.author Bader, David A.
dc.contributor.author Cong, Guojing
dc.date.accessioned 2007-05-15T20:28:03Z
dc.date.available 2007-05-15T20:28:03Z
dc.date.issued 2006-02-25
dc.description.abstract Combinatorial problems such as those from graph theory pose serious challenges for parallel machines due to non-contiguous, concurrent accesses to global data structures with low degrees of locality. The hierarchical memory systems of symmetric multiprocessor (SMP) clusters optimize for local, contiguous memory accesses, and so are inefficient platforms for such algorithms. Few parallel graph algorithms outperform their best sequential implementation on SMP clusters due to long memory latencies and high synchronization costs. In this paper, we consider the performance and scalability of two graph algorithms, list ranking and connected components, on two classes of shared-memory computers: symmetric multiprocessors such as the Sun Enterprise servers and multithreaded architectures (MTA) such as the Cray MTA-2. While previous studies have shown that parallel graph algorithms can speedup on SMPs, the systems' reliance on cache microprocessors limits performance. The MTA's latency tolerant processors and hardware support for fine-grain synchronization makes performance a function of parallelism. Since parallel graph algorithms have an abundance of parallelism, they perform and scale significantly better on the MTA. We describe and give a performance model for each architecture. We analyze the performance of the two algorithms and discuss how the features of each architecture affects algorithm development, ease of programming, performance, and scalability. en
dc.description.sponsorship This work was supported in part by NSF Grants CAREER ACI-00-93039, ITR ACI-00-81404, DEB-99- 10123, ITR EIA-01-21377, Biocomplexity DEB-01-20709, DBI-0420513, ITR EF/BIO 03-31654; and DARPA Contract NBCH30390004. en
dc.identifier.uri http://hdl.handle.net/1853/14383
dc.language.iso en_US en
dc.publisher Georgia Institute of Technology en
dc.relation.ispartofseries CSE Technical Reports; GT-CSE-06-04 en
dc.subject Connected components en
dc.subject Graph algorithms en
dc.subject List ranking en
dc.subject Multithreading en
dc.subject Shared memory en
dc.title On the Architectural Requirements for Efficient Execution of Graph Algorithms en
dc.type Text
dc.type.genre Technical Report
dspace.entity.type Publication
local.contributor.corporatename College of Computing
local.contributor.corporatename School of Computational Science and Engineering
local.relation.ispartofseries College of Computing Technical Report Series
local.relation.ispartofseries School of Computational Science and Engineering Technical Report Series
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
relation.isOrgUnitOfPublication 01ab2ef1-c6da-49c9-be98-fbd1d840d2b1
relation.isSeriesOfPublication 35c9e8fc-dd67-4201-b1d5-016381ef65b8
relation.isSeriesOfPublication 5a01f926-96af-453d-a75b-abc3e0f0abb3
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
GT-CSE-06-04.pdf
Size:
409.7 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.86 KB
Format:
Item-specific license agreed upon to submission
Description: