Title:
Detecting Communities from Given Seeds in Social Networks

dc.contributor.author Riedy, Jason
dc.contributor.author Bader, David A.
dc.contributor.author Jiang, Karl
dc.contributor.author Pande, Pushkar
dc.contributor.author Sharma, Richa
dc.contributor.corporatename Georgia Institute of Technology. College of Computing
dc.contributor.corporatename Georgia Institute of Technology. School of Computational Science and Engineering
dc.date.accessioned 2011-02-23T23:10:40Z
dc.date.available 2011-02-23T23:10:40Z
dc.date.issued 2011-02-22
dc.description.abstract Analyzing massive social networks challenges both high-performance computers and human under- standing. These massive networks cannot be visualized easily, and their scale makes applying complex analysis methods computationally expensive. We present a region-growing method for finding a smaller, more tractable subgraph, a community, given a few example seed vertices. Unlike existing work, we focus on a small number of seed vertices, from two to a few dozen. We also present the first comparison between five algorithms for expanding a small seed set into a community. Our comparison applies these algorithms to an R-MAT generated graph component with 240 thousand vertices and 32 million edges and evaluates the community size, modularity, Kullback-Leibler divergence, conductance, and clustering coefficient. We find that our new algorithm with a local modularity maximizing heuristic based on Clauset, Newman, and Moore performs very well when the output is limited to 100 or 1000 vertices. When run without a vertex size limit, a heuristic from McCloskey and Bader generates communities containing around 60% of the graph's vertices and having a small conductance and modularity appropriate to the result size. A personalized PageRank algorithm based on Andersen, Lang, and Chung also performs well with respect to our metrics. en_US
dc.identifier.uri http://hdl.handle.net/1853/36980
dc.language.iso en_US en_US
dc.publisher Georgia Institute of Technology en_US
dc.relation.ispartofseries CSE Technical Reports ; GT-CSE-11-01 en_US
dc.subject Conductance en_US
dc.subject Modularity en_US
dc.subject Random walk en_US
dc.subject Region-growing algorithm en_US
dc.subject Seed set en_US
dc.subject Social networks en_US
dc.subject Vertices en_US
dc.title Detecting Communities from Given Seeds in Social Networks en_US
dc.type Text
dc.type.genre Technical Report
dspace.entity.type Publication
local.contributor.corporatename College of Computing
local.contributor.corporatename School of Computational Science and Engineering
local.relation.ispartofseries College of Computing Technical Report Series
local.relation.ispartofseries School of Computational Science and Engineering Technical Report Series
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
relation.isOrgUnitOfPublication 01ab2ef1-c6da-49c9-be98-fbd1d840d2b1
relation.isSeriesOfPublication 35c9e8fc-dd67-4201-b1d5-016381ef65b8
relation.isSeriesOfPublication 5a01f926-96af-453d-a75b-abc3e0f0abb3
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
GT-CSE-11-01.pdf
Size:
3.2 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.76 KB
Format:
Item-specific license agreed upon to submission
Description: