Title:
Topk Queries across Multiple Private Databases

dc.contributor.author Xiong, Li
dc.contributor.author Chitti, Subramanyam B.
dc.contributor.author Liu, Ling
dc.date.accessioned 2005-03-08T19:13:05Z
dc.date.available 2005-03-08T19:13:05Z
dc.date.issued 2004
dc.description.abstract Advances in distributed service-oriented computing and global communications have formed a strong technology push for large scale data integration among organizations and enterprises. It is widely observed that multiple organizations in the same market sectors are actively competing as well as collaborating with constantly evolving alliances. Many such organizations want to find out the aggregation statistics about sales in the sector without disclosing sales data in their private databases. Privacy-preserving data sharing is becoming increasingly important for large scale mission-critical data integration applications. In this paper we present a decentralized peer-to-peer protocol for supporting statistics queries over multiple private databases while respecting privacy constraints of participants. Ideally, given a database query spanning multiple private databases, we wish to compute the answer to the query without revealing any additional information of each individual database apart from the query result. In practice, a popular approach is to relax this constraint to allow efficient information integration while minimizing the information disclosure. The paper has a number of unique contributions. First, we formalize the notion of loss of privacy in terms of information revealed and propose a data privacy metric. Second, we propose a novel probabilistic decentralized protocol for privacy preserving top k selection. Third, we perform a formal analysis of the protocol and also experimentally evaluate the protocol in terms of its correctness, efficiency and privacy characteristics. en
dc.format.extent 323863 bytes
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/5307
dc.language.iso en_US
dc.publisher Georgia Institute of Technology en
dc.relation.ispartofseries CERCS;GIT-CERCS-04-32
dc.subject Global communications en
dc.subject Data integration en
dc.subject Privacy-preserving data sharing en
dc.subject Peer-to-peer protocols en
dc.subject Service-oriented computing
dc.subject Distributed computing
dc.subject Top-k selection queries
dc.title Topk Queries across Multiple Private Databases en
dc.type Text
dc.type.genre Technical Report
dspace.entity.type Publication
local.contributor.author Liu, Ling
local.contributor.corporatename Center for Experimental Research in Computer Systems
local.relation.ispartofseries CERCS Technical Report Series
relation.isAuthorOfPublication 96391b98-ac42-4e2c-93ee-79a5e16c2dfb
relation.isOrgUnitOfPublication 1dd858c0-be27-47fd-873d-208407cf0794
relation.isSeriesOfPublication bc21f6b3-4b86-4b92-8b66-d65d59e12c54
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
git-cercs-04-32.pdf
Size:
316.27 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.86 KB
Format:
Item-specific license agreed upon to submission
Description: