Title:
Understanding and circumventing censorship on Chinese social media

dc.contributor.advisor Gilbert, Eric
dc.contributor.author Hiruncharoenvate, Chaya
dc.contributor.committeeMember Bruckman, Amy
dc.contributor.committeeMember De Choudhury, Munmun
dc.contributor.committeeMember Edwards, Keith
dc.contributor.committeeMember Mark, Gloria
dc.contributor.department Interactive Computing
dc.date.accessioned 2017-06-07T17:40:56Z
dc.date.available 2017-06-07T17:40:56Z
dc.date.created 2017-05
dc.date.issued 2017-04-05
dc.date.submitted May 2017
dc.date.updated 2017-06-07T17:40:56Z
dc.description.abstract Chinese Internet users not only face the most technologically advanced filtering system in the world, the Great Firewall of China, but also are under the watchful eyes of the repressive government that controls every layer of their communications. Although social networking sites such as Facebook and Twitter are blocked in China, Chinese Internet users have the local replicas such as WeChat and Sina Weibo to communicate with others. However, these sites employ both advanced keyword detection algorithms and human censors to filter any kind of inappropriate content. While previous research has explored the technology behind the censorship mechanisms, little work has focused on the effects of censorship on online and offline behaviors. In this thesis, I bridge this gap by conducting a mixed-method study to gain a deeper understanding of these effects. The results of the mixed-method study show that censorship has strong off-platform effects, which are not detectable from usage logs. Users deliberately self-censor their speech out of caution, because they do not have a clear understanding of what content is being censored and what risks are associated with censorship on Chinese social media. Although on-platform effects of censorship are present on social media usage logs, they wear out over time. Informed by these results, I attempt to provide social media users a better understanding of how the censorship mechanism works and an effective censorship circumvention technique, both of which will lead to greater freedom of expression among social media users. Digital activists have long employed homophones of censored keywords to avoid detection by keyword matching algorithms on Chinese social media. One part of this thesis demonstrates that it is possible to scale this technique up in ways that are costly and difficult to defend against because human censors must manually read through all social media posts. Specifically, I developed a non-deterministic algorithm for generating homophones that creates large numbers of false positives for censors. In experiments, the algorithm allows homophone-transformed posts to remain on Sina Weibo three times longer than their previously censored counterparts without creating any confusion to native Chinese speakers. Extrapolating from this work, I employed this algorithm in the development of CENSE, a real-time system that Chinese social media users can use to easily detect and replace censored keywords with homophones. The results of a formative interview study indicate a welcoming response from Chinese social media users to the concept of a censorship circumvention tool. Overall, the contributions of my research bridge the areas of Internet censorship and censorship circumvention technologies. The mixed-method study provides a better understanding of how censorship affects social media users. Additionally, the homophone transformation algorithm and CENSE, a real-time censorship circumvention tool, aid users in experiencing increased freedom of expression on Chinese social media.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/58263
dc.language.iso en_US
dc.publisher Georgia Institute of Technology
dc.subject Social computing
dc.subject Social media
dc.subject Censorship
dc.subject Internet censorship
dc.subject China
dc.subject Censorship circumvention
dc.title Understanding and circumventing censorship on Chinese social media
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.corporatename College of Computing
local.contributor.corporatename School of Interactive Computing
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
relation.isOrgUnitOfPublication aac3f010-e629-4d08-8276-81143eeaf5cc
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
HIRUNCHAROENVATE-DISSERTATION-2017.pdf
Size:
8.61 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.87 KB
Format:
Plain Text
Description: