Title:
The laws of "LOL": Computational approaches to sociolinguistic variation in online discussions

dc.contributor.advisor Eisenstein, Jacob
dc.contributor.advisor Yang, Diyi
dc.contributor.author Stewart, Ian Bruce
dc.contributor.committeeMember De Choudhury, Munmun
dc.contributor.committeeMember Riedl, Mark
dc.contributor.committeeMember Jurgens, David
dc.contributor.committeeMember Baldwin, Timothy
dc.contributor.department Computer Science
dc.date.accessioned 2022-01-14T16:02:21Z
dc.date.available 2022-01-14T16:02:21Z
dc.date.created 2020-12
dc.date.issued 2020-08-21
dc.date.submitted December 2020
dc.date.updated 2022-01-14T16:02:22Z
dc.description.abstract When speaking or writing, a person often chooses one form of language over another based on social constraints, including expectations in a conversation, participation in a global change, or expression of underlying attitudes. Sociolinguistic variation (e.g. choosing "going" versus "goin'") can reveal consistent social differences such as dialects and consistent social motivations such as audience design. While traditional sociolinguistics studies variation in spoken communication, computational sociolinguistics investigates written communication on social media. The structured nature of online discussions and the diversity of language patterns allow computational sociolinguists to test highly specific hypotheses about communication, such different configurations of listener "audience." Studying communication choices in online discussions sheds light on long-standing sociolinguistic questions that are hard to tackle, and helps social media platforms anticipate their members' complicated patterns of participation in conversations. To that end, this thesis explores open questions in sociolinguistic research by quantifying language variation patterns in online discussions. I leverage the "birds-eye" view of social media to focus on three major questions in sociolinguistics research relating to authors' participation in online discussions. First, I test the role of conversation expectations in the context of content bans and crisis events, and I show that authors vary their language to adjust to audience expectations in line with community standards and shared knowledge. Next, I investigate language change in online discussions and show that language structure, more than social context, explains word adoption. Lastly, I investigate the expression of social attitudes among multilingual speakers, and I find that such attitudes can explain language choice when the attitudes have a clear social meaning based on the discussion context. This thesis demonstrates the rich opportunities that social media provides for addressing sociolinguistic questions and provides insight into how people adapt to the communication affordances in online platforms.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/65976
dc.language.iso en_US
dc.publisher Georgia Institute of Technology
dc.subject social computing
dc.subject sociolinguistics
dc.subject computational social science
dc.subject natural language processing
dc.title The laws of "LOL": Computational approaches to sociolinguistic variation in online discussions
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.advisor Eisenstein, Jacob
local.contributor.advisor Yang, Diyi
local.contributor.corporatename College of Computing
relation.isAdvisorOfPublication d2334908-9b54-40ce-9a5b-26987819dd65
relation.isAdvisorOfPublication 272fa3af-72ef-4d23-aadb-545c918c571f
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
STEWART-DISSERTATION-2020.pdf
Size:
1.6 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.86 KB
Format:
Plain Text
Description: