Title:
PerfDB + PerlML: Enabling Big Data-Driven Research On Fine-Grained Performance Phenomena

dc.contributor.advisor Pu, Calton
dc.contributor.author Kimball, Joshua M.
dc.contributor.committeeMember Liu, Ling
dc.contributor.committeeMember Navathe, Shamkant
dc.contributor.committeeMember Arulraj, Joy
dc.contributor.committeeMember Wang, Qingyang
dc.contributor.department Computer Science
dc.date.accessioned 2021-09-15T15:42:23Z
dc.date.available 2021-09-15T15:42:23Z
dc.date.created 2021-08
dc.date.issued 2021-07-14
dc.date.submitted August 2021
dc.date.updated 2021-09-15T15:42:24Z
dc.description.abstract The long-tail latency problem is a well-known problem in large-scale system topologies like cloud platforms. Long-tail latency can lead to less predictable system performance, degraded quality of experience and potential economic loss. Previous research has focused on coarse- grained, symptomatic treatments like redundant request executions to mitigate tail latency and its effects. Instead, we propose studying these performance bugs systematically and addressing their underlying root cause. The millibottleneck theory of performance bugs provides a testable hypothesis for explaining at least some requests comprising the latency long tail. The theory posits that transient performance anomalies cause a non-negligible number of requests to complete in seconds, called Very Long Response Time Requests (VLRT), instead of tens of milliseconds like most other requests. In this dissertation, we enable the systematic evaluation of the millibottleneck theory across a big data-scale experimental data collection. First, we present perftables, a performance log parser, that extracts resource monitoring data across a wide variety of hardware and software configurations. Secondly, we use our data management system, PerfDB, to load and integrate fine-grained system performance data from approximately 400 experiments. We conduct the first-generation population study of VLRT, and our data support millibottlenecks inducing VLRT through CTQO (Cross-Tier Queue Overflow). We also enable the study of a second latency class called Less Long Requests (LLRs). Finally, we present our ensemble-based, supervised machine learning system, PerfML, that handles data characterized by heterogenous feature space and hierarchical, imbalanced classes—characteristics inherent to the data needed to study millibottlenecks and latency performance bugs. The analytics results from PerfML demonstrate its ability to isolate different kinds of millibottlenecks across a range of systems and configurations with high recall and acceptable precision.
dc.description.degree Ph.D.
dc.format.mimetype application/pdf
dc.identifier.uri http://hdl.handle.net/1853/65049
dc.publisher Georgia Institute of Technology
dc.subject Big data
dc.subject machine learning
dc.subject data management
dc.subject systems performance
dc.subject anomaly detection
dc.title PerfDB + PerlML: Enabling Big Data-Driven Research On Fine-Grained Performance Phenomena
dc.type Text
dc.type.genre Dissertation
dspace.entity.type Publication
local.contributor.advisor Pu, Calton
local.contributor.corporatename College of Computing
relation.isAdvisorOfPublication fc48a3de-da43-4d32-af59-414047eb7cd7
relation.isOrgUnitOfPublication c8892b3c-8db6-4b7b-a33a-1b67f7db2021
thesis.degree.level Doctoral
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
KIMBALL-DISSERTATION-2021.pdf
Size:
5.08 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
3.87 KB
Format:
Plain Text
Description: