Telemetry and Modeling for Automatic Tuning in Apache Cassandra
January 1, 2019A Capella Computer Science, 2018-19
Liaison(s): (not listed)
Advisor(s): Beth Trushkowsky
Students(s): Jonathan Cruz (PM-S), Carissa DeRanek, Lilly Liu, Jonathan Raygoza, Ashley Schmit (PM-F)
Databases鈥攅specially large-scale databases鈥攁re the 鈥渞eactor core鈥 powering today鈥檚 software services. The performance of large databases depends on many interactions between internal parameters and the varying external load they鈥檙e asked to handle. Our goal is to improve both the 鈥渟ensing鈥 and the 鈥渃ontrol鈥 of Cassandra, a large-scale open-source database, adapting its operation based on changing conditions. The pipeline will use machine learning to guide parameter-tuning for the database, depending on operational and query patterns.