CS Students Publish Reading Skill Study
June 26, 2019Share story
Last summer, computer science professor Julie Medero, along with Alfredo Gomez 鈥21, Alicia Ngo 鈥20 and Ali Otondo 鈥20 (aka The A Team), embarked on a project to develop an iOS application that would help children improve their reading skills. The resulting research paper, 鈥淩eading KiTTY: Pitch Range as an Indicator of Reading Skill,鈥 has been accepted to the Widening Natural Language Processing (WiNLP) workshop, to be held during the Association for Computational Linguistics conference later this summer in Florence, Italy.
鈥淭his paper is about an analysis of the prosody of children鈥檚 oral reading,鈥 says Medero (prosody refers to the patterns of rhythm and sound in text, e.g., pitch, reading speed, emotionality). 鈥淭hat means we鈥檙e looking at how children of different reading levels use their voices as part of reading out loud. Our lab鈥檚 Reading KiTTY project is looking at how elementary-aged children could be guided through the creation of kinetic typography animations that visualize their reading out loud. Kinetic typography is a form of animation that uses size, color and motion of text, along with images, to represent the meaning of a text. It鈥檚 popular in music videos and is also commonly used in videos of famous speeches, but we think it has the potential for interesting applications in literacy education, too.鈥
鈥淭he research focuses on two aspects,鈥 says Gomez, who will make a presentation at the workshop, 鈥渃reating the first iteration of the app and exploring how we could leverage natural language processing and speech processing in order effectively promote creativity. The paper accepted by WiNLP focuses on our work using pitch range as an indicator for reading skill as we apply聽machine learning and other computational linguistics techniques.鈥
Ngo explains how the app works: 鈥淔irst, children read aloud for the app.聽After they finish reading, children can see their words come to life through kinetic typography. For example, if a child reads a question prosodically, the kinetic typography should show a pitch rise at the end of the sentence, perhaps by positioning the high-pitched letters higher (along the vertical axis) than the other letters. Using our results, we intend to provide teachers with unique feedback of each student鈥檚 prosody, and therefore, each student鈥檚 reading comprehension.鈥
Ngo continues, 鈥淎lfredo and I trained a machine-learning model with scikit-learn to predict the presence of a high pitch in a text, similar to developing text-to-speech software. After realizing that was really difficult (Google hasn鈥檛 even figured this one out completely yet), we decided to conduct some more data analyses to understand our dataset. We analyzed and extracted data from 5,000+ audio files of children reading, using Natural Language Toolkit and NLP software (AuToBI and Praat). We found that there is a statistically significant聽difference聽in the average pitch range between聽skilled聽readers and struggling readers as they read聽sentences.鈥
The researchers hope to add more ways to gauge a student鈥檚 reading comprehension to include reading speed, emotionality/enthusiasm and the reader鈥檚 pause lengths between words.
鈥淭his project is important because teachers often don鈥檛 have the time to listen to each individual student read,鈥 says Ngo. 鈥淩eading KiTTY can provide unique feedback for each student, so that teachers can focus on聽improving聽their students鈥 reading comprehension instead of merely聽measuring聽their students鈥 reading comprehension.鈥