Classifying fine-grained smartphone interaction states, such as typing and scrolling, directly from raw sensor streams, and predicting typing metrics from them with deep learning.
Knowing how a person physically interacts with their phone, not just which apps they open, is a useful signal for digital phenotyping. This project builds and benchmarks models that infer interaction states (typing, scrolling, other) from smartphone sensor data, and separately predict continuous typing metrics like keystroke count. It grew out of the behavioral-sensing work I did in wearables and mobile health.
The sampling-rate sweep matters for deployment: a model that is nearly as accurate at 2 Hz as at 60 Hz can run continuously on a phone without draining the battery. By identifying XGBoost as the best classifier and measuring the accuracy versus rate trade-off, the work points toward interaction-aware sensing that is cheap enough to run all day. That is the basis for using everyday phone behavior as a passive health signal.