NEW YORK – Depression is one of the most common mental health issues in the United States, affecting the lives of over 17 million Americans suffering from it with over 800,000 people dying from suicide annually, according to the World Health Organization (WHO). While long held stigmas around mental health are being torn down, dependency on technology to accurately predict depression is on the rise. Recent advances in mobile sensing technologies and machine learning have sparked hope and optimism among scientists that claim predictive modeling could revolutionize the way depression assessments are conducted.
But research from Sandra Matz, the David W. Zalaznick Associate Professor of Business at Columbia Business School, warns that the swift shift to technologies like GPS sensors and machine learning as predictors – without first validating them in the general population – is having a lasting impact on accuracy.
The Study
Depression predictions from GPS-based mobility do not generalize well to large demographically heterogeneous samples, that was led by Sandrine Müller and Xi (Leslie) Chen from Columbia University, analyses over 57 million GPS data points to show that the same procedure that leads to high prediction accuracy (approx. 80%) in homogeneous student samples, leads to much lower accuracies in a U.S.-wide sample using heterogeneous socio-demographics (approx. 60%). Notable, even training the classification model on more homogeneous subsamples -- including similar age, similar city, and daily schedules -- did not substantially improve prediction accuracy either.
Overall, the researchers’ findings challenge the notion of applying mobility-based predictions of depression at scale. While the findings suggest that mobility patterns can indeed reveal information about mental health, they suggest that findings generated on small, homogenous student populations might not easily generalize to the broader population. Without proper validation, the implementation of smartphone-based diagnostic tools could, in fact, cause more harm than good: Inaccurate diagnostic output could both prevent individuals from seeking out the right venues for further diagnostic assessment and treatment as well as unnecessarily bind scarce resources.
To learn more about the cutting-edge research being conducted at Columbia Business School, please visit www.gsb.columbia.edu.
###