Integrating Supermarket Behavioural Data with Established Indicators to Improve Alcohol-Related Mortality Prediction in England

Main Article Content

Raphael Derecki
James Goulding
Elizabeth Dolan
Brian O'Shea

Abstract

Introduction & Background
Within behavioural sciences, there has been a resurgence in not only explaining factors but also predicting them with precision that exceeds classic social science standards. In England (UK), alcohol is a ubiquitously used substance with cultural significance but also many potential issues related to consumption. High levels of alcohol usage are associated with a range of psychosocial issues including addiction, criminality, poverty, and ultimately death.


Objectives & Approach
While our ultimate aim is to examine and predict problematic alcohol usage (i.e., consumption of more than 35 units per week), our preliminary results examine alcohol-related deaths as a simpler dependent variable to assess the robustness of our approach.


Using modern data scientific techniques, we provide a machine learning workflow for predicting alcohol deaths at the lower-tier local authority level using a range of literature-derived variables including psychosocial and environmental demographics, trade sales data, and novel supermarket behavioural customer data (i.e., alcohol purchases). Within this workflow, we discuss the importance of feature creation, selection, and engineering; correct sampling standards (i.e., cross-validation); and model class selection, implementation, and validation.


Relevance to Digital Footprints
This study uses data from a large UK supermarket to quantify alcohol purchasing rates within lower-tier local authority levels.


Conclusions & Implications
Overall, we find that incorporating a range of variables, including novel supermarket behavioural data, enhances the prediction of alcohol deaths. While machine learning methods are often criticized for their potentially low interpretability, particularly with more complex model classes, we implement a range of explainable AI techniques to elucidate variable importance.

Article Details

How to Cite
Derecki, R., Goulding, J., Dolan, E. and O'Shea, B. (2025) “Integrating Supermarket Behavioural Data with Established Indicators to Improve Alcohol-Related Mortality Prediction in England”, International Journal of Population Data Science, 10(5). doi: 10.23889/ijpds.v10i5.3329.

Most read articles by the same author(s)

1 2 3 > >>