Information is increasingly digital, creating opportunities to respond to pressing issues about human populations in near real time using linked datasets that are large, complex, and diverse. The potential social and individual benefits that can come from data-intensive science are large, but raise challenges of balancing individual privacy and the public good, building appropriate socio-technical systems to support data-intensive science, and determining whether defining a new field of inquiry might help move those collective interests and activities forward. A combination of expert engagement, literature review, and iterative conversations led to our conclusion that defining the field of Population Data Science (challenge 3) will help address the other two challenges as well. We define Population Data Science succinctly as the science of data about people and note that it is related to but distinct from the fields of data science and informatics. A broader definition names four characteristics of: data use for positive impact on citizens and society; bringing together and analyzing data from multiple sources; finding population-level insights; and developing safe, privacy-sensitive and ethical infrastructure to support research. One implication of these characteristics is that few people possess all of the requisite knowledge and skills of Population Data Science, so this is by nature a multi-disciplinary field. Other implications include the need to advance various aspects of science, such as data linkage technology, various forms of analytics, and methods of public engagement. These implications are the beginnings of a research agenda for Population Data Science, which if approached as a collective field, can catalyze significant advances in our understanding of trends in society, health, and human behavior.
Most read articles by the same author(s)
- Adalsteinn D Brown, Andrew S Boozary, David Henry, Greg Marchildon, Michael Schull, Political and Policy Arguments for Integrated Data , International Journal of Population Data Science: Vol 1 No 1 (2017): IJPDS
- Amy Mizen, Jane Lyons, Ruth Doherty, Damon Berridge, Paul Wilkinson, Ai Milojevic, David Carruthers, Ashley Akbari, Iain Lake, Gwyneth Davies, Anna Mavrogianni, Mohammad Al Sallakh, Lorraine Dearden, Rhodri Johnson, Sarah Elizabeth Rodgers, Creating individual level air pollution exposures in an anonymised data safe haven: a platform for evaluating impact on educational attainment , International Journal of Population Data Science: Vol 3 No 1 (2018): IJPDS
- Sarah Meghan Mah, Claudia Sanmartin, Sam Harper, Nancy A Ross, Childbirth-Related Hospital Burden by Socioeconomic Status in a Universal Health Care Setting , International Journal of Population Data Science: Vol 3 No 1 (2018): IJPDS
- Gabriel E Fabreau, Evan P Minty, Danielle A Southern, Hude Quan, William A Ghali, A Metadata Manifesto: The Need for Global Health Metadata , International Journal of Population Data Science: Vol 3 No 1 (2018): IJPDS