<?xml version="1.0"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "JATS-journalpublishing1.dtd" [
]>
<article xml:lang="en" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML"
  dtd-version="1.2" article-type="abstract">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IJPDS</journal-id>
      <journal-title-group>
        <journal-title>International Journal of Population Data Science</journal-title>
        <abbrev-journal-title>IJPDS</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="epub">2399-4908</issn>
      <publisher>
        <publisher-name>Swansea University</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.23889/ijpds.v9i5.2886</article-id>
      <article-id pub-id-type="publisher-id">9:5:394</article-id>
      <title-group>
        <article-title>The art of research cohort construction using administrative datasets</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Williamson</surname>
            <given-names initials="L">Lee</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="affil-1"><label>1</label><institution>University of Edinburgh</institution></aff>
      <pub-date date-type="pub" publication-format="electronic">
        <day>18</day>
        <month>09</month>
        <year>2024</year>
      </pub-date>
      <pub-date date-type="collection" publication-format="electronic">
        <year>2024</year>
      </pub-date>
      <volume>9</volume>
      <issue>5</issue>
      <elocation-id>2886</elocation-id>
      <permissions>
        <license license-type="open-access" xlink:href="https://creativecommons.org/licences/by/4.0/">
          <license-p>This work is licenced under a Creative Commons Attribution 4.0 International License.</license-p>
        </license>
      </permissions>
      <self-uri xlink:href="https://ijpds.org/article/view/2886">This article is available from the IJPDS website at: https://ijpds.org/article/view/2886</self-uri>
    </article-meta>
  </front>
  <body>
    <sec>
      <title>Objectives</title>
      <p>Using research case studies, I will give an overview as to how researchers can experience problems translating great research ideas, grounded in the relevant literature, into a robust research design. Assuming that the question cannot be reliably researched using small but rich sample surveys, I will present ways in which routine admin data can help in this context, along with the additional challenges posed when creating the correct cohort to address the research question.</p>
    </sec>
    <sec>
      <title>Approach</title>
      <p>The examples given are from a study which links together routinely collected administrative data for a representative sample of the population. It includes a wealth of information from census, civil registration data, education data, and with further permission health data can also be linked. The study in question is provided deidentified, and accessed within a data enclave. The size and scope of the study make it an unparalleled resource for analysing a range of socio-economic, demographic and health questions.</p>
    </sec>
    <sec>
      <title>Results</title>
      <p>I will demonstrate how despite the large number of study members, these cohorts must be carefully considered in order to research outcomes (events/results) due to inconsistency in the years that different admin data are available centrally in admin systems (health/education data). Examples include: life-course events for a cohort of women followed up to mid-life, and a complete sub-cohort followed up into later-life.</p>
    </sec>
    <sec>
      <title>Discussion</title>
      <p>To showcase exciting ways in which linked admin data can be operationalised to take a longitudinal/life-course approach to address research questions, and to emphasise cohort construction is complex.</p>
    </sec>
  </body>
</article>