<?xml version="1.0"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "JATS-journalpublishing1.dtd" [
]>
<article xml:lang="en" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
  xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML"
  dtd-version="1.2" article-type="abstract">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IJPDS</journal-id>
      <journal-title-group>
        <journal-title>International Journal of Population Data Science</journal-title>
        <abbrev-journal-title>IJPDS</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="epub">2399-4908</issn>
      <publisher>
        <publisher-name>Swansea University</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.23889/ijpds.v10i3.3127</article-id>
      <article-id pub-id-type="publisher-id">10:3:107</article-id>
      <title-group>
        <article-title>Linking administrative and Census 2021 data in Wales, UK: A cross-sectional
          study examining completeness and representativeness for population linkage analytics.</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Johnson</surname>
            <given-names initials="R">Rhodri</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Lyons</surname>
            <given-names initials="J">Jane</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Edwards</surname>
            <given-names initials="M">Michael</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Turner</surname>
            <given-names initials="S">Samantha</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Fry</surname>
            <given-names initials="R">Richard</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Griffiths</surname>
            <given-names initials="L">Lucy</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Lyons</surname>
            <given-names initials="R">Ronan</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="affil-1"><label>1</label><institution>Swansea University, Swansea, United Kingdom</institution></aff>
      <pub-date date-type="pub" publication-format="electronic">
        <day>01</day>
        <month>06</month>
        <year>2025</year>
      </pub-date>
      <pub-date date-type="collection" publication-format="electronic">
        <year>2025</year>
      </pub-date>
      <volume>8</volume>
      <issue>4</issue>
      <elocation-id>3127</elocation-id>
      <permissions>
        <license license-type="open-access"
          xlink:href="https://creativecommons.org/licences/by/4.0/">
          <license-p>This work is licenced under a Creative Commons Attribution 4.0 International
            License.</license-p>
        </license>
      </permissions>
      <self-uri xlink:href="https://ijpds.org/article/view/3127">This article is available from the
        IJPDS website at: https://ijpds.org/article/view/3127</self-uri>
    </article-meta>
  </front>
  <body>
    <sec>
      <title>Objectives</title>
      <p> To explore the completeness and representativeness of Census 2021 data linkage within the
        Secure Anonymised Information Linkage (SAIL) Databank for research on the population of
        Wales, UK and, understand which subgroups of the population are disproportionately
        represented in data linkage population-wide studies.</p>
    </sec>
    <sec>
      <title>Methods</title>
      <p>An observational, population-wide cross-sectional comparison study, utilising
        administrative demographic data and decennial survey data held in SAIL. Two linked data
        sources, the Welsh Demographic Service Dataset (WDSD) and Census 2021, were used to create
        and compare two cohorts of the resident population of Wales, UK, on 21st March 2021. </p>
      <p>The two cohorts were linked together to provide understanding on how many individuals from
        Census 2021 can be successfully linked within SAIL and found across both sources. We
        utilised logistic regression models to analyse the variation in the linkability of the
        survey data within SAIL by various demographic and household characteristics. </p>
    </sec>
    <sec>
      <title>Results</title>
      <p>In total, 3,090,976 individuals were present in the WDSD population, 2,965,196 individuals
        in the Census population, 2,440,191 individuals found in both, with 650,785 and 525,005
        individuals found only in WDSD and Census respectively. Focussing on the multivariate
        logistic regression analysis (n= 2,415,260, aged 16+ and non-communal establishment
        resident), being male (OR=1.28 [95%CI 1.28,1.32]), aged 75+ years (OR=1.27 [95%CI
        1.25,1.29]), of Asian ethnicity (OR=1.27 [95%CI 1.24,1.30]), a more recent migrant (arriving
        to UK after 2000) (OR= 1.30 [95%CI 1.28,1.32]), member of the LGBTQ+ community (OR=1.29,
        [95%CI 1.25,1.29]) or not disclosing LGBTQ+ status (OR=1.41 [95%CI 1.39,1.43]), separated,
        divorced or widowed (OR=1.28 [95%CI 1.27,1.29]), or living in rental accommodation (OR=1.47
        [95%CI 1.45,1.48]) were the characteristics associated with the highest odds of not having
        Census linkable data in SAIL.</p>
    </sec>
    <sec>
      <title>Conclusion</title>
      <p>Results show that certain personal characteristics and sub-groups of the population of
        Wales are disproportionately represented when combining population estimates and utilising
        Census data in data linkage population-wide studies in SAIL. This is an important finding
        for researchers to understand when carrying out future linked research on the Welsh
        population.</p>
    </sec>
  </body>
</article>