<?xml version="1.0"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "JATS-journalpublishing1.dtd" [
]>
<article xml:lang="en" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
  xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML"
  dtd-version="1.2" article-type="abstract">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IJPDS</journal-id>
      <journal-title-group>
        <journal-title>International Journal of Population Data Science</journal-title>
        <abbrev-journal-title>IJPDS</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="epub">2399-4908</issn>
      <publisher>
        <publisher-name>Swansea University</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.23889/ijpds.v10i3.3247</article-id>
      <article-id pub-id-type="publisher-id">10:3:212</article-id>
      <title-group>
        <article-title>How the RDMF enhances the use of Administrative Data for analysis for the
          public good</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Goode</surname>
            <given-names initials="H">Hannah</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Kellow</surname>
            <given-names initials="E">Elias</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Baird</surname>
            <given-names initials="C">Charles</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Colquitt</surname>
            <given-names initials="R">Rachael</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Mavron</surname>
            <given-names initials="N">Nick</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Brooker</surname>
            <given-names initials="C">Chris</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Cobbledick</surname>
            <given-names initials="D">David</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="affil-1"><label>1</label><institution>The Office for National Statistics, Newport,
        United Kingdom</institution></aff>
      <pub-date date-type="pub" publication-format="electronic">
        <day>01</day>
        <month>06</month>
        <year>2025</year>
      </pub-date>
      <pub-date date-type="collection" publication-format="electronic">
        <year>2025</year>
      </pub-date>
      <volume>8</volume>
      <issue>4</issue>
      <elocation-id>3247</elocation-id>
      <permissions>
        <license license-type="open-access"
          xlink:href="https://creativecommons.org/licences/by/4.0/">
          <license-p>This work is licenced under a Creative Commons Attribution 4.0 International
            License.</license-p>
        </license>
      </permissions>
      <self-uri xlink:href="https://ijpds.org/article/view/3247">This article is available from the
        IJPDS website at: https://ijpds.org/article/view/3247</self-uri>
    </article-meta>
  </front>
  <body>
    <sec>
      <title>Objective</title>
      <p>The Reference Data Management Framework (RDMF) connects and unifies linked datasets to
        create unique indexes for individuals, businesses and locations in the UK. It enables
        expanded research via secure data matching services, linking data from different data
        sources, reliable de-identification and other expanded analytical capabilities.</p>
    </sec>
    <sec>
      <title>Method</title>
      <p>The RDMF uses a series of different approaches to linkage. Fully automated linkage is a
        direct match on a unique identifier for example a National Insurance Number. Rule based
        matching applies a set of matching rules which are ordered hierarchically, where the
        subsequent set of rules is used in the next iteration of linkage. Probabilistic matching
        calculates the score or probability that two records are a match based on matching agreement
        weights between variables and calculated through gradient descent. Records are also matched
        clerically when a computer is unable to decide if two records are a match, a human can
        decide.</p>
    </sec>
    <sec>
      <title>Results</title>
      <p>The four indexes created in the RDMF contain one record for each person DI-Demographic
        Index, business BI-Business Index, location LI-Location Index and industrial and
        occupational classification CI-Classification Index.</p>
      <p>The RDMF is a multi-tool that uses those indexes to allow faster, more consistent linkage
        of datasets at scale by indexing the data first and then using the index as a common ID to
        be joined on.</p>
      <p>The indexing process, via a combination of automated linkage and matching services, allows
        for the enrichment of datasets, expansion of their analytical potential and
        de-identification while retaining analytical use. The RDMF is underpinned by a Quality and
        Assurance framework made up of a set of policies, activities, processes and outputs in line
        with the Code of Practice for Statistics.</p>
    </sec>
    <sec>
      <title>Conclusion</title>
      <p>Once a dataset has been indexed, it can link to other indexed datasets through a direct
        join with the same index id, or through cross index association where all four indexes have
        been linked together. This allows for scalable and consistent dataset linkage increasing the
        analytical value of each dataset.</p>
    </sec>
  </body>
</article>