<?xml version="1.0"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "JATS-journalpublishing1.dtd" [
]>
<article xml:lang="en" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
  xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML"
  dtd-version="1.2" article-type="abstract">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">IJPDS</journal-id>
      <journal-title-group>
        <journal-title>International Journal of Population Data Science</journal-title>
        <abbrev-journal-title>IJPDS</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="epub">2399-4908</issn>
      <publisher>
        <publisher-name>Swansea University</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.23889/ijpds.v10i3.3110</article-id>
      <article-id pub-id-type="publisher-id">10:3:91</article-id>
      <title-group>
        <article-title>Synthetic data and clarifying the quagmire of terminology that surrounds it</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Oliver</surname>
            <given-names initials="E">Emily</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Frayling</surname>
            <given-names initials="L">Lora</given-names>
          </name>
          <xref ref-type="aff" rid="affil-2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Bharat</surname>
            <given-names initials="S">Shah Suraj</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Pattinson</surname>
            <given-names initials="E">Elizabeth</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Stock</surname>
            <given-names initials="J">Joshua</given-names>
          </name>
          <xref ref-type="aff" rid="affil-3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Lugg-Widger</surname>
            <given-names initials="F">Fiona</given-names>
          </name>
          <xref ref-type="aff" rid="affil-3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Gordon</surname>
            <given-names initials="E">Emma</given-names>
          </name>
          <xref ref-type="aff" rid="affil-1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="affil-1"><label>1</label><institution>ADR UK, London, United Kingdom</institution></aff>
      <aff id="affil-2"><label>2</label><institution>HDI CIC, Cambridge, United Kingdom</institution></aff>
      <aff id="affil-3"><label>3</label><institution>Cardiff University, Cardiff, United Kingdom</institution></aff>
      <pub-date date-type="pub" publication-format="electronic">
        <day>01</day>
        <month>06</month>
        <year>2025</year>
      </pub-date>
      <pub-date date-type="collection" publication-format="electronic">
        <year>2025</year>
      </pub-date>
      <volume>8</volume>
      <issue>4</issue>
      <elocation-id>3110</elocation-id>
      <permissions>
        <license license-type="open-access"
          xlink:href="https://creativecommons.org/licences/by/4.0/">
          <license-p>This work is licenced under a Creative Commons Attribution 4.0 International
            License.</license-p>
        </license>
      </permissions>
      <self-uri xlink:href="https://ijpds.org/article/view/3110">This article is available from the
        IJPDS website at: https://ijpds.org/article/view/3110</self-uri>
    </article-meta>
  </front>
  <body>
    <sec>
      <title>Objectives</title>
      <p>Building on existing literature, our objective is to recommend definitions for key
        synthetic data terminology regarding its use cases in privacy preservation. By unpacking the
        words we use, we expose unsettled terminology and propose definitions which offer clarity
        and consistency, thus affording consensus to support increased adoption of synthetic data.</p>
    </sec>
    <sec>
      <title>Methods</title>
      <p>Taking into account the differing interests of creators, guardians and users of synthetic
        data, we applied a principled approach to selecting literature that could reveal insight on:</p>
      <list list-type="bullet">
        <list-item>
          <p> The scope and purpose of synthetic data when used for privacy preservation </p>
        </list-item>
        <list-item>
          <p> Methods for characterisation and evaluation of it </p>
        </list-item>
        <list-item>
          <p> Clarity on quality control and regulations </p>
        </list-item>
        <list-item>
          <p> Guidelines for practitioners generating, distributing and using synthetic data. </p>
        </list-item>
      </list>
      <p>We examined a range of academic and grey literature and, on finding a lack of definitions
        and consistency, dug deeper to explore the differences in terminology depending on type and
        usefulness for different use cases.</p>
    </sec>
    <sec>
      <title>Results</title>
      <p>Understanding the use case is key to creating accurate definitions. The terms ‘utility’ and
        ‘fidelity’ are separate but related concepts and feature commonly in literature, often
        interchangeably. Yet whilst ‘utility’ is dependent on context (is it right for this use
        case?), ‘fidelity’ is an inherent quality of the data. Different levels of fidelity can be
        important for some use cases, but increasing fidelity will not always increase utility.
        Furthermore, structural fidelity is different to statistical fidelity, yet this
        differentiation is rarely acknowledged. As such, terms related to fidelity remain ambiguous,
        while others, such as ‘microdata’ are simply assumed by the author to be understood by the
        reader with no definitions provided.</p>
    </sec>
    <sec>
      <title>Conclusion</title>
      <p>Inconsistent and undefined use of terminology leads to miscommunication, and consequently a
        lack of reliability and efficiency. It is important to solve this through wide stakeholder
        engagement for purposes of good governance and public trust. We recommend a non-static
        glossary where contested terms are recognised and clarity is formed.</p>
    </sec>
  </body>
</article>