<?xml version="1.0"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "JATS-journalpublishing1.dtd"[]>
<article xml:lang="en" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" dtd-version="1.2" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">IJPDS</journal-id>
<journal-title-group>
<journal-title>International Journal of Population Data Science</journal-title>
<abbrev-journal-title>IJPDS</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2399-4908</issn>
<publisher>
<publisher-name>Swansea University</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.23889/ijpds.v11i1.3160</article-id>
<article-id pub-id-type="publisher-id">11:1:22</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Population Data Science</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Enabling the analysis of patient-level data across jurisdictions for research use: a real-world exploration of a federated likelihood approach</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author"><name><surname>Harmon</surname><given-names initials="M">Megan</given-names></name><xref ref-type="aff" rid="affil-1"><sup>1</sup></xref><xref ref-type="aff" rid="affil-2"><sup>2</sup></xref><xref ref-type="aff" rid="affil-3"><sup>3</sup></xref><xref ref-type="corresp" rid="correspondingAurthor">*</xref></contrib>
<contrib contrib-type="author"><name><surname>Li</surname><given-names initials="N">Na</given-names></name><xref ref-type="aff" rid="affil-1"><sup>1</sup></xref><xref ref-type="aff" rid="affil-2"><sup>2</sup></xref></contrib>
<contrib contrib-type="author"><name><surname>Sajobi</surname><given-names initials="T">Tolulope</given-names></name><xref ref-type="aff" rid="affil-1"><sup>1</sup></xref><xref ref-type="aff" rid="affil-4"><sup>4</sup></xref></contrib>
<contrib contrib-type="author"><name><surname>Holodinsky</surname><given-names initials="J">Jessalyn</given-names></name><xref ref-type="aff" rid="affil-1"><sup>1</sup></xref><xref ref-type="aff" rid="affil-2"><sup>2</sup></xref><xref ref-type="aff" rid="affil-3"><sup>3</sup></xref><xref ref-type="aff" rid="affil-4"><sup>4</sup></xref><xref ref-type="aff" rid="affil-5"><sup>5</sup></xref><xref ref-type="aff" rid="affil-6"><sup>6</sup></xref><xref ref-type="aff" rid="affil-7"><sup>7</sup></xref></contrib>
<contrib contrib-type="author"><name><surname>Williamson</surname><given-names initials="T">Tyler</given-names></name><xref ref-type="aff" rid="affil-1"><sup>1</sup></xref><xref ref-type="aff" rid="affil-2"><sup>2</sup></xref><xref ref-type="aff" rid="affil-5"><sup>5</sup></xref></contrib>
<aff id="affil-1"><label>1</label><institution>Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada</institution></aff>
<aff id="affil-2"><label>2</label><institution>Centre for Health Informatics, University of Calgary, Calgary, Alberta, Canada</institution></aff>
<aff id="affil-3"><label>3</label><institution>Department of Emergency Medicine, University of Calgary, Calgary, Alberta, Canada</institution></aff>
<aff id="affil-4"><label>4</label><institution>Department of Clinical Neurosciences, University of Calgary, Calgary, Alberta, Canada</institution></aff>
<aff id="affil-5"><label>5</label><institution>O&#x2019;Brien Institute for Public Health, University of Calgary, Calgary, Alberta, Canada</institution></aff>
<aff id="affil-6"><label>6</label><institution>Hotchkiss Brain Institute, University of Calgary, Calgary, Alberta, Canada</institution></aff>
<aff id="affil-7"><label>7</label><institution>Alberta Children&#x2019;s Hospital Research Institute, University of Calgary, Calgary, Albera, Canada</institution></aff>
</contrib-group>
<author-notes>
<corresp id="correspondingAurthor"><label>*</label>Corresponding author: Megan Harmon, <email>megan.harmon@ucalgary.ca</email></corresp>
<fn fn-type="conflict">
<label>Statement on conflicts of interest</label>
<p>The authors have no conflicts of interest.</p></fn>
</author-notes>
<pub-date date-type="pub" publication-format="electronic"><day>28</day><month>04</month><year>2026</year></pub-date>
<pub-date date-type="collection" publication-format="electronic"><year>2026</year></pub-date>
<volume>11</volume>
<issue>1</issue>
<elocation-id>3160</elocation-id>
<permissions>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/">
<license-p>This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.</license-p>
</license>
</permissions>
<self-uri xlink:href="https://ijpds.org/article/view/3160">This article is available from the IJPDS website at: https://ijpds.org/article/view/3160</self-uri>
<abstract>
<sec>
<title>Introduction</title>
<p>The disclosure of health data is governed by strict privacy regulations which significantly restrict the transfer of data across jurisdictions. These limitations restrict the scope of health research, particularly the ability to conduct studies that span multiple jurisdictions (e.g. provinces, states, or countries). One common practice is to use meta-analysis to pool jurisdiction-specific estimates. However, this approach relies on combining aggregate-level data, which may overlook important nuances. Therefore, alternative methods are needed. This study investigates the potential of a federated likelihood approach to analyse health data across jurisdictions while keeping the data inside the jurisdiction.</p>
</sec>
<sec>
<title>Objectives</title>
<p>(1) Evaluate the effectiveness of the meta-analysis technique in pooling jurisdiction-specific results to produce overall estimates (2) assess the utility of a federated likelihood approach for interjurisdictional health data analysis; and (3) compare the performance of a federated likelihood method with the conventional meta-analysis approach.</p>
</sec>
<sec>
<title>Methods</title>
<p>Log-likelihood functions from separate jurisdictions were directly combined to form a single overall log-likelihood function, and overall parameter estimates. We show that these parameter estimates are mathematically equivalent to those obtained from analysing the pooled individual-level data. Using data from the Canadian Primary Care Sentinel Surveillance Network, and using chronic kidney disease as a case study, estimates from five jurisdictions are combined using meta-analysis and the federated likelihood approach. These results are compared to the overall estimates from a global model.</p>
</sec>
<sec>
<title>Result</title>
<p>The likelihood method demonstrated strong performance in analysing federated data, yielding the lowest relative percent absolute bias at 1.34%. This was notably lower than the bias observed with the common effect meta-analysis (4.09%).</p>
</sec>
<sec>
<title>Conclusion</title>
<p>The federated likelihood method offers a promising approach to interjurisdictional health data analysis. However, further investigation is needed to assess its feasibility across different model types and model complexity to fully understand the scope of its utility.</p>
</sec>
</abstract>
<kwd-group>
<kwd>likelihood functions</kwd>
<kwd>meta-analysis</kwd>
<kwd>federated analysis</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec>
<title>Introduction</title>
<p>As health systems rely on data to drive decision-making and improve outcomes, the ability to analyse health data distributed across multiple institutions and regions has become essential [<xref ref-type="bibr" rid="ref-1">1</xref>, <xref ref-type="bibr" rid="ref-2">2</xref>]. While some research questions can be addressed within a single organisation, many critical public health and clinical challenges &#x2013; such as understanding rare events or chronic disease surveillance &#x2013; demand a broader, more integrated approach [<xref ref-type="bibr" rid="ref-3">3</xref>&#x2013;<xref ref-type="bibr" rid="ref-5">5</xref>]. In such cases, no single data source may offer sufficient scale or diversity to produce reliable insights. Federated health data analysis allows researchers to overcome these limitations by synthesising the statistical structure of the data across locations [<xref ref-type="bibr" rid="ref-6">6</xref>], thereby enabling more accurate estimates, detection of rare outcomes, and data-driven public health policies that are both comprehensive and generalisable. Without such collaboration, key patterns and risks may remain undetected, limiting the effectiveness of health interventions and decision-making at scale.</p>
<p>Globally, the disclosure of health data is subject to specific jurisdictional regulations. Generally, these regulations, whether they are organisational policies or government legislation, restrict the disclosure, transfer, and sharing of patient-level data across regional boundaries, particularly for research use. Countries around the world face barriers when conducting cross-jurisdictional health research due to varying privacy regulations. In Canada, provincial legislation like the Alberta Health Information Act and the Ontario Personal Health Information Protection Act governs the collection, use, and disclosure of health information [<xref ref-type="bibr" rid="ref-7">7</xref>, <xref ref-type="bibr" rid="ref-8">8</xref>]. The European Union&#x2019;s General Data Protection Regulation (GDPR) imposes strict limits on the sharing of personal health data between member states [<xref ref-type="bibr" rid="ref-9">9</xref>]. Similarly, in the United States, the Health Insurance Portability and Accountability Act (HIPAA) enforces privacy standards that can make cross-state health data sharing difficult [<xref ref-type="bibr" rid="ref-10">10</xref>].</p>
<p>As cross-border collaboration and data-driven health research become increasingly important, finding secure, ethical, and accurate methods for analysing federated health data remains a pressing international challenge.</p>
<p>Meta-analysis is a well-established method for synthesising findings across independent studies and is particularly attractive in federated data settings due to its minimal data sharing requirements. However, its reliance on summary statistics limits its ability to recover the full statistical structure of the original data [<xref ref-type="bibr" rid="ref-11">11</xref>]. Standard meta-analysis methods do not preserve the likelihood contributions from individual datasets, leading to a loss of granularity and limited analytical flexibility. Additionally, meta-analytic methods perform poorly when regional sample sizes are small, which may yield unstable or biased estimates [<xref ref-type="bibr" rid="ref-12">12</xref>]. This is a common challenge in multi-jurisdictional health research, where sample sizes may vary considerably across regions. Another approach is federated learning which allows multiple institutions to collaboratively and iteratively train machine learning models without sharing raw data. Each institution retains its data locally, training a model on-site and then sharing only model parameters (such as gradients or weights) with a central server, where these parameters are aggregated to form a global model. This method has gained traction in healthcare due to its privacy-preserving nature, as demonstrated by Rieke et al. (2020) and Sheller et al. (2020) [<xref ref-type="bibr" rid="ref-13">13</xref>, <xref ref-type="bibr" rid="ref-14">14</xref>], who have explored its application in medical contexts where data security is critical. While federated learning has been widely adopted for machine learning tasks such as prediction and optimisation, it is increasingly being extended to traditional statistical modelling. Recent work has demonstrated federated implementations of generalised linear regression and Cox proportional hazards regression for clinical and epidemiological research [<xref ref-type="bibr" rid="ref-15">15</xref>&#x2013;<xref ref-type="bibr" rid="ref-18">18</xref>]. These advances show that federated approaches can also support hypothesis testing and estimation of treatment effects, thus broadening their applicability beyond purely predictive contexts.</p>
<p>This work explores the use of a federated likelihood approach to enable the analysis of health information without direct data sharing. Likelihood functions underpin models such as logistic regression, serving as the mathematical tool that determines the parameter values that best explain the observed data. Likelihood functions, therefore, carry all the information contained in the data about a specific model, while being sufficiently aggregated to ensure no personal information can &#x201C;leak through.&#x201D; Conceptually, likelihood functions for various jurisdictional models can be combined to yield an overall likelihood function that is mathematically identical to what would be available if the data were combined for the analysis [<xref ref-type="bibr" rid="ref-19">19</xref>]. Unlike meta-analysis, a federated likelihood approach preserves the complete probabilistic information encoded in the local data, allowing for joint estimation of parameters under richer statistical models and leading to more accurate inference. However, sharing a closed-form solution to the likelihood function isn&#x2019;t practical, since such solutions typically don&#x2019;t exist for complex models and must be approximated numerically. As an alternative, we investigate a brute force strategy in which regional log-likelihood matrices are aggregated, eliminating the need to reconstruct full likelihood functions.</p>
<p>The effectiveness of using a likelihood-based method for the interjurisdictional analysis of health data remains unknown, and it is not yet clear how this approach compares to current meta-analysis techniques. Evaluating the effectiveness of these methodologies is crucial for developing a robust framework for health data analysis that can be applied both nationally and internationally. While theoretical results suggest that combining likelihood functions across regions can produce the same outcomes as a centralised analysis, practical challenges such as computational complexity need to be explored. This work seeks to address these pressing questions, evaluating the potential of the federated likelihood approach in overcoming the challenges associated with interjurisdictional health data analysis.</p>
<p>The objective of this study is threefold. First, it aims to evaluate the effectiveness of the meta-analysis technique in pooling province-specific results to produce national level estimates. Second, it seeks to assess the utility of a federated likelihood approach for interjurisdictional health data analysis. Finally, this study aims to compare the performance of a federated likelihood method with the conventional meta-analysis approach. To achieve these objectives, this study uses the well-established relationship between hypertension and chronic kidney disease (CKD) among patients with diabetes as a case study &#x2013; an ideal choice given the strong, well-documented association between these conditions and their public health relevance [<xref ref-type="bibr" rid="ref-20">20</xref>]. This allows for meaningful benchmarking of the analytical methods&#x2019; accuracy and reliability.</p>
</sec>
<sec>
<title>Methods</title>
<sec>
<title>Case study setting and data source</title>
<p>This study utilised data from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN), a pan-Canadian collection of electronic medical records (EMR) encompassing over 2 million primary care patients spread across 14 regional networks [<xref ref-type="bibr" rid="ref-21">21</xref>]. CPCSSN has pan-Canadian coverage of primary care practices, with participating practices situated in the majority of provinces across Canada. Since its establishment in 2008, CPCSSN continuously receives EMR data extracted by contributing networks from participating primary care practitioners. CPCSSN contains information on consenting patients&#x2019; diagnoses, procedures, laboratory tests, medication prescriptions, and referrals. As CPCSSN data is pooled at the national level, it is one of the few data sources that allows for national level data analysis while also including meaningful differences across jurisdictional boundaries rather than creating artificial boundaries as has been done by others [<xref ref-type="bibr" rid="ref-22">22</xref>]. This allows for the assessment of this novel approach by providing a basis for comparison against the actual national values.</p>
<p>Adult patients (18 years of age and older) with diabetes (n = 42,341) were identified using validated case definitions [<xref ref-type="bibr" rid="ref-23">23</xref>, <xref ref-type="bibr" rid="ref-24">24</xref>]. A unique baseline visit was defined for each patient as their first visit with a participating primary care practitioner between January 1, 2014, and December 31, 2014. In keeping with using CKD and hypertension as a case study, our analyses were restricted to patients without an existing diagnosis of CKD at any time prior to baseline and those who were not marked deceased at their baseline visit.</p>
<p>Diagnoses of CKD were identified using eGFR. Specifically, two or more laboratory values separated by at least 90 days but not more than 1 year apart reporting an eGFR less than 60 mL/min/1.73 m<sup>2</sup>, regardless of age as well as two or more laboratory values separated by at least 90 days but not more than 1 year apart reporting an eGFR less than 75, 60, and 45 mL/min/1.73 m<sup>2</sup> for ages younger than 40, 40 to 64, and 65 years or older, respectively [<xref ref-type="bibr" rid="ref-25">25</xref>].</p>
<p>The variables examined in this study include hypertension, sex, and age. Hypertension is a dichotomous variable based on any diagnosis prior to baseline [<xref ref-type="bibr" rid="ref-23">23</xref>]. Sex is a dichotomous variable based on what was recorded in the EMR. Age is treated as a continuous variable ranging from 18 to 104, without any transformations. Age was calculated as baseline visit year minus birthyear.</p>
</sec>
<sec>
<title>Statistical analysis</title>
<p>All analyses were conducted using R version 4.4.1.</p>
<p>Five logistic regression models were fit separately for British Columbia (n = 1,700), Alberta (n = 8,002), Manitoba (n = 6,089), Ontario (n = 23,946), and the Maritimes (n = 2,604) which includes patients from Newfoundland and Labrador, New Brunswick, Nova Scotia, and Prince Edward Island. These regions were aggregated due to sample size. Additionally, a global logistic regression model was fit which encompasses all individual patient data across the provinces (n = 42,341). In each model, hypertension serves as the primary exposure variable, while sex and age are included as covariates to control for potential confounding effects. The province of Quebec has been excluded from the analyses as there were no recorded patients with the outcome of interest. The province of Saskatchewan and the Canadian territories are not included as CPCSSN does not currently collect data in those regions.</p>
</sec>
<sec>
<title>Meta-analysis</title>
<p>Common effect meta-analysis was employed to pool the coefficients from the five regional models using the R package &#x2018;meta&#x2019;. A separate meta-analysis was conducted for each model parameter to synthesise the provincial estimates of CKD across Canada. The inverse variance method was used for pooling [<xref ref-type="bibr" rid="ref-26">26</xref>]. The heterogeneity across regions was assessed using Cochran&#x2019;s Q, which is a chi-square statistic comparing the effect size from each regional model to the overall pooled effect. Regional heterogeneity was measured using I<sup>2</sup>, which is the proportion of total variation in effects due to true heterogeneity rather than chance.</p>
</sec>
<sec>
<title>Assumptions of the federated likelihood approach</title>
<p>The federated likelihood method is built upon a core set of assumptions that support both the validity and interpretability of inferences drawn from distributed data sources. These assumptions are critical for ensuring the reliability and applicability of the method, and are outlined as follows:</p>
<p>1. Common Effect Assumption:</p>
<p>A central assumption is that of a common effect. The fundamental objective of this method is to estimate a global parameter that is common across all sites. If true heterogeneity is present, more flexible models such as mixed-effects approaches may be warranted which is beyond the scope of this study.</p>
<p>2. Scientific Pooling Assumption (Justifiability of Pooled Analysis):</p>
<p>The primary assumption is that data from all participating jurisdictions or institutions can be justifiably pooled for unified analysis. This requires that the research question, study populations, and measured variables are sufficiently comparable to allow for meaningful inference. This assumption should be critically assessed a priori: if conceptual alignment across jurisdictions is lacking, then pooling &#x2013; no matter how technically feasible &#x2013; is inappropriate.</p>
<p>3. Consistency of Model Structure Across Sites:</p>
<p>To construct a valid joint likelihood, all participating sites must use an identical model structure. This includes consistent specification of the outcome variable, covariates, link function, and distributional assumptions. Structural inconsistencies, such as differences in variable inclusion, outcome definitions, missingness, or data transformations, would undermine compatibility and compromise the validity of joint inference.</p>
<p>4. Independence of Observations Across Jurisdictions (IID Assumption):</p>
<p>Finally, the method assumes that observations are independent and identically distributed (IID) across jurisdictions, with no overlapping or duplicated records. This condition is crucial to avoid double-counting, which can bias the likelihood.</p>
<p>These assumptions form the theoretical backbone of the federated likelihood framework. In practice, each must be carefully assessed within the specific context of the research question and available data. When any assumption is violated, alternative methods, such as mixed-effects modelling or meta-analytic approaches, may offer more suitable solutions.</p>
</sec>
<sec>
<title>Federated likelihood method</title>
<p>As it is often impractical to share a closed-form solution to the likelihood function since such solutions typically don&#x2019;t exist for complex models and must be approximated numerically, we investigate a brute force approach in which regional log-likelihood matrices are constructed and aggregated, eliminating the need to reconstruct full likelihood functions. This brute force approach utilises a grid search strategy to calculate the log-likelihood at every combination of plausible parameter values thereby constructing a matrix of log-likelihood values.</p>
<p>A predefined range of possible parameter values is created for each parameter in the model. These parameter values are drawn from the 95% confidence intervals associated with each of the estimates from the regional multivariable logistic regression models. Using confidence intervals to define this range is beneficial because they reflect the variability in our estimates, allowing us to incorporate regional variation into the grid search for selecting parameter values in our fully combined model. To construct the range of values for a given parameter, the smallest (most conservative) lower bound and the largest (most conservative) upper bound are identified across each of the regional confidence intervals for that parameter. These two values &#x2013; representing the lowest and highest plausible values of the parameter across all models &#x2013; form the endpoints of the parameter space for that vector. Once this inclusive range is established, it is segmented by generating a sequence of values at small, uniform increments. This study will explore the performance across two vector increments: 0.01 and 0.005. This results in a finely resolved vector that effectively captures the full extent of the parameter uncertainty across models. This approach ensures that no plausible values are excluded from the grid search and allows for the construction of a combined model that fully reflects the range of estimates derived from the ensemble of individual models.</p>
<p>Within each region, the log-likelihood was then calculated for each unique combination of values from the parameter vectors. The result of this process is a matrix of log-likelihood values, with each element of the matrix representing the likelihood of the data given a specific combination of parameter values. Technically, this will be a tensor for more than two parameters, but for simplicity we will refer to this as a matrix. This matrix effectively maps out the likelihood landscape for the regional model, allowing for a detailed examination of how the likelihood changes across different parameter combinations. By calculating the log-likelihood across all potential sets of parameter values, each matrix includes the "true" maximum log-likelihood, corresponding to the set of parameter values that best fit the regional data.</p>
<p>This process was then repeated independently for each regional model, generating a matrix of log-likelihood values for each region. The regional log-likelihood matrices were then summed. Summing the log-likelihoods at each intersection of parameter values leverages the additive property of log-likelihoods in independent models, allowing for the combination of information from the regions without directly accessing their raw data. The combined log-likelihood matrix represents the overall likelihood for the global model, effectively synthesising the regional information into a single estimate. An illustration of this process using simulated data is presented in <xref ref-type="fig" rid="fig-1">Figure 1</xref>.</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption><title>Contour plots of simulated data demonstrating the application of the likelihood method for the analysis of data across two distinct regions with two parameters</title></caption>
<graphic xlink:href="ijpds-06-3160-g001.tif"/>
</fig>
<p>In addition, the 14.7% likelihood region was identified, representing the set of parameter values where the likelihood is at least 14.7% of the maximum likelihood. From this region, a 14.7% likelihood interval for each parameter can be derived, defining the range of parameter values around the maximum where the likelihood of the data is highest. The 14.7% cutoff is chosen because it approximately corresponds to the likelihood threshold associated with a 95% confidence interval under likelihood theory [<xref ref-type="bibr" rid="ref-27">27</xref>]. We report both likelihood and confidence intervals.</p>
<p>The parameter values that maximised the combined likelihood were identified and compared to the global model estimates to evaluate the performance of the likelihood-based approach. In addition, the estimates obtained through the likelihood method were directly compared to those generated by meta-analysis. The meta-analysis, which aggregates odds ratios across regions, and the likelihood-based method, which combines regional log-likelihoods, represent two distinct strategies for synthesising regional data into combined estimates. To assess the accuracy of each approach, the common effect meta-analysis estimates, along with the federated likelihood estimates, were compared to the global model by calculating the relative percent absolute bias (RPAB). This was defined as the absolute difference between each pooled estimate and the global estimate, divided by the global estimate. Comparing the RPABs across methods provides a measure of how closely each approach approximates the true global model and highlights potential limitations or biases inherent in either the meta-analysis or federated likelihood method.</p>
</sec>
</sec>
<sec>
<title>Results</title>
<p>A total of 42,341 participants were included in the analysis. The mean age of patients at baseline was 63.38 (SD: 12.22). Among the overall study population, 53% of patients were male and 47% were female. 51% of the study population has hypertension, with Ontario having the lowest proportion at 44% and the Maritimes having the highest proportion of hypertension at 67%. 15% of the overall study population have chronic kidney disease, with Alberta having the lowest proportion at 13% and the Maritimes having the largest proportion at 19%. Among the study population, 4% live in British Columbia, 19% live in Alberta, 14% live in Manitoba, 55% live in Ontario, and 6% live in the Maritimes (<xref ref-type="table" rid="table-1">Table 1</xref>).</p>
<table-wrap id="table-1">
<label>Table 1</label><caption><title>Descriptive statistics of adult patients with diabetes in Canada</title></caption>
<table frame="hsides" rules="groups">
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="10%"/>
<tbody>
<tr>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Global (N=42,341)</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>British Columbia (N=1,700)</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Alberta (N=8,002)</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Manitoba (N=6,089)</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Ontario (N=23,946)</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Maritimes (N=2,604)</bold></italic></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Age, mean (SD)</italic></td>
<td align="center" valign="top">63.38 (12.22)</td>
<td align="center" valign="top">63.14 (12.39)</td>
<td align="center" valign="top">62.66 (12.11)</td>
<td align="center" valign="top">63.03 (12.60)</td>
<td align="center" valign="top">63.58 (12.09)</td>
<td align="center" valign="top">64.72 (12.08)</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Age range</italic></td>
<td align="center" valign="top">18&#x2013;104</td>
<td align="center" valign="top">20&#x2013;100</td>
<td align="center" valign="top">18&#x2013;98</td>
<td align="center" valign="top">24&#x2013;100</td>
<td align="center" valign="top">18&#x2013;104</td>
<td align="center" valign="top">20&#x2013;101</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Sex</italic></td>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">&#x2003;<italic>Male, n (%)</italic></td>
<td align="center" valign="top">22,478 (53%)</td>
<td align="center" valign="top">911 (54%)</td>
<td align="center" valign="top">4,315 (54%)</td>
<td align="center" valign="top">3,320 (55%)</td>
<td align="center" valign="top">12,606 (53%)</td>
<td align="center" valign="top">1,326 (51%)</td>
</tr>
<tr>
<td align="left" valign="top">&#x2003;<italic>Female, n (%)</italic></td>
<td align="center" valign="top">19,863 (47%)</td>
<td align="center" valign="top">789 (46%)</td>
<td align="center" valign="top">3,687 (46%)</td>
<td align="center" valign="top">2,769 (45%)</td>
<td align="center" valign="top">11,340 (47%)</td>
<td align="center" valign="top">1,278 (49%)</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Hypertension</italic></td>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">&#x2003;<italic>No, n (%)</italic></td>
<td align="center" valign="top">20,828 (49%)</td>
<td align="center" valign="top">700 (41%)</td>
<td align="center" valign="top">3,050 (38%)</td>
<td align="center" valign="top">2,916 (48%)</td>
<td align="center" valign="top">13,292 (56%)</td>
<td align="center" valign="top">870 (33%)</td>
</tr>
<tr>
<td align="left" valign="top">&#x2003;<italic>Yes, n (%)</italic></td>
<td align="center" valign="top">21,513 (51%)</td>
<td align="center" valign="top">1,000 (59%)</td>
<td align="center" valign="top">4,952 (62%)</td>
<td align="center" valign="top">3,173 (52%)</td>
<td align="center" valign="top">10,654 (44%)</td>
<td align="center" valign="top">1,734 (67%)</td>
</tr>
<tr>
<td align="left" valign="top"><italic>CKD</italic></td>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top">&#x2003;<italic>No, n (%)</italic></td>
<td align="center" valign="top">36,193 (85%)</td>
<td align="center" valign="top">1,454 (86%)</td>
<td align="center" valign="top">6,987 (87%)</td>
<td align="center" valign="top">5,235 (86%)</td>
<td align="center" valign="top">20,396 (85%)</td>
<td align="center" valign="top">2,121 (81%)</td>
</tr>
<tr>
<td align="left" valign="top">&#x2003;<italic>Yes, n (%)</italic></td>
<td align="center" valign="top">6,148 (15%)</td>
<td align="center" valign="top">246 (14%)</td>
<td align="center" valign="top">1,015 (13%)</td>
<td align="center" valign="top">854 (14%)</td>
<td align="center" valign="top">3,550 (15%)</td>
<td align="center" valign="top">483 (19%)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>*CKD=chronic kidney disease. Three missing values for sex were imputed using K-Nearest Neighbours (KNN).</p>
</table-wrap-foot>
</table-wrap>
<p>We evaluated the performance of meta-analysis and a federated likelihood method for estimating the association between hypertension and chronic kidney disease across Canadian provinces. Analyses focused on assessing regional heterogeneity and estimating model accuracy compared to the true global logistic regression model. Estimates from the global and regional models, and results for each method are summarised below.</p>
<p>Across the five provinces studied, model estimates showed both similarities and important regional differences. The effect of hypertension on the estimated log-odds of CKD, common across all ages and both sexes, varied across regions, with the strongest associations observed in British Columbia (0.57) and the Maritimes (0.47), and a noticeably smaller effect size in Ontario (0.16).</p>
<sec>
<title>Meta-analysis results</title>
<p>A common effect meta-analysis was conducted to pool the coefficients from the five regional models. The regional coefficients for hypertension, and their standard errors (shown in <xref ref-type="table" rid="table-2">Table 2</xref>) were used for the meta-analysis. The results are summarised in the forest plot presented in <xref ref-type="fig" rid="fig-2">Figure 2</xref>.</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption><title>Meta-analysis results for the coefficient for hypertension, synthesising model results from British Columbia (BC), Alberta (AB), Manitoba (MB), Ontario (ON), and Maritimes (MAR)</title></caption>
<graphic xlink:href="ijpds-06-3160-g002.tif"/>
</fig>
<table-wrap id="table-2">
<label>Table 2</label><caption><title>Global and regional logistic regression results</title></caption>
<table frame="hsides" rules="groups">
<col width="10%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<col width="15%"/>
<tbody>
<tr>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Global model</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>British Columbia</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Alberta</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Manitoba</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Ontario</bold></italic></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><italic><bold>Maritimes</bold></italic></td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td colspan="6" align="center" valign="top">Coefficients (SE) <break/>95% confidence intervals</td>
</tr>
<tr>
<td align="left" valign="top"><italic>Intercept</italic></td>
<td align="center" valign="top">&#x2013;7.0460 (0.1007)</td>
<td align="center" valign="top">&#x2013;7.0588 (0.5060)</td>
<td align="center" valign="top">&#x2013;7.2386 (0.2456)</td>
<td align="center" valign="top">&#x2013;6.6667 (0.2581)</td>
<td align="center" valign="top">&#x2013;7.0716 (0.1338)</td>
<td align="center" valign="top">&#x2013;7.1802 (0.3915)</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top">&#x2013;7.24, &#x2013;6.85</td>
<td align="center" valign="top">&#x2013;8.08, &#x2013;6.09</td>
<td align="center" valign="top">&#x2013;7.73, &#x2013;6.76</td>
<td align="center" valign="top">&#x2013;7.18, &#x2013;6.17</td>
<td align="center" valign="top">&#x2013;7.34, &#x2013;6.81</td>
<td align="center" valign="top">&#x2013;7.96, &#x2013;6.43</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Hypertension</italic></td>
<td align="center" valign="top">0.2542 (0.0300)</td>
<td align="center" valign="top">0.5660 (0.1661)</td>
<td align="center" valign="top">0.4661 (0.0823)</td>
<td align="center" valign="top">0.3651 (0.0820)</td>
<td align="center" valign="top">0.1634 (0.0387)</td>
<td align="center" valign="top">0.4664 (0.1302)</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top">0.20, 0.31</td>
<td align="center" valign="top">0.25, 0.90</td>
<td align="center" valign="top">0.31, 0.63</td>
<td align="center" valign="top">0.21, 0.53</td>
<td align="center" valign="top">0.09, 0.24</td>
<td align="center" valign="top">0.21, 0.73</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Sex</italic></td>
<td align="center" valign="top">&#x2013;0.0737 (0.0291)</td>
<td align="center" valign="top">&#x2013;0.1012 (0.1462)</td>
<td align="center" valign="top">&#x2013;0.0442 (0.0708)</td>
<td align="center" valign="top">&#x2013;0.2778 (0.0780)</td>
<td align="center" valign="top">&#x2013;0.0034 (0.0384)</td>
<td align="center" valign="top">&#x2013;0.2726 (0.1084)</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top">&#x2013;0.13, &#x2013;0.02</td>
<td align="center" valign="top">&#x2013;0.39, 0.19</td>
<td align="center" valign="top">&#x2013;0.18, 0.09</td>
<td align="center" valign="top">&#x2013;0.43, &#x2013;0.12</td>
<td align="center" valign="top">&#x2013;0.08, 0.07</td>
<td align="center" valign="top">&#x2013;0.49, &#x2013;0.06</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
<td align="center" valign="top"></td>
</tr>
<tr>
<td align="left" valign="top"><italic>Age</italic></td>
<td align="center" valign="top">0.0770 (0.0014)</td>
<td align="center" valign="top">0.0740 (0.0070)</td>
<td align="center" valign="top">0.0753 (0.0034)</td>
<td align="center" valign="top">0.0714 (0.0036)</td>
<td align="center" valign="top">0.0780 (0.0019)</td>
<td align="center" valign="top">0.0807 (0.0054)</td>
</tr>
<tr>
<td align="left" valign="top"></td>
<td align="center" valign="top">0.07, 0.08</td>
<td align="center" valign="top">0.06, 0.09</td>
<td align="center" valign="top">0.07, 0.08</td>
<td align="center" valign="top">0.06, 0.08</td>
<td align="center" valign="top">0.07, 0.08</td>
<td align="center" valign="top">0.07, 0.09</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The meta-analysis for hypertension produced pooled estimates that were compared to the known global model estimate of 0.25 (95% CI: 0.20 to 0.31). The common effect meta-analysis yielded an estimated log-odds ratio of 0.26 (95% CI: 0.20 to 0.32), which closely approximated the global estimate and included it within its confidence interval. However, substantial heterogeneity was detected across regions (I<sup>2</sup> = 80%, p = 0.0005), suggesting meaningful variation in the association between hypertension and CKD across regions. This level of heterogeneity suggests that the assumption of a common effect may not hold. In contexts where effect heterogeneity is substantial, estimates may be biased or oversimplified because they have been pooled across jurisdictions without adequately accounting for true between-site variation, therefore alternative approaches such as mixed-effects or random-effects models could be more appropriate. Nonetheless, the primary aim of this study was to evaluate the performance of the federated likelihood method, rather than to identify the most suitable model for inference. As such, the use of a common effect model remains justifiable within the scope of our objectives.</p>
</sec>
<sec>
<title>Likelihood method results</title>
<p>A wide parameter space was generated based on the confidence intervals from the regional multivariable logistic regression models estimating the association between hypertension and CKD. The development of the parameter vectors is illustrated in <xref ref-type="fig" rid="fig-3">Figure 3</xref>.</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption><title>Vectors of possible parameter values. Vectors increased from the minimum value by increments of either 0.01 or 0.005 up to the maximum value</title></caption>
<graphic xlink:href="ijpds-06-3160-g003.tif"/>
</fig>
<p>Within each region, the log-likelihood was calculated for every unique combination of values from the parameter vectors, resulting in 4,438,496 evaluations per region given the threshold of 0.01, and 61,151,895 evaluations per region given the threshold of 0.005. This process produced a matrix of log-likelihood values for each regional model, effectively mapping the likelihood landscape and enabling a detailed examination of how the likelihood varies across different parameter combinations. Each matrix includes the "true" maximum log-likelihood, corresponding to the set of parameter values that best fit the regional data. The regional log-likelihood matrices were then summed to create a combined log-likelihood matrix, which was subsequently maximised to identify the parameter values that best fit the data across regions. The results of this process are presented in <xref ref-type="table" rid="table-3">Table 3</xref>.</p>
<table-wrap id="table-3">
<label>Table 3</label><caption><title>Federated likelihood method (0.01, 0.005) results compared to the global logistic regression results</title></caption>
<table frame="hsides" rules="groups">
<col width="20%"/>
<col width="20%"/>
<col width="12%"/>
<col width="12%"/>
<col width="12%"/>
<col width="12%"/>
<col width="12%"/>
<tbody>
<tr>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle">Intercept</td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle">Hypertension</td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle">Sex</td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle">Age</td>
</tr>
<tr>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle">Maximum log-likelihood</td>
<td colspan="4" align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle">Coefficients</td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">Alberta</td>
<td align="center" valign="top">Regional model</td>
<td align="center" valign="top">&#x2013;2702.45</td>
<td align="center" valign="top">&#x2013;7.2386</td>
<td align="center" valign="top">0.4661</td>
<td align="center" valign="top">&#x2013;0.0442</td>
<td align="center" valign="top">0.0753</td>
</tr>
<tr>
<td align="center" valign="top">Likelihood model (0.01)</td>
<td align="center" valign="top">&#x2013;2703.44</td>
<td align="center" valign="top">&#x2013;6.9156</td>
<td align="center" valign="top">0.4776</td>
<td align="center" valign="top">&#x2013;0.0454</td>
<td align="center" valign="top">0.0705</td>
</tr>
<tr>
<td align="center" valign="top"><bold>Likelihood model (0.005)</bold></td>
<td align="center" valign="top"><bold>&#x2013;2702.45</bold></td>
<td align="center" valign="top"><bold>&#x2013;7.2556</bold></td>
<td align="center" valign="top"><bold>0.4676</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.0454</bold></td>
<td align="center" valign="top"><bold>0.0755</bold></td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">British Columbia</td>
<td align="center" valign="top">Regional model</td>
<td align="center" valign="top">&#x2013;618.57</td>
<td align="center" valign="top">&#x2013;7.0588</td>
<td align="center" valign="top">0.5660</td>
<td align="center" valign="top">&#x2013;0.1012</td>
<td align="center" valign="top">0.0740</td>
</tr>
<tr>
<td align="center" valign="top">Likelihood model (0.01)</td>
<td align="center" valign="top">&#x2013;618.70</td>
<td align="center" valign="top">&#x2013;6.8156</td>
<td align="center" valign="top">0.5776</td>
<td align="center" valign="top">&#x2013;0.1054</td>
<td align="center" valign="top">0.0705</td>
</tr>
<tr>
<td align="center" valign="top"><bold>Likelihood model (0.005)</bold></td>
<td align="center" valign="top"><bold>&#x2013;618.59</bold></td>
<td align="center" valign="top"><bold>&#x2013;7.1606</bold></td>
<td align="center" valign="top"><bold>0.5626</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.1004</bold></td>
<td align="center" valign="top"><bold>0.0755</bold></td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">Manitoba</td>
<td align="center" valign="top">Regional model</td>
<td align="center" valign="top">&#x2013;2184.38</td>
<td align="center" valign="top">&#x2013;6.6667</td>
<td align="center" valign="top">0.3651</td>
<td align="center" valign="top">&#x2013;0.2778</td>
<td align="center" valign="top">0.0714</td>
</tr>
<tr>
<td align="center" valign="top">Likelihood model (0.01)</td>
<td align="center" valign="top">&#x2013;2184.42</td>
<td align="center" valign="top">&#x2013;6.6056</td>
<td align="center" valign="top">0.3676</td>
<td align="center" valign="top">&#x2013;0.2754</td>
<td align="center" valign="top">0.0705</td>
</tr>
<tr>
<td align="center" valign="top"><bold>Likelihood model (0.005)</bold></td>
<td align="center" valign="top"><bold>&#x2013;2184.41</bold></td>
<td align="center" valign="top"><bold>&#x2013;6.6006</bold></td>
<td align="center" valign="top"><bold>0.3676</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.2804</bold></td>
<td align="center" valign="top"><bold>0.0705</bold></td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">Ontario</td>
<td align="center" valign="top">Regional model</td>
<td align="center" valign="top">&#x2013;8939.61</td>
<td align="center" valign="top">&#x2013;7.0716</td>
<td align="center" valign="top">0.1634</td>
<td align="center" valign="top">&#x2013;0.0034</td>
<td align="center" valign="top">0.0780</td>
</tr>
<tr>
<td align="center" valign="top">Likelihood model (0.01)</td>
<td align="center" valign="top">&#x2013;8940.51</td>
<td align="center" valign="top">&#x2013;7.2456</td>
<td align="center" valign="top">0.1576</td>
<td align="center" valign="top">&#x2013;0.0054</td>
<td align="center" valign="top">0.0805</td>
</tr>
<tr>
<td align="center" valign="top"><bold>Likelihood model (0.005)</bold></td>
<td align="center" valign="top"><bold>&#x2013;8940.50</bold></td>
<td align="center" valign="top"><bold>&#x2013;7.2456</bold></td>
<td align="center" valign="top"><bold>0.1576</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.0004</bold></td>
<td align="center" valign="top"><bold>0.0805</bold></td>
</tr>
<tr>
<td rowspan="3" align="left" valign="top">Maritimes</td>
<td align="center" valign="top">Regional model</td>
<td align="center" valign="top">&#x2013;1083.57</td>
<td align="center" valign="top">&#x2013;7.1802</td>
<td align="center" valign="top">0.4664</td>
<td align="center" valign="top">&#x2013;0.2726</td>
<td align="center" valign="top">0.0807</td>
</tr>
<tr>
<td align="center" valign="top">Likelihood model (0.01)</td>
<td align="center" valign="top">&#x2013;1083.57</td>
<td align="center" valign="top">&#x2013;7.1656</td>
<td align="center" valign="top">0.4676</td>
<td align="center" valign="top">&#x2013;0.2754</td>
<td align="center" valign="top">0.0805</td>
</tr>
<tr>
<td align="center" valign="top"><bold>Likelihood model (0.005)</bold></td>
<td align="center" valign="top"><bold>&#x2013;1083.57</bold></td>
<td align="center" valign="top"><bold>&#x2013;7.1706</bold></td>
<td align="center" valign="top"><bold>0.4676</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.2704</bold></td>
<td align="center" valign="top"><bold>0.0805</bold></td>
</tr>
<tr>
<td colspan="2" align="left" valign="top"><bold>Global model</bold></td>
<td align="center" valign="top"><bold>&#x2013;15563.98</bold></td>
<td align="center" valign="top"><bold>&#x2013;7.0460</bold></td>
<td align="center" valign="top"><bold>0.2542</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.0737</bold></td>
<td align="center" valign="top"><bold>0.0770</bold></td>
</tr>
<tr>
<td colspan="2" align="left" valign="top">Combined likelihood model (0.01)</td>
<td align="center" valign="top">&#x2013;15567.06</td>
<td align="center" valign="top">&#x2013;7.2856</td>
<td align="center" valign="top">0.2476</td>
<td align="center" valign="top">&#x2013;0.0754</td>
<td align="center" valign="top">0.0805</td>
</tr>
<tr>
<td colspan="2" align="left" valign="top"><bold>Combined likelihood model (0.005)</bold></td>
<td align="center" valign="top"><bold>&#x2013;15564.56</bold></td>
<td align="center" valign="top"><bold>&#x2013;6.9406</bold></td>
<td align="center" valign="top"><bold>0.2576</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.0754</bold></td>
<td align="center" valign="top"><bold>0.0755</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>At the regional level, the estimates obtained using the federated likelihood approach closely approximated the estimates from the regional logistic regression models. Across all regions, the differences in maximum log-likelihood values between the regional and federated likelihood models (both the 0.01 and 0.005 vector increments) were negligible &#x2013; typically less than one unit &#x2013; indicating strong agreement in model fit. Similarly, coefficient estimates from the federated likelihood models remained consistent with those from the regional models, suggesting that the federated estimation procedure preserved site-specific inference.</p>
<p>For the primary exposure of interest (hypertension), the combined federated likelihood model using the 0.01 vector increment produced an estimated log-odds ratio of 0.2476, closely approximating the global estimate of 0.2542. Using a smaller increment of 0.005 further improved alignment with the global model: the log-odds ratio increased to 0.2576, and the maximum log-likelihood (&#x2013;15,564.56) approached that of the global model (&#x2013;15,563.98), outperforming the 0.01 model (&#x2013;15,567.06). These results suggest that tighter increments enhance estimation accuracy.</p>
<p>Additionally, adjustment for potential confounding was maintained across all modelling strategies. By ensuring that the same covariates (sex and age) were included across jurisdictional models, the federated likelihood method was able to control for key sources of confounding.</p>
<p>Overall, the likelihood method effectively estimated the true global model parameters without requiring individual-level data transfer. Although slight deviations were present, the magnitude of the discrepancies in effect size was small and did not impact the overall interpretation of the findings. These results highlight the utility of the federated likelihood approach for interjurisdictional health data analysis. <xref ref-type="table" rid="table-4">Table 4</xref> summarises all of the results previously discussed.</p>
<table-wrap id="table-4">
<label>Table 4</label><caption><title>Summary of the performance of the common effect meta-analysis and the federated likelihood method relative to the global model</title></caption>
<table frame="hsides" rules="groups">
<col width="20%"/>
<col width="20%"/>
<col width="20%"/>
<col width="20%"/>
<col width="20%"/>
<tbody>
<tr>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><bold>Intercept</bold></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><bold>Hypertension</bold></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><bold>Sex</bold></td>
<td align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><bold>Age</bold></td>
</tr>
<tr>
<td align="left" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"></td>
<td colspan="4" align="center" style="border-top: solid 1pt; border-bottom: solid 1pt;" valign="middle"><bold>Coefficients</bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Common effects (95% CI)</bold></td>
<td align="center" valign="top">&#x2013;7.0131 (&#x2013;7.2644, &#x2013;6.8124)</td>
<td align="center" valign="top">0.2646 (0.2043, 0.3248)</td>
<td align="center" valign="top">&#x2013;0.0721 (&#x2013;0.1294, &#x2013;0.0149)</td>
<td align="center" valign="top">0.0765 (0.0738, 0.0793)</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Likelihood 0.01 (14.7% LI)</bold></td>
<td align="center" valign="top">&#x2013;7.2856 (&#x2013;7.3356, &#x2013;7.2356)</td>
<td align="center" valign="top">0.2476 (0.1876, 0.2976)</td>
<td align="center" valign="top">&#x2013;0.0754 (&#x2013;0.1254, &#x2013;0.0154)</td>
<td align="center" valign="top">0.0805 (0.0805, 0.0805)</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Likelihood 0.005 (14.7% LI)</bold></td>
<td align="center" valign="top">&#x2013;6.9406 (&#x2013;6.9956, &#x2013;6.8906)</td>
<td align="center" valign="top">0.2576 (0.2026, 0.3126)</td>
<td align="center" valign="top">&#x2013;0.0754 (&#x2013;0.1304, &#x2013;0.0204)</td>
<td align="center" valign="top">0.0755 (0.0755, 0.0755)</td>
</tr>
<tr>
<td align="left" valign="top"><bold>Global model (95% CI)</bold></td>
<td align="center" valign="top"><bold>&#x2013;7.0460 (&#x2013;7.2444, &#x2013;6.8495)</bold></td>
<td align="center" valign="top"><bold>0.2542 (0.1955, 0.3130)</bold></td>
<td align="center" valign="top"><bold>&#x2013;0.0737 (&#x2013;0.1308, &#x2013;0.0166)</bold></td>
<td align="center" valign="top"><bold>0.0770 (0.0743, 0.0798)</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>*LI=likelihood interval. A 14.7% likelihood interval includes a range of parameter values where the likelihood is at least 14.7% of the maximum likelihood. The 14.7% likelihood cutoff is chosen because it approximately corresponds to the likelihood values that define a 95% confidence interval under likelihood theory.</p>
</table-wrap-foot>
</table-wrap>
<p>The common effect meta-analysis estimates, along with the federated likelihood estimates, were compared to the global model to evaluate their accuracy. Relative percent absolute bias (RPAB), defined as the absolute difference between each pooled estimate and the global estimate divided by the global estimate, was calculated to quantify the bias in each method. <xref ref-type="fig" rid="fig-4">Figure 4</xref> summarises the RPAB for the coefficients for hypertension across the two approaches compared to the global model. Among all methods, the federated likelihood model with the 0.005 increment achieved the lowest RPAB (1.34%), indicating the closest approximation to the global model. The model with the 0.01 increment also performed well but showed slightly higher bias (2.60%). Both federated likelihood approaches outperformed the common effects meta-analysis, which produced moderately accurate estimates (4.09% RPAB).</p>
<fig id="fig-4">
<label>Figure 4</label>
<caption><title>Bar graph of the relative percent absolute bias comparing the coefficients for hypertension from the common effects meta-analysis, 0.01 federated likelihood, and 0.005 federated likelihood models to the global model</title></caption>
<graphic xlink:href="ijpds-06-3160-g004.tif"/>
</fig>
<p>Overall, the federated likelihood method produced estimates that were highly consistent with those obtained from the common effects meta-analysis approach. Notably, the federated likelihood method provided the most accurate estimate for the primary effect of interest, demonstrating a smaller RPAB compared to the common effect meta-analysis. The federated likelihood method therefore offers a robust alternative that maintains both accuracy and interpretability in federated data analysis.</p>
</sec>
</sec>
<sec>
<title>Discussion</title>
<p>This study introduced a real-world implementation of a federated likelihood approach to data analysis and compared this method to the existing meta-analysis framework. Overall, the federated likelihood method proved to be an effective strategy for analysing distributed data across regions where individual-level data sharing is restricted.</p>
<p>Both the common effect meta-analysis and the federated likelihood method produced similar results across parameters, suggesting that either approach is generally appropriate for federated health data analysis. However, the federated likelihood method with a smaller increment (0.005) consistently outperformed both the 0.01 increment model and the meta-analysis model. The federated likelihood approach with a 0.005 increment produced estimates with lower relative percent absolute bias, resulting in a closer approximation to the global model. These findings indicate that applying a tighter increment enhances the accuracy of pooled estimates. However, if the degree to which the accuracy increases does not meaningfully improve inference, then it may not justify the added computational cost of using a tighter grid. Specifically, for policy-critical decisions such as drug safety studies or resource allocation, the added accuracy from the federated likelihood method may justify the added computational cost. Whereas for exploratory analyses, meta-analysis may be sufficient.</p>
<p>Importantly, our analysis revealed significant heterogeneity in the effect of hypertension across regions, as evidenced by a high I<sup>2</sup> statistic. This observation raises important concerns regarding the assumption of a common effect, which underlies both the federated likelihood and common effect meta-analysis models. In contexts where effect heterogeneity is substantial, this assumption may be violated, leading to biased or oversimplified estimates. In such cases, alternative modelling strategies &#x2013; such as a random-effects meta-analysis or mixed-effect modelling &#x2013; may be more appropriate for capturing the between-region differences in effect size. Nevertheless, the goal of this study was to evaluate the performance of the federated likelihood method rather than to identify the optimal model for inference.</p>
<p>This study has several notable strengths. First, it introduces and evaluates a novel methodological approach for federated health data analysis. Second, the study employs a real-world case using data from the Canadian Primary Care Sentinel Surveillance Network, providing a practical demonstration of the method&#x2019;s utility in a clinically relevant context. This enhances the validity of the findings and underscores the method&#x2019;s potential for direct application in health research. Third, by systematically comparing the federated likelihood approach with the common effect meta-analysis, the study offers a comprehensive performance evaluation. This comparative design enables a clear assessment of the advantages and limitations of each method, which is an important contribution to the methodological literature. Finally, the study directly compares the performance of two federated likelihood vector increments (0.01 and 0.005), demonstrating that a smaller increment improves estimation accuracy. This evaluation of methodological specification provides practical insights into implementation choices that can significantly impact model performance in real-world federated settings.</p>
<p>A limitation of the likelihood approach is its computational intensity. Computing a single regional log-likelihood matrix required roughly 6.5 hours of run-time; across all five regions, this amounted to roughly 32.5 hours of run-time. In contrast, the meta-analysis for hypertension completed in just 0.4 seconds. The computational burden increases substantially as heterogeneity between regions grows, as sample size increases, as the increments of the parameter vectors become smaller, as the number of model parameters increases, and as the number of regions increases. Greater heterogeneity may result in wider confidence intervals across regions, expanding the range of possible parameter values and thus increasing the number of unique combinations to evaluate. Similarly, as sample size increases, we can assume variance decreases, meaning that the use of finer increments (e.g., 0.005 as in this study) would be needed in order to properly capture the variability. However, while the use of finer increments improves approximation accuracy, it also increases the number of calculations required exponentially (the 0.01 increment resulted in 4,438,496 evaluations compared to 61,151,895 evaluations using the 0.005 increment). The addition of more parameters or more regions compound this effect, making the process increasingly demanding. Researchers aiming to implement this method should ensure they have access to high-performance computing resources or cloud-based distributed systems capable of supporting large-scale parallel processing. Additionally, the development of more computationally efficient algorithms and approximations, such as adaptive grid search procedures or stochastic approximation techniques, can support the scaling of the method to larger datasets and more complex models.</p>
<p>Another important limitation of this work lies in the characteristics of the case study. The empirical evaluation was based on a single real-world dataset which does not fully capture the diverse contexts in which the method could be applied. Specifically, all participating regions had relatively large sample sizes and broadly similar distributions. As a result, heterogeneity in data quality or data distributions was not assessed in this study. Differences in variable distributions, such as shifts in age ranges or outcome prevalence should be explored. Moreover, future research should explore applications involving real-world datasets with rare outcomes, where the method&#x2019;s performance may be especially informative. We also acknowledge that the scope of comparisons was restricted to meta-analysis, and broader evaluations including comparisons to other federated approaches could provide deeper insights into relative performance. Addressing these limitations in future studies will help strengthen the evidence for the method&#x2019;s broader utility and robustness.</p>
<p>Future directions for this work include further validation of the likelihood method across a wider range of model types and complexities. Extending the framework to other model types requires only replacing the binomial log-likelihood function with the model-specific likelihood function associated with the outcome distribution. All other components of the federated estimation procedure carry over directly. Testing should extend to alternative model structures such as linear regression, ordinal logistic regression, Poisson regression, and non-linear models. Moreover, the method should be evaluated in settings with increased model complexity, including the incorporation of interaction terms, to fully define the scope and limitations of the approach. In addition, future work should focus on extending the federated likelihood framework to support random-effects or mixed-effects models. This would allow for more flexible modelling of between-site variability, particularly in contexts where the common effect assumption is violated and would enhance the method&#x2019;s applicability to heterogeneous real-world data.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>The likelihood-based method shows strong potential as an innovative approach for federated health data analysis, particularly in environments where stringent privacy regulations restrict the sharing of individual-level information. By enabling the aggregation of statistical evidence across jurisdictions without the need to transfer sensitive data, this method addresses a major obstacle to large-scale, multi-jurisdictional health research. Further investigation is required to fully evaluate its versatility and limitations. Future research should assess its performance across a broader array of model types and complexity. These extensions will be critical for defining the boundaries of the method&#x2019;s utility, ensuring its operational feasibility, and understanding its robustness in more realistic and heterogeneous data environments.</p>
</sec>
</body>
<back>
<ack>
<title>Acknowledgements</title>
<p>This research was funded by the Canadian Institute for Health Research (CIHR).</p>
</ack>
<sec>
<title>Ethics statement</title>
<p>This study was approved by the Conjoint Health Research Ethics Board (CHREB).</p>
</sec>
<sec>
<title>Data availability statement</title>
<p>Data is available upon request from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN).</p>
</sec>
<sec>
<title>AI disclosure statement</title>
<p>The authors used ChatGPT, 2026, OpenAI for grammar and language editing. The output was reviewed and verified by the authors.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="ref-1"><label>1</label><mixed-citation publication-type="journal"><string-name><surname>van Panhuis</surname> <given-names>WG</given-names></string-name>, <string-name><surname>Paul</surname> <given-names>P</given-names></string-name>, <string-name><surname>Emerson</surname> <given-names>C</given-names></string-name>, <string-name><surname>Grefenstette</surname> <given-names>J</given-names></string-name>, <string-name><surname>Wilder</surname> <given-names>R</given-names></string-name>, <string-name><surname>Herbst</surname> <given-names>AJ</given-names></string-name>, <etal>et al</etal>. <article-title>A systematic review of barriers to data sharing in public health</article-title>. <source>BMC Public Health</source>. <year>2014</year> <month>Nov</month> <day>5</day>;<volume>14</volume>(<issue>1</issue>):<fpage>1144</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2458-14-1144</pub-id></mixed-citation></ref>
<ref id="ref-2"><label>2</label><mixed-citation publication-type="website"><article-title>Canada PHA of. Expert Advisory Group Report 3: Toward a world-class health data system</article-title> [Internet]. <year>2022</year> [cited 2025 June 24]. Available from: <uri>https://www.canada.ca/en/public-health/corporate/mandate/about-agency/external-advisory-bodies/list/pan-canadian-health-data-strategy-reports-summaries/expert-advisory-group-report-03-toward-world-class-health-data-system.html</uri>.</mixed-citation></ref>
<ref id="ref-3"><label>3</label><mixed-citation publication-type="journal"><string-name><surname>Dodd</surname> <given-names>C</given-names></string-name>, <string-name><surname>Andrews</surname> <given-names>N</given-names></string-name>, <string-name><surname>Petousis-Harris</surname> <given-names>H</given-names></string-name>, <string-name><surname>Sturkenboom</surname> <given-names>M</given-names></string-name>, <string-name><surname>Omer</surname> <given-names>SB</given-names></string-name>, <string-name><surname>Black</surname> <given-names>S</given-names></string-name>. <article-title>Methodological frontiers in vaccine safety: qualifying available evidence for rare events, use of distributed data networks to monitor vaccine safety issues, and monitoring the safety of pregnancy interventions</article-title>. <source>BMJ Glob Health</source>. <year>2021</year> <month>May</month> <day>19</day>;<volume>6</volume>(<issue>Suppl 2</issue>):<fpage>e003540</fpage>. <pub-id pub-id-type="doi">10.1136/bmjgh-2020-003540</pub-id></mixed-citation></ref>
<ref id="ref-4"><label>4</label><mixed-citation publication-type="journal"><string-name><surname>Butler</surname> <given-names>AL</given-names></string-name>, <string-name><surname>Smith</surname> <given-names>M</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>W</given-names></string-name>, <string-name><surname>Adair</surname> <given-names>CE</given-names></string-name>, <string-name><surname>Vigod</surname> <given-names>SN</given-names></string-name>, <string-name><surname>Lesage</surname> <given-names>A</given-names></string-name>, <etal>et al</etal>. <article-title>Multi-province epidemiological research using linked administrative data: a case study from Canada</article-title>. <source>Int J Popul Data Sci</source>. <volume>3</volume>(<issue>3</issue>):<fpage>443</fpage>. <pub-id pub-id-type="doi">10.23889/ijpds.v3i3.443</pub-id></mixed-citation></ref>
<ref id="ref-5"><label>5</label><mixed-citation publication-type="journal"><string-name><surname>Lix</surname> <given-names>LM</given-names></string-name>, <string-name><surname>Ayles</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bartholomew</surname> <given-names>S</given-names></string-name>, <string-name><surname>Cooke</surname> <given-names>CA</given-names></string-name>, <string-name><surname>Ellison</surname> <given-names>J</given-names></string-name>, <string-name><surname>Emond</surname> <given-names>V</given-names></string-name>, <etal>et al</etal>. <article-title>The Canadian Chronic Disease Surveillance System: A model for collaborative surveillance</article-title>. <source>Int J Popul Data Sci</source>. <year>2018</year> <month>Oct</month> <day>5</day>;<volume>3</volume>(<issue>3</issue>):<fpage>433</fpage>. <pub-id pub-id-type="doi">10.23889/ijpds.v3i3.433</pub-id></mixed-citation></ref>
<ref id="ref-6"><label>6</label><mixed-citation publication-type="journal"><string-name><surname>Casaletto</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bernier</surname> <given-names>A</given-names></string-name>, <string-name><surname>McDougall</surname> <given-names>R</given-names></string-name>, <string-name><surname>Cline</surname> <given-names>MS</given-names></string-name>. <article-title>Federated Analysis for Privacy-Preserving Data Sharing: A Technical and Legal Primer</article-title>. <source>Annu Rev Genomics Hum Genet</source>. <year>2023</year> <month>Aug</month> <day>25</day>;<volume>24</volume>(<issue>Volume 24, 2023</issue>):<fpage>347</fpage>&#x2013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genom-110122-084756</pub-id></mixed-citation></ref>
<ref id="ref-7"><label>7</label><mixed-citation publication-type="website"><article-title>Health Information Act | Alberta.ca [Internet]</article-title>. <year>2025</year> [cited <year>2025</year> <month>June</month> <day>4</day>]. Available from: <uri>https://www.alberta.ca/health-information-act</uri>.</mixed-citation></ref>
<ref id="ref-8"><label>8</label><mixed-citation publication-type="website"><article-title>Personal Health Information Protection Act</article-title>, <year>2004</year>, S.O. 2004, c. 3, Sched. A | ontario.ca [Internet]. [cited <year>2025</year> <month>June</month> <day>4</day>]. Available from: <uri>https://www.ontario.ca/laws/statute/04p03</uri>.</mixed-citation></ref>
<ref id="ref-9"><label>9</label><mixed-citation publication-type="website"><string-name><surname>Chassang</surname> <given-names>G</given-names></string-name>. <article-title>The impact of the EU general data protection regulation on scientific research</article-title>. <source>ecancermedicalscience</source> [Internet]. <year>2017</year> <month>Jan</month> <day>3</day> [cited <year>2024</year> <month>Oct</month> <day>17</day>];<volume>11</volume>. Available from: <uri>http://www.ecancer.org/journal/11/full/709-the-impact-of-the-eu-general-data-protection-regulation-on-scientific-research.php</uri>.</mixed-citation></ref>
<ref id="ref-10"><label>10</label><mixed-citation publication-type="journal"><string-name><surname>Kulynych</surname> <given-names>J</given-names></string-name>, <string-name><surname>Korn</surname> <given-names>D</given-names></string-name>. <article-title>The New HIPAA (Health Insurance Portability and Accountability Act of 1996) Medical Privacy Rule: Help or Hindrance for Clinical Research?</article-title> <issue>oCirculation</issue>. <year>2003</year> <month>Aug</month> <day>26</day>;<volume>108</volume>(<issue>8</issue>):<fpage>912</fpage>&#x2013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.1161/01.CIR.0000080642.35380.50</pub-id></mixed-citation></ref>
<ref id="ref-11"><label>11</label><mixed-citation publication-type="journal"><string-name><surname>Lemyre</surname> <given-names>FC</given-names></string-name>, <string-name><surname>L&#x00E9;vesque</surname> <given-names>S</given-names></string-name>, <string-name><surname>Domingue</surname> <given-names>MP</given-names></string-name>, <string-name><surname>Herrmann</surname> <given-names>K</given-names></string-name>, <string-name><surname>Ethier</surname> <given-names>JF</given-names></string-name>. <article-title>Distributed Statistical Analyses: A Scoping Review and Examples of Operational Frameworks Adapted to Health Analytics</article-title>. <source>JMIR Med Inform</source>. <year>2024</year> <month>Nov</month> <day>14</day>;<volume>12</volume>(<issue>1</issue>):<fpage>e53622</fpage>. <pub-id pub-id-type="doi">10.2196/53622</pub-id></mixed-citation></ref>
<ref id="ref-12"><label>12</label><mixed-citation publication-type="journal"><string-name><surname>Khellaf</surname> <given-names>R</given-names></string-name>, <string-name><surname>Bellet</surname> <given-names>A</given-names></string-name>, <string-name><surname>Josse</surname> <given-names>J</given-names></string-name>. <article-title>Federated causal inference: Multi-study ate estimation beyond meta-analysis</article-title>. <source>Prepr At</source>. <year>2025</year>;<fpage>10</fpage>.</mixed-citation></ref>
<ref id="ref-13"><label>13</label><mixed-citation publication-type="journal"><string-name><surname>Rieke</surname> <given-names>N</given-names></string-name>, <string-name><surname>Hancox</surname> <given-names>J</given-names></string-name>, <string-name><surname>Li</surname> <given-names>W</given-names></string-name>, <string-name><surname>Milletar&#x00EC;</surname> <given-names>F</given-names></string-name>, <string-name><surname>Roth</surname> <given-names>HR</given-names></string-name>, <string-name><surname>Albarqouni</surname> <given-names>S</given-names></string-name>, <etal>et al</etal>. <article-title>The future of digital health with federated learning</article-title>. <source>Npj Digit Med</source>. <year>2020</year> <month>Sept</month> <day>14</day>;<volume>3</volume>(<issue>1</issue>):<fpage>119</fpage>. <pub-id pub-id-type="doi">10.1038/s41746-020-00323-1</pub-id></mixed-citation></ref>
<ref id="ref-14"><label>14</label><mixed-citation publication-type="book"><string-name><surname>Sheller</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Reina</surname> <given-names>GA</given-names></string-name>, <string-name><surname>Edwards</surname> <given-names>B</given-names></string-name>, <string-name><surname>Martin</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bakas</surname> <given-names>S</given-names></string-name>. <chapter-title>Multi-institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation</chapter-title>. In: <string-name><surname>Crimi</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bakas</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kuijf</surname> <given-names>H</given-names></string-name>, <string-name><surname>Keyvan</surname> <given-names>F</given-names></string-name>, <string-name><surname>Reyes</surname> <given-names>M</given-names></string-name>, <string-name><surname>van Walsum</surname> <given-names>T</given-names></string-name>, editors. <source>Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries</source>. <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>; <year>2019</year>. p. <fpage>92</fpage>&#x2013;<lpage>104</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-030-11723-8_9</pub-id></mixed-citation></ref>
<ref id="ref-15"><label>15</label><mixed-citation publication-type="journal"><string-name><surname>Masciocchi</surname> <given-names>C</given-names></string-name>, <string-name><surname>Gottardelli</surname> <given-names>B</given-names></string-name>, <string-name><surname>Savino</surname> <given-names>M</given-names></string-name>, <string-name><surname>Boldrini</surname> <given-names>L</given-names></string-name>, <string-name><surname>Martino</surname> <given-names>A</given-names></string-name>, <string-name><surname>Mazzarella</surname> <given-names>C</given-names></string-name>, <etal>et al</etal>. <article-title>Federated Cox Proportional Hazards Model with multicentric privacy-preserving LASSO feature selection for survival analysis from the perspective of personalized medicine</article-title>. In: <year>2022</year> <source>IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS) [Internet]</source>. <year>2022</year> [cited 2025 Aug 19]. p. <fpage>25</fpage>&#x2013;<lpage>31</lpage>. Available from: <uri>https://ieeexplore.ieee.org/abstract/document/9867090</uri>.</mixed-citation></ref>
<ref id="ref-16"><label>16</label><mixed-citation publication-type="journal"><string-name><surname>Li</surname> <given-names>W</given-names></string-name>, <string-name><surname>Tong</surname> <given-names>J</given-names></string-name>, <string-name><surname>Anjum</surname> <given-names>MdM</given-names></string-name>, <string-name><surname>Mohammed</surname> <given-names>N</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>X</given-names></string-name>. <article-title>Federated learning algorithms for generalized mixed-effects model (GLMM) on horizontally partitioned data from distributed sources</article-title>. <source>BMC Med Inform Decis Mak</source>. <year>2022</year> <month>Oct</month> <day>16</day>;<volume>22</volume>(<issue>1</issue>):<fpage>269</fpage>. <pub-id pub-id-type="doi">10.1186/s12911-022-02014-1</pub-id></mixed-citation></ref>
<ref id="ref-17"><label>17</label><mixed-citation publication-type="journal"><string-name><surname>Cellamare</surname> <given-names>M</given-names></string-name>, <string-name><surname>van Gestel</surname> <given-names>AJ</given-names></string-name>, <string-name><surname>Alradhi</surname> <given-names>H</given-names></string-name>, <string-name><surname>Martin</surname> <given-names>F</given-names></string-name>, <string-name><surname>Moncada-Torres</surname> <given-names>A</given-names></string-name>. <article-title>A Federated Generalized Linear Model for Privacy-Preserving Analysis</article-title>. <source>Algorithms</source>. <year>2022</year> <month>July</month>;<volume>15</volume>(<day>7</day>):<fpage>243</fpage>. <pub-id pub-id-type="doi">10.3390/a15070243</pub-id></mixed-citation></ref>
<ref id="ref-18"><label>18</label><mixed-citation publication-type="journal"><string-name><surname>Westers</surname> <given-names>F</given-names></string-name>, <string-name><surname>Leder</surname> <given-names>S</given-names></string-name>, <string-name><surname>Tealdi</surname> <given-names>L</given-names></string-name>. <article-title>Horizontal federated learning and assessment of Cox models</article-title>. <source>Front Digit Health</source> [Internet]. <year>2025</year> <month>June</month> <day>12</day> [cited <year>2025</year> <month>Aug</month> <day>19</day>];<volume>7</volume>. Available from: <uri>https://www.frontiersin.org/journals/digital-health/articles/10.3389/fdgth.2025.1603630/full</uri>.</mixed-citation></ref>
<ref id="ref-19"><label>19</label><mixed-citation publication-type="journal"><string-name><surname>Bj&#x00F8;rnstad</surname> <given-names>JF</given-names></string-name>. <article-title>On the Generalization of the Likelihood Function and the Likelihood Principle</article-title>. <source>J Am Stat Assoc</source>. <year>1996</year> <month>June</month>;<volume>91</volume>(<issue>434</issue>):<fpage>791</fpage>&#x2013;<lpage>806</lpage>. <pub-id pub-id-type="doi">10.1080/01621459.1996.10476947</pub-id></mixed-citation></ref>
<ref id="ref-20"><label>20</label><mixed-citation publication-type="journal"><string-name><surname>Weldegiorgis</surname> <given-names>M</given-names></string-name>, <string-name><surname>Woodward</surname> <given-names>M</given-names></string-name>. <article-title>The impact of hypertension on chronic kidney disease and end-stage renal disease is greater in men than women: a systematic review and meta-analysis</article-title>. <source>BMC Nephrol</source>. <year>2020</year> <month>Nov</month> <day>25</day>;<volume>21</volume>(<issue>1</issue>):<fpage>506</fpage>. <pub-id pub-id-type="doi">10.1186/s12882-020-02151-7</pub-id></mixed-citation></ref>
<ref id="ref-21"><label>21</label><mixed-citation publication-type="journal"><string-name><surname>Garies</surname> <given-names>S</given-names></string-name>, <string-name><surname>Birtwhistle</surname> <given-names>R</given-names></string-name>, <string-name><surname>Drummond</surname> <given-names>N</given-names></string-name>, <string-name><surname>Queenan</surname> <given-names>J</given-names></string-name>, <string-name><surname>Williamson</surname> <given-names>T</given-names></string-name>. <article-title>Data Resource Profile: National electronic medical record data from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN)</article-title>. <source>Int J Epidemiol</source>. <year>2017</year> <month>Aug</month> <day>1</day>;<volume>46</volume>(<issue>4</issue>):<fpage>1091</fpage>&#x2013;<lpage>1092f</lpage>. <pub-id pub-id-type="doi">10.1093/ije/dyw248</pub-id></mixed-citation></ref>
<ref id="ref-22"><label>22</label><mixed-citation publication-type="journal"><string-name><surname>Duan</surname> <given-names>R</given-names></string-name>, <string-name><surname>Boland</surname> <given-names>MR</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Chang</surname> <given-names>HH</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>H</given-names></string-name>, <etal>et al</etal>. <article-title>Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm</article-title>. <source>J Am Med Inform Assoc</source>. <year>2020</year> <month>Mar</month> <day>1</day>;<volume>27</volume>(<issue>3</issue>):<fpage>376</fpage>&#x2013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1093/jamia/ocz199</pub-id></mixed-citation></ref>
<ref id="ref-23"><label>23</label><mixed-citation publication-type="journal"><string-name><surname>Williamson</surname> <given-names>T</given-names></string-name>, <string-name><surname>Green</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Birtwhistle</surname> <given-names>R</given-names></string-name>, <string-name><surname>Khan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Garies</surname> <given-names>S</given-names></string-name>, <string-name><surname>Wong</surname> <given-names>ST</given-names></string-name>, <etal>et al</etal>. <article-title>Validating the 8 CPCSSN Case Definitions for Chronic Disease Surveillance in a Primary Care Database of Electronic Health Records</article-title>. <source>Ann Fam Med</source>. <year>2014</year> <month>July</month> <day>1</day>;<volume>12</volume>(<issue>4</issue>):<fpage>367</fpage>&#x2013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1370/afm.1644</pub-id></mixed-citation></ref>
<ref id="ref-24"><label>24</label><mixed-citation publication-type="journal"><string-name><surname>Lethebe</surname> <given-names>BC</given-names></string-name>, <string-name><surname>Williamson</surname> <given-names>T</given-names></string-name>, <string-name><surname>Garies</surname> <given-names>S</given-names></string-name>, <string-name><surname>McBrien</surname> <given-names>K</given-names></string-name>, <string-name><surname>Leduc</surname> <given-names>C</given-names></string-name>, <string-name><surname>Butalia</surname> <given-names>S</given-names></string-name>, <etal>et al</etal>. <article-title>Developing a case definition for type 1 diabetes mellitus in a primary care electronic medical record database: an exploratory study</article-title>. <source>CMAJ Open</source>. <year>2019</year> <month>Apr</month>;<day>7</day>(<issue>2</issue>):<fpage>E246</fpage>&#x2013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.9778/cmajo.20180142</pub-id></mixed-citation></ref>
<ref id="ref-25"><label>25</label><mixed-citation publication-type="journal"><string-name><surname>Liu</surname> <given-names>P</given-names></string-name>, <string-name><surname>Quinn</surname> <given-names>RR</given-names></string-name>, <string-name><surname>Lam</surname> <given-names>NN</given-names></string-name>, <string-name><surname>Elliott</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>James</surname> <given-names>MT</given-names></string-name>, <etal>et al</etal>. <article-title>Accounting for Age in the Definition of Chronic Kidney Disease</article-title>. <source>JAMA Intern Med</source>. <year>2021</year> <month>Oct</month> <day>1</day>;<volume>181</volume>(<issue>10</issue>):<fpage>1359</fpage>. <pub-id pub-id-type="doi">10.1001/jamainternmed.2021.4813</pub-id></mixed-citation></ref>
<ref id="ref-26"><label>26</label><mixed-citation publication-type="journal"><string-name><surname>Harrer</surname> <given-names>M</given-names></string-name>, <string-name><surname>Cuijpers</surname> <given-names>P</given-names></string-name>, <string-name><surname>Furukawa</surname> <given-names>TA</given-names></string-name>, <string-name><surname>Ebert</surname> <given-names>DD</given-names></string-name>. <article-title>Chapter 4 Pooling Effect Sizes | Doing Meta-Analysis in R</article-title> [Internet]. [cited <year>2025</year> <month>Apr</month> <day>28</day>]. Available from: <uri>https://bookdown.org/MathiasHarrer/Doing_Meta_Analysis_in_R/pooling-es.html</uri>.</mixed-citation></ref>
<ref id="ref-27"><label>27</label><mixed-citation publication-type="book"><string-name><surname>Kalbfleisch</surname> <given-names>JG</given-names></string-name>. <chapter-title>Probability and statistical inference. 2: Statistical inference</chapter-title>. In: <edition>2</edition>. ed. <publisher-loc>New York</publisher-loc>: <publisher-name>Springer</publisher-name>; <year>1985</year>.</mixed-citation></ref>
</ref-list>
<glossary>
<title>Abbreviations</title>
<array>
<tbody>
<tr>
<td>CKD:</td>
<td>chronic kidney disease</td>
</tr>
<tr>
<td>CPCSSN:</td>
<td>Canadian Primary Care Sentinel Surveillance Network</td>
</tr>
<tr>
<td>RPAB:</td>
<td>relative percent absolute bias</td>
</tr>
</tbody>
</array>
</glossary>
</back>
</article>
