{"id":436754,"date":"2025-09-19T19:03:22","date_gmt":"2025-09-19T19:03:22","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/436754\/"},"modified":"2025-09-19T19:03:22","modified_gmt":"2025-09-19T19:03:22","slug":"area-level-socioeconomic-variables-associated-with-territorial-disparities-in-tuberculosis-notification-rates-in-metropolitan-france-a-bayesian-ecological-analysis-infectious-diseases-of-poverty","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/436754\/","title":{"rendered":"Area-level socioeconomic variables associated with territorial disparities in tuberculosis notification rates in metropolitan France: a Bayesian ecological analysis | Infectious Diseases of Poverty"},"content":{"rendered":"<p>Design, period and study area<\/p>\n<p>We conducted a retrospective ecological study in metropolitan France for the period of 2008\u20132019. The data were aggregated into two six-year periods and at a quasi-ZIP code geographic level. More precisely, we chose a specific geographic division (called the PMSI21 code), which corresponds to existing ZIP codes with \u2265\u00a01000 inhabitants in 2021 and to aggregations of neighboring ZIP codes with fewer than 1000 inhabitants [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Geographical location of health insurance beneficiaries. &#010;                  https:\/\/documentation-snds.health-data-hub.fr\/snds\/fiches\/localisation_geographique_beneficiaires.html#remarques-preliminaires&#010;                  &#010;                . Accessed 08 Jul 2025.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR14\" id=\"ref-link-section-d96626534e754\" target=\"_blank\" rel=\"noopener\">14<\/a>]. For clarity and consistency, this territorial division is hereafter designated the \u201cZIP code\u201d.<\/p>\n<p>Data collectionTB cases<\/p>\n<p>TB cases reported in metropolitan France from 2008 to 2019 were extracted from the national notifiable disease surveillance system for analysis. All forms of TB, including both pulmonary and extrapulmonary cases, were retained in the analysis. The case data included the ZIP code of the person\u2019s usual place of residence, age, sex, countries of birth of both the individual and their parents, and whether the person lived in communal housing (yes\/no). If the person lived in communal housing, the type of communal housing was specified as prison, residence for elderly people, collective shelter, or other communal housing. A single, simplified \u2018housing type\u2019 variable was constructed from these two variables: it was categorized as\u00a0\u2018individual housing\u2019\u00a0when cases indicated not living in communal housing, \u2018prison\u2019 when the type of communal housing was \u2018prison\u2019, and \u2018other communal housing\u2019 otherwise. Cases with a missing or incorrect ZIP code were excluded from the analysis.<\/p>\n<p>We classified cases as \u2018immigrant\u2019 or \u2018native French\u2019, adhering as closely as possible to the definition of immigrants by the French National Institute of Statistics and Economic Studies (INSEE): \u201cAn immigrant is a person born as a foreigner abroad and residing in France\u201d [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 15\" title=\"Immigrant\u2014definition. &#010;                  https:\/\/www.insee.fr\/fr\/metadonnees\/definition\/c1328&#010;                  &#010;                . Accessed 10 Feb 2025.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR15\" id=\"ref-link-section-d96626534e772\" target=\"_blank\" rel=\"noopener\">15<\/a>]. Given that we only had data on the country of birth, individuals were classified as \u2018immigrants\u2019 if they were born outside of France and either both parents were foreign-born or data on parental countries of birth were unavailable. All other cases were classified as \u2018native French\u2019.<\/p>\n<p>Socioeconomic area-level data<\/p>\n<p>We examined the associations at the ZIP code level between TB notification rates and multiple socioeconomic variables provided by INSEE. These variables were selected based on documented risk factors for TB in low-endemic countries from the literature. Initially, we analyzed the following variables separately: median household income per consumption unit, the proportion of high school graduates in the unschooled population aged 15\u00a0years and older, the proportion of manual workers in the active population aged 15\u201364\u00a0years, and the unemployment rate among the active population aged 15\u201364 years. We then estimated the associations between TB notification rates and a social deprivation indicator that combines these four variables, known as the French Deprivation Index (FDep) [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Rey G, Jougla E, Fouillet A, Hemon D. Ecological association between a deprivation index and mortality in France over the period 1997\u20132001: variations with spatial scale, degree of urbanicity, age, gender and cause of death. BMC Public Health. 2009;9:33.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR16\" id=\"ref-link-section-d96626534e784\" target=\"_blank\" rel=\"noopener\">16<\/a>]. Additionally, we measured the associations between TB notification rates and the proportion of overcrowded households (see definition in the Supplementary Information) [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 17\" title=\"Household crowding index calculation method. &#010;                  https:\/\/www.insee.fr\/fr\/metadonnees\/definition\/c1236&#010;                  &#010;                . Accessed 2023-09-12.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR17\" id=\"ref-link-section-d96626534e787\" target=\"_blank\" rel=\"noopener\">17<\/a>], as well as the population density level (Low, Medium, or High) [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 18\" title=\"Municipality population density level 2010-2017 based on the 2010 census. &#010;                  https:\/\/www.insee.fr\/fr\/information\/2114627&#010;                  &#010;                . Accessed 2023-09-25.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR18\" id=\"ref-link-section-d96626534e790\" target=\"_blank\" rel=\"noopener\">18<\/a>].<\/p>\n<p>For the first period, continuous variables were derived from municipal or sub-municipal datasets from the 2009 or 2010 censuses, whereas for the second period, they were obtained from the 2015 and 2016 censuses (see Supplementary Table 1). We primarily utilized population-weighted means to calculate ZIP code-level values for most continuous variables. However, for the median household income per consumption unit, we employed population-weighted medians. The population density level of municipalities was consistent across both periods, as data was solely available from the 2010 census. To establish ZIP code population density levels, we computed the cumulative number of inhabitants in 2010 for each density level within the municipalities corresponding to the same ZIP code [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 19\" title=\"Categorization of municipality population density in 4 levels: methodology. &#010;                  https:\/\/www.insee.fr\/fr\/statistiques\/fichier\/2114627\/methode-constitution.pdf&#010;                  &#010;                . Accessed 2023-09-25.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR19\" id=\"ref-link-section-d96626534e796\" target=\"_blank\" rel=\"noopener\">19<\/a>].<\/p>\n<p>Statistical analysisStandardization<\/p>\n<p>We calculated the notified TB case counts in metropolitan France across all strata defined by each of the two studied periods (2008\u20132013 and 2014\u20132019), age group (0\u201314, 15\u201324, 25\u201344, 45\u201364, and 65\u00a0years or older), sex (male, female), immigration status (immigrant, native), and housing category (individual housing, prison, other communal housing). Missing values were present in the last four variables, and were imputed using the k-nearest neighbors (kNN) imputation method implemented in the R package VIM [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 20\" title=\"Kowarik A, Templ M. Imputation with the R package VIM. J Stat Softw. 2016;74(7):1\u201316.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR20\" id=\"ref-link-section-d96626534e811\" target=\"_blank\" rel=\"noopener\">20<\/a>]; see the supplementary file for more details. The reference notification rate was calculated in each stratum by dividing the number of notified TB cases by the corresponding population in metropolitan France, derived from the 2010 and 2016 censuses (ad hoc files provided by INSEE).<\/p>\n<p>Finally, the reference notification rates were multiplied by the population of each stratum in each ZIP code (using 2010 and 2016 as reference years for the periods 2008\u20132013 and 2014\u20132019, respectively) and summed to obtain the expected number of TB cases by ZIP code and period.<\/p>\n<p>Modeling<\/p>\n<p>The number of TB cases notified in ZIP code i in period t, \\({n}_{i,t}\\), was assumed to follow a Poisson distribution of mean \\({E}_{i,t}{\\lambda }_{i}\\), with \\({E}_{i,t}\\) being the expected number of TB cases notified in ZIP code i and period t and \\({\\lambda }_{i,t}\\) a relative risk known as the standardized notification rate (SNR).<\/p>\n<p>$${n}_{i,t}\\sim Poisson\\left({{\\lambda }_{i,t}E}_{i,t}\\right)$$<\/p>\n<p>We modeled \\({\\lambda }_{i,t}\\) via \\({\\alpha }_{0}\\), a constant, \\({b}_{i}\\), a spatial random effect, and explanatory variables. The median household income per consumption unit, the proportion of high school graduates in the\u00a0unschooled population aged \u2265 15\u00a0years, the proportion of manual workers in the active population aged 15\u201364\u00a0years, the proportion of unemployment rate among the active population aged 15\u201364\u00a0years,\u00a0and the proportion of overcrowded households were log-transformed to reduce distribution skewness. All the continuous explanatory variables were then standardized by period and entered into the model with smoothed functions:<\/p>\n<p>$$\\textrm{log}\\left( {\\lambda_{i,t} } \\right)\\; = \\,\\alpha_{0} + b_{i} + f_{k} \\left( {X_{i,t}^{k} } \\right),$$<\/p>\n<p>Where \\({X}_{i,t}^{k}\\) is the standardized measure of the \\({k}^{th}\\) explanatory variable in ZIP code i and period t and where \\({f}_{k}\\) is an order 2 random walk function. We opted to standardize the explanatory variables by period due to our primary interest in their spatial contrasts within each period. Additionally, changes in the data collection method occurred between 2010 and 2016, particularly concerning the median household income per consumption unit.<\/p>\n<p>We modeled the effect of the population density level with dummy variables: <\/p>\n<p>$$\\textrm{log}\\left( {\\lambda_{i,t} } \\right)\\; = \\,\\alpha_{0} + b_{i} + \\alpha_{1} I_{i}^{Medium} + \\alpha_{2} I_{i}^{High} ,$$<\/p>\n<p>Where \\({I}_{i}^{Medium}\\) (respectively\\(, {I}_{i}^{High}\\)) was equal to 1 if the population density level was \u2018Medium\u2019 (respectively, \u2018High\u2019) in ZIP code i, 0 otherwise.<\/p>\n<p>The spatial random effect, \\({b}_{i}\\), was employed to model spatial autocorrelation and residual variations in the standardized TB notification rates. These residual variations could be linked, for example, to territorial disparities in the completeness of the reporting system. \\({b}_{i}\\) was assigned a BYM2 distribution [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Riebler A, S\u00f8rbye SH, Simpson D, Rue H. An intuitive Bayesian spatial model for disease mapping that accounts for scaling. Stat Med Res. 2016;25(4):1145\u201365.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR21\" id=\"ref-link-section-d96626534e1535\" target=\"_blank\" rel=\"noopener\">21<\/a>], which includes a spatially structured component, \\({u}_{i}\\), following a standardized intrinsic conditional autoregressive model, and a random component, \\({v}_{i}\\), following a standard normal distribution: \\({b}_{i}=\\frac{1}{\\sqrt{{\\tau }_{b}}}\\left(\\sqrt{1-\\phi }{v}_{i}+\\sqrt{\\phi }{u}_{i}\\right)\\), with \\(\\frac{1}{{\\tau }_{b}}\\), the marginal variance of \\({b}_{i}\\).<\/p>\n<p>We utilized the deviance information criterion (DIC) to identify the variables most strongly associated with TB notification rates; lower DIC values indicated a better fit. We subsequently tested whether the association between socioeconomic variables and TB notification rates changed between periods by adding a time \u00d7 period interaction to each univariable model. In practice, we modeled the nonlinear effect of the continuous variables using a second-order random walk model specific to each period, with both models sharing the same hyperparameters. The interaction effect of the population density level with the period was modeled using a factor \u00d7 factor interaction term. We assessed the relevance of the interaction terms by graphically examining the overlap of the period-specific effects (and their credible intervals).<\/p>\n<p>We ultimately developed a multivariable model incorporating variables with the lowest DIC values and minimal correlation to avoid multicollinearity:<\/p>\n<p>$$\\textrm{log}\\left({\\lambda }_{i,t}\\right)={\\alpha }_{0}+{b}_{i}+{f}_{1}\\left({X}_{i,t}^{1} \\right)+\\dots +{f}_{K}({X}_{i,t}^{K})+{\\alpha }_{1}{I}_{i}^{Medium}+{\\alpha }_{2}{I}_{i}^{High}$$<\/p>\n<p>Associations with explanatory variables<\/p>\n<p>Based on the multivariable model, we examined the shape of the association between each explanatory variable and the logarithm of the SNR (log-SNR, the linear predictor of the models). To that effect, we plotted the partial contribution of each variable \\({X}^{k}\\) to the linear predictor, \\({\\widehat{f}}_{k}\\left({X}_{i,t}^{k}\\right)\\), along with its 95% credible interval (95% CrI). This approach allowed us to identify which explanatory variables contributed significantly to the predictions.<\/p>\n<p>To assess the extent of the SNR variation across the distribution of each explanatory variable, we calculated the ratio of the SNRs at their 10th and 90th percentiles and labeled it the \u201cinter-decile standardized rate ratio\u201d (IdRR). In this, we adopted an approach proposed by Larsen et al. [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 22\" title=\"Larsen K, Petersen JH, Budtz-Jorgensen E, Endahl L. Interpreting parameters in the logistic regression model with random effects. Biometrics. 2000;56(3):909\u201314.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR22\" id=\"ref-link-section-d96626534e1982\" target=\"_blank\" rel=\"noopener\">22<\/a>, <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Larsen K, Merlo J. Appropriate assessment of neighborhood effects on individual health: integrating random and fixed effects in multilevel logistic regression. Am J Epidemiol. 2005;161(1):81\u20138.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR23\" id=\"ref-link-section-d96626534e1985\" target=\"_blank\" rel=\"noopener\">23<\/a>] in the context of multilevel logistic regression models, where they developed the \u201cinterval odds ratio\u201d to reflect the variation in the odds ratio due to random effects in the linear predictor. Similarly, Chaix et al. developed the \u201cinterquartile spatial odds ratio\u201d, defined as the odds ratio between an individual residing in a location in the first quartile and one from a location in the fourth quartile of spatial risk.<\/p>\n<p>Sensitivity analyses<\/p>\n<p>To assess the robustness of our findings to the chosen distribution of the spatial random effect, we replaced the BYM2 distribution with the Leroux distribution [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 24\" title=\"Leroux BG, Lei X, Breslow N. Estimation of disease rates in small areas: a new mixed model for spatial dependence. In: Leroux BG, Lei X, Breslow N, editors. Statistical models in epidemiology, the environment, and clinical trials: 2000\/\/2000. New York: Springer; 2000. p. 179\u201391.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR24\" id=\"ref-link-section-d96626534e1996\" target=\"_blank\" rel=\"noopener\">24<\/a>] and reran the model without explanatory variables as well as the multivariable model. Like the BYM2 model, the Leroux model comprises both structured and unstructured components but differs in their parameterization (see supplementary file section 4.2.1 for further details).<\/p>\n<p>We also evaluated the sensitivity of the analysis to the chosen imputation method (kNN) by imputing missing values using two alternative approaches. Initially, we utilized the random forest imputation algorithm from the R package missForest [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 25\" title=\"Stekhoven DJ, Buhlmann P. MissForest\u2013non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112\u20138.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR25\" id=\"ref-link-section-d96626534e2002\" target=\"_blank\" rel=\"noopener\">25<\/a>]. Subsequently, we proportionally distributed the missing values across strata defined by sex, age group, immigration status, and housing type, utilizing information from cases with similar characteristics. Further details are provided in supplementary file section 5.1.<\/p>\n<p>All statistical analyses were performed using R version 4.2.3 [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 26\" title=\"R Core Team. R: A Language and Environment for Statistical Computing. &#010;                  https:\/\/www.R-project.org\/&#010;                  &#010;                .\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR26\" id=\"ref-link-section-d96626534e2008\" target=\"_blank\" rel=\"noopener\">26<\/a>] and INLA version 22.12.16 [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 27\" title=\"Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J R Stat Soc Ser B (Statist Methodol). 2009;71(2):319\u201392.\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR27\" id=\"ref-link-section-d96626534e2011\" target=\"_blank\" rel=\"noopener\">27<\/a>, <a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 28\" title=\"R-Inla Team. R-INLA, a package in R that do approximate Bayesian inference for Latent Gaussian Models. &#010;                  www.r-inla.org&#010;                  &#010;                .\" href=\"http:\/\/idpjournal.biomedcentral.com\/articles\/10.1186\/s40249-025-01354-0#ref-CR28\" id=\"ref-link-section-d96626534e2014\" target=\"_blank\" rel=\"noopener\">28<\/a>].<\/p>\n","protected":false},"excerpt":{"rendered":"Design, period and study area We conducted a retrospective ecological study in metropolitan France for the period of&hellip;\n","protected":false},"author":2,"featured_media":436755,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5309],"tags":[148080,148079,2000,299,36,148078,6552,1093,6555,20676],"class_list":{"0":"post-436754","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-france","8":"tag-area-level-socioeconomic-variables","9":"tag-ecological-study","10":"tag-eu","11":"tag-europe","12":"tag-france","13":"tag-health-inequality","14":"tag-infectious-diseases","15":"tag-public-health","16":"tag-tropical-medicine","17":"tag-tuberculosis"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/115232519143351108","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/436754","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=436754"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/436754\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/436755"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=436754"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=436754"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=436754"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}