{"id":357561,"date":"2025-08-19T19:43:13","date_gmt":"2025-08-19T19:43:13","guid":{"rendered":"https:\/\/www.europesays.com\/uk\/357561\/"},"modified":"2025-08-19T19:43:13","modified_gmt":"2025-08-19T19:43:13","slug":"generiskcalc-a-web-based-tool-for-genetic-risk-association-analysis-in-case-control-studies-bmc-bioinformatics","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/uk\/357561\/","title":{"rendered":"GeneRiskCalc: a web-based tool for genetic risk association analysis in case\u2013control studies | BMC Bioinformatics"},"content":{"rendered":"<p>Hardy\u2013Weinberg equilibrium<\/p>\n<p>The HWE is a key principle in population genetics, stating that allele and genotype frequencies remain stable across generations in the absence of evolutionary forces like mutation, genetic drift, migration, or selection. In genetic association studies, HWE testing helps identify deviations that may indicate genotyping errors, population stratification, or non-random mating. In case\u2013control studies, HWE is typically assessed in the control group to ensure data validity, as significant deviations may suggest biases that could affect association findings [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 15\" title=\"Relethford JH. Human population genetics. Hoboken: John Wiley &amp; Sons; 2012.\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#ref-CR15\" id=\"ref-link-section-d26896646e763\" target=\"_blank\" rel=\"noopener\">15<\/a>].<\/p>\n<p>Hardy\u2013Weinberg equation<\/p>\n<p>For a gene with two alleles, \u201cA\u201d (dominant) and \u201ca\u201d (recessive), with respective allele frequencies p and q, the sum of allele frequencies is given by: (Eq.\u00a0(<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ1\" target=\"_blank\" rel=\"noopener\">1<\/a>)):<\/p>\n<p>Using these allele frequencies, the expected genotype frequencies are calculated using Eq. (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ2\" target=\"_blank\" rel=\"noopener\">2<\/a>):<\/p>\n<p>$${p}^{2}+ 2pq+ {q}^{2} =1$$<\/p>\n<p>\n                    (2)\n                <\/p>\n<p>where: p2\u2009=\u2009frequency of the \u201cAA\u201d genotype, 2pq\u2009=\u2009frequency of the \u201cAa\u201d genotype, and q2\u2009=\u2009frequency of the \u201caa\u201d genotype.<\/p>\n<p>After applying Eq. (<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ2\" target=\"_blank\" rel=\"noopener\">2<\/a>), we obtain the expected genotype frequencies. These expected values are then compared to the observed genotype frequencies to determine whether there is a significant difference between them. To assess this difference, a chi-square (\u03c72) test is performed.<\/p>\n<p>Chi-Square (\u03c72) test for HWE<\/p>\n<p>To test for HWE, the chi-square test is commonly applied using the Eq.\u00a0(<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ3\" target=\"_blank\" rel=\"noopener\">3<\/a>):<\/p>\n<p>$${X}^{2}=\\sum \\frac{(Oi-{Ei)}^{2}}{Ei}$$<\/p>\n<p>\n                    (3)\n                <\/p>\n<p>where: <b>\u03c7<\/b><b>2<\/b>\u2009=\u2009chi-square test statistic, Oi\u2009=\u2009observed genotype counts, and Ei\u2009=\u2009expected genotype counts (calculated from allele frequencies). It is important to note that a p-value greater than 0.05 indicates no significant deviation from HWE, suggesting that the population is in equilibrium. Conversely, a p-value less than 0.05 indicates a significant deviation from HWE, warranting further investigation to assess potential genotyping errors, population stratification, or other confounding factors.<\/p>\n<p>Odds ratio and confidence interval calculation in case\u2013control studies<\/p>\n<p>The OR is a key statistical measure used to assess the strength of association between a genetic variant and a disease in case\u2013control studies. It quantifies how the presence or absence of a particular allele or genotype affects the odds of developing a disease. In genetic association studies, different genetic models such as dominant, recessive, codominant, over-dominant, and allele models are used for the calculation of risk [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 16\" title=\"Martorell-Marugan J, Toro-Dominguez D, Alarcon-Riquelme ME, Carmona-Saez P. MetaGenyo: A web tool for meta-analysis of genetic association studies. BMC Bioinf. 2017;18(1):563. &#010;                  https:\/\/doi.org\/10.1186\/s12859-017-1990-4&#010;                  &#010;                .\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#ref-CR16\" id=\"ref-link-section-d26896646e864\" target=\"_blank\" rel=\"noopener\">16<\/a>]. The OR is typically determined using a 2\u2009\u00d7\u20092 contingency table that organizes allele or genotype counts for cases and controls (Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Tab1\" target=\"_blank\" rel=\"noopener\">1<\/a>).<\/p>\n<p><b id=\"Tab1\" data-test=\"table-caption\">Table\u00a01 2\u2009\u00d7\u20092 contingency tables allele\/genotype counts<\/b><\/p>\n<p>To calculate the OR, we use the formulae given in Eq.\u00a0(<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ4\" target=\"_blank\" rel=\"noopener\">4<\/a>):<\/p>\n<p>$$OR=\\frac{a \\times d}{b \\times c}$$<\/p>\n<p>\n                    (4)\n                <\/p>\n<p>where: a\u2009=\u2009Number of cases with the risk allele b\u2009=\u2009Number of controls with the risk allele, c\u2009=\u2009Number of cases without the risk allele, and d\u2009=\u2009Number of controls without the risk allele.<\/p>\n<p>After obtaining the OR, calculating the 95% confidence interval (CI) is crucial for assessing the statistical significance and precision of the estimated association. The CI is computed using the logarithmic method based on the standard error (SE) of the natural logarithm (ln) of the OR.<\/p>\n<p>The SE is calculated using Eq.\u00a0(<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ5\" target=\"_blank\" rel=\"noopener\">5<\/a>):<\/p>\n<p>$$\\text{SE }(\\text{ln}(\\text{OR}))= \\sqrt{\\frac{1}{a}+}\\frac{1}{b}+\\frac{1}{c}+\\frac{1}{d}$$<\/p>\n<p>\n                    (5)\n                <\/p>\n<p>where a, b, c, and d are the respective cell counts in the 2\u2009\u00d7\u20092 contingency table (Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Tab1\" target=\"_blank\" rel=\"noopener\">1<\/a>).<\/p>\n<p>Further, the CI is calculated using the Eq.\u00a0(<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"equation anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Equ6\" target=\"_blank\" rel=\"noopener\">6<\/a>):<\/p>\n<p>$$ {\\text{CI}}_{{{95}\\% }} = {\\text{ exp }}[{\\text{ln}}\\left( {{\\text{OR}}} \\right) \\, \\pm {1}.{96 } \\times {\\text{ SE }}\\left( {{\\text{ln}}\\left( {{\\text{OR}}} \\right)} \\right] $$<\/p>\n<p>\n                    (6)\n                <\/p>\n<p>After calculating the values, interpreting the OR and CI is crucial in genetic association studies. An OR of 1 indicates no association between the genetic variant and disease risk, while an OR greater than 1 suggests increased odds of disease, implying the variant may be a risk factor. Conversely, an OR less than 1 indicate decreased odds of disease, suggesting a potential protective effect. The 95% CI provides additional insight into statistical significance; if the interval includes 1, the association is generally considered not statistically significant, meaning the observed effect could be due to chance rather than a true genetic relationship [<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 14\" title=\"Tenny S, Kerndt CC, Hoffman MR. Case control studies. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2023.\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#ref-CR14\" id=\"ref-link-section-d26896646e1051\" target=\"_blank\" rel=\"noopener\">14<\/a>].<\/p>\n<p>                           P-value computation<\/p>\n<p>In this study, a p-value threshold of\u2009<\/p>\n<p>[Note: This threshold is commonly used in candidate gene association studies and should not be confused with genome-wide association studies (GWAS), which apply more stringent criteria (e.g., p\u200917]. To address multiple comparisons across seven genetic models, we applied a Bonferroni correction, resulting in an adjusted threshold of p\u2009<\/p>\n<p>Input\/ output specification<\/p>\n<p>GeneRiskCalc requires genotypic counts for Homozygous Dominant (HW), Heterozygous (HT), and Homozygous Recessive (HR) in both case and control groups. These values are used for HWE analysis and OR calculation with a 95% CI and p-value.<\/p>\n<p>Concerning the Output, the results will include genotypic frequency, allele frequency, HWE status based on \u03c72 value, p-value (assessing genetic equilibrium), and OR\/CI with p-value (indicating genetic association strength). For visualization, users can manually input OR and CI values or copy data from Excel into the integrated Forest Plotter to illustrate effect size and direction.<\/p>\n<p>Data visualization<\/p>\n<p>The results are displayed in a structured table, presenting clear and organized information on key statistical measures, including allele and genotype counts, HWE with \u03c72 and p-values, and OR with 95% CI, and p-values. Additionally, Bonferroni correction is applied for multiple genetic models to ensure accuracy.<\/p>\n<p>Validation of GeneRiskCalc<\/p>\n<p>The accuracy and reliability of GeneRiskCalc were rigorously validated through multiple approaches. Benchmark testing compared results with established tools to ensure precision. For HWE analysis, various online software were used such as bio. blog.labs (Seb Carvello\u2014Hardy\u2013Weinberg Equilibrium Calculator), Wpcalc (Online Calculator of Hardy\u2013Weinberg equilibrium), San Mateo (Excel file: Supplementary file-<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"supplementary material anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#MOESM1\" target=\"_blank\" rel=\"noopener\">1<\/a>).<\/p>\n<p>Furthermore, to validate and compare the calculated ORs, several freely available online tools were utilized. These included MedCalc\u2019s Odds Ratio Calculator (<a href=\"https:\/\/www.medcalc.org\/calc\/odds_ratio.php\" target=\"_blank\" rel=\"noopener\">https:\/\/www.medcalc.org\/calc\/odds_ratio.php<\/a>), GIGA Calculator\u2019s Odds Ratio tool (<a href=\"https:\/\/www.gigacalculator.com\/calculators\/odds-ratio-calculator.php\" target=\"_blank\" rel=\"noopener\">https:\/\/www.gigacalculator.com\/calculators\/odds-ratio-calculator.php<\/a>), the OpenEpi 2\u2009\u00d7\u20092 Table Statistics tool (OpenEpi\u20132\u2009\u00d7\u20092 Table Statistics), and the Odds Ratio calculator provided by Epitools (<a href=\"https:\/\/epitools.ausvet.com.au\/twobytwotable\" target=\"_blank\" rel=\"noopener\">https:\/\/epitools.ausvet.com.au\/twobytwotable<\/a>). These platforms facilitated cross-verification of OR values and supported a robust interpretation of the statistical results.<\/p>\n<p>To ensure result consistency, simulated datasets with predefined HWE status (Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Tab2\" target=\"_blank\" rel=\"noopener\">2<\/a>) and OR values were analyzed (Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Tab3\" target=\"_blank\" rel=\"noopener\">3<\/a>). Additionally, real-world genetic association datasets were tested to validate performance in practical scenarios (Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Tab4\" target=\"_blank\" rel=\"noopener\">4<\/a>). The tool\u2019s robustness was further assessed by evaluating edge cases, including scenarios with zero genotype counts and extreme OR values (Table\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"table anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Tab2\" target=\"_blank\" rel=\"noopener\">2<\/a>). Finally, user interface testing was conducted to ensure seamless data input, intuitive result interpretation, and overall ease of use. Together, these validation steps confirm that GeneRiskCalc delivers accurate, reliable, and robust results for genetic association analysis. The workflow of GeneRiskCalc, from data input to result visualization, is depicted in Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z#Fig1\" target=\"_blank\" rel=\"noopener\">1<\/a>.<\/p>\n<p><b id=\"Tab2\" data-test=\"table-caption\">Table\u00a02 Comparison of Hardy\u2013Weinberg equilibrium calculations across different tools using simulated data<\/b><b id=\"Tab3\" data-test=\"table-caption\">Table\u00a03 Validation of GeneRiskCalc against other OR calculation tools using randomly selected simulated data<\/b><b id=\"Tab4\" data-test=\"table-caption\">Table\u00a04 Data validated for odds ratio under different genetic models<\/b><b id=\"Fig1\" class=\"c-article-section__figure-caption\" data-test=\"figure-caption-text\">Fig.\u00a01<\/b><a class=\"c-article-section__figure-link\" data-test=\"img-link\" data-track=\"click\" data-track-label=\"image\" data-track-action=\"view figure\" href=\"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-025-06207-z\/figures\/1\" rel=\"nofollow noopener\" target=\"_blank\"><img decoding=\"async\" aria-describedby=\"Fig1\" src=\"https:\/\/www.europesays.com\/uk\/wp-content\/uploads\/2025\/08\/12859_2025_6207_Fig1_HTML.png\" alt=\"figure 1\" loading=\"lazy\" width=\"685\" height=\"1458\"\/><\/a><\/p>\n<p>Workflow diagram of GeneRiskCalc from input to visualization: The process includes user data entry, Hardy\u2013Weinberg Equilibrium testing, Odds Ratio calculation, and optional forest plot generation for result visualization<\/p>\n","protected":false},"excerpt":{"rendered":"Hardy\u2013Weinberg equilibrium The HWE is a key principle in population genetics, stating that allele and genotype frequencies remain&hellip;\n","protected":false},"author":2,"featured_media":357562,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3846],"tags":[3721,7445,125891,121359,121360,125890,125887,125892,267,125888,87714,125889,70,16,15],"class_list":{"0":"post-357561","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-genetics","8":"tag-algorithms","9":"tag-bioinformatics","10":"tag-case-control-studies","11":"tag-computational-biology-bioinformatics","12":"tag-computer-appl-in-life-sciences","13":"tag-forest-plot","14":"tag-genetic-association-studies","15":"tag-genetic-epidemiology","16":"tag-genetics","17":"tag-hardy-weinberg-equilibrium","18":"tag-microarrays","19":"tag-odds-ratio","20":"tag-science","21":"tag-uk","22":"tag-united-kingdom"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@uk\/115057144890384350","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/357561","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/comments?post=357561"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/posts\/357561\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media\/357562"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/media?parent=357561"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/categories?post=357561"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/uk\/wp-json\/wp\/v2\/tags?post=357561"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}