{"id":174009,"date":"2025-08-25T09:45:14","date_gmt":"2025-08-25T09:45:14","guid":{"rendered":"https:\/\/www.europesays.com\/us\/174009\/"},"modified":"2025-08-25T09:45:14","modified_gmt":"2025-08-25T09:45:14","slug":"development-and-internal-validation-of-an-ai-based-emergency-triage-model-for-predicting-critical-outcomes-in-emergency-department","status":"publish","type":"post","link":"https:\/\/www.europesays.com\/us\/174009\/","title":{"rendered":"Development and internal validation of an AI-based emergency triage model for predicting critical outcomes in emergency department"},"content":{"rendered":"<p>This study demonstrates the potential of machine learning models to predict ICU admission from the ED more accurately than the conventional CTAS triage system. Our top-performing model, XGBoost, showed superior discrimination, a finding consistent with other studies that highlight the ability of tree-based models to capture complex, non-linear interactions among predictors<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 20\" title=\"Ivanov, O. et al. Improving ED emergency severity index acuity assignment using machine learning and clinical natural Language processing. J. Emerg. Nurs. 47(2), 265&#x2013;278e7. &#010;                  https:\/\/doi.org\/10.1016\/j.jen.2020.11.001&#010;                  &#010;                 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR20\" id=\"ref-link-section-d394114025e1547\" target=\"_blank\" rel=\"noopener\">20<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Zhang, X. et al. Prediction of emergency department hospital admission based on natural Language processing and neural networks. Methods Inf. Med. 56(05), 377&#x2013;389. &#010;                  https:\/\/doi.org\/10.3414\/ME17-01-0024&#010;                  &#010;                 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR23\" id=\"ref-link-section-d394114025e1550\" target=\"_blank\" rel=\"noopener\">23<\/a>. The improvement in AUROC (0.917 vs. 0.882) is noteworthy, but the substantial improvement in AUPRC (0.629 vs. 0.333) is particularly compelling. AUPRC is more informative than AUROC in settings with class imbalance, such as ICU prediction, and this large gain suggests our model is significantly better at ensuring that patients flagged as high-risk are truly likely to require ICU admission, thereby improving the positive predictive value of the triage assessment. The step-like pattern observed in the CTAS ROC curve (Fig.\u00a0<a data-track=\"click\" data-track-label=\"link\" data-track-action=\"figure anchor\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#Fig2\" target=\"_blank\" rel=\"noopener\">2<\/a>) reflects its nature as a categorical scale with five discrete levels, which inherently limits its ability to provide nuanced, continuous risk stratification compared to the machine learning models.<\/p>\n<p>Previous studies have shown that machine learning can improve triage outcomes using both structured and unstructured data, primarily in developed countries<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 20\" title=\"Ivanov, O. et al. Improving ED emergency severity index acuity assignment using machine learning and clinical natural Language processing. J. Emerg. Nurs. 47(2), 265&#x2013;278e7. &#010;                  https:\/\/doi.org\/10.1016\/j.jen.2020.11.001&#010;                  &#010;                 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR20\" id=\"ref-link-section-d394114025e1560\" target=\"_blank\" rel=\"noopener\">20<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 21\" title=\"Fernandes, M. et al. Predicting intensive care unit admission among patients presenting to the emergency department using machine learning and natural Language processing. PLoS One. 15(3), e0229331. &#010;                  https:\/\/doi.org\/10.1371\/journal.pone.0229331&#010;                  &#010;                 (2020).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR21\" id=\"ref-link-section-d394114025e1563\" target=\"_blank\" rel=\"noopener\">21<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 23\" title=\"Zhang, X. et al. Prediction of emergency department hospital admission based on natural Language processing and neural networks. Methods Inf. Med. 56(05), 377&#x2013;389. &#010;                  https:\/\/doi.org\/10.3414\/ME17-01-0024&#010;                  &#010;                 (2017).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR23\" id=\"ref-link-section-d394114025e1566\" target=\"_blank\" rel=\"noopener\">23<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 29\" title=\"Tahayori, B., Chini-Foroush, N. &amp; Akhlaghi, H. Advanced natural Language processing technique to predict patient disposition based on emergency triage notes. Emerg. Med. Australas. 33(3), 480&#x2013;484. &#010;                  https:\/\/doi.org\/10.1111\/1742-6723.13656&#010;                  &#010;                 (2021).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR29\" id=\"ref-link-section-d394114025e1569\" target=\"_blank\" rel=\"noopener\">29<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 34\" title=\"Grant, L. et al. Machine learning outperforms the Canadian triage and acuity scale (CTAS) in predicting need for early critical care. Can J. Emerg. Med. &#010;                  https:\/\/doi.org\/10.1007\/s43678-024-00807-z&#010;                  &#010;                 (2024).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR34\" id=\"ref-link-section-d394114025e1572\" target=\"_blank\" rel=\"noopener\">34<\/a>. This study builds on those findings and demonstrates that such improvements are also achievable in resource-limited settings. Differences include excluding pain scores as they are subjective and unreliable for determining patient acuity<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 35\" title=\"Ueareekul, S., Changratanakorn, C., Tianwibool, P., Meelarp, N. &amp; Wongtanasarasin, W. Accuracy of pain scales in predicting critical diagnoses in non-traumatic abdominal pain cases; a cross-sectional study. Arch. Acad. Emerg. Med. 11(1), e68. &#010;                  https:\/\/doi.org\/10.22037\/aaem.v11i1.2131&#010;                  &#010;                 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR35\" id=\"ref-link-section-d394114025e1576\" target=\"_blank\" rel=\"noopener\">35<\/a>. Furthermore, the integration of NLP to analyze chief complaints in free text enabled the model to interpret textual data. Conventional triage systems, including ESI and CTAS, rely on chief complaints for categorization. The NLP approach can capture the subtle variations in clinical presentations, allowing for a broader categorization of chief complaints. Additionally, the use of multilingual embeddings effectively manages the linguistic diversity of clinical documentation in the local context, allowing the model to interpret text written in Thai with occasional English medical terms. However, reliance on free-text chief complaints introduces variability that could affect model prediction reliability.<\/p>\n<p>Our findings also align with other advanced triage systems. For instance, the TriAge-Go system, a sophisticated software as a medical device (SaMD), also showed improved prediction over standard triage in a recent prospective evaluation<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 36\" title=\"Taylor, R. A. et al. Impact of artificial intelligence&#x2013;based triage decision support on emergency department care. NEJM AI. 2(3), AIoa2400296. &#010;                  https:\/\/doi.org\/10.1056\/AIoa2400296&#010;                  &#010;                 (2025).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR36\" id=\"ref-link-section-d394114025e1583\" target=\"_blank\" rel=\"noopener\">36<\/a>. While TriAge-Go represents a highly advanced implementation, our model demonstrates that significant improvements can also be achieved in resource-limited settings using readily available data.<\/p>\n<p>This study shows that data-driven tools can make ED decisions more effective. By providing a real-time risk score, the model can flag high-risk patients for triage nurses, helping to mitigate under-triage and focus attention where it is most needed. It can also aid in resource management by providing more accurate forecasts for ICU bed demand.<\/p>\n<p>Limitations<\/p>\n<p>Several limitations should be acknowledged. First, a significant limitation is our definition of the ground truth. The primary outcome of direct ICU admission from the ED does not account for disposition errors, such as unplanned ICU transfers (UIT) from a general ward within 24\u00a0h of admission. These cases often represent patients who were under-triaged, and their exclusion may mean our model was trained on more clearly identifiable cases of critical illness, potentially overestimating its performance. Future prospective studies should incorporate UIT to create a more robust and clinically accurate composite outcome.<\/p>\n<p>Second, this study is a retrospective, single-center analysis conducted on a predominantly Asian population, which inherently limits its generalizability to other settings and demographic groups. A critical challenge for all predictive models is performance degradation upon real-world, prospective implementation. Studies on models like the Rothman Index and the EPIC Sepsis Model have shown that even models validated on large retrospective datasets can experience a significant drop in performance when deployed, often due to data drift or overfitting<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 37\" title=\"Rothman, M. J., Rothman, S. I. &amp; Beals, J. Development and validation of a continuous measure of patient condition using the electronic medical record. J. Biomed. Inform. 46(5), 837&#x2013;848. &#010;                  https:\/\/doi.org\/10.1016\/j.jbi.2013.06.011&#010;                  &#010;                 (2013).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR37\" id=\"ref-link-section-d394114025e1601\" target=\"_blank\" rel=\"noopener\">37<\/a>,<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 38\" title=\"Cull, J., Brevetta, R., Gerac, J., Kothari, S. &amp; Blackhurst, D. Epic sepsis model inpatient predictive analytic tool: A validation study. Crit. Care Explor. 5(7), e0941. &#010;                  https:\/\/doi.org\/10.1097\/CCE.0000000000000941&#010;                  &#010;                 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR38\" id=\"ref-link-section-d394114025e1604\" target=\"_blank\" rel=\"noopener\">38<\/a>. Therefore, our model must be considered an early-stage development, and its clinical utility can only be confirmed through rigorous external and prospective validation.<\/p>\n<p>Third, the reliance on free-text chief complaints, while powerful, introduces variability. The quality and detail of documentation can differ between nurses, which could affect model reliability. Future work should prioritize improving the quality of free-text inputs and exploring standardization using systems like SNOMED CT to enhance data consistency<a data-track=\"click\" data-track-action=\"reference anchor\" data-track-label=\"link\" data-test=\"citation-ref\" aria-label=\"Reference 39\" title=\"Vuokko, R., Vakkuri, A. &amp; Palojoki, S. Systematized nomenclature of medicine&#x2013;clinical terminology (SNOMED CT) clinical use cases in the context of electronic health record systems: systematic literature review. JMIR Med. Inf. 11(1), e43750. &#010;                  https:\/\/doi.org\/10.2196\/43750&#010;                  &#010;                 (2023).\" href=\"http:\/\/www.nature.com\/articles\/s41598-025-17180-1#ref-CR39\" id=\"ref-link-section-d394114025e1611\" target=\"_blank\" rel=\"noopener\">39<\/a>.<\/p>\n<p>Forth, the outcome was limited to ICU admissions. Certain conditions, such as anaphylaxis or reactive airway disease, require immediate attention but may not result in ICU admission. In contrast, conditions associated with high mortality, such as unconsciousness, may lead to death in the ED rather than admission to the ICU. Outcomes such as emergency procedures, early mortality, or ED resource utilization could provide a more comprehensive evaluation of patient acuity.<\/p>\n<p>Finally, the absence of detailed patient history as a predictor may have constrained the model\u2019s performance. Incorporating prior medical information could significantly enhance prediction accuracy and help address potential biases.<\/p>\n","protected":false},"excerpt":{"rendered":"This study demonstrates the potential of machine learning models to predict ICU admission from the ED more accurately&hellip;\n","protected":false},"author":3,"featured_media":174010,"comment_status":"","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[21],"tags":[691,738,48694,1141,10046,10047,48695,10542,159,95799,158,67,132,68],"class_list":{"0":"post-174009","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-artificial-intelligence","8":"tag-ai","9":"tag-artificial-intelligence","10":"tag-computational-biology-and-bioinformatics","11":"tag-health-care","12":"tag-humanities-and-social-sciences","13":"tag-multidisciplinary","14":"tag-prognosis","15":"tag-risk-factors","16":"tag-science","17":"tag-signs-and-symptoms","18":"tag-technology","19":"tag-united-states","20":"tag-unitedstates","21":"tag-us"},"share_on_mastodon":{"url":"https:\/\/pubeurope.com\/@us\/115088767237502422","error":""},"_links":{"self":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts\/174009","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/comments?post=174009"}],"version-history":[{"count":0,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/posts\/174009\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/media\/174010"}],"wp:attachment":[{"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/media?parent=174009"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/categories?post=174009"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.europesays.com\/us\/wp-json\/wp\/v2\/tags?post=174009"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}