Evaluating AI-driven characters in extended reality (XR) healthcare simulations: A systematic review
Authors: Dasa, D., Board, M., Rolfe, U., Dolby, T. and Tang, W.
Journal: Artificial Intelligence in Medicine
Volume: 170
eISSN: 1873-2860
ISSN: 0933-3657
DOI: 10.1016/j.artmed.2025.103270
Abstract:AI-driven characters in extended reality (XR) healthcare simulations are increasingly used for clinical training, yet their effectiveness, implementation, and quality assurance remain poorly understood. We conducted a systematic review of 132 studies published between January 2015 and July 2025, including 11 randomized controlled trials (RCTs), sourced from biomedical, computing, and education databases and targeted proceedings. Most studies used virtual reality (62.1%) and focused on effectiveness (n = 71), with fewer examining implementation (n = 45) or quality assurance (n = 44). Meta-analysis of two RCTs found a large effect on knowledge and decision-making (Hedges’ g = 1.31, 95% CI 0.08–2.54, I2 = 85%), while one RCT reported faster task performance with AI-driven characters (g = -0.68, 95% CI -1.32 to -0.04). Certainty of evidence was low due to small samples and high heterogeneity. Implementation success was often associated with phased roll-outs and faculty training, but quality assurance practices (particularly bias audits and transparency measures) were rarely documented. The review proposes the DASEX framework to address these gaps and guide future integration of AI-driven characters in XR training.
Source: Scopus
Evaluating AI-driven characters in extended reality (XR) healthcare simulations: A systematic review.
Authors: Dasa, D., Board, M., Rolfe, U., Dolby, T. and Tang, W.
Journal: Artif Intell Med
Volume: 170
Pages: 103270
eISSN: 1873-2860
DOI: 10.1016/j.artmed.2025.103270
Abstract:AI-driven characters in extended reality (XR) healthcare simulations are increasingly used for clinical training, yet their effectiveness, implementation, and quality assurance remain poorly understood. We conducted a systematic review of 132 studies published between January 2015 and July 2025, including 11 randomized controlled trials (RCTs), sourced from biomedical, computing, and education databases and targeted proceedings. Most studies used virtual reality (62.1%) and focused on effectiveness (n = 71), with fewer examining implementation (n = 45) or quality assurance (n = 44). Meta-analysis of two RCTs found a large effect on knowledge and decision-making (Hedges' g = 1.31, 95% CI 0.08-2.54, I2 = 85%), while one RCT reported faster task performance with AI-driven characters (g = -0.68, 95% CI -1.32 to -0.04). Certainty of evidence was low due to small samples and high heterogeneity. Implementation success was often associated with phased roll-outs and faculty training, but quality assurance practices (particularly bias audits and transparency measures) were rarely documented. The review proposes the DASEX framework to address these gaps and guide future integration of AI-driven characters in XR training.
Source: PubMed
Evaluating AI-driven characters in extended reality (XR) healthcare simulations: A systematic review
Authors: Dasa, D., Board, M., Rolfe, U., Dolby, T. and Tang, W.
Journal: ARTIFICIAL INTELLIGENCE IN MEDICINE
Volume: 170
eISSN: 1873-2860
ISSN: 0933-3657
DOI: 10.1016/j.artmed.2025.103270
Source: Web of Science (Lite)
Evaluating AI-driven characters in extended reality (XR) healthcare simulations: A systematic review.
Authors: Dasa, D., Board, M., Rolfe, U., Dolby, T. and Tang, W.
Journal: Artificial intelligence in medicine
Volume: 170
Pages: 103270
eISSN: 1873-2860
ISSN: 0933-3657
DOI: 10.1016/j.artmed.2025.103270
Abstract:AI-driven characters in extended reality (XR) healthcare simulations are increasingly used for clinical training, yet their effectiveness, implementation, and quality assurance remain poorly understood. We conducted a systematic review of 132 studies published between January 2015 and July 2025, including 11 randomized controlled trials (RCTs), sourced from biomedical, computing, and education databases and targeted proceedings. Most studies used virtual reality (62.1%) and focused on effectiveness (n = 71), with fewer examining implementation (n = 45) or quality assurance (n = 44). Meta-analysis of two RCTs found a large effect on knowledge and decision-making (Hedges' g = 1.31, 95% CI 0.08-2.54, I2 = 85%), while one RCT reported faster task performance with AI-driven characters (g = -0.68, 95% CI -1.32 to -0.04). Certainty of evidence was low due to small samples and high heterogeneity. Implementation success was often associated with phased roll-outs and faculty training, but quality assurance practices (particularly bias audits and transparency measures) were rarely documented. The review proposes the DASEX framework to address these gaps and guide future integration of AI-driven characters in XR training.
Source: Europe PubMed Central