Quality, Safety, and Disparities of AI Chatbots in Managing Chronic Diseases: Experimental Evidence

Cookie settings

Necessary

These necessary cookies are required to enable the core functionality of the website. Opting out of these cookies is not possible.

cb-enable

This cookie stores the user's cookie consent status for the current domain. Expiry: 1 year.

laravel_session

Stores the session ID to recognize the user when the page reloads and to restore their login session. Expiry: 2 hours.

XSRF-TOKEN

Provides CSRF protection for forms. Expiry: 2 hours.

Startseite
Publikationen
IZA Discussion Papers
Quality, Safety, and Disparities of AI Chatbots in Managing Chronic Diseases: Ex...

IZA Discussion Paper No. 18074

August 2025

Quality, Safety, and Disparities of AI Chatbots in Managing Chronic Diseases: Experimental Evidence

Yafei Si, Yurun Meng, Xi Chen, Ruopeng An, Limin Mao, Bingqin Li, Hazel Bateman, Han Zhang, Hongbin Fan, Jiaqi Zu, Shaoqing Gong, Zhongliang Zhou, Yudong Miao

published online in: npj Digital Medicine, 25 September 2025

The rapid development of AI solutions reveals opportunities to address the underdiagnosis and poor management of chronic conditions in developing settings. Using the method of simulated patients and experimental designs, we evaluate the quality, safety, and disparity of medical consultation with ERNIE Bot in China among 384 patient-AI trials. ERNIE Bot reached a diagnostic accuracy of 77.3%, correct drug prescriptions of 94.3%, but prescribed high rates of unnecessary medical tests (91.9%) and unnecessary medications (57.8%). Disparities were observed based on patient age and household economic status, with older and wealthier patients receiving more intensive care. Under standardized conditions, ERNIE Bot, ChatGPT, and DeepSeek demonstrated higher diagnostic accuracy but a greater tendency toward overprescription than human physicians. The results suggest the great potential of ERNIE Bot in empowering quality, accessibility, and affordability of healthcare provision in developing contexts but also highlight critical risks related to safety and amplification of sociodemographic disparities.

Download

Keywords

JEL Codes

C0 I10 I11 C90 C93