TY - RPRT AU - Si, Yafei AU - Meng, Yurun AU - Chen, Xi AU - An, Ruopeng AU - Mao, Limin AU - Li, Bingqin AU - Bateman, Hazel AU - Zhang, Han AU - Fan, Hongbin AU - Zu, Jiaqi AU - Gong, Shaoqing AU - Zhou, Zhongliang AU - Miao, Yudong TI - Quality, Safety, and Disparities of AI Chatbots in Managing Chronic Diseases: Experimental Evidence PY - 2025/Aug/ PB - Institute of Labor Economics (IZA) CY - Bonn T2 - IZA Discussion Paper IS - 18074 UR - https://www.iza.org/index.php/publications/dp18074 AB - The rapid development of AI solutions reveals opportunities to address the underdiagnosis and poor management of chronic conditions in developing settings. Using the method of simulated patients and experimental designs, we evaluate the quality, safety, and disparity of medical consultation with ERNIE Bot in China among 384 patient-AI trials. ERNIE Bot reached a diagnostic accuracy of 77.3%, correct drug prescriptions of 94.3%, but prescribed high rates of unnecessary medical tests (91.9%) and unnecessary medications (57.8%). Disparities were observed based on patient age and household economic status, with older and wealthier patients receiving more intensive care. Under standardized conditions, ERNIE Bot, ChatGPT, and DeepSeek demonstrated higher diagnostic accuracy but a greater tendency toward overprescription than human physicians. The results suggest the great potential of ERNIE Bot in empowering quality, accessibility, and affordability of healthcare provision in developing contexts but also highlight critical risks related to safety and amplification of sociodemographic disparities. ER -