Introduction Clinical documentation is a significant driver of burnout among physicians. Ambient artificial intelligence (AI) scribes, which leverage generative large language models to automate the creation of clinical notes from patient - physician conversations, are rapidly emerging as a potential solution.
While these tools promise to enhance efficiency and reduce administrative tasks, concerns about the quality, accuracy and potential biases persist. There is now a need for a systematic synthesis of evidence to evaluate the impact of these technologies in clinical practice.
To assess the effects of ambient AI scribes on physicians' clinical documentation, the specific objectives are to: (1) evaluate the effectiveness of these tools on documentation, including accuracy and completeness; (2) synthesise evidence on the impact on physician efficiency after adoption, including time spent on documentation and (3) examine physicians' satisfaction with these tools, including physicians' perceived burden. Methods and analysis A systematic review of quantitative or mixed-method studies as well as preprints will be conducted.