Bluesky Facebook Reddit Email

Testing and evaluation of health care applications of large language models

10.15.24 | JAMA Network

Apple iPhone 17 Pro

Apple iPhone 17 Pro delivers top performance and advanced cameras for field documentation, data collection, and secure research communications.

About The Study: Existing evaluations of large language models mostly focus on accuracy of question answering for medical examinations, without consideration of real patient care data. Dimensions such as fairness, bias, and toxicity and deployment considerations received limited attention. Future evaluations should adopt standardized applications and metrics, use clinical data, and broaden focus to include a wider range of tasks and specialties.

Corresponding Author: To contact the corresponding author, Nigam H. Shah, MBBS, PhD, email nigam@stanford.edu .

To access the embargoed study: Visit our For The Media website at this link https://media.jamanetwork.com/

(doi:10.1001/jama.2024.21700)

Editor’s Note: Please see the article for additional information, including other authors, author contributions and affiliations, conflict of interest and financial disclosures, and funding and support.

# # #

Embed this link to provide your readers free access to the full-text article This link will be live at the embargo time https://jamanetwork.com/journals/jama/fullarticle/10.1001/jama.2024.21700?guestAccessKey=dcffc9b1-79d9-4409-9187-a60ea9c30a51&utm_source=For_The_Media&utm_medium=referral&utm_campaign=ftm_links&utm_content=tfl&utm_term=101524

JAMA

Keywords

Article Information

Contact Information

JAMA Network Media Relations
JAMA Network
mediarelations@jamanetwork.org

How to Cite This Article

APA:
JAMA Network. (2024, October 15). Testing and evaluation of health care applications of large language models. Brightsurf News. https://www.brightsurf.com/news/8OMD0PZ1/testing-and-evaluation-of-health-care-applications-of-large-language-models.html
MLA:
"Testing and evaluation of health care applications of large language models." Brightsurf News, Oct. 15 2024, https://www.brightsurf.com/news/8OMD0PZ1/testing-and-evaluation-of-health-care-applications-of-large-language-models.html.