Trustworthy AI Lab
Trustworthy AI Lab
Welcome
News
People
Publications
Opportunities
Ali Soroush
Latest
Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework
Across generations, sizes, and types, large language models poorly report self-confidence in gastroenterology clinical reasoning tasks
Large language models versus classical machine learning performance in COVID-19 mortality prediction using high-dimensional tabular data
Evaluating Prompt Engineering Techniques for Accuracy and Confidence Elicitation in Medical LLMs
The challenge of uncertainty quantification of large language models in medicine
Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models
Cite
×