Trustworthy AI Lab
Trustworthy AI Lab
Welcome
News
People
Publications
Opportunities
Ali Soroush
Latest
Across generations, sizes, and types, large language models poorly report self-confidence in gastroenterology clinical reasoning tasks
Evaluating Prompt Engineering Techniques for Accuracy and Confidence Elicitation in Medical LLMs
The challenge of uncertainty quantification of large language models in medicine
Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models
Cite
×