Research

Conformal language modeling

In this paper, we propose a novel approach to conformal prediction (CP) that is adapted to generative, large language models (LLMs). Conformal prediction is a popular technique for deriving prediction sets from machine learning models that have rigorous, statistical performance guarantees. We extend conformal techniques to a broad class of language models that sample from a conditional distribution over the combinatorial, unbounded space of possible text outputs, given some input prompt. Specifically, we translate the process of constructing prediction sets into calibrating a \emph{stopping rule}, under which we draw diverse samples from our model until we are confident that the growing set of candidate answers includes at least one high-quality response. At the same time, we calibrate a \emph{rejection rule} to selectively discard low-quality or redundant responses to reduce sample noise. Under minimal assumptions, we theoretically prove that our resulting output sets contain at least one high-quality answer with some desired probability that a user can set (such as $90\%$), while still remaining empirically precise on average. Furthermore, within this set of sampled candidate answers, we show that we can also accurately identify subsets of individual components (e.g., phrases or sentences) that are each independently correct (e.g., that are not ``hallucinations'')---again, with provably high probability. We demonstrate the effectiveness of our approach on multiple types of large language models applied to tasks in open-domain question answering, text summarisation, and radiology report generation.

Details

author(s)

Regina Barzilay

Tommi Jaakola

Adam Yala

publication date

16 June 2023

source

Arxiv

related programme

MIT Jameel Clinic

Link to publication

External link ->

Generative AI in the era of 'alternative facts'

27 March 2024

MIT Open Publishing Services

External data and AI are making each other more valuable

26 February 2024

Harvard Business Review Press

Rethinking patch dependence for masked autoencoders

25 January 2024

Arxiv

Removing biases from molecular representations via information maximisation

1 December 2023

Arxiv

Effective human-AI teams via learned natural language rules and onboarding

7 November 2023

Arxiv

A deep dive into single-cell RNA sequencing foundation models

23 October 2023

bioRxiv

Antibiotic identified by AI

11 October 2023

Nature

LLM-grounded video diffusion models

2 October 2023

Arxiv

Successful Development of a Natural Language Processing Algorithm for Pancreatic Neoplasms and Associated Histologic Features

14 September 2023

Pancreas

Leveraging artificial intelligence in the fight against infectious diseases

13 July 2023

Science

BioAutoMATED: An end-to-end automated machine learning tool for explanation and design of biological sequences

21 June 2023

Cell Systems