Skip to main content

*CANCELLED* Machine Learning Lunch Meeting

Towards Plurality: Foundations for Learning from Diverse Human Preferences

Event Details

Date
Thursday, March 14, 2024
Time
1 p.m.
Description

*CANCELLED* will resume next week

Everyone is invited to the weekly machine learning lunch meetings, where our faculty members from Computer Science, Statistics, ECE, and other departments will discuss their latest groundbreaking research in machine learning. This is an opportunity to network with faculty and fellow researchers while learning about the cutting-edge research being conducted at our university. See https://sites.google.com/view/wiscmllm/home for more information.

Speaker: Ramya Vinayak (ECE)

Abstract: Large pre-trained models trained on internet-scale data are often not ready for safe deployment out of the box. They are heavily fine-tuned and aligned using large quantities of human preference data. When we want to align an AI/ML model to human preference or values,  it is worthwhile to ask whose preference and values we are aligning it to? Recently, the limitations of current approaches due to their inherent uniformity assumption have been highlighted and the need for plurality – capturing the diversity in human preferences and values – is getting recognized as an important challenge to address. While alignment from human preferences has currently become a very active area of research, it is worthwhile to note that there is a rich literature on learning preferences from human judgments using comparison queries. It plays a crucial role in several applications ranging from cognitive and behavioral psychology, crowdsourcing democracy, surveys in social science applications, and recommendation systems. However, the models in the literature often focus on learning average preference over the population due to the limitations on the amount of data available per individual and also assume the knowledge of the metric or way humans judge similarity and dissimilarity.

In this talk, I will discuss some recent results that focus on how we can reliably capture diversity in preferences while pooling together data from individuals. In particular, I will talk about fundamental questions in two directions: (1) Simultaneous metric and preference learning where the goal is to learn an unknown but shared metric from preference queries while the preferences are diverse and also unknown. (2) Learning distribution of preferences over a population with a single comparison query per individual.

Cost
Free

Tags