Talk: Solving Real-World Tasks with AI Agents

Shuyan Zhou: PhD Candidate, Language Technologies Institute, CMU

Date

Monday, March 11, 2024

Time

12-1 p.m.

Location

1240 Computer Sciences

Description

LIVE STREAM: https://uwmadison.zoom.us/j/94411416574?pwd=K3dHQVR4L2NiY1FXaVU5TnpnVFY4dz09

Abstract: For years, my dream has been to create autonomous AI agents capable of carrying out tedious procedural tasks (e.g., arranging conference travel), allowing me to focus on more creative and exciting tasks. Modern AI models, especially large language models (LLMs) like ChatGPT, have suddenly brought us much closer to achieving such AI agents. But, has my dream already come true? In this talk, I will answer this question by delving into my systematic evaluation of AI agents in realistic tasks. The evaluation uncovers many critical limitations of AI agents, such as tool use, abstract reasoning, and knowledge cutoff. It suggests that LLMs are crucial yet early steps towards AI autonomy. To address these challenges, I will introduce my research of a more suitable “language” for AIs, which overcomes the inherent limitations of using natural language for task solving. Then, I will discuss my work on teaching AI agents to learn new tools by reading the tool documentation rather than direct demonstrations. Finally, I will discuss my future plans for comprehensive AI agent evaluations, agent foundations, and the application of AI agents to critical sectors in the real world.

Bio: Shuyan Zhou is a final-year PhD student at the Language Technologies Institute at CMU, advised by Graham Neubig. Her research in NLP and AI focuses on creating AI agents for real-world tasks, such as using computers and generating code. Her work has been recognized at top natural language processing and machine learning conferences and journals such as ICLR, ICML, ACL, EMNLP, and TACL. You can find more about her at https://shuyanzhou.com

Cost

Free

Calendar

Click a date to see events on that day.

		July
S	M	T	W	T	F	S
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Talk: Solving Real-World Tasks with AI Agents

Tags

Calendar

Search

Categories

Browse events by tag

Talk: Solving Real-World Tasks with AI Agents

Event Details

Tags

Calendar

View events by date

Search

Search for events

Categories

Browse events by tag