Talk: Learning Spatio-Temporal Representations for Video Understanding

Du Tran: Research Scientist, Facebook AI; Ph.D, Computer Science, Dartmouth College; M.S, Computer Science, University of Illinois at Urbana-Champaign

Date

Thursday, April 8, 2021

Time

4-5 p.m.

Location

Online
Also offered online

Description

Abstract: Video understanding is one of the fundamental problems in computer vision with various applications, including autonomous vehicles, robot learning, and visual perception. Compared with traditional image understanding, video understanding: (i) has higher model complexity and requires learning from a much larger amount of data; (ii) requires more expensive annotations; (iii) and sometimes demands multimodal modeling, e.g., audiovisual modeling instead of visual only. In this talk, I will present some of our approaches addressing these challenges, such as efficient and scalable spatiotemporal learning, cross-modal self-supervised learning of video and audio representations, and multimodal learning. Finally, I will outline several potential future research directions in this area.

Bio: Du Tran is a staff research scientist at Facebook AI. He graduated with a Ph.D. in computer science from Dartmouth College and an M.S. in computer science from the University of Illinois at Urbana-Champaign, receiving the Dartmouth Presidential Fellowship and the Vietnam Education Fellowship. His research interests are in computer vision, machine learning, and computer graphics, with specific interests in video understanding, representation learning, and multimodal modeling. His work on C3D was instrumental in steering the field towards the widespread adoption of 3D CNNs as the model of choice for video analysis. His video understanding architectures have been deployed in production at Facebook, to process hundreds of millions of videos daily for various tasks, including video classification, violence prediction, and advertisement ranking.

Cost

Free

Calendar

Click a date to see events on that day.

		August
S	M	T	W	T	F	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28	29
30	31

Talk: Learning Spatio-Temporal Representations for Video Understanding

Tags

Calendar

Search

Categories

Browse events by tag

Talk: Learning Spatio-Temporal Representations for Video Understanding

Event Details

Tags

Calendar

View events by date

Search

Search for events

Categories

Browse events by tag