Where's Waldo: Matching People in Images of Crowds
Rahul Garg
Deva Ramanan
Steven M. Seitz
Noah Snavely
Proc. IEEE Conf. on Computer Vision and Pattern Recognition (2011), pp. 1793-1800
Abstract
Given a community-contributed set of photos of a
crowded public event, this paper addresses the problem of
finding all images of each person in the scene. This problem
is very challenging due to large changes in camera viewpoints, severe occlusions, low resolution and photos from
tens or hundreds of different photographers. Despite these
challenges, the problem is made tractable by exploiting a
variety of visual and contextual cues – appearance, timestamps, camera pose and co-occurrence of people. This paper demonstrates an approach that integrates these cues to
enable high quality person matching in community photo
collections downloaded from Flickr.com
crowded public event, this paper addresses the problem of
finding all images of each person in the scene. This problem
is very challenging due to large changes in camera viewpoints, severe occlusions, low resolution and photos from
tens or hundreds of different photographers. Despite these
challenges, the problem is made tractable by exploiting a
variety of visual and contextual cues – appearance, timestamps, camera pose and co-occurrence of people. This paper demonstrates an approach that integrates these cues to
enable high quality person matching in community photo
collections downloaded from Flickr.com
Research Areas
Learn more about how we conduct our research
We maintain a portfolio of research projects, providing individuals and teams the freedom to emphasize specific types of work.
Our research philosophy
Philosophy-light-banner