Changes

534 bytes added ,  16:13, 15 April 2013
Line 147: Line 147:     
Additionally, we present a method for extracting the geo-spatial trajectory of a moving camera in a city from videos in the wild such as typical YouTube clips. First, we divide the video into smaller segments and localize each one individually. Then, we fuse the information from different segments utilizing a Bayesian formulation to have a temporally consistent trajectory. Lastly, we perform a post processing by a novel non-model-based trajectory reconstruction method based on Minimum Spanning trees; we argue that such post processing is essential for addressing the problems that the basic Bayesian formulation faces due to having a predefined underlying motion model, while the motion of camera in wild videos does not necessary follow any pattern.
 
Additionally, we present a method for extracting the geo-spatial trajectory of a moving camera in a city from videos in the wild such as typical YouTube clips. First, we divide the video into smaller segments and localize each one individually. Then, we fuse the information from different segments utilizing a Bayesian formulation to have a temporally consistent trajectory. Lastly, we perform a post processing by a novel non-model-based trajectory reconstruction method based on Minimum Spanning trees; we argue that such post processing is essential for addressing the problems that the basic Bayesian formulation faces due to having a predefined underlying motion model, while the motion of camera in wild videos does not necessary follow any pattern.
 +
 +
===Analyzing 3D objects in cluttered images===
 +
Speaker: [http://www.umiacs.umd.edu/~sshekha/ Sumit Shekhar] : April 18, 2013
 +
Data-driven dictionaries have produced state-of-the-art results in various classification tasks. However, when the target data has a different distribution than the source data, the learned sparse representation may not be optimal. In this talk, I will discuss a technique to learn a joint dictionary which can work well for the target data as well, and present some results on face and object recognition.
 +
    
===Analyzing 3D objects in cluttered images===
 
===Analyzing 3D objects in cluttered images===
 
Speaker: [http://www.ics.uci.edu/~shejrati/ Mohsen Hejrati] : April 25, 2013
 
Speaker: [http://www.ics.uci.edu/~shejrati/ Mohsen Hejrati] : April 25, 2013
   
We present an approach to detecting and analyzing the 3D configuration of objects in real-world images with heavy occlusion and clutter. We focus on the application of finding and analyzing cars. We do so with a two-stage framework; the first stage reasons about 2D shape and appearance variation due to within-class variation (station wagons look different than sedans) and changes in viewpoint. Rather than using a view-based model, we describe a compositional representation that models a large number of effective views and shapes using a small number of local view-based templates. We use this model to propose candidate detections and 2D estimates of shape. These estimates are then refined by our second stage, using an explicit 3D model of shape and viewpoint. We use a morphable model to capture 3D within-class variation, and use a weak-perspective camera model to capture viewpoint. We learn all model parameters from 2D annotations. We demonstrate state-of-the-art accuracy for detection, viewpoint estimation, and 3D shape reconstruction on challenging images from the PASCAL VOC 2011 dataset.
 
We present an approach to detecting and analyzing the 3D configuration of objects in real-world images with heavy occlusion and clutter. We focus on the application of finding and analyzing cars. We do so with a two-stage framework; the first stage reasons about 2D shape and appearance variation due to within-class variation (station wagons look different than sedans) and changes in viewpoint. Rather than using a view-based model, we describe a compositional representation that models a large number of effective views and shapes using a small number of local view-based templates. We use this model to propose candidate detections and 2D estimates of shape. These estimates are then refined by our second stage, using an explicit 3D model of shape and viewpoint. We use a morphable model to capture 3D within-class variation, and use a weak-perspective camera model to capture viewpoint. We learn all model parameters from 2D annotations. We demonstrate state-of-the-art accuracy for detection, viewpoint estimation, and 3D shape reconstruction on challenging images from the PASCAL VOC 2011 dataset.
  
50

edits