Changes

Main Page (view source)

Revision as of 16:33, 3 September 2012

1,059 bytes added , 16:33, 3 September 2012

no edit summary

Line 102: Line 102:

Abstract from the paper:

We present a very efﬁcient, highly accurate, “Explicit Shape Regression” approach for face alignment. Unlike previous regression-based approaches, we directly learn a vectorial regression function to infer the whole facial shape (a set of facial landmarks) from the image and explicitly minimize the alignment errors over the training data. The inherent shape constraint is naturally encoded into the regressor in a cascaded learning framework and applied from coarse to ﬁne during the test, without using a ﬁxed parametric shape model as in most previous methods. To make the regression more effective and efﬁcient, we design a two-level boosted regression, shape-indexed features and a correlation-based feature selection method. This combination enables us to learn accurate models from large training data in a short time (20 minutes for 2,000 training images), and run regression extremely fast in test (15 ms for a 87 landmarks shape). Experiments on challenging data show that our approach signiﬁcantly outperforms the state-of-the-art in terms of both accuracy and efﬁciency.

+

===Combining Per-Frame and Per-Track Cues for Multi-Person Action Recognition===

+

Speaker: [http://www.umiacs.umd.edu/~sameh/ Sameh Khamis] -- Date: September 13, 2012

+

We propose a model to combine per-frame and per-track cues for action recognition. With multiple targets in a scene, our model simultaneously captures the natural harmony of an individual's action in a scene and the flow of actions of an individual in a video sequence, inferring valid tracks in the process. Our motivation is based on the unlikely discordance of an action in a structured scene, both at the track level (e.g., a person jogging then dancing) and the frame level (e.g., a person jogging in a dance studio). While we can utilize sampling approaches for inference in our model, we instead devise a global inference algorithm by decomposing the problem and solving the subproblems exactly and efficiently, recovering a globally optimal joint solution in several cases. Finally, we improve on the state-of-the-art action recognition results for two publicly available datasets.

Sameh

199

edits

Anonymous

Search

Changes

Namespaces

More

Page actions

Main Page (view source)

Revision as of 16:33, 3 September 2012

Navigation

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Changes

Main Page (view source)

Revision as of 16:33, 3 September 2012

Navigation

Wiki tools

Page tools