Speaker: Alex Vasilescu (UCLA Computer Science)
Title: Multilinear (Tensor) Framework for Computer Vision, Computer Graphics, and Machine Learning
Abstract:
Most multivariate observable data such as images, videos, human
motion capture data, and speech are the result of the multiple
causal factors (hidden variables) that are not directly
measurable, but which are of interest in data analysis. These
causal factors can be fundamental physical or biological
processes that cause patterns of variation in the observational
data, which comprise a set of measurements or response variables
that are affected by the causal factors. For example, natural
images which comprise a set of pixels are the consequence of
multiple causal factors related to scene structure, illumination,
and imaging.
The key to solving statistical data analysis problems of the sort
that arise in the domains of computer graphics and vision is to
find a suitable multivariate data representation that facilitates
the analysis, visualization, compression, approximation,
recognition and/or interpretation of the observed data. This is
often done by applying suitable transformations to the
observational data space. Multilinear algebra offers a potent
mathematical framework for extracting and explicitly representing
the multifactor structure of image datasets.
Representations founded on linear transformations of the original observed data have traditionally been preferred due to their conceptual and computational simplicity based on linear algebra, the algebra of vectors and matrices. Linear transformations, such as PCA and independent components analysis (ICA) are limited in their ability to facilitate data analysis, however, since they are well suited for modeling observed data that results only from single-factor linear variation or from the linear combination of multiple sources.
Multilinear transformations, which subsume the common linear transformations as special cases, lead to generative models that explicitly capture how the observed data are influenced by multiple underlying causal factors. We will define data tensors, discuss tensor models (CP,Multilinear PCA,Multilinear ICA, Multilinear KPCA, Multilinear KICA) for data analysis, compression and rendering. Recognition is achieved with a novel Multilinear Projection Operator.
I will demonstrate the potency of our novel statistical learning approach in the context of facial image biometrics, where the relevant factors include different facial geometries, expressions, lighting conditions, and viewpoints. When applied to the difficult problem of automated face recognition, our multilinear representations, called TensorFaces (M-mode PCA) and Independent TensorFaces (MICA), yields significantly improved recognition rates relative to the standard PCA and ICA approaches.
Bio:
M. Alex O. Vasilescu is a Research Assistant Professor in the Computer
Science Department at UCLA. She received her education at MIT and the
University of Toronto. She was a research scientist at the MIT Media
Lab from 2005-07 and at New York University's Courant Insitute of
Mathematical Sciences from 2001-05. She has pioneered the developement
of multilinear (tensor) algebra for computer vision, computer graphics
and machine learning. She has published papers in computer vision and
computer graphics, particularly in the areas of face recognition,
human motion analysis/synthesis, image-based rendering, and
physics-based modeling (deformable models). She has given several
invited talks and keynote addresses about her work and has several
patents and patents pending. Her face recognition research, known as
TensorFaces, has been funded by the TSWG, the Department of Defense's
Combating Terrorism Support Program. MIT's Technology Review Magazine
named her to their 2003 TR100 List of Top 100 Young Innovators.
(www.media.mit.edu/~maov)