Nov 19, 2012 - inputs X can be any kind of objects. â output y is a real number. â Prediction of complex outputs. â. Structured output y is complex (images, ...
Structured output models for image segmentation Aurelien Lucchi Machine Learning Workshop (MLWS) IDIAP - EPFL Monday November 19th, 2012 Collaborators: Yunpeng Li, Kevin Smith, Raphael Sznitman, Bohumil Maco, Graham Knott, Pascal Fua.
Outline 1.Review Conditional Random Fields (CRF) 2.Maximum likelihood training for CRFs 3.Maximum Margin Training for CRFs 1.Cutting plane (Structured SVM) 2.Online subgradient descent
1. Review CRF
Structured prediction ●
Non structured output ● ●
●
inputs X can be any kind of objects output y is a real number
Prediction of complex outputs
● ●
Structured output y is complex (images, text, audio...) Ad hoc definition of structured data: data that consists of several parts, and not only the parts themselves contain information, but also the way in which the parts belong together Slide courtesy: Christoph Lampert
Structured prediction for image segmentation
Histograms, Filter responses, ...
CRF for image segmentation
Maximum-a-posteriori (MAP) solution :
Data (D)
Unary likelihood
Pair-wise Terms
MAP Solution
Boykov and Jolly [ICCV 2001], Blake et al. [ECCV 2004] Slide courtesy : Pushmeet Kohli
CRF for image segmentation
Maximum-a-posteriori (MAP) solution :
Data (D)
Unary likelihood
Pair-wise Terms
MAP Solution
Boykov and Jolly [ICCV 2001], Blake et al. [ECCV 2004] Slide courtesy : Pushmeet Kohli
CRF for image segmentation
Pair-wise Terms Favors the same label for neighboring nodes.
CRF for image segmentation
Maximum-a-posteriori (MAP) solution :
Data (D)
Unary likelihood
Pair-wise Terms
MAP Solution
Boykov and Jolly [ICCV 2001], Blake et al. [ECCV 2004] Slide courtesy : Pushmeet Kohli
Energy minimization ●
MAP inference for discrete graphical models:
●
Dynamic programming –
●
Graph-cuts (Boykov, 2001) –
●
Exact on non loopy graphs Optimal solution if energy function is submodular
Belief propagation (Pearl, 1982) –
No theoretical guarantees on loopy graphs but seems to work well in practice.
●
Mean field (root in statistical physics)
●
...
Training a structured model ? ●
First rewrite the energy function as:
Log-linear model
●
Efficient Learning/Training – need to efficiently learn parameters w from training data ?
Training a structured model ? ●
Energy function is parametrized by vector w
+
-1
1
-1
?
?
1
?
?
Training a structured model ? ●
Energy function is parametrized by vector w
+
-1
1
-1
0
1
1
1
0
Low energy
High energy
2. Maximum likelihood training
Maximum likelihood
Note: We assumed that p is a Gibbs distribution
Maximum likelihood
●
L(w) is differentiable and convex (it has a positive definite Hessian) so gradient descent can find the global optimum.
Maximum likelihood
●
For general CRFs, there is still a problem with the computation of the derivative because the number of possible configurations for y is typically (exponentially) large.
Training a structured model ? ●
Other solutions exist: ●
Pseudo-likelihood
●
Variational approximation
●
Contrastive divergence
●
Maximum-margin framework (e.g. Structured SVM)
3.1. Maximum Margin Training of Structured Models: cutting plane (structured SVM)
Structured SVM
●
Given a set of N training examples with ground truth labels , we can write ≡ Energy for the correct labeling at least as low as energy of any incorrect labeling..
Using the prior data model (5), the prior source model (6) and the prior .... (MFA) then becomes a natural tool for obtaining approximate solutions with lower ...
most independent and piecewise homogeneous. To insure these .... In this algorithm, â¼ represents either argmax or generate sample using or still compute the ...
This paper describes a clustering algorithm for segmenting the color images of natural scenes. The proposed method operates in the 1976 CIE CL*, a*, ...
Feb 27, 2015 - In this article, we propose a deep neural network (DNN) architecture called ... domain), we propose to automatically learn the dependencies between ..... At the end of that step, the linear transformation (U2, a2) of the encoding.
In this algorithm, we have made a sequence of the priors, posteriors and they then ... For a probability model determination, we can suppose to have mixture of ...
capabilities of this technique are shown in. Section 3. 2. Segmentation based on ... gradient. The following figure (figure 2) illustrates our algorithm: Figure 2.
therefore uses a hybrid co-operation approach and is almost automatic and unsupervised. The performance of ... the application of different algorithms to the same image ..... reduced the research space of the closest point from p in M. (M ¼ C ...
and can be solved using pixel-wise classification and specific classifiers. .... L151 is true if A is greater than 151, and false otherwise. More generally, we can ...
ulations. J. Math. Biol. 43 (2001), 545â560. [4] R. Levins, Some demographic and genetic consequences of environ- mental heterogeneity for biological control.
processing operations such as image registration and archiving. We will use the ... The model-based clustering tree algorithm operates re- cursively on the .... Applications ..... Answers via model-based cluster analysis" , The Computer J our- ... [2
Central question of the paper: Are RBC models consistent with the observed output dynamics? (Or: How often would an econometrician observe the same kind ...
metrical. The 128 Ã 128 observation images of Fig. 2-b have been decomposed up to the 3rd wavelet scale. In Fig. 3, the evolution of a normalized L1 norm er-.
Abstract: Significant progress in control design has been achieved by the ... feature is the result of the idea of the diligent use of nonsmooth criteria of the form.
the vector of exogenous inputs or a test signal, y â Rp2 the vector of measurements and z â Rp1 the controlled or .... the definition fâ(κ) := sup Ïâ[0,â] ..... all t ⥠0, is taken over f1(κ, t) = z(κ, t) â zmax(t), f2(κ, t) = zmin
gesture paths was obtained by manual video indexing and automatic blob tracking. 4. InputâOutput Hidden Markov Models. The aim of IOHMM is to propagate, ...
May 22, 2002 - TV8202B CIF/QCIF CMOS sensor with digital YUV output. Outline Spec ... On-chip digital signal processing. â¢. Real-time ... Digital still camera.
Feb 12, 2010 - which can also be obtained as the solution that minimizes: J1(f) = g â Hf2 + ...... Cybernetics, Part B, 36(4):849â862, 2006. [44] K. Friston, J.
Broadcasting and television are now entering the era of High Definition (HD) â a transition as ... Several European countries have also used the introduction.