Reconstruction and Clustering with Graph optimization and Priors on

Jul 3, 2017 - We note xi,j the binary label of edge presence: xi,j = {. 1 if ei,j ∈ E∗, ... We look for a discrete solution for x ⇔ x ∈ {0,1}E. July 3th ...... Page 81 ...
6MB taille 2 téléchargements 296 vues
Reconstruction and Clustering with Graph optimization and Priors on Gene Networks and Images Aurélie Pirayre

PhD supervisors:

Frédérique BIDARD-MICHELOT Camille COUPRIE Laurent DUVAL Jean-Christophe PESQUET

IFP Energies nouvelles Facebook A.I. Research IFP Energies nouvelles CentraleSupélec

July 3th , 2017

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

1 / 45

I NTRODUCTION

An overview

Gene regulatory networks

Signals and images

Our framework

Variational

Bayes variational

Method

BRANE

HOGMep

Reconstruction

Clustering

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

2 / 45

I NTRODUCTION

Context

Biological motivation Second generation bio-fuel production Cellulases from Trichoderma reesei Lignocellulosic Biomass

Pre-

trea

tme n

t

Cellulose Hemi-cellulose

July 3th , 2017

atic ym Enz sis roly Hyd

Sugar F erm ent a

tio

n

Ethanol

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

g xin Mi s uel hf wit

Bio-fuels

3 / 45

I NTRODUCTION

Context

Biological motivation Second generation bio-fuel production Cellulases from Trichoderma reesei Lignocellulosic Biomass

Pre-

trea

tme n

t

Cellulose Hemi-cellulose

atic ym Enz sis roly Hyd

Sugar F erm ent a

tio

n

Ethanol

g xin Mi s uel hf wit

Bio-fuels

Improve Trichoderma reseei cellulase production Understand cellulase production mechanisms July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

3 / 45

I NTRODUCTION

Context

Biological motivation Second generation bio-fuel production Cellulases from Trichoderma reesei Lignocellulosic Biomass

Pre-

trea

tme n

t

Cellulose Hemi-cellulose

atic ym Enz sis roly Hyd

Sugar F erm ent a

tio

n

Ethanol

g xin Mi s uel hf wit

Bio-fuels

Improve Trichoderma reseei cellulase production Understand cellulase production mechanisms ⇒ Use of Gene Regulatory Network (GRN) July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

3 / 45

I NTRODUCTION

GRN overview

What is a Gene Regulatory Network (GRN)? GRN: a graph G(V, E) V = {v1 , . . . , vG }: a set of G nodes (corresponding to genes) E: a set of edges (corresponding to interactions between genes)

v1

v2

July 3th , 2017

v3

vX

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

4 / 45

I NTRODUCTION

GRN overview

What is a Gene Regulatory Network (GRN)? GRN: a graph G(V, E) V = {v1 , . . . , vG }: a set of G nodes (corresponding to genes) E: a set of edges (corresponding to interactions between genes)

A gene regulatory network... v1

v2

v3

vX

... models biological gene regulatory mechanisms DNA Gene 1

Gene 2

Gene 3

TF2

TF3

Gene X

mRNA

Protein

July 3th , 2017

TF1





TFX ⊕⊕

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

4 / 45

I NTRODUCTION

Data overview

What biological data can be used? For a given experimental condition, transcriptomic data answer to: which genes are expressed? in which amount?

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

5 / 45

I NTRODUCTION

Data overview

What biological data can be used? For a given experimental condition, transcriptomic data answer to: which genes are expressed? in which amount? How to obtain transcriptomic data? Microarray and RNAseq experiments

z −0.948  0.737  M = −0.253  3.747 1.383 

July 3th , 2017

S conditions }| −0.013 . . . −1.308 0.619 . . . −0.141 −0.175 . . . −0.859 1.115 . . . −0.418 1.184 . . . −0.493

{ −0.977 −0.803 −0.595 −0.084 −0.562

          

G genes

What do transcriptomic data look like? Gene expression data (GED): G genes × S conditions

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

5 / 45

I NTRODUCTION

Links between data and GRNs

How to use GED to produce a GRN ? sj 

From gene expression data...

July 3th , 2017

M=

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

gi

  ···

. . . mi,j

  , 

6 / 45

I NTRODUCTION

Links between data and GRNs

How to use GED to produce a GRN ? sj 

From gene expression data...

M=

gi

  ···

. . . mi,j

  , 

V = {v1 , · · · , vG } a set of vertices (genes) and E a set of edges

leading to a complete graph...

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

6 / 45

I NTRODUCTION

Links between data and GRNs

How to use GED to produce a GRN ? sj 

From gene expression data...

M=

gi

  ···

. . . mi,j

  , 

V = {v1 , · · · , vG } a set of vertices (genes) and E a set of edges Each edge ei,j is weighted by ωi,j gj

leading to a complete weighted graph...

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

 W=

gi

  ···

. . . ωi,j

  , 

6 / 45

I NTRODUCTION

Links between data and GRNs

How to use GED to produce a GRN ? sj 

From gene expression data...

M=

gi

  ···



. . . mi,j

 , 

V = {v1 , · · · , vG } a set of vertices (genes) and E a set of edges Each edge ei,j is weighted by ωi,j gj

leading to a complete weighted graph...

 W=

gi

  ···

. . . ωi,j

  , 

We look for a subset of edges E ∗ reflecting regulatory links between genes to infer a meaningful gene network.

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

( Wi,j =

1 0

if ei,j ∈ E ∗ otherwise.

6 / 45

I NTRODUCTION

Difficulties in GRN inference

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

7 / 45

I NTRODUCTION

Difficulties in GRN inference

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

7 / 45

I NTRODUCTION

Difficulties in GRN inference

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

7 / 45

I NTRODUCTION

Difficulties in GRN inference

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

7 / 45

I NTRODUCTION

Our BRANE strategy What is the subset of edges E ∗ reflecting real regulatory links between genes? ⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G ?

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

8 / 45

I NTRODUCTION

Our BRANE strategy What is the subset of edges E ∗ reflecting real regulatory links between genes? ⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G ? ( 1 if ei,j ∈ E ∗ , We note xi,j the binary label of edge presence: xi,j = 0 otherwise.

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

8 / 45

I NTRODUCTION

Our BRANE strategy What is the subset of edges E ∗ reflecting real regulatory links between genes? ⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G ? ( 1 if ei,j ∈ E ∗ , We note xi,j the binary label of edge presence: xi,j = 0 otherwise. ( ∗ = 1 if ωi,j > λ, Classical thresholding: xi,j 0 otherwise.

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

8 / 45

I NTRODUCTION

Our BRANE strategy What is the subset of edges E ∗ reflecting real regulatory links between genes? ⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G ? ( 1 if ei,j ∈ E ∗ , We note xi,j the binary label of edge presence: xi,j = 0 otherwise. ( ∗ = 1 if ωi,j > λ, Classical thresholding: xi,j 0 otherwise. Given by a cost function for given weights ω: maximize E x∈{0,1}

July 3th , 2017

P

ωi,j xi,j + λ(1 − xi,j )

(i,j)∈V2

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

8 / 45

I NTRODUCTION

Our BRANE strategy What is the subset of edges E ∗ reflecting real regulatory links between genes? ⇒ what is the binary adjacency matrix W ∈ {0, 1}G×G ? ( 1 if ei,j ∈ E ∗ , We note xi,j the binary label of edge presence: xi,j = 0 otherwise. ( ∗ = 1 if ωi,j > λ, Classical thresholding: xi,j 0 otherwise. Given by a cost function for given weights ω: maximize E x∈{0,1}

July 3th , 2017

P (i,j)∈V2

ωi,j xi,j + λ(1 − xi,j ) ⇔ minimize E x∈{0,1}

P

ωi,j (1 − xi,j ) + λ xi,j

(i,j)∈V2

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

8 / 45

I NTRODUCTION

Our BRANE strategy BRANE: Biologically Related A priori Network Enhancement Extend classical thresholding Integrate biological priors into the functional to be optimized Enforce modular networks Additional knowledge: Transcription factors (TFs): regulators Non transcription factors (TFs): targets

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

9 / 45

I NTRODUCTION

Our BRANE strategy BRANE: Biologically Related A priori Network Enhancement Extend classical thresholding Integrate biological priors into the functional to be optimized Enforce modular networks Additional knowledge: Transcription factors (TFs): regulators Non transcription factors (TFs): targets Method

a priori

Formulation

Algorithm

Inference

BRANE Cut BRANE Relax

Gene co-regulatiton TF-connectivity

Discrete Continuous

Maximal flow Proximal method

Joint inference and clustering

BRANE Clust

Gene grouping

Mixed

Alternating scheme

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

9 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A discrete method: BRANE Cut We look for a discrete solution for x ⇔ x ∈ {0, 1}E

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

10 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A priori

A priori: modular structure and gene co-regulation minimize x∈{0,1}E

July 3th , 2017

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

11 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A priori

A priori: modular structure and gene co-regulation minimize x∈{0,1}E

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Modular network: favors links between TFs and TFs   2η λi,j = 2λTF   λTF + λTF

July 3th , 2017

if (i, j) ∈ / T2 if (i, j) ∈ T2 otherwise.

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

11 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A priori

A priori: modular structure and gene co-regulation minimize x∈{0,1}E

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Modular network: favors links between TFs and TFs   2η λi,j = 2λTF   λTF + λTF

if (i, j) ∈ / T2 if (i, j) ∈ T2 otherwise.

with: T: the set of TF indices η > max {ωi,j | (i, j) ∈ V2 } λTF > λTF

A linear relation is sufficient: λTF = βλTF with β = July 3th , 2017

|V| |T |

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

11 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A priori

A priori: modular structure and gene co-regulation minimize E x∈{0,1}

July 3th , 2017

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

12 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A priori

A priori: modular structure and gene co-regulation minimize E x∈{0,1}

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Gene co-regulation: favors edge coupling Ψ(xi,j ) =

X 0

ρi,j,j0 |xi,j − xi,j0 | 2

(j,j )∈T i∈V\T

ρi,j,j0 : co-regulation probability

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

12 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

A priori

A priori: modular structure and gene co-regulation minimize E x∈{0,1}

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Gene co-regulation: favors edge coupling Ψ(xi,j ) =

X 0

ρi,j,j0 |xi,j − xi,j0 | 2

(j,j )∈T i∈V\T

ρi,j,j0 : co-regulation probability with X ρi,j,j0 =

1(min{ωj,j0 , ωj,k , ωj0 ,k } > γ)

k∈V\(T ∪{i}) |V\T |−1

γ: the (|V| − 1)th of the normalized weights ω July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

12 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

July 3th , 2017

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

P

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

July 3th , 2017

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

P

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

P

s

July 3th , 2017

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

t

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

P

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2 x1,3 x1,4 s x2,3

t

x2.4 x3,4 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

P

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2

s

8 5 5 10 5 1

x1,3 x1,4 x2,3

t

x2.4 x3,4 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2

s

8 5 5 10 5 1

x1,3

v1

x1,4

v2

x2,3

v3

x2.4

v4

t

x3,4 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2

s

8 5 5 10 5 1

x1,3

v1

x1,4

v2

x2,3

v3

x2.4

v4

∞ ∞ ∞

t



x3,4 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P (i,j)∈V2 j>i

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2

s

P

ωi,j |xi,j − 1| + λi,j xi,j +

8 5 5 10 5 1

η=6 λTF = 3 λTF = 1

x1,3

v1

x1,4

v2

x2,3

v3

x2.4

v4

∞ ∞ ∞

t



x3,4 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P (i,j)∈V2 j>i

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2

s

P

ωi,j |xi,j − 1| + λi,j xi,j +

8 5 5 10 5 1

η=6 λTF = 3 λTF = 1

x1,3

v1

x1,4

v2

x2,3

v3

x2.4

v4

∞ ∞ ∞

t



3 x3,4

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P (i,j)∈V2 j>i

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

x1,2

s

P

ωi,j |xi,j − 1| + λi,j xi,j +

8 5 5 10 5 1

η=6 λTF = 3 λTF = 1

x1,3

v1

x1,4

v2

x2,3

v3

x2.4

v4

∞ ∞ ∞

t



3 x3,4

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

x1,2

s=1

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

8 5 5 10 5 1

η=6 λTF = 3 λTF = 1

x1,3

v1

x1,4

v2



x2,3

v3



x2.4

v4

∞ ∞

t=0

3 x3,4

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

η=6 λTF = 3 λTF = 1

1 8

s=1

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

5 5 10 5 1

1

v1

0

v2



1

v3



∞ ∞

t=0

v4

0 3

0 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Cut — NETWORK INFERENCE WITH GRAPH CUTS

Formulation and resolution

A maximal flow for a minimum cut formulation minimize x∈{0,1}E

P

P

ωi,j |xi,j − 1| + λi,j xi,j +

(i,j)∈V2 j>i

η=6 λTF = 3 λTF = 1

1 8

s=1

ρi,j,j0 |xi,j − xi,j0 |

i∈V\T (j,j0 )∈T2 , j0 >j

5 5 10 5 1

1

v1

0

v2



1

v3



∞ ∞

t=0

v4

0 3

0 July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

13 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

A continuous method: BRANE Relax

We look for a continuous solution for x ⇔ x ∈ [0, 1]E

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

14 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

A priori

A priori: modular structure and TF connectivity minimize E x∈{0,1}

July 3th , 2017

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

15 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

A priori

A priori: modular structure and TF connectivity minimize E x∈{0,1}

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Modular network: favors links between TFs and TFs   2η λi,j = 2λTF   λTF + λTF

July 3th , 2017

if (i, j) ∈ / T2 if (i, j) ∈ T2 otherwise.

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

15 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

A priori

A priori: modular structure and TF connectivity minimize E x∈{0,1}

P

ωi,j ϕ(xi,j − 1) + λi,j ϕ(xi,j ) + µΨ(xi,j )

(i,j)∈V2

Modular network: favors links between TFs and TFs   2η λi,j = 2λTF   λTF + λTF

if (i, j) ∈ / T2 if (i, j) ∈ T2 otherwise.

TF connectivity: constraint TF node degree Ψ(xi,j ) =

X i∈V\T

  X φ xi,j − d j∈V

φ(·): a convex distance function with β-Lipschitz continuous gradient July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

15 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Formulation

A convex relaxation for a continuous formulation ! minimize x∈{0,1}E

July 3th , 2017

P

ωi,j (1 − xi,j ) + λi,j xi,j +

(i,j)∈V2 j>i

µ

P

φ

i∈V\T

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

P

xi,j − d

j∈V

16 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Formulation

A convex relaxation for a continuous formulation ! P

minimize x∈{0,1}E

ωi,j (1 − xi,j ) + λi,j xi,j +

(i,j)∈V2 j>i

µ

P

P

φ

xi,j − d

j∈V

i∈V\T

Relaxation and vectorization: minimize x∈[0,1]E

E X

ωl (1 − xl ) + λl xl + µ

l=1

P X

φ

E X

i=1

! Ωi,k xk − d ,

k=1

P×E

where Ω ∈ {0, 1} encodes the degree of the P TFs nodes in the complete graph. ( 1 if j is the index of an edge linking the TF node vi in the complete graph, Ωi,j = 0 otherwise.

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

16 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Formulation

Distance function in BRANE Relax minimize x∈[0,1]E

E P

ωl (1 − xl ) + λl xl + µ

l=1

P P

 φ

i=1

E P

 Ωi,k xk − d

k=1

Choice of φ: node degree distance function, with respect to d zi =

E P

Ωi,k xk − d

k=1

squared `2 norm: φ(z) = ( ||z||2 z2 if |zi | ≤ δ Huber function: φ(zi ) = i 1 2δ(|zi | − 2 δ) otherwise

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

17 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Resolution

Optimization strategy via proximal methods Splitting minimize ω > (1E − x) + λ> x + µΦ(Ωx − d) + ι[0,1]E (x) | {z } x∈RE | {z } f2

f1

f1 ∈ Γ0 (RE ): proper, convex, and lower semi-continuous f2 : convex, differentiable with an L−Lipschitz continuous gradient

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

18 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Resolution

Optimization strategy via proximal methods Splitting minimize ω > (1E − x) + λ> x + µΦ(Ωx − d) + ι[0,1]E (x) | {z } x∈RE | {z } f2

f1

f1 ∈ Γ0 (RE ): proper, convex, and lower semi-continuous f2 : convex, differentiable with an L−Lipschitz continuous gradient Algorithm 1: Forward-Backward Fix x0 ∈ RE for k = 0, 1, . . . do Select the index jk ∈ {1, . . . , J} of a block of variables (j ) (j ) zk k = xk k − γk A−1 jk ∇jk f2 (xk ) (j )

k xk+1 = proxγ −1 ,A k

(j )

(jk ) jk f1

(zk k )

(j¯k ) (j¯ ) xk+1 = xk k , j¯k = {1, . . . , J}\{jk } July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

18 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Resolution

Optimization strategy via proximal methods Splitting minimize ω > (1E − x) + λ> x + µΦ(Ωx − d) + ι[0,1]E (x) | {z } x∈RE | {z } f2

f1

f1 ∈ Γ0 (RE ): proper, convex, and lower semi-continuous f2 : convex, differentiable with an L−Lipschitz continuous gradient Algorithm 2: Preconditioned Forward-Backward Fix x0 ∈ RE for k = 0, 1, . . . do Select the index jk ∈ {1, . . . , J} of a block of variables (j ) (j ) zk k = xk k − γk A−1 jk ∇jk f2 (xk ) (j )

k xk+1 = proxγ −1 ,A k

(j )

(jk ) jk ,f1

(zk k )

(j¯k ) (j¯ ) xk+1 = xk k , j¯k = {1, . . . , J}\{jk } July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

18 / 45

BRANE Relax — NETWORK INFERENCE AS A RELAXED PROBLEM

Resolution

Optimization strategy via proximal methods Splitting minimize ω > (1E − x) + λ> x + µΦ(Ωx − d) + ι[0,1]E (x) | {z } x∈RE | {z } f2

f1

f1 ∈ Γ0 (RE ): proper, convex, and lower semi-continuous f2 : convex, differentiable with an L−Lipschitz continuous gradient Algorithm 3: Block Coordinate + Preconditioned Forward-Backward Fix x0 ∈ RE for k = 0, 1, . . . do Select the index jk ∈ {1, . . . , J} of a block of variables (j ) (j ) zk k = xk k − γk A−1 jk ∇jk f2 (xk ) (j )

k xk+1 = proxγ −1 ,A k

(j )

(jk ) jk ,f1

(zk k )

(j¯k ) (j¯ ) xk+1 = xk k , j¯k = {1, . . . , J}\{jk } July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

18 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

A mixed method: BRANE Clust We look for a discrete solution for x and a continuous one for y

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

19 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

A priori

A priori: gene grouping and modular structure maximize x∈{0,1}E y∈NG

July 3th , 2017

P

f (yi , yj )ωi,j xi,j + λ(1 − xi,j ) + Ψ(yi )

(i,j)∈V2

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

20 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

A priori

A priori: gene grouping and modular structure maximize x∈{0,1}E y∈NG

P

f (yi , yj )ωi,j xi,j + λ(1 − xi,j ) + Ψ(yi )

(i,j)∈V2

Clustering-assisted inference Node labeling y ∈ NG Weight ωi,j reduction if nodes vi and vj belong to distinct clusters Cost function: β − 1(yi 6= yj ) f (yi , yj ) = β

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

20 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

A priori

A priori: gene grouping and modular structure maximize x∈{0,1}E y∈NG

P

f (yi , yj )ωi,j xi,j + λ(1 − xi,j ) + Ψ(yi )

(i,j)∈V2

Clustering-assisted inference Node labeling y ∈ NG Weight ωi,j reduction if nodes vi and vj belong to distinct clusters Cost function: β − 1(yi 6= yj ) f (yi , yj ) = β

TF-driven clustering promoting modular structure

Ψ(yi ) =

X

µi,j 1(yi = j)

i∈V j∈T

µi,j : modular structure controlling parameter July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

20 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Alternating optimization strategy maximize n x∈{0,1} y∈NG

July 3th , 2017

P (i,j)∈V2

β−1(yi 6=yj ) ωi,j xi,j β

+ λ(1 − xi,j ) +

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

P

µi,j 1(yi = j)

i∈V j∈T

21 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Alternating optimization strategy Alternating optimization maximize n x∈{0,1} y∈NG

July 3th , 2017

P (i,j)∈V2

β−1(yi 6=yj ) ωi,j xi,j β

+ λ(1 − xi,j ) +

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

P

µi,j 1(yi = j)

i∈V j∈T

21 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Alternating optimization strategy Alternating optimization maximize n x∈{0,1} y∈NG

P (i,j)∈V2

β−1(yi 6=yj ) ωi,j xi,j β

+ λ(1 − xi,j ) +

At y fixed and x variable: X β − 1(yi 6= yj ) maximize ωi,j xi,j β x∈{0,1}n 2

P

µi,j 1(yi = j)

i∈V j∈T

+ λ(1 − xi,j )

(i,j)∈V

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

21 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Alternating optimization strategy Alternating optimization maximize n x∈{0,1} y∈NG

P (i,j)∈V2

β−1(yi 6=yj ) ωi,j xi,j β

+ λ(1 − xi,j ) +

P

µi,j 1(yi = j)

i∈V j∈T

At y fixed and x variable: X β − 1(yi 6= yj ) maximize ωi,j xi,j β x∈{0,1}n 2

+ λ(1 − xi,j )

(i,j)∈V

At x fixed and y variable: X ωi,j xi,j 1(yi 6= yj ) + minimize β y∈NG 2 (i,j)∈V

July 3th , 2017

X

µi,j 1(yi 6= j)

i∈V, j∈T

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

21 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Alternating optimization strategy Alternating optimization maximize n x∈{0,1} y∈NG

P (i,j)∈V2

β−1(yi 6=yj ) ωi,j xi,j β

+ λ(1 − xi,j ) +

P

µi,j 1(yi = j)

i∈V j∈T

At y fixed and x variable: X β − 1(yi 6= yj ) maximize ωi,j xi,j β x∈{0,1}n 2

+ λ(1 − xi,j )

(i,j)∈V

Explicit form:

∗ xi,j

=

( 1 0

if ωi,j >

λβ β−1(yi 6=yj )

otherwise.

At x fixed and y variable: X ωi,j xi,j 1(yi 6= yj ) + minimize β y∈NG 2 (i,j)∈V

July 3th , 2017

X

µi,j 1(yi 6= j)

i∈V, j∈T

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

21 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Clustering optimization strategy At x fixed and y variable: X ωi,j xi,j X minimize 1(yi 6= yj ) + µi,j 1(yi 6= j) β y∈NG 2 (i,j)∈V

July 3th , 2017

i∈V, j∈T

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

22 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Clustering optimization strategy At x fixed and y variable: X ωi,j xi,j X minimize 1(yi 6= yj ) + µi,j 1(yi 6= j) β y∈NG 2 (i,j)∈V

July 3th , 2017

(NP)

i∈V, j∈T

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

22 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Clustering optimization strategy At x fixed and y variable: X ωi,j xi,j X minimize 1(yi 6= yj ) + µi,j 1(yi 6= j) β y∈NG 2 (i,j)∈V

(NP)

i∈V, j∈T

discrete problem ⇒ quadratic relaxation T-class problem ⇒ T binary sub-problems (t)

label restriction to T: {s(1) , . . . , s(T) } such that sj = 1 if j = t and 0 otherwise. Y = {y(1) , . . . , y(T) } such that y(t) ∈ [0, 1]G

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

22 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Clustering optimization strategy At x fixed and y variable: X ωi,j xi,j X minimize 1(yi 6= yj ) + µi,j 1(yi 6= j) β y∈NG 2 (i,j)∈V

(NP)

i∈V, j∈T

discrete problem ⇒ quadratic relaxation T-class problem ⇒ T binary sub-problems (t)

label restriction to T: {s(1) , . . . , s(T) } such that sj = 1 if j = t and 0 otherwise. Y = {y(1) , . . . , y(T) } such that y(t) ∈ [0, 1]G

Problem re-expressed as:   T 2  2 X X ωi,j xi,j  (t) X (t) (t) (t)   minimize yi − yj + µi,j yi − sj Y β 2 t=1

July 3th , 2017

(i,j)∈V

i∈V, j∈T

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

22 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Clustering optimization strategy

minimize Y

T X t=1



  2 2 X ωi,j xi,j  (t) X (t) (t) (t)   yi − yj + µi,j yi − sj β 2 (i,j)∈V

i∈V, j∈T

This is the Combinatorial Dirichlet problem Minimization via solving a linear system of equations [Grady, 2006]

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

23 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Clustering optimization strategy

minimize Y

T X t=1



  2 2 X ωi,j xi,j  (t) X (t) (t) (t)   yi − yj + µi,j yi − sj β 2 i∈V, j∈T

(i,j)∈V

This is the Combinatorial Dirichlet problem Minimization via solving a linear system of equations [Grady, 2006] (t)

Final labeling: node i is assigned to label t for which yi is maximal (t)

y∗i = argmax yi t∈T

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

23 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Random walker in graphs y1

We want to obtain the optimal labeling y∗ based on a weighted graph ⇒ Random Walker algorithm

6 10 y5

y4 7

10 y3

July 3th , 2017

3 12 5

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

3 5 9

y2

24 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Random walker in graphs 1 y1

We want to obtain the optimal labeling y∗ based on a weighted graph ⇒ Random Walker algorithm

6 10 y5

y4 7

10 y3 3

July 3th , 2017

3 12 5

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

3 5 9

y2 2

24 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Random walker in graphs 1 y1

We want to obtain the optimal labeling y∗ based on a weighted graph ⇒ Random Walker algorithm

6 10 y5

3 12 y4

5

7

10 y3 3

3 5 9

y2 2

1

0.95 0.35

0.46

0.03

0.01

0

0

y(1) July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

24 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Random walker in graphs 1 y1

We want to obtain the optimal labeling y∗ based on a weighted graph ⇒ Random Walker algorithm

6 10 y5

7

3

0

0.95

0.01

0.35

0.46

0.03

0.01

0

0

y(1) July 3th , 2017

0.19

y4

10 y3

1

3 12 5 3 5 9

y2 2

0.28

0.02

0.97

0

1

y(2) Recons. and Clust. with Graph Optim. and Priors on GRN and Images

24 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Random walker in graphs 1 y1

We want to obtain the optimal labeling y∗ based on a weighted graph ⇒ Random Walker algorithm

6 10 y5

7

3

0

0

0.95

0.01

0.03

0.35

0.46

0.03

0.01

0

0

y(1) July 3th , 2017

0.19

0.28

0.02

0.97

0

1

y(2)

0.46

y4

10 y3

1

3 12 5 3 5 9

y2 2

0.26

0.95

0.02

1

0

y(3)

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

24 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

Random walker in graphs 1 y1

We want to obtain the optimal labeling y∗ based on a weighted graph ⇒ Random Walker algorithm

6 10 y5

3 12 y4

5

7

10 3 5 9

y3

y2

3

1

0

0

0.95

0.01

0.03

0.35

0.46

0.19

0.28

1

0.46

0.26

0.03

0.01

0.02

0.97

0.95

0.02

0

0

0

1

1

0

y(1) July 3th , 2017

y(2)

2

y(3)

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

3

1

3

2

y∗ = {1, 1, 2, 3, 3} 24 / 45

BRANE Clust — NETWORK INFERENCE WITH CLUSTERING

Formulation and resolution

hard- vs soft- clustering in BRANE Clust minimize Y

T P

P

t=1

(i,j)∈V2

ωi,j xi,j β



(t) yi



 (t) 2 yj

+

P

µi,j



(t) yi



 (t) 2 sj

i∈V, j∈T

hard-clustering

soft-clustering

# clusters = # TF ( → ∞ if i = j µi,j = 0 otherwise.

# clusters ≤ # TF   if i = j α µi,j = α1(ωi,j > τ ) if i 6= j and i ∈ T   ωi,j 1(ωi,j > τ ) if i 6= j and i ∈ /T

July 3th , 2017

!

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

25 / 45

BRANE RESULTS

It’s time to test the BRANE philosophy...

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

26 / 45

BRANE RESULTS

Methodology

Numerical evaluation strategy AUPR Ref Reference Precision-Recall curve Classical thresholding

P=

|TP| |TP|+|FP|

R=

|TP| |TP|+|FN|

July 3th , 2017

AUPR BRANE BRANE Precision-Recall curve

BRANE edge selection Gene-gene interaction scores (ND)-CLR or (ND)-GENIE3 Gene expression data DREAM4 or DREAM5 challenges

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

27 / 45

BRANE RESULTS

DREAM4 synthetic results

BRANE performance on in-silico data DREAM4 [Marbach et al., 2010]

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

28 / 45

BRANE RESULTS

DREAM4 synthetic results

BRANE performance on in-silico data DREAM4 [Marbach et al., 2010] 1

2

3

4

5

Average

Gain

CLR BRANE Cut BRANE Relax BRANE Clust

0.256 0.282 0.278 0.275

0.275 0.308 0.293 0.337

0.314 0.343 0.336 0.360

0.313 0.344 0.333 0.335

0.318 0.356 0.345 0.342

0.295 0.327 0.317 0.330

10.9 % 7.8 % 12.2 %

GENIE3 BRANE Cut BRANE Relax BRANE Clust

0.269 0.298 0.293 0.287

0.288 0.316 0.320 0.348

0.331 0.357 0.356 0.364

0.323 0.344 0.345 0.371

0.329 0.352 0.354 0.367

0.308 0.333 0.334 0.347

8.4 % 8.5 % 12.8 %

Network

1

2

3

4

5

Average

Gain

ND-CLR BRANE Cut BRANE Relax BRANE Clust

0.254 0.271 0.270 0.258

0.250 0.277 0.264 0.251

0.324 0.334 0.327 0.327

0.318 0.335 0.325 0.337

0.331 0.343 0.332 0.342

0.295 0.312 0.304 0.303

5.9 % 3.1 % 2.5 %

ND-GENIE3 BRANE Cut BRANE Relax BRANE Clust

0.263 0.275 0.276 0.273

0.275 0.312 0.307 0.311

0.336 0.367 0.369 0.354

0.328 0.346 0.347 0.373

0.354 0.368 0.371 0.370

0.309 0.334 0.334 0.336

7.2 % 7.3 % 8.1 %

Network

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

28 / 45

BRANE RESULTS

DREAM4 synthetic results

BRANE performance on in-silico data DREAM4 [Marbach et al., 2010] CLR

GENIE3

ND-CLR

ND-GENIE3

BRANE Cut

10.9 %

8.4 %

5.9 %

7.2 %

BRANE Relax

7.8 %

8.5 %

3.1 %

7.3 %

BRANE Clust

12.2 %

12.8 %

2.5 %

8.1 %

BRANE approaches validated on small synthetic data BRANE methodologies outperform classical thresholding First and second best performers: BRANE Clust and BRANE Cut ⇒ Validation on more realistic synthetic data July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

29 / 45

BRANE RESULTS

DREAM5 synthetic results

BRANE performance on in-silico data DREAM5 [Marbach et al., 2012]

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

30 / 45

BRANE RESULTS

DREAM5 synthetic results

BRANE performance on in-silico data DREAM5 [Marbach et al., 2012] CLR BRANE Cut BRANE Relax BRANE Clust

ND-CLR BRANE Cut BRANE Relax BRANE Clust

July 3th , 2017

AUPR

Gain

0.252 0.268 0.272 0.301

6.3 % 7.9 % 19.4 %

AUPR

Gain

0.272 0.277 0.274 0.289

1.9 % 0.6 % 6.2 %

GENIE3 BRANE Cut BRANE Relax BRANE Clust

ND-GENIE3 BRANE Cut BRANE Relax BRANE Clust

AUPR

Gain

0.283 0.295 0.294 0.336

4.2 % 3.8 % 18.6 %

AUPR

Gain

0.313 0.317 0.314 0.345

1.1 % 0.3 % 10.2 %

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

30 / 45

BRANE RESULTS

DREAM5 synthetic results

BRANE performance on in-silico data DREAM5 [Marbach et al., 2012] CLR BRANE Cut BRANE Relax BRANE Clust

ND-CLR BRANE Cut BRANE Relax BRANE Clust

AUPR

Gain

0.252 0.268 0.272 0.301

6.3 % 7.9 % 19.4 %

AUPR

Gain

0.272 0.277 0.274 0.289

1.9 % 0.6 % 6.2 %

AUPR

Gain

0.283 0.295 0.294 0.336

4.2 % 3.8 % 18.6 %

AUPR

Gain

0.313 0.317 0.314 0.345

1.1 % 0.3 % 10.2 %

GENIE3 BRANE Cut BRANE Relax BRANE Clust

ND-GENIE3 BRANE Cut BRANE Relax BRANE Clust

BRANE approaches validated on realistic synthetic data and outperform classical thresholding First and second best performer: BRANE Clust and BRANE Cut ⇒ Validation of BRANE Cut and BRANE Clust on real data July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

30 / 45

BRANE RESULTS

Escherichia coli results

BRANE Clust performance on real data Escherichia coli dataset AUPR CLR BRANE Clust

July 3th , 2017

0.0378 0.0399

Gain 5.5 %

GENIE3 BRANE Clust

AUPR

Gain

0.0488 0.0536

9.8 %

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

31 / 45

BRANE RESULTS

Escherichia coli results

BRANE Clust performance on real data Escherichia coli dataset AUPR CLR BRANE Clust

0.0378 0.0399

Gain 5.5 %

GENIE3 BRANE Clust

AUPR

Gain

0.0488 0.0536

9.8 %

BRANE Clust predictions using GENIE3 weights

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

31 / 45

BRANE RESULTS

Escherichia coli results

BRANE Clust performance on real data Escherichia coli dataset AUPR CLR BRANE Clust

0.0378 0.0399

Gain 5.5 %

GENIE3 BRANE Clust

AUPR

Gain

0.0488 0.0536

9.8 %

BRANE Clust predictions using GENIE3 weights

BRANE Clust validated on real dataset July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

31 / 45

BRANE RESULTS

Trichoderma results results

BRANE Cut in the real life GRN of T. reesei obtained with BRANE Cut using CLR weights

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

32 / 45

BRANE RESULTS

Trichoderma results results

BRANE Cut in the real life GRN of T. reesei obtained with BRANE Cut using CLR weights

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

32 / 45

BRANE RESULTS

Trichoderma results results

BRANE Cut in the real life GRN of T. reesei obtained with BRANE Cut using CLR weights

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

32 / 45

BRANE RESULTS

Trichoderma results results

BRANE Cut in the real life GRN of T. reesei obtained with BRANE Cut using CLR weights

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

32 / 45

C ONCLUSIONS

It’s time to conclude...

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

33 / 45

C ONCLUSIONS

Conclusions Inference: BRANE Cut and BRANE Relax Joint inference and clustering: BRANE Clust

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

34 / 45

C ONCLUSIONS

Conclusions Inference: BRANE Cut and BRANE Relax Joint inference and clustering: BRANE Clust The BRA- in BRANE: integrating biological a priori constrains the search of relevant edges The -NE in BRANE: proposed graph inference methods lead to promising results and outperforms state-of-the-art methods

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

34 / 45

C ONCLUSIONS

Conclusions Inference: BRANE Cut and BRANE Relax Joint inference and clustering: BRANE Clust The BRA- in BRANE: integrating biological a priori constrains the search of relevant edges The -NE in BRANE: proposed graph inference methods lead to promising results and outperforms state-of-the-art methods ⇒ Average improvements around 10 % ⇒ Biological relevant inferred networks ⇒ Negligible time complexity with respect to graph weight computation

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

34 / 45

C ONCLUSIONS

Conclusions Inference: BRANE Cut and BRANE Relax Joint inference and clustering: BRANE Clust The BRA- in BRANE: integrating biological a priori constrains the search of relevant edges The -NE in BRANE: proposed graph inference methods lead to promising results and outperforms state-of-the-art methods ⇒ Average improvements around 10 % ⇒ Biological relevant inferred networks ⇒ Negligible time complexity with respect to graph weight computation Biological a priori relevance for network inference BRANE Clust  BRANE Cut  BRANE Relax

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

34 / 45

C ONCLUSIONS

Perspectives From biological graphs...

GRN

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

GRN

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

GRN

July 3th , 2017

Clustering

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

GRN

July 3th , 2017

Clustering

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Extend TF-based a priori for GRN

July 3th , 2017

Clustering

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

GRN

July 3th , 2017

Clustering

Extend TF-based a priori for GRN, clustering

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

GRN

Clustering

Extend TF-based a priori for GRN, clustering

Gene-gene scores

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Clustering

GRN

Extend TF-based a priori for GRN, clustering

Gene-gene scores

Normalized gene expression data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Clustering

GRN

Extend TF-based a priori for GRN, clustering , graph weighting,

Gene-gene scores

Normalized gene expression data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Clustering

GRN

Extend TF-based a priori for GRN, clustering , graph weighting, data normalization...

Gene-gene scores

Normalized gene expression data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Clustering

GRN

Extend TF-based a priori for GRN, clustering , graph weighting, data normalization... Integrate transcriptomic data treatment

Gene-gene scores

Normalized gene expression data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Clustering

GRN

Extend TF-based a priori for GRN, clustering , graph weighting, data normalization... Integrate transcriptomic data treatment

Gene-gene scores

Normalized gene expression data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

GRN

Clustering

Extend TF-based a priori for GRN, clustering , graph weighting, data normalization... Integrate transcriptomic data treatment

Normalized gene expression data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives From biological graphs...

TF-based a priori

Clustering

GRN

Extend TF-based a priori for GRN, clustering , graph weighting, data normalization... Integrate transcriptomic data treatment

Omics data

Integrate a priori, omics- data and treatments

Omics data

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

35 / 45

C ONCLUSIONS

Perspectives ... to general graphs BRANE-like applications for non biological graphs Coupled edge inference: social networks Node-degree constraint: telecommunication Coupling between inference and clustering: temperature networks, brain networks

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

36 / 45

C ONCLUSIONS

Perspectives ... to general graphs BRANE-like applications for non biological graphs Coupled edge inference: social networks Node-degree constraint: telecommunication Coupling between inference and clustering: temperature networks, brain networks Topological constraint in graph inference Expected node degree distribution Scale-free networks: webgraphs, financial networks, social networks...

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

36 / 45

C ONCLUSIONS

Perspectives ... to general graphs BRANE-like applications for non biological graphs Coupled edge inference: social networks Node-degree constraint: telecommunication Coupling between inference and clustering: temperature networks, brain networks Topological constraint in graph inference Expected node degree distribution Scale-free networks: webgraphs, financial networks, social networks... Laplacian-based approach for graph comparison Spectral view of the graph Modularity Local and topological-based criteria July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

36 / 45

C ONCLUSIONS

Publications Journal papers — published D. Poggi-Parodi, F. Bidard, A. Pirayre, T. Portnoy. C. Blugeon, B. Seiboth, C.P. Kubicek, S. Le Crom and A. Margeot Kinetic transcriptome reveals an essentially intact induction system in a cellu- lase hyper-producer Trichoderma reesei strain Biotechnology for Biofuels, 7:173, Dec. 2014 A. Pirayre, C. Couprie, F. Bidard, L. Duval, and J.-C. Pesquet. BRANE Cut: biologically-related a priori network enhancement with graph cuts for gene regulatory network inference BMC Bioinformatics, 16(1):369, Dec. 2015. A. Pirayre, C. Couprie, L. Duval, and J.-C. Pesquet. BRANE Clust: Cluster-Assisted Gene Regulatory Network Inference Refinement IEEE/ACM Transactions on Computational Biology and Bioinformatics, Mar. 2017.

Journal papers — in preparation Y. Zheng, A. Pirayre, L. Duval and J.-C. Pesquet Joint restoration/segmentation of multicomponent images with variational Bayes and higher-order graphical models (HOGMep) To be submitted to IEEE Transactions on Computational Imaging, Jul. 2017. A. Pirayre, D. Ivanoff, L. Duval, C. Blugeon, C. Firmo, S. Perrin, E. Jourdier, A. Margeot and F. Bidard Growing Trichoderma reesei on a mix of carbon sources suggests links between development and cellulase production To be submitted to BMC Genomics, Jul. 2017. July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

37 / 45

C ONCLUSIONS

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

38 / 45

HOGMep

HOGMep for non-blind inverse problems y = Hx + n x: unknown signal to be recovered H: known degradation operator n: additive noise y: observations

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

39 / 45

HOGMep

HOGMep — Bayesian framework

Estimation of x from the knowledge of the posterior pdf p(x|y) p(x|y) =

p(x)p(y|x) p(y)

p(x): the marginal pdf encoding information about x p(y|x): the likelihood highlighting the uncertainty in y p(y): the marginal pdf of y

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

40 / 45

HOGMep

HOGMep — Variational Bayesian Approximation

q(x): approximation of p(x|y) qopt (x) = argmin KL(q(x) || p(x | y)) q(x)

Separable distribution: q(x) =

J Y

qj (xj ),

j=1

with

  Q qopt (x ) ∝ exp hln p(y, x)i j j i6=j qi (xi )

Estimation of the distributions in an iterative manner

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

41 / 45

HOGMep

HOGMep — Bayesian formulation Likelihood prior: p(y | x, γ) = N (Hx, γ −1 I) p(z): prior on hidden variables z ⇒ generalized Potts model p(x|z): prior on x conditionally to z ⇒ MEP distribution restricted to Gaussian Scale Mixtures GSM(m, Ω, β) Hyperpriors: p(γ), p(ml ) and p(Ωl )

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

42 / 45

HOGMep

HOGMep — Bayesian formulation Likelihood prior: p(y | x, γ) = N (Hx, γ −1 I) p(z): prior on hidden variables z ⇒ generalized Potts model p(x|z): prior on x conditionally to z ⇒ MEP distribution restricted to Gaussian Scale Mixtures GSM(m, Ω, β) Hyperpriors: p(γ), p(ml ) and p(Ωl )

Joint posterior distribution p(y | x, γ)

N  Y

L  Y p(xi | zi , ui , m, Ω)p(ui | β) p(z)p(γ) p(ml )p(Ωl )

i=1

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

l=1

42 / 45

HOGMep

HOGMep — VBA strategy Separable form for the approximation: q(Θ) =

N Y

(q(xi , zi )q(ui )) q(γ)

i=1

L Y

(q(ml )q(Ωl ))

l=1

with q(xi |zi = l) = N (η i,l , Ξi,l ), q(zi = l) = πi,l , q(ml ) = N (µl , Λl ), q(Ωl ) = W(Γl , νl ), q(γ) = G(a, b).

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

43 / 45

HOGMep

HOGMep — Some restoration results Restoration Original

Degraded

DR

3MG

VB-MIG

HOGMep

SNR

6.655

9.467

6.744

12.737

12.895

Original

Degraded

DR

3MG

VB-MIG

HOGMep

SNR

19.659

18.728

17.188

15.486

19.555

July 3th , 2017

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

44 / 45

HOGMep

HOGMep — Some segmentation results Segmentation ICM ICM

SC

SC

VB-MIG

HOGMep

ICM

SC

SC

VB-MIG

HOGMep

July 3th , 2017

ICM

Recons. and Clust. with Graph Optim. and Priors on GRN and Images

45 / 45