evolving views on HOA - GyronymO : Sound Spatialisation

Jun 2, 2009 - evolving views on HOA: from technological to pragmatic concerns. Jérôme Daniel. Orange Labs. Ambisonics Symposium, Graz, 2009/06/25 ...
5MB taille 1 téléchargements 317 vues
evolving views on HOA: from technological to pragmatic concerns Jérôme Daniel Orange Labs Ambisonics Symposium, Graz, 2009/06/25

r&d legal direction

outline 1 „ introduction (chronology) 2 „ main concepts and promises 3 „ focus on: 5.0 decoding and HOA microphone array 4 „ HOA tools and integration (plugins) 5 „ format, standardization, coding 6 „ HOA in "real life": learning from recording / production experiments

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 2

r&d legal direction

France Telecom Group

introduction / earlier study and motivations

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 3

r&d legal direction

France Telecom Group

Ambisonics and HOA: a chronology of space expansion… [Gerzon] [Gerzon, Craven,…] [Malham]

70's

90's

[Bamford] [Poletti] [Daniel] [Nicol] [Sontacchi…]

96-00

Ambisonics Symposium…

MPEG AudioBIFS

00-03

03-06

[Laborie et al]

SoundField SRP (Trinnov)

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 4

[Fazzi] [Solvang] [Adriensen] [IEM…]

[Daniel, Nicol, Moreau, Bertet]

r&d legal direction

07/08

09…

Head-tracked binaural Plugins VST, etc.

EigenMike (mh-acoustics) France Telecom Group

higher order ambisonics in short

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 5

r&d legal direction

France Telecom Group

Higher Order Ambisonics (HOA) „

increase angular discrimination thanks to additional encoding directivities „ „ „

cos θ

cos 2θ

spatial encoding ≈ circular Fourier Transform spatial spectrum = {ambisonic components} spatial bandwidth = highest angular frequency

cos 3θ

cos 4θ

enriched spatial bandwidth 0th order

1st order

sin θ Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 6

2nd order

sin 2θ

3rd order

4th order

sin 3θ

sin 4θ

r&d legal direction

France Telecom Group

Higher Order Ambisonics (HOA) Front (X) „

increase angular discrimination thanks to additional encoding directivities „ „

Left (Y)

Right

„

„

spatial encoding ≈ circular Fourier Transform spatial spectrum = {ambisonic components} spatial bandwidth = highest angular frequency

enhance spatial separation to feed loudspeakers more selectively „ „

synthesize directivities with enhanced beamwidth spatial decoding ≈ multi-directional beamforming

Back

+

+ =

+ =

+ =

= enhanced beamwidth

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 7

r&d legal direction

France Telecom Group

Higher Order Ambisonics (HOA) „

increase angular discrimination thanks to additional encoding directivities „ „ „

„

enhance spatial separation to feed loudspeakers more selectively „ „

„

1st order

2nd order

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 8

spatial encoding ≈ circular Fourier Transform spatial spectrum = {ambisonic components} spatial bandwidth = highest angular frequency

synthesize directivities with enhanced beamwidth spatial decoding ≈ multi-directional beamforming ≈ inverse discrete circular Fourier Transform more accurate sound images (reduced spread angle)

3rd order

r&d legal direction

4th order

France Telecom Group

HOA as a "holophonic" approach NFC-HOA [Daniel] „ filter implementation improvement [Adriaensen] „ More workable scheme for close sources: High-passed (NFC)-HOA [Daniel, Moreau] „ Further connections with WFS [Nicol, Daniel] [Fazzi], … „

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 9

r&d legal direction

France Telecom Group

HOA claims and promises „

claims „

format flexibility • (wrt reproduction setups, spatial manipulation, scalability)

„

spatial "objectivity" and predictability • representation Æ reproduction ⇒ spatial fidelity and transparency?

„

„

high res, "true 3D" recording technology

promises for many application contexts „

a format for new 3D audio content generation and consumption!? • one generic content for various terminals, transport constraints, consumption styles

„ „

immersive telecommunication: teleconferencing, ambience sharing interactive 3D navigation, games…

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 10

r&d legal direction

France Telecom Group

A way to turn dreams into reality „

assess or criticize HOA claims to improve the techno

„

consider format standardization „

„

test out HOA with "real life" concerns „ „

„

(also matter of maturity)

current format and equipment standards (5.0), current practices get lessons…

bring HOA techno into the hands of content creators / sound engineers „

make advertising, adapt practices, facilitate the convergence between research and production worlds

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 11

r&d legal direction

France Telecom Group

focus on: decoding for standard 5.0 setup spherical HOA microphones

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 12

r&d legal direction

France Telecom Group

HOA decoding for standard 5.0 setup (+8.0)

energy vector optimized

Craven [AES24]

energy vector optimized

+ = target, ie ideal sound image) * = "energy vector" (HF prediction) □ = "velocity vector" (LF prediction) = physical limit for energy vector(pair-wise pan-pot) „

concerns: image stability, consistency, homogeneity, discreteness…

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 13

r&d legal direction

France Telecom Group

HOA decoding for standard 5.0 setup (+8.0) Mainly these two decoders were involved in recording+production experiments reported later „ Many other potential decoders, by combining criteria in different ways „

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 14

r&d legal direction

France Telecom Group

HOA sphere microphone: basics „

Q sensors on the sphere „ „

processing = matrix + EQ „

Q HOA signals

EQ: • theoretically -mx6dB / oct ! • one must limit bass-boost

0 1 2 3 4

180 160

rough( r / c ) B00+1

EQ0( r / c , R / c )

rough( r / c ) B11+1

EQ1( r / c , R / c )

 NFC( R / c ) B00+1  NFC( R / c ) B11+1

EQ1( r / c , R / c )

 NFC( R / c ) B11−1

EQ1( r / c , R / c )

 NFC( R / c ) B10+1

rough ( r / c ) B11−1

a = 5 cm 200

rough( r / c ) B10+1

# 140

Matrix

#

120 Amplitude (dB)

„

Æ sound field sampling Q=32 Æ 4th order, K=25 HOA components

#

100

NxK

#

+1 rough( r / c ) B mm

EQm( r / c , R / c )

 +1 NFC( R / c ) Bmm

−1 rough( r / c ) B mm

EQm( r / c , R / c )

 −1 NFC( R / c ) Bmm

#

80 60

B mn

40

#

σ rough ( r / c )

rough ( r / c ) B m+10

20

#

0 2

10

3

10 Fréquence (Hz)

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 15

#

# (r / c,R / c) m

EQ

# EQm( r / c , R / c )

#

#  σ NFC( R / c ) Bmn #

 NFC( R / c ) Bm+10

#

4

10

r&d legal direction

France Telecom Group

HOA sphere mic: limits and tradeoff correct estimation

spatial aliasing 10 0

Order

2

-10 -20

3

4

reduced spatial bandwidth

-30 -40

(dB)

estimation error

0 1

∅7cm, 32 sensors Æ25 components (4th order)

-50

10

2

3

10 Frequency [Hz]

10

4

-60

shift towards LF when radius increases shift towards HF when radius decreases

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 16

r&d legal direction

France Telecom Group

spatial aliasing

rigid sphere arrays: trade-offs (for a given radius)

60 50

increase EQ max level… „ „ „

etc.

40

bandwidth enlarges towards LF, but… less benefits for higher orders and increase of noise level

(dB)

„

+18dB

+15dB SNR

30 20

3 oct

2

10

increase the number of sensors Q… „ „ „

1 oct

10 0

„

1,5 oct

3

10 Frequency (Hz)

1 oct4 10

Qx4

4 x more sensors to gain 1 octave towards HF SNR and robustness improvment: +10*log10(Q) dB => e.g. 15dB for Q = 32 sensors

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 17

r&d legal direction

France Telecom Group

HOA microphones built and experimented FTR&D

(DPA4060)

(Panasonic 2€)

32 caps Æ 4th order 12 caps Æ 2th order 20 caps Æ 3th order

(Sennheiser mke4)

EigenMike™ (mh-acoustics)

⇒ great improvement in terms of usability!

32 caps Æ 4th order

alternative [Epain & Daniel]

8 caps Æ 3th order, 2D reduced spatial aliasing

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 18

r&d legal direction

France Telecom Group

Impact of imperfect encoding For instance: on Craven, 4th order 5.0 decoding Craven o4 /o4

4th order

Craven o4 /o3

Craven o4 /o2

2nd order truncation

3rd order truncation

Craven o4 /o1

1st order truncation

to lower frequencies

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 19

r&d legal direction

France Telecom Group

HOA tools, integration and demonstrators

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 20

r&d legal direction

France Telecom Group

„

HOA VST plugins (Orange Labs) „ „ „ „ „

„

HOAEncoder HOAMicProcessor HOARotator HOASpkDecoder HOABinDecoder

Tested host applications „ „

PlogueBidule, (MaxMSP) (Cubase, Nuendo), Podium

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 21

r&d legal direction

France Telecom Group

format, standardization, compression

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 22

r&d legal direction

France Telecom Group

HOA format: which need for standardization? for interoperability between HOA processing units „ for sharing HOA sound files „

„ „

„

for insertion in 3D interactive multimedia contents „

„

no or few concern with compression ⇒ associated data, extended file header, new 'trunk' ⇒ eg MPEG4 (cf AudioBIFS V3 norm)

for generalized consumption: broadcast, exchange „ „

high concern with compression issues extended issues: format conversion, spatial audio object coding

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 23

r&d legal direction

France Telecom Group

HOA in "real life" (incl. mass market concerns) learning from experimental and collaborative recording opportunities

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 24

r&d legal direction

France Telecom Group

Various recording conditions and constraints immersive ambience „ music / theatre performance (often codified spatial organisation⇒cultural expectations) „ without or with video or even stereoscopic video „

„ „

⇒ mic positioning constraints; ⇒ sound and visual image coherency issues

with or without spot/close microphone mixing „ direct mixing of post-produced „ with concurrent mutlichannel system (trees) Æ EWO „ in collaboration with professional sound engineers Æ RadioFrance

„

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 25

r&d legal direction

France Telecom Group

Fully immersive "nature ambience"

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 26

r&d legal direction

France Telecom Group

classic organization: frontal scene „ „

front loudspeakers provide the orchestra image (quite dry) rear loudspeakers for field reverberated by the room 32 DPA4060, 4th order

Orchestre National de France, Studio 104, Radio-France, Juin 2008

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 27

r&d legal direction

France Telecom Group

very large front scene Workshop Ears Wide Open, Le Tambour, Rennes, Mars 2008 multi-microphone trees

HOA sphere 20 DPA, order 3

„

large scene Æ more or less contributions from rear loudspeakers

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 28

r&d legal direction

France Telecom Group

panoramic scene SoundPainting session, Chapelle des Ursulines, Lannion, November 2008 EigenMike, ordre 4

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 29

r&d legal direction

France Telecom Group

Opéra de Rennes: La Trahison Orale

"with height" spatial organisation

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 30

r&d legal direction

France Telecom Group

tuba

front wall

Spatial configuration: "La Trahison Orale"

balcony

stage

„

orchestra difficult trade-off between sound pit

nce) ie d u a ( s ll a t s

balance and spatial readibility „ 3DÆ2D projection issues depending on the microphone positioning and "pointing direction" Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 31

r&d legal direction

France Telecom Group

Don Giovanni, Opéra de Rennes (2 June 2009) „ 3D

video and audio capture „ direct satellite broadcast to different places •

+ Mezzo TV in SD (in 39 countries)

„ audio

part: Orange Labs + Radio-France

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 32

r&d legal direction

France Telecom Group

Don Giovanni „

Technical aspects „ „ „ „

„

HOA sphere positioning trials Æ 5.0 provided to Radio-France real time mixing with 4 dozens of spot microphones artificial reverb added (even on "ambience HOA 5.0") ⇒ no longer "HOA" spatial model

Results „ „ „ „

great success (artistic, technical, popular) very nice and appreciated sound ! … but not really faithful wrt the actual theatre ;-) great communication impact for HOA ! • (websearch: HOA + "Don Giovanni")

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 33

r&d legal direction

France Telecom Group

Some lessons „

Mistrust monitoring over another setup: „ „

„ „ „

eg 8.0 vs 5.0 ⇒ front-back instability + influence of the reproduction room (and ldspk, and array size), esp. with particular sounds (applauses, broadband signals, etc.) or recorded scene (very large, etc.) : dry, spherical vs hemi-spherical, spherical vs horizontal ⇒ projection of elevated source binaural vs loudspeaker presentation ⇒ wave interferences, potential coloration and phasing

Shall content creation and recording be generic, or specific to a target setup? „ Use HOA as end-to-end approach or intermediary toolset?

„

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 34

r&d legal direction

France Telecom Group

Conclusion „

further improvements of the technologic part still expected „ „ „ „

„

system transparency (spatial decoding) 3D scene analysis and manipulation: spatial editing tools (benefiting from new ext. dev.: mic array, et. ) coding / compression, format conversion

get lessons from experiments „

adapt HOA to sound engineers practice… then reciprocally! • improve ergonomics • HOA as end-to-end techno and format? Or as a toolkit?

„

issues regarding: • spatial organisation, • mic positioning, etc. • generic of specific content production

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 35

r&d legal direction

France Telecom Group

Thank you for your attention

Ambisonics Symposium/2009-06-25/Jérôme Daniel – p 36

r&d legal direction

France Telecom Group