for references chains, each type of context can give the reference domain (so both hypotheses must be tested). Haptics and dialogue history. ⢠Interpretations ...
Overview • Research domain: Interpretation of natural language and spontaneous gestures.
• Background: A model of contextual interpretation of multimodal referring expressions in visual and task contexts.
• Objective: To show that our model can be extended to an interaction mode including tactile and kinesthetic feedback.
• Context: Conception phase of the IST-MIAMM European project, with DFKI, TNO, SONY & CANON (Multidimensional Information Access using Multiple Modalities).
1
Reference domains and visual context • The use of perceptual grouping « these three objects » fi {
,
,
}
« the two circles » fi { , }
• The use of salience « the triangle » fi {
}
Multimodal fusion architecture user model
referential expression
referential expression request domains
underspecification
language module history
dialogue manager
request domains
visual module history
referential gesture request
referent domains
reference domain(s)
task module history
2
Haptics and deixis • Haptic gestures can take the three classical functions of gesture in man-machine interaction: – semiotic function: ‘select this object’ – ergotic function: ‘reduce the size of this object’ – epistemic function: ‘save the compliance of this object’
• How can the system identify the function(s)? – linguistic clues (referential expression, predicate) – task indications (possibilities linked to a type of objects)
• Deixis role: to make the object salient, whatever the function, in order to focus the addressee’s attention on it.
Haptics and perceptual grouping • Interest: formalism for the focalization on a subset of objects • Grouping factors: – objects which have similar tactile or haptic properties (shape, consistency, texture) – objects that have been browsed by the user (the elements of such a group are ordered) – objects that are stuck together, parts of a same object...
3
Haptics and perceptual domains • Can visual and tactile perceptions work together? – simultaneous visual and tactile perception implies the same world of objects (and synchronized feedbacks) – a referring expression can be interpreted in visual context or in tactile context
• How can the system identify the nature of perception? – for immediate references, the visual context gives the reference domain and haptic gives the starting point in it – for references chains, each type of context can give the reference domain (so both hypotheses must be tested)
Haptics and dialogue history • Interpretations that need an order within the reference domain: ‘the first one’, ‘the next one’, ‘the last one’ – in visual perception, guiding lines can be helpful (if none, an order can always be built with the reading direction) – in haptic perception, the only criterion can be the manipulation order
• Some referring expressions that do not need an order may be interpreted in the haptic manipulation history – ‘the big one’ (in the domain of browsed objects) – ‘them’ (the most pressured objects)
4
Architecture for speech-haptic referring user model
referential expression
referential expression request
language
domains
underspecification
module history
dialogue manager
request domains
visuotactile module
history
haptic gesture request
referent domains
reference domain(s)
task module history
Summary • What does not change from deictic to haptic – – – – –
the status of speech and gesture in the architecture the repartition of information among speech and gesture the need of reference domain the use of salience and the use of orders in domains the algorithms for the exploitation of all these notions
• What does change – some unchanged notions can have one more cause – objects must be browsed to be grouped in a haptic domain – one aspect of the architecture: the visual perception module becomes the visuo-tactile perception module
5
Future work • Within the dialogue manager module, domains may be confronted, using a relevance criterion The way the linguistic contraints of the referring expression apply in the different domains may be such a criterion.
• Validation in the MIAMM framework The transition from deictic to haptic may not be an additional cost for the development of a dialogue system, both from the architecture point of view and the dialogue management point of view.
expression and action of human gestures. However, in man-machine interaction, haptic gestures concern virtual objects for which the application should.
man (or at least the M.I.T. student!) is cognitively a .... handed subjects; seven were male, one was female. ... All experiments were performed using a man-.
An indefinite noun phrase is generally used to introduce a new referent; a definite is an indicator .... for personal pronouns, and Figure 6 for demon-.
or three-dimensional entities existent in time and capable of movement in space. ... already organised two programs of this type at Pontignano (Siena, Italy) and in Paris ( ... The school is organised in an hotel at Vieille Perrotine in l'Ile d'Olero
other primates use to identify objects in their environment. Nevertheless, we .... neural substrate for visual and haptic object recognition. As suggested earlier ..... imagery, but reflected the activity of a multi-modal network. In fact, Amedi et a
Judged âimpossibleâ correctly. Incorrectly interpreted. 2.4 Strict Judgment of the Correctness. Theorem 1. The picture represents a polyhedral scene if and only if.
noticed that the nature of hand movements critically depends on the stimulus property to be perceived (e.g., Davidson et al., 1974; Klatzky and Lederman, 1987).
Another indicator is given by the disposition of the objects in the .... implies an extraction of objects of a given category in a .... to ask a question like âthe two?â.
May 4, 2006 - The latter aspect is linked to sensitivity to both low-level (i.e. geometry), and high-level object 'affordances' (i.e. mechanical properties), which.
Sep 10, 2004 - Abstract The control and perception of body orientation and motion are sub- served by multiple sensory and motor mechanisms ranging from relatively simple, peripheral .... The vestibular system is illustrated in Figure 1. ... brain reg
Dec 7, 2006 - Free and open-source software!! O.Flores ... You can recall commands, for correction or re-use ... Download R from the CRAN (Comprehensive R Archive Network) at ... Objects can have non-intrinsic attributes (dimension of matrices,. . .
PowerPoint, Visual Basic, Visual C++, Visual InterDev, Visual Studio, Windows, Windows. Media ...... Represents values supplied by the provider for a data source. ... Applications or scripts, written by using ADSI, work with any directory service.
in order to get a concrete result when dealing with a commutative ring see what happens after ... Dimension of Boolean valued lattices and rings. J. Pure Appl.
programming with a simple, interpreted and object-oriented language: S, statistical ... R comes with very detailed help information on objects and the system.
tutorial, but you can survive this experience without learning the com- .... (Besides Apple, Digital Equipment and Hewlett-Packard also have the option to sell.
tutorial, but you can survive this experience without learning the com- .... (Besides Apple, Digital Equipment and Hewlett-Packard also have the option to sell.
The three wise men came to. Bethlehem in search of the. Lord. They brought to him precious gifts: gold to honor the newborn king, incense to the true God in ...
conceptual structure for understanding referring gestures (Wolff et al. 1998, De Angeli et ... The ecological approach is an established psychological theory to per- ception ..... the two effects were tested separately using non-parametric statistics
Sep 8, 2016 - features layer, one hidden layer and the output units. For read- ..... conditional random fields,â in Proc. NIPS Workshop, 2009, pp. 1â8. [9] G. E. ...
wacky way that male-female relations work appearances are sometimes more important than reality. (More on that later.) How to be a Pickup Artist: A Practical ...
A robust, task independent spoken Language Identi cation ... have the necessary discriminative power to provide good ... The pdf's are represented as mix-.
Proceedings of the Sixth International Conference on Multimodal Interfaces, Penn .... DREYFUS H.L., DREYFUS S.E., Mind over Machine: The Power of Human ...