ANALEC is a new tool which aim is to bring together corpus annotation, visualization and query management. The main idea is to provide a unified and dynamic ...
UMR 8094 Langues, Textes, Traitements Informatiques, Cognition
Frédéric Landragin, Thierry Poibeau and Bernard Victorri ANALEC is a new tool which aim is to bring together corpus annotation, visualization and query management. The main idea is to provide a unified and dynamic way of annotating textual data. ANALEC allows researchers to dynamically build their own annotation scheme and use the possibilities of scheme revision, data querying and graphical visualization during the annotation process. Each query result can be visualized using a graphical representation that puts forward a set of annotations that can be directly corrected or completed. Text annotation is then considered as a cyclic process. Statistics like frequencies and correlations make it possible to verify annotated data on the fly during the annotation.
This window allows the user to navigate in the annotation scheme
Corpus visualization zone
Corpus annotation zone: In this example, the features are classified using three levels
Some features can be edited, others not depending on the ‘view’
Correlation window
Each type, each element and each value of an element can be renamed and deleted
Geometrical representation window Each point in the graph is linked to a linguistic unit, and interesting groupings may appear depending on the features taken into account
The user can choose a unit and two different features, and ANALEC automatically computes a table displaying their correlations. The cells in the table are coloured following the result of a classical chi-squared test
This module is especially useful to check that all the elements of a certain type are grouped together or not
Designing an annotation scheme is a major issue that requires a ‘trial and error’ strategy. Moreover, data annotation is not a static process but a dynamic one, which depends on corpus visualization and data queries. The goal of ANALEC is to allow researchers to dynamically build their own annotation scheme, using the possibilities of partial annotation, dynamic scheme revision, data querying and visualization during the annotation procedure itself. ANALEC includes three kinds of computation and visualization modules to analyse annotated data: frequencies, correlations, and geometrical representations. With these modules, it is possible to observe and prove linguistic hypotheses in vivo, directly from the observation of the data. This research is partially sponsored by the contract PEPS MC4 (Modélisation Contrastive et Computationnelle des Chaînes de Coréférence) from CNRS INSHS and INS2I.
Proceedings of the Sixth International Conference on Multimodal Interfaces, Penn .... DREYFUS H.L., DREYFUS S.E., Mind over Machine: The Power of Human ...
Feb 6, 2015 - First results: thematic issue of Langages journal, number 195 (sept 2014). 3 ... Free and open source (JAVA), cf. http://www.lattice.cnrs.fr/.
Gesture and salience. ⢠Gesture is the first way to make an object salient. ⢠With no gesture, an object may be salient when it has a property that the other objects ...
linguistic context and the task context. There has ... linguistic and the task contexts, considering components such as ... [13]), others focus on the management of a visual focus of ... previous methods, visual salience also depends on the.
Feb 27, 2009 - syntactic analysis: verb with imperative mood with two arguments. ⢠semantic analysis: âthatâ corresponds to the object; âthereâ to the place;.
picking up and taking into account the ambient noise and the ambient luminosity. ⢠ergotic constraints (transforming, changing the state of the environment):.
on the reluctant purpose of Macbeth, who felt com- ... had the art of covering treacherous purposes with .... man, for none of woman born should have power to.
The quintessence of ornamental knots is exemplified by The Book of Kells, an ... ranging from molecular geneticsâto help us understand how to unravel a loop ...
as possible with future concurrent annotations. 1 Introduction .... The manual annotation of this corpus is planned to be done with. Analec (Landragin ... Used by TXM to mark words (tokens) for lemmatisation and POS-tagging. â Requires ...
we study and characterize visual salience and linguistic salience in parallel .... several communicative acts. (film, comic book, discourse, conversation).
monumental art, so aerial and close to the public at the same time. His stainless steel sculptures project originates from this legacy. Purity lies within the heart of.
write down together rules such as the second and third of the above ex- ample. ... of final states, and. 2Chapter 3 on page 11 deals with Context-Free Grammars.
Fern's sneakers were sopping by the time she caught up with her father ... âYou go back to the house and ... life and death, and you talk about controlling myself.â ...
fully assimilated to character, for example in Catch us if you can (1965). He also made a semi-â autobiographical movie taking place during World War II called ...
de Montréal, elles représentent une option fascinante d'enseignement et d'apprentissage. Elles peuvent accroître la blématique complexe qui demande une importante réflexion, car .... l'enquête internationale PISA (OCDE,. 2001), il semble que les TIC .
histoire de frdric guillaume i roi prusse et lecteur brandebourg etc are a good way to achieve details about operating certainproducts. Many products that you buy can be obtained using instruction manuals. These user guides are clearlybuilt to give s