how to de-identify a large clinical corpus in 10 days - Xavier Tannier

Construction of a FrenchâLSF corpus - Xavier Tannier

an average of 39 words. They normally describe the five. 'W's of the reported event: what, when, where, who and, as much as possible, why. For example: (1).

Temporal Annotation - Xavier Tannier

Aggregates â answer question "how often/how frequently?" Dates â answer question "when?" Non-atomic temporal expressions ( + +.

digiteo - Xavier Tannier

The destroyed UCR headquaterss is in the Moreno district of Buenos. Aires. Alleged guerrilla urban commandos .... #1 [armed:amod] man. [attack:nsubj, kill:nsubj]. Generative Event Schema Induction with Entity Disambiguation. 5 ... Page 39 ...

Poster - Xavier Tannier

RUN 2: Word Embeddings: vectors calculated on the MIMIC II corpus using word2vec. DR subtask performance. CR subtask performance. Features. Run.

Slides - Xavier Tannier

Jul 10, 2012 - Context. â¢ Our ultimate goal: build automatic timelines from a query. â âTunisian revolutionâ. 2010, Dec. 17: Mohamed Bouazizi sets himself.


May 25, 2012 - Event nouns. All nouns. Singular. 80.1%. 83.4%. Plural. 19.9%. 16.6%. Event nouns All nouns. Definite article. 27.9%. 19.9%. Indefinite article.

Evaluating Web-as-corpus Topical Document ... - Xavier Tannier

May 30, 2014 - (tuple length, no. tuples, no. documents per query). Cons: - Non reproducible: the Web and search engines are changing. - Costly: can't easily ...

Diapositive 1 - Xavier Tannier

Needs for manually annotating Web pages are many: â Text tagging. e.g. named entities. â Image tagging. e.g. image retrieval. â Web page cleaning. e.g. ad ...


Ben Ali left Tunis after a monthlong popular protest which Tunisians called the. Jasmine Revolution .

poster - Xavier Tannier

Temporal information extraction from clinical text ... Event classification. Method: ... Word embeddings computed with word2vec. Language. Classifier. Algorithm.

author version - Xavier Tannier

Abstract This article presents FIDJI, a question-answering (QA) system for French. ...... Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence. Alignment. in .... http://pascallin.ecs.soton.ac.uk/Challenges/RTE/.

A Dataset for Open Event Extraction in English - Xavier Tannier

Submit the document title to the Google search engine. 2. Keep only ... with SpotSigs and mcl. 3. Clean semiautomatically remaining texts. #docs. #sentences # ...

Graduate internship Automatic ... - Xavier Tannier

Automatic Classification of Claims from Political. Debates and Declarations. Keywords: natural language processing, text mining, machine learning, com-.

Graduate internship Distant ... - Xavier Tannier

Distant supervision for event extraction from a newswire corpus. Keywords: natural language processing, text mining, machine learning, distant supervision.

Supervised Machine Learning Techniques to Detect ... - Xavier Tannier

to Detect TimeML Events in French and English. BÃ©atrice ... event expressions (), with their class and attributes (time, aspect, polarity, modality).

Natural Language Queries for Information Retrieval in ... - Xavier Tannier

Natural Language Queries for Information Retrieval in Structured Documents. Xavier Tannier, Jean-Jacques Girardot and Mihaela Mathieu. Ãcole Nationale ...

Retrieval Status Values in Information Retrieval ... - Xavier Tannier

evaluation measures have been proposed. Most of these measures are based on the ranking of documents retrieved by IR systems in response to queries.

NLP-driven Data Journalism - Xavier Tannier

cific subject defined by a user query (e.g. situation in Syria, nuclear proliferation, North Pole ownership). The evolution of the relations over time are then ...

Research Informatics Infrastructure i2b2 implemented ... - Xavier Tannier

Mar 15, 2018 - Data Volume Challenge. â» Query data in place without duplication. â» Allows big-data handling i2b2 implemented over SMART-on-FHIR ...

Retrieval Status Values in Information Retrieval ... - Xavier Tannier

Page 1 ... Abstract. Retrieval systems rank documents according to their retrieval ... Each IRS has a particular way to compute document RSV according to the IR.

XGTagger, a generic interface for analysing XML ... - Xavier Tannier

That is why the words must be given to the system in the order a human could ... A characteristic of XML documents is that they contain several consecutive, potentially overlapping. "reading context s ..... 5 .3 Le x ica l enric h ment. The user 's .

Industrial Engineering and Computer Sciences ... - Xavier Tannier

that analyse NLQs and translate them into a formal language (NEXI) query. ..... scores. Here, we present the results of both the strict metric, that only re-.

XTM a robust temporal processor for running text - Xavier Tannier

Google also offers now, in an experimental way, a timeline view to provide results of a .... We call this last class of nouns âtime span nounsâ. Examples of such ...

A Content Management Perspective on Fact-Checking - Xavier Tannier

Apr 27, 2018 - point in the existing literature as well as help develop a roadmap ..... question of a space in which to search for such elements and efficient ..... 18See e.g. the course âCalling Bullshit: Data Reasoning in a Digital Worldâ ....

how to de-identify a large clinical corpus in 10 days - Xavier Tannier

des documents recommandant