Which Semantics for Neighbourhood Semantics?

talaris Natural Language Processing: representation, inference and semantics

Perception, Cognition, Interaction

Audio, Speech, and Language Processing

Patrick Blackburn INRIA Chercheur

Nancy

Team Leader, Research Director (DR2), INRIA oui Claire Gardent CNRS Chercheur

Nancy

Research Director (DR2), SHS Department CNRS oui Isabelle Blanchard CNRS Assistant

Nancy

Corinna Anderson UnivEtrangere PhD

Nancy

Yale University, USA Carlos Areces INRIA Chercheur

Nancy

Research Scientist (CR1), INRIA. On sabatical leave 2009-04-01 / 2010-04-01 Lotfi Bellalem UnivFr Enseignant

Nancy

PRAG, ESIAL, UHP Nancy 1 Nadia Bellalem UnivFr Enseignant

Nancy

Assistant Professor, IUT Nancy-Charlemagne, University of Nancy 2 Samuel Cruz-Lara UnivFr Enseignant

Nancy

Assistant Professor, IUT Nancy-Charlemagne, University of Nancy 2 Christine Fay-Varnier UnivFr Enseignant

Nancy

Assistant Professor, School of Geology, INPL Jean-Charles Lamirel UnivFr Enseignant

Nancy

Assistant Professor, IUT Robert Schuman, University of Strasbourg oui Fabienne Venant UnivFr Enseignant

Nancy

Assistant Professor, IUT Nancy-Charlemagne, University of Nancy 2. On maternity and parental leave from 2010-06-01 Paul Bedaride UnivFr PhD

Nancy

UHP Nancy 1, Ministry grant, from 2006-10-15 until 2010-10-15 Luciana Benotti UnivFr PhD

Nancy

UHP Nancy 1, CORDIS grant, from 2006-06-10 until 2010-01-28 Ingrid Falk UnivFr PhD

Nancy

University of Nancy 2, SEMbySEM project, from 2008-10-01 until 2011-10-01 Guillaume Hoffmann UnivFr PhD

Nancy

UHP Nancy 1, Ministry grant, from 2007-10-15 until 2010-11-15 Laura Perez UnivFr PhD

Nancy

[University of Nancy 2, Ministry grant, from 2009-10-01 until 2012-10-01 Alejandra Lorenzo UnivFr PhD

Nancy

[UHP Nancy 1, Région/ALLEGRO Interreg IV A project, from 2010-10-15 Mathieu Desnouveaux INRIA Technique

Nancy

INRIA Engineer on CPER MISN/TALC, 2008-09-01 until 2010-09-01 Emmanuel Didiot INRIA Technique

Nancy

INRIA Engineer on CPER MISN/TALC, from 2010-12-01. Treveur Bretaudiere INRIA Technique

Nancy

INRIA Engineer on ADT MoViTAL, from 2010-12-01 Tarik Osswald INRIA Technique

Nancy

Engineer, ITEA Project MetaVerse Overall Objectives Background

TALARIS stands for Traitment Automatique des Langues: Representation, Inference, et Semantique. As this name suggests, the aim of the TALARIS team is to investigate semantic phenomena (broadly constructed) in natural language from a computational perspective. More concretely, TALARIS's goal is to develop grammars (with a special emphasis on French) with a semantic dimension, to explore the linguistic and computational issues involved in such areas as natural language generation, textual entailment recognition, discourse and dialogue modeling, pragmatics, and multilinguality, and to investigate the interplay between representation and inference in computational semantics for natural language.

Organization

The work of the TALARIS team can be subdivided into four overlapping and mutually supporting categories.

Computational Semantics. This theme is devoted to the theoretical and computational issues involved in building semantic representations for natural language. Special emphasis is placed on developing large scale semantic coverage for the French Language.

Discourse, Dialogue and Pragmatics. This theme is devoted to developing theoretical and computational models of discourse and dialogue processing, and investigating the inferential impact of pragmatic factors (that is, the factors affecting how humans being actually use language).

Logics for Natural Language and Knowledge Representation. The theme is devoted to theoretical and computational tools for working with logics suitable for natural language inference and knowledge representation. Special emphasis is place on hybrid logic, higher order logic, and discourse representation theory (DRT).

Multilinguality for Multimedia. This theme is devoted to creating generic ISO-based mechanisms for representing and dealing with multilingual textual information. The center of this activity is the MLIF (Multi Lingual Information Framework) specification platform for elementary multilingual units.

Overall Objectives

The major long term computational goals of the TALARIS team are:

The design and implementation of incremental clustering techniques that can handle heterogeneous textual data collections.

The creation of a large scale computational semantics framework for French that supports deep semantic analysis and surface realisation (the production of sentences from meaning representations).

The integration and use of this framework in systems interfacing 3D worlds and Natural Language Processing (NLP) technologies e.g., extending a serious game with dialog capabilities or exploiting Natural Language Generation to automate the production of language learning exercices in a 3D setting.

The creation of efficient inference systems for logics that are capable of representing natural language content and the background knowledge required to support reasoning.

Integrating language technology and semantic resources into multimedia applications.

These computational goals will be pursued in the context of theoretical investigations that will rigorously map out the required scientific and mathematical context.

Highlights

New results on dynamic modal logics (e.g., memory logics) in terms of lower and upper complexity bounds, tableaux algorithms, expressive power, axiomatization and interpolation.

Mature expositions of inference frameworks for hybrid logics, together with stable implementations of inference systems.

Talaris systems ranked first and third in the international GIVE (Giving Instruction in Virtual Environment) challenge .

The 3 day Natal workshop, organized by Claire Gardent, which gathers together students and researchers from Nancy, Saarbrúcken and neighbouring areas around current themes in NLP themes, was successfully held (for the third running) from 16–18 June 2010 http:// talc. loria. fr/ NATAL-10-16-18-juin-2010-LORIA,178. html. As in 2008 and 2009, there were two thematic days with invited speakers (this year, on Natural Language Processing and Computer Aided Language Learning) and a third Masters Student Day where Erasmus Mundus students from Nancy and Saarbrúcken met and presented their research.

Scientific Foundations Computational Linguistics and Computational Logic

We said above that the central research theme of TALARIS was computational semantics (where “semantics” is broadly construed to cover various pragmatic and discourse level phenomena) and that TALARIS is particularly focused on investigating the interplay between representation and inference. Another way of putting this would be to say that the scientific foundations of TALARIS's work boil down to the motto: computational linguisticsmeets computational logicand knowledge representation.

From computational linguistics we take the large linguistic and lexical semantics resources, the parsing and generation algorithms, and the insight that (whenever possible) statistical information should be employed to cope with ambiguity. From computational logic and knowledge representation we take the various languages and methodologies that have been developed for handling different forms of information (such as temporal information), the computational tools (such as theorem provers, model builders, model checkers, sat-solvers and planners) that have been devised for working with them, together with the insight that, whenever possible, it is better to work with inference tools that have been tuned for particular problems, and moreover that, whenever possible, it is best to devote as little computational energy to inference as possible.

This picture is somewhat idealized. For example, for many languages (and French is one of them) the large scale linguistic resources (lexicons, grammars, WordNet, FrameNet, PropBank, etc.) that exist for English are not yet available. In addition, the syntax/semantics interface often cannot be taken for granted, and existing inference tools often need to be adapted to cope with the logics that arise in natural language applications (for example, existing provers for Description Logic, though excellent, do not cope with temporal reasoning). Thus we are not simply talking about bringing together known tools, and investigating how they work once they are combined — often a great deal of research, background work and development is needed. Nonetheless, the ideal of bringing together the best tools and ideas from computational linguistics, knowledge representation and computational logic and putting them to work in coordination is the guiding line.

Semantics and Inference

Over the next decade, progress in natural language semantics is likely to depend on obtaining a deeper understanding of the role played by inference. One of the simplest levels at which inference enters natural language is as a disambiguation mechanism. Utterances in natural language are typically highly ambiguous: inference allows human beings to (seemingly effortlessly) eliminate the irrelevant possibilities and isolate the intended meaning. But inference can be used in many other processes, for example, in the integration of new information into a known context. This is important when generating natural language utterances. For this task we need to be sure that the utterance we generate is suitable for the person being addressed. That is, we need to be sure that the generated representations fit in well with the recipient's knowledge and expectations of the world, and it is inference which guides us in achieving this.

Much recent semantic research actively addresses such problems by systematically integrating inference as a key element. This is an interesting development, as such work redefines the boundary between semantics and pragmatics. For example, van der Sandt's algorithm for presupposition resolution (a classic problem of pragmatics) uses inference to guarantee that new information is integrated in a coherent way with the old information.

The TALARIS team investigates such semantic/pragmatic problems from various angles (for example, from generation and discourse analysis perspectives) and tries to combine the insights offered by different approaches. For example, for some applications (e.g., the textual entailment recognition task) shallow syntactic parsing combined with fast inference in description logic may be the most suitable approach. In other cases, deep analysis of utterances or sentences and the use of a first-order inference engine may be better. Our aim is to explore these approaches and their limitations.

Linguistic Resources

In an ideal world, computational semanticists would not have to worry overly much about linguistic resources. Large scale lexica, treebanks, and wide coverage grammars (supported by fast parsers and offering a flexible syntax semantics interface) would be freely available and easy to combine and use. The semanticist could then focus on modeling semantic phenomena and their interactions.

Needless to say, in reality matters are not nearly so straightforward. For a start, for many languages (including French) there are no large-scale resources of the sort that exist for English. Furthermore even in the case of English, the idealized situation just sketched does not obtain. For example, the syntax/semantics interface cannot be regarded as a solved problem: phenomena such as gapping and VP-ellipsis (where a verb, or verb phrase, in a coordinated sentence is missing and has to be somehow “reconstructed” from the previous context) still offer challenging problems for semantic construction.

Thus a team like TALARIS simply cannot focus exclusively on semantic issues: it must also have competence in developing and maintaining a number of different lexical resources (and in particular, resources for French).

TALARIS is involved in such aspects in a number of ways. For example, it participates in the development of an open source syntactic and synonymic lexicon for French, in an attempt to lay the ground for a French version of FrameNet; and it also works on developing a large scale, reversible (i.e., usable both for parsing and for generation) Tree Adjoining Grammar for French.

Logic Engineering

Once again, in the ideal world, not only would computational semanticists not have to worry about the linguistic resources at their disposal, but they would not have to worry about the inference tools available either. These could be taken for granted, applied as needed, and the semanticist could concentrate on developing linguistically inspired inference architectures. But in spite of the spectacular progress made in automated theorem proving (both for very expressive logics like predicate logics, and for weak logics like description logics) over the last decade, we are not yet in the ideal world. The tools currently offered by the automated reasoning community still have a number of drawbacks when it comes to natural language applications.

For a start, most of the efforts of the first-order automated reasoning community have been devoted to theorem proving; model building, which is also a useful technology for natural language processing, is nowhere nearly as well developed, and far fewer systems are available. Secondly, the first-order reasoning community has adopted a resolutely `classical' approach to inference problems: their provers focus exclusively on the satisfiability problem. The description logic community has been much more flexible, offering architectures and optimisations which allow a greater range of problems to be handled more directly. One reason for this has been that, historically, not all description logics offered full Boolean expressivity. So there is a long tradition in description logic of treating a variety of inference problems directly, rather than via reduction to satisfiability. Thirdly, many of the logics for which optimised provers exists do not directly offer the kinds of expressivity required for natural language applications. For example, it is hard to encode temporal inference problems in implemented versions of description logics. Fourth, for very strong logics (notably higher-order logics) few implementations exists and their performance is currently inadequate.

These problems are not insurmountable, and TALARIS members are actively investigating ways of overcoming them. For a start, logics such as higher-order logic, description logic and hybrid logic are nowadays thought of as various fragments of (or theories expressed in) first-order logic. That is, first-order logic provides a unifying framework that often allows transfer of tools or testing methodologies to a wide range of logics. For example, the hybrid logics used in TALARIS (which can be thought of as more expressive versions of description logics) make heavy use of optimization techniques from first-order theorem proving.

Empirical Studies

The role of empirical methods (model learning, data extraction from corpora, evaluation) has greatly increased in importance in both linguistics and computer science over the last fifteen years. TALARIS members have been working for many years on the creation, management and dissemination of linguistic resources reusable by the scientific community, both in the context of implementation of data servers, and in the definition of standardized representation formats like TAG-ML. In addition, they have also worked on the applications of linguistic ideas in multimodal settings and multimedia.

Such work is important to our scientific goals. As we said above, one of the most important points that needs to be understood about logical inference is how its use can be minimized and intelligently guided. Ultimately, such minimization and guidance must be based on empirical observations concerning the kinds of problems that arise repeatedly in natural language applications.

Finally, is should be remarked that the emphasis on empirical studies lends another dimension to what is meant by inference. While much of TALARIS's focus is on symbolic approaches to inference, statistical and probabilistic methods, either on their own or blended with symbolic approaches, are likely to play an increasingly important role in the future. TALARIS researchers are well aware of the importance of such approaches and are interested in exploring their strengths and weaknesses, and where relevant, intend to integrate them into their work.

Application Domains Modular Grammar Building

The development of large scale grammars is a complex task which usually involves factorising information as much as possible. While good grammar writing and factorisation environments exist for “non tree grammars” (e.g., HPSG, LFG), this is not the case for “tree based grammars” such as TAG, Interaction Grammars or Tree Description Grammars. The Extended Metagrammar Compiler (XMG) developed at TALARIS remedies this shortcoming while additionally providing a clean and modular way to describe several linguistic dimensions thereby supporting the production of tree grammars with semantic information , , , , , , , , , , , , .

Referential Expressions

TALARIS has a longstanding interest in the semantics and the processing of referential expressions. In recent years, an extensive corpus annotation has been carried out on 5.000 definite descriptions , , , , , , , ; an algorithm for generating bridging definite descriptions has been specified and implemented which illustrates the interaction of realisation and inference , , , ; a constraint based algorithm for definite description has been proposed which differs from the standard one in that it uses constraints to produce a minimal description , ; and a shallow anaphora resolver for French has been developed and evaluated within the national evaluation campaign MeDIA.

Surface Realization

The tree adjoining grammar for French developed by TALARIS associates with each NL expression not only a syntactic tree but also a semantic representation. Interestingly, the semantic calculus used is reversible in that the association between strings and semantic representations is non-directional (declarative). We put this feature to work and have been working over the years towards developing surface realisers for French called GenI , and RTGen , . While GenI manipulates full elementary trees to build derived trees, RTGEn uses an RTG (Regular Tree Grammar) encoding of the Tree Ajoining Grammar to first build derivation trees. A preliminary comparison suggests that RTGen outperforms GenI. Current work concentrates on further optimising RTGen and on integrating top down control to reduce the search space and constrain the output.

Textual Entailment Recognition

In essence, the textual entailment recognition task is an inference task, namely deciding whether the information contained in a given text T₁can be inferred from the information provided by another text T₂.

It is crucial to be able to answer this question. One important characteristic of natural language is the large number of ways in which it can express the same information. Many natural language processing applications like question answering, information retrieval, generation, and anaphora resolution need to deal with this diversity efficiently and accurately, and recognising textual entailments is a key step towards this.

Textual entailment recognition is a difficult task. We work on developing linguistically principled approaches to tackle specific entailment sources such as syntactic variations or the compositional semantics of factive verbs , , .

Computational Logics and Computational Semantics.

Members of TALARIS have long actively proposed and developed the idea of using inference (and in particular, using computational tools like model builders and theorem provers) as an integral part of different tasks in computational semantics, mainly during semantic construction , . The book “Representation and Inference for Natural Language: A First Course in Computational Semantics” by Patrick Blackburn and Johan Bos is nowadays an important reference in this area.

Hybrid Automated Deduction

TALARIS's main contribution in this topic has been the design of resolution and tableaux calculi for hybrid logics, calculi that were then implemented in the HyLoResand HTabtheorem provers. For example, TALARIS members have proved that the resolution calculus for hybrid logics can be enhanced with optimisations of order and selection functions without losing completeness. Moreover, a number of `effective' (i.e., directly implementable) termination proofs for the hybrid logic $Im1 ${\#8459 (@)}$$ has been established, for both resolution and tableaux based approaches, and the techniques are being extended to more expressive languages. Current work includes adding a temporal reasoning component to the provers, extending the architecture to allow querying against a background theory without having to explore again the theory with each new query, and testing the hybrid provers performance against dedicated state-of-the-art provers from other domains (firs-order logic, description logics) using suitable translations.

Moreover, we are interested in providing a range of inference services beyond satisfiability checking. For example, the current version of HyLoResand HTabincludes model generation (i.e., the provers can generate a model when the input formula is satisfiable).

We have also started to explore other decision methods (e.g., game based decision methods) which are useful for non-standard semantics like topological semantics. The prover HyLoBanis an example of this work.

Multimedia

MLIF (Multi Lingual Information Framework) is intended to be a generic ISO-based mechanism for representing and dealing with multilingual textual information. A preliminary version of MLIF has been associated with digital media within the ISO/IEC MPEG context and dealing with subtitling of video content, dialogue prompts, menus in interactive TV, and descriptive information for multimedia scenes. MLIF comprises a flexible specification platform for elementary multilingual units that may be either embedded in other types of multimedia content or used autonomously to localise existing content.

Interfacing Virtual Worlds and Natural Language Processing

In 2010, Talaris addressed a new application domain namely, the integration of deep natural language processing (NLP) techniques with 3D worlds and games. A first foray into that theme has been the submission of two systems to the international GIVE (Giving instructions in a virtual environment). Two recently accepted EU funded projects (Interreg project Allegro and Eurostar project Emo-Speech) on that theme will permit a fully blown exploration of the research issues and of the technological problems arising in this area. This new theme builds on the tools and techniques developped by Talaris over the last 5 years for deep NLP and in particular, on the availability of an expressive grammar writing environnement (XMG), of wide coverage deep grammars for French and English (SemTAG and SemXTAG), of a grammar based surface realiser (GenI) and of parsers (LLP2, SemConst) using these grammars.

Software AGREE

AGREE (Asynchronous Grounding of REferential Expressions) is a set of modules that manage the grounding process at the reference level. It contains an interpretation evaluation module that construes understanding judgments made by the system and those manifested in the dialogue by the user, a dialogue module that maintains a coherent state of the dialogue (adjacency pairs), and a generation module (GenI) in order to produce paraphrases of the understood referents. The whole system has been implemented in Java and uses the same semantic/referential representation that was used in the MEDIA project.

Version: 0.1

License: GPL

Last update: 2008-11-12

Web site: http:// www. loria. fr/ ~denis/ grounding. html

Authors: Alexandre Denis

Contact: Alexandre Denis

The eXtended Meta-Grammar ( XMG) Compiler and Tools

A metagrammar compiler generates automatically a grammar from a reduced description called a MetaGrammar. This description captures the linguistic properties underlying the syntactical rules of a grammar. Various past and present TALARIS members have been working on metagrammar compilation since 2001 and several tools have been developed within this framework starting with the MGC system of Bertrand Gaiffe (now of ATILF, Analyse et Traitment Informatique de la Langue Francaise, a Nancy-based CNRS unit) to the newly developed XMGsystem of Crabbé et al.

The XMGsystem is a 2nd generation compiler that proposes (a) a representation language allowing the user to describe in a factorised and flexible way the linguistic information contained in the grammar, and (b) a compiler for this language (using a Warren Abstract Machine-like architecture). An innovative feature of this compiler is the fact that it makes it possible to describe several linguistic dimensions, and in particular it is possible to define a natural Syntax/Semantics interface within the Metagrammar.

The compiler actually supports two syntactic formalisms (Tree Adjoining Grammars and Interaction Grammars) and the description both of the syntactic and of the semantic dimension of natural language. The generated grammars are in XML format, which makes them easy to reuse. Plug-ins have been realised with the LLP2 parser, with Eric de la Clergerie's DyALog parser and with the GenIgenerator. Future work will deal with the modularisation and the extension of XMGto define a library of languages describing linguistic data allowing the user to describe his own target formalism.

Developed under the supervision of Denys Duchier, the XMGcompiler is the result of an intensive collaboration with CALLIGRAMME. It has been implemented in Oz/Mozart and runs under the Linux, Mac, and Windows platforms. It is available with tools easing its use with parsers and generators (tree viewer, duplicate remover, anchoring module, metagrammar browser).

The system is currently being used and tested by Owen Rambow (University of Columbia, USA) and Laura Kallmeyer (University of Tuebingen, Germany).

Version: 1.1.4

License: CeCILL

Last update: 27/09/2005

Web site: http:// sourcesup. cru. fr/ xmg/

Documentation: http:// sourcesup. cru. fr/ xmg/ #Documentation

Authors: Benoit Crabbé, Denys Duchier, Joseph Le Roux, Yannick Parmentier

Contact: Benoit Crabbé, Yannick Parmentier

Frolog

Frolog is a dialogue system based on current technology from computational linguistics, artificial intelligence planning, and theorem proving. It implements a text adventure game engine that uses natural language processing techniques to analyse the player's input and generate the system's output.

The Frolog core is implemented in Prolog and Java, but it uses external tools for the most heavy-loaded tasks. It performs syntactic analysis of the input based on an English grammar developed using XMGand computes a flat semantic representation using the Tulepa parser. It then uses the constructed semantic representation and an off-the-shelf planner to interpret the player's intention and change the world model accordingly. The world is modelled as a knowledge base in description logics, and accessed using the Description Logic theorem prover Racer. Finally, the results of the action, or descriptions of objects, are generated automatically, using the GenIgenerator.

Frolog is intended to serve as a laboratory in order to test pragmatic theories about the phenomenon of accommodation. It is also result in the first integrated system to use SemTag(the LORIA toolbox for TAG-based Parsing and Generation).

Version: 1.0

License: GPL

Last update: 2008-11-07

Authors: Luciana Benotti, Alejandra Lorenzo, Laura Perez

Contact: Luciana Benotti

GenIsurface realiser

The GenIsurface realiser is a successor of the InDiGen realiser. Also based on a chart algorithm, it is implemented in Haskell and aims for modularity, re-usability and extensibility. The system is “stand-alone” as we use the Glasgow Haskell compiler to obtain executable code for Windows, Solaris, Linux and Mac OS X.

The GenIgenerator uses efficient datatypes and intelligent rule application to minimise the generation of redundant structures. It also uses a notion of polarities as a means, first, of coping with lexical ambiguity and second, of selecting variants obeying given syntactic constraints.

GenIis compatible with both a grammar for French ( SemTag) and for English ( SemXTag), both grammars beeing produced using the MetaGrammar Compiler. SemTagcovers the basic syntactic structures of French as described in Anne Abeillé's book “An Electronic Grammar for French”. SemXTaghas a coverage similar to that of XTAG, the TAG grammar for English developped by the University of Pennsylvannia . Both grammars are additionnally equiped with a compositional semantics supporting semantic construction (during parsing) and/or surface realisation.

The system can process the output of the XMGMetagrammar compiler mentioned above.

Version: 0.20.2

License: GPL

Last update: 2009-11-16

Web site: http:// talc. loria. fr/ GenI-un-realisateur-de-surface. html

Project(s): GenI

Authors: Carlos Areces, Claire Gardent, Eric Kow

Contact: Claire Gardent

HyLoRes, a Resolution Based Theorem Prover for Hybrid Logics

HyLoResis a resolution based theorem prover for hybrid logics (it is complete for the hybrid language H(@, $Im2 $\#8595 $$ ), a very expressive but undecidable language, and it implements a decision method for the sublanguage H(@)). It implements a version of the “given clause” algorithm which is the underlying framework of many current state of the art resolution-based theorem provers for first-order logic; and uses heuristics of order and selection function to prune the search space on the space of possible generated clauses.

HyLoResis implemented in Haskell, and compiled with the Glasgow Haskell compiler (thus, users need no additional software to use the prover). We have also developed a graphical interface.

The interest of HyLoResis twofold: on one hand it is the first mature theorem prover for hybrid languages, and on the other, it is the first modern resolution based prover for modal-like languages implementing optimisations and heuristics like order resolution with selection functions.

Version: 2.5

License: GPL

Last update: 2009-04-09

Web site: http:// www. glyc. dc. uba. ar/ content/ tools. php

Authors: Carlos Areces, Daniel Gorín and Juan Heguiabehere

Contact: Carlos Areces

HTab, a Tableau Based Theorem prover for Hybrid Logics

The main goal behind HTabis to make available an optimised tableaux prover for hybrid logics, using algorithms that ensure termination. We ultimately aim to cover a number of frame conditions (i.e., reflexivity, symmetry, antisymmetry, etc.), as far as we can ensure termination. Moreover, we are interested in providing a range of inference services beyond satisfiability checking. For example, the current version of HTabincludes model generation (i.e., HTabcan generate a model from a saturated open branch in the tableau).

HTaband HyLoResare actually being developed in coordination, and a generic inference system involving both provers is being designed. The aim is to take advantage of the dual behaviour existing between the resolution and tableaux algorithms: while resolution is usually most efficient for unsatisfiable formulas (because a contradiction can be reported as soon as the empty clause is derived), tableaux methods are better suited to handle satisfiable formulas (because a saturated open branch in the tableaux represents a model for the input formula).

Version: 1.5.4

License: GPL

Last update: 2010-11-10

Web site: http:// www. glyc. dc. uba. ar/ content/ tools. php

Authors: Carlos Areces, Guillaume Hoffmann

Contact: Guillaume Hoffmann

HyLoBan, a Game Based Theorem Prover for Topological Hybrid Logics

HyLoBanis a game-based prover, resulting from a direct implementation of Sustretov's game-based proofs of the PSPACE-completeness of the hybrid logics of T0 and T1 topological spaces. The interest of this approach is that termination is guaranteed and in addition the underlying game-based architecture is of independent interest; its disadvantage is that (at present) it is still extremely inefficient.

Version: 0.2

License: GPL

Last update: 2009-10-29

Web site: http:// www. glyc. dc. uba. ar/ content/ tools. php

Authors: Carlos Areces, Guillaume Hoffmann, Dmitry Sustretov

Contact: Guillaume Hoffmann

hGEN, a Random Formula Generator

hGen is a random CNF (conjunctive normal form) generator of formulas for sublanguages of H(@, $Im2 $\#8595 $$ , A, P). It is an extension of the latest proposal of Patel-Schneider and Sebastiane, nowadays considered the standard testing environment for classical modal logics. The random generator is used for assessing the performance of different provers.

Version: 1.2

License: GPL

Last update: 2009-06-17

Web site: http:// www. glyc. dc. uba. ar/ content/ tools. php

Authors: Carlos Areces, Daniel Gorín, Juan Heguiabehere and Guillaume Hoffmann

Contact: Carlos Areces

MEDIA

In the framework of the MEDIA project, software has been developed to process transcriptions of a spoken dialogue corpus and to provide a semantic representation of their task-related content. This software contains a tokeniser, a LTAG parser (LLP2), a LTAG grammar, an OWL ontology and a set of rules in description logic, and works together with a reasoner such as RACER. The current version contains a reference resolution module (anaphora and deixis) which is based on the referential domains theory. The package also contains ways to project the semantic form (referentially solved) into the MEDIA formalism and to evaluate the accuracy of the representation using a test corpus. The whole system has been implemented in Java and communicates with other modules using TCP/IP.

Version: 0.5

License: GPL

Website: http:// www. loria. fr/ ~denis/ media. html

Last update: 12/11/2008

Project(s): MEDIA

Authors and Contact: Alexandre Denis

Nessie

Nessie is a semantic construction tool written in OCaml. It takes a lexicon and a syntax tree as input and produces a semantic representation taking the form of a simply typed lambda term. Simply typed lambda calculus is used not only as the target language, but also as the glue language for assembling the representations provided by the lexicon.

This tool has been successfully used in several applications, the most notable of which being the computation of discourse semantics according to two different theories, namely the compositional DRT (Muskens 95) and the compositional treatment of dynamicity (de Groote 2006).

Future developments of Nessie may include using richer typing systems, and interfacing it with inference and rewriting tools to simplify the representations it produces.

Last update: 2008-11-14

Authors: Sébastien Hinderer

Contact: Sébastien Hinderer

DeDe Corpus

DeDe is a corpus of roughly 50.000 words where around 5.000 definite descriptions have been annotated as coreferential, contextually dependent, non referential or autonomous. The corpus consists of articles from the newspaper Le Mondeand is annotated with Multext-based morphosyntactic information .

Authors: Claire Gardent, Hélène Manuelian

Web site: Distributed by the CNRTL http:// www. cnrtl. fr/

Contact: Claire Gardent

SemTAG

A TAG grammar developed with the XMGmetagrammar compiler and which describes both the syntax and the semantics of a core fragment coverage of French. Syntactically, the grammars covers the constructions described in A. Abeillé 's book. Additionnally, it is equipped with a unification based compositional semantics which supports both semantic construction (using LLP2, Tulipa or SemCONST) and surface realisation (using GenI).

Authors: Claire Gardent, Benoit Crabbé

Contact: Claire Gardent

SemXTAG

A TAG grammar for English developed with the XMGmetagrammar compiler and which describes both the syntax and the semantics of English. Syntactically, the grammar has a coverage comparable to that of the XTAG grammar developed by the University of Pennsylvannia. Additionnally, the grammar integrates a unification based compositional semantics. Used both for parsing (by LLP2 and SemCONST) and for generation (by GenI).

Authors: Claire Gardent, Katya Alahverdzhieva

Contact: Claire Gardent

WikipediaAnnotator

The WikipediaAnnotator program provides semantic annotation of Wikipedia discussion pages. It annotates French Wikipedia participants utterances on the connotation and subjective levels using deep syntax and shallow semantics.

Developing context: CCCP-Prosodie

Programming language: Java

Development effort: 18 man/month

Type of license: GPL

Partners: Telecom Paris Tech

Web site: – (in construction)

ATOOL

ATOOL is a semantic Annotation Tool for the High-Level Semantic Representation MMIL which takes as input the MEDIA corpus (TEI format according to the specifications document of September 2009)

Developing context: PORT-MEDIA

Programming language: Java

Development effort: 4 months (student) + 1 month of evaluation and improvements.

Expected users: Annotators

Type of license: Open Source.

Web site: http:// www. port-media. org/ doku. php?id=start

SRL-Web Annotation

This tool allows the annotation of the utterances in the MEDIA corpus and stores all the information about predicates and arguments in a relational database.

Developing context: PORT-MEDIA

Programming language: JSP-JAVA

Development effort: 1 month.

Expected users: Annotators

Type of license: Open Source.

Web site: http:// talc. loria. fr/

PORT-MEDIA

PORT-MEDIA is a framework supporting the automatic annotation of the MEDIA corpus with the high-level semantic representation MMIL. It is a blackboard architecture interfaced with the Tree Tagger, the Malt parser, the frames, the semantic role labeling and the HLSBuilder for the automatic annotation of the MEDIA Corpus. Additionally it is interfaced with RACER, two ontologies in owl and a relational database with all the information at different linguistic levels. Finally the evaluation software is also provided.

Developing context: PORT-MEDIA

Programming language: Java, MySql, XML, XSLT, OWL.

Development effort: 12 months.

Expected users: Annotators

Type of license: Open Source.

Web site: http:// www. port-media. org/ doku. php?id=start

WSMACI

The Web Service for the Multilingual-Assisted Chat Interface program (WSMACI) is a linguistic assistant for virtual worlds. Its first version is dedicated to English assistance in such worlds. It has been developed in the context of the Metaverse1 project. It provide the end-users with MLIF-based provision of sentence analysis and word information (synonyms, definitions, translations) based on Google Translate, WordNet and the Brown Corpus.

Programming language: MLIF, PHP, SQL

Development effort: 6 man/month

Type of license: INRIA specific

Authors : Tarik Oswald, Samuel Cruz-Lara

Contact: Tarik Oswald

Web site: – (in construction)

4LED

The 4 Layers Emotion Detection program (4LED) is an emotion detection tool. The emotions are extracted from texts in particular, from chat interfaces in virtual worlds. It has been developed in the context of the Metaverse1 project. The emotion detection process is based on SMILEY detection using WordNet-Domains and Tree-Tagger-based rules, WordNet-Affect, and keywords.

Programming language: MLIF, PHP, SQL

Development effort: 6 man/month

Type of license: INRIA specific

Partners: ArtefactO (for rendering)

Authors : Tarik Oswald, Samuel Cruz-Lara

Contact: Tarik Oswald

Web site: – (in construction)

SLMC

The Second Life Magic Carpet program (SLMC) is an assistant whose role is to guide people through virtual worlds with textual instructions. It has been developed in the context of the Metaverse1 project. It has been developed in the context of the Metaverse1 project. It analyses the instructions of the visitors in order to find where they want to go, using web services for the analysis, for synonyms retrieving and for path finding.

Developing context: Metaverse1

Programming language: LSL, PHP, SQL

Development effort: 8 man/month

Type of license: INRIA specific

Partners: Innovalia Spain, Utrecht University

Authors : Tarik Oswald, Samuel Cruz-Lara

Contact: Tarik Oswald

Web site: – (in construction)

DiacXis

The DiacXis program seeks to analyse the evolution over time of textual information. It has been developed in the context of the CPER TALC action McFiiD. DiacXis addresses the analysis of textual information evolving over time using a diachronic approach based both on the Multiview Data Analysis (MVDA) paradigm and on specific cluster labeling techniques especially developed for the statistical analysis of complex data, like textual data . Given a set of textual datasets stemming from different time periods, DiacXis provides the information analyst with lists of stable, appearing, disappearing, merging and splitting topics over these time periods.

Programming language: C + Java

Development effort: 6 man/month

Type of license: INRIA specific

Partners: INIST, ITT Kampur

Authors : Jean-Charles Lamirel, Navesh Pryankar

Contact: Jean-Charles Lamirel

Web site: – (in construction)

IGNGF

The IGNG-F program implements a new incremental clustering algorithm whose main domain of application is the statistical analysis of continuous flow of evolving textual data. It has been developed in the context of the CPER TALC (McFiiD action). It is based on a generic adaptation of the classical neural-based clustering approaches using gas of neurons with free topology. This adaptation resulted in a description space independent and parameter-free, neural clustering technique using Hebbian learning and labeling expectation maximization instead of classical Euclidean or correlation distances . We showed that this approach is more efficient than the usual techniques for the analysis of highly polythematic textual data. Given a continuous flow of textual data, the IGNG-F tool provides the information analyst with precise online detection of diachronic topic changes.

Programming language: C

Development effort: 6 man/month

Type of license: INRIA specific

Partners: INIST, ITT Hyderhabad

Authors : Jean-Charles Lamirel, Raghvendra Mall

Contact: Jean-Charles Lamirel

Web site: – (in construction)

TextClus

The TextClus interface is a java interface whose role is to provide end-users with a whole set of clustering techniques applied to texts. It has been developed in the context of the CPER TALC McFiiD operation. The platform uses a vectorial representation of text data. It includes, in a federating interface, the management of different kinds of data preprocessing techniques (IDF, entropy, random mapping, ...), different kinds of clustering techniques (from standard methods to elaborated neural ones), the management of multiple experiments and comparison of their results by method-independent clustering quality measures specifically adapted to the analysis of textual data , . Each step can be precisely tuned by the end-user through the TextClus program interface to construct a global experimental process.

Programming language: C

Development effort: 6 man/month

Type of license: INRIA specific

Partners: INIST

Authors : Jean-Charles Lamirel, Pascal Cuxac

Contact: Jean-Charles Lamirel

Web site: – (in construction)

New Results Dynamic Modal Logics

We are investigating modal logics that include operators that not only allow for exploration of the model in which they are evaluated, but that can also modifyit. These logics could be specially suitable to describe, for example, the semantic of utterances, to model the fact that uttering a sentence changes the context it was uttered in.

We have investigated in detail a family of dynamic modal logics called memory logicsand established lower and upper complexity bounds, mapped their expressive power, devised tableaux and model checking algorithms, and sound and complete axiomatizations.

Automated Deduction for Hybrid Logics

This year we have finally released stable versions of the two theorem provers (HyLoRes and HTab) for hybrid logics developed by the team. Moreover, the theoretical framework behind each of them has been described in detail in a PhD thesis and a journal article.

Integrating inference with dialogue

This work concentrates on integrating techniques from logic and artificial intelligence ( notably planning) with work on pragmatics and the structure of dialogue , .

Giving Instruction in a Virtual World (GIVE)

We developed two generation systems for the international GIVE challenge , . Interfaced with the 3D game provided by the challenge organisers, these systems guide the player with natural language instructions they generate in real time and according to the player's position in the game. The two systems were ranked first and third. Their development revived a long standing Talaris/Led interest in the generation of referring expressions and launched a new line of research on situated referring expression generation and on the interface between 3D game and NLP.

Using 3D worlds to teach French (I-FLEG)

Within the Interreg IV A Allegro project, we investigated how embedding text generation in a 3D game could help automating the generation of situated language exercises i.e., exercises whose content varies with the 3D world context, with the learner level and with the teaching goal. We developed a system (I-FLEG) illustrating this interaction and made contact with language teachers to arrange for learner use and teacher testing. I-FLEG will be deployed in 2011 in language learning classes and its usability for language teaching tested.

Semantic construction using rewriting

Paul Bédaride and Claire Gardent developed a new method for constructing semantic representations for textual data which is based on graph rewriting , , . They tested it on artificial data with good results and showed that it permits constructing deep semantic representations from dependency structures.

Using Regular Tree Grammars to enhance surface realisation

Making use of the fact that Regular Tree Grammars can generate the derivation trees of a Tree Adjoining Grammar, we developed an alternative surface realiser to Geni called RTGen. Preliminary results suggest that RTGen outperforms GenI. Current work concentrates on further optimising RTGen and on integrating top down control to reduce the search space and constrain the output , .

Verb classes and Semantic Role Labelling for French

Although verb classes and Semantic Role Labelling (SRL) have been shown to be an essential component of semantic processing, there is to date no such resource and tool for French. Using Formal Concept Analysis and information from the LADL tables, we developed a method for classifying verbs based on their subcategorisation information; and a method for projecting thematic roles onto the Paris 7 dependency bank. We are currently extending the verb classification approach to integrate thematic grids and working on evaluating and improving the result. In the long run, we aim to provide a Propbank style corpus for French to and to develop a semantic role labeller based on this corpus , , .

Deep analysis of interaction patterns in texts

In the context of the CCCP-Prosodie project, we showed that connotation and subjective markers are good evidence for conflicts between participants on Wikipedia discussion pages. We observed striking patterns of interaction during conflicts, especially the alternation of 1st person and 2nd person subjective markers in negatively connotated utterances. Further work will involve examine the various conflicts type, aiming to automatically distinguish argument-based conflicts from personal ones (ad hominem)

MLIF

TALARIS contributes to ISO TC 37 committee “Terminologies and other Language Resources”, and more specifically to the activities of its SC3 “Computer Applications in Terminology”, and SC4 “Linguistic Resources Management”. Within TC37/SC4, TALARIS is currently contributing, as project leader, to the definition and specification of the Multi Lingual Information Framework (MLIF) [ISO DIS 24616]. MLIF is being designed with the objective of providing a common abstract model being able to generate several formats used in the framework of translation and localization. MLIF will soon be released as FDIS (Final Draft International Standard) and it should finally be published as an official ISO Standard within the first semester of 2011 , , .

New clustering quality metrics for the analysis of complex textual data

Cluster quality evaluation is a key issue for many data analysis tasks. As we showed in previous work, the classical distance based quality indices are often strongly biased and highly dependent on the clustering method. To cope with such problems, we proposed in earlier work specific Macro-Recall/Precision and F-measures metrics that exploit the properties of cluster associated data. However, our more recent experiments showed that these new metrics failed to highlight degenerated clustering results when analyzing complex textual data , . To remedy this shortcoming, we devised two extensions of these metrics namely Micro-Measures , , and Cumulated Micro-measures . We then experimentally showed the effectiveness of our extended approach by applying it to different documentary corpus of highly polythematic bibliographic records issued from the PASCAL CNRS scientific database.

New diachronic method for the analysis of slowly time-varying textual data

The literature taking into account the chronological aspect in textual information flows focuses on "DataStream" whose main idea is the "on the fly" management of incoming data. However the proposed algorithms are intended to treat very large volumes of data and are thus not optimal for detecting emergent topics such as, for example, the evolution of a research theme within bibliographical records. To address this issue, we proposed a new approach based on our Multiview Data Analysis Paradigm (MVDA) , , , in combination with specific cluster labeling techniques especially developed for the statistical analysis of complex data, like textual data. We applied our approach to the IST PROMTECH reference dataset related to optoelectronic research. When compared with state-of the art approaches, our method proved to provide very significant added-value by permitting to precisely highlight and quantify the observed evolutions, and their related context, ranging from vocabulary changes in a given topic to overall appearing/disappearing of topics, or even to splitting or merging between topics .

New incremental clustering algorithms for the analysis of highly time-varying textual data

Neural clustering algorithms show high performance in the general context of the analysis of homogeneous textual datasets. However, we showed that there is a drastic decrease of performance of these algorithms, as well as of the more classical algorithms, when applied to heterogeneous or polythematic textual datasets. Such result degradation indicates that most of the exiting clustering methods, even those that are considered incremental, are not really able to deal with highly time-varying data . We therefore proposed a new approach to incremental clustering based on a generic adaptation of the classical neural-based clustering methods relying on gas of neurons with free topology. This adaptation resulted in a description space independent and parameter-free, neural clustering technique using Hebbian learning and labeling expectation maximization instead of classical Euclidean or correlation distances. We have proved that our approach very significantly outperformed the existing ones in all experimental contexts, and specifically, in the case of highly time-varying data , , .

Building up evolving networks of wikis

The WICRI Project aims to explore a new concept of "wikis network" with adaptive capabilities. In particular, one aim is to propose strategies for enriching the content of the wiki network by dynamic integration and exploitation of Web data. Hence, Web mining represents an important challenge for enhancing the dynamicity, the flexibility and the scope of such a network. On the one hand, this process is mandatory for assisting the potential contributors with elaborated and reliable redaction guidelines during the network construction phase. On the other hand, it is essential for supplying end-users with external information whose added value is to maintain significant relationships with the semantic context of the wiki network. Although the WICRI project is still in his launching phase, our preliminary prototype of "network of wikis" is already acting as an on-line collaborative research platform .

Other Grants and Activities Regional Initiatives McFIID

Theme:Clustering; Statistical Analysis; Textual data; Time-evolving data; Distributed data

Description:The McFIID project is a CPER project continuing the CPER CLASSIF project. It concerns the development of incremental multi-clustering techniques for managing distributed and evolving flows of textual data. New approach of diachronic analysis based on the use of multiple viewpoints combined with unsupervised bayesian reasoning, as well as new online incremental clustering techniques based on non standard similarity measures, are tested in the curse of these project.

Administrative context:CPER

Web site: http:// wikitalc. loria. fr/ dokuwiki/ doku. php?id=operations:mcfiid

Period:start 2007-01-01 / 2011-12-31

Contact:Jean-Charles Lamirel

Partner(s):INIST, LORIA

National Initiatives CCCP-Prosodie

Theme:Discourse, Dialogue and Pragmatics; Logics for Natural Language and Knowledge Representation

Description:The goal of CCCP-Prosodie is to empirically investigate the functioning of online communities (such as Wikipedia), and particular to link their activities and their use of language (as recorded in such corpora as email exchanges, for example). The TALARIS team is involved in this project for three reasons: to provide Natural language processing tools, to design an annotation scheme capable of dealing with information from both the social sciences (sociology and economics) and the humanities (psychology and ergonomics), and to provide help with inference technology.

Administrative context:ANR CONTINT

Web site: http:// recherche. telecom-bretagne. eu/ labo_communicant/ cccp-prosodie/

Period:start 2008-01-12 / end 2011-31-06

Contact:Alexandre Denis

Partner(s):Institut Télécom, UTC Compiégne, UNSA (Univ. Nice Sophia-Antipolis), Univ. de Versailles St-Quentin

PORT-MEDIA

Theme:Corpus Linguistics, Semantic Annotation of Corpora. Discourse, Dialogue and Pragmatics; Natural Language Understanding and Knowledge Representation

Description:The PORT-MEDIA project is an ANR project that aims to collect linguistic data for multiple domains and to investigate the use of a high-level semantic representation for annotating dialogue corpora. TALARIS contributed to the high-level semantics specification for annotating the MEDIA corpus and to the development of tools for the manual annotation (e.g., ATOOL and SRL-Web Annotation) as well as to the development of the blackboard architecture for the automatic annotation of the MEDIA corpus. Additionnally, Talaris provided the automatic annotation of the whole corpus and its evaluation.

Administrative context:ANR CONTINT

Web site: http:// www. port-media. org/ doku. php?id=start

Period:start 2009-03-01 / end 2012-03-01

Contact:Matthieu Quignard, Lina M. Rojas-Barahona

Partner(s):ELDA, LIG/GETALP, LIA, LIUM, LORIA

Passage

Theme:Computational Semantics

Description:The PASSAGE project has two main aims. The first is to improve the robustness and precision of existing computational grammars for French, and to use them on large corpora (corpora containing several million words). The second is to exploit the resulting syntactical analyzes to create richer linguistic resources (such as Treebanks) for the French language.

Administrative context:ANR MDCA

Web site: http:// atoll. inria. fr/ passage/ home-fr. html

Period:start 2007-01-01 / end 2010-30-06

Contact:Claire Gardent

Partner(s):CEA-LIST, LIMSI, INRIA Rocquencourt, CNRS

European Initiatives Allegro

Theme:Computational Semantics

Description:The Allegro project aims to develop NLP techniques that support language teaching for French and German.

Administrative context:INTERREG IV A

Web site: http:// talc. loria. fr/ -ALLEGRO-Nancy-. html

Period:start 2010-01-01 / end 2012-12-31

Contact:Claire Gardent

Partner(s):Saarbrücken University, Supelec Metz, INRIA Nancy Grand Est

EMOSPEECH

Theme:Computational Semantics

Description:The EMOSPEECH project aims to augment serious games with natural language (spoken and written dialog) and emotional abilities (gesture, intonation, facial expressions).

Administrative context:Eurostars

Period:start 2010-09-01 / end 2013-08-31

Contact:Claire Gardent

Partner(s):Artefacto, Acapella, INRIA Nancy Grand Est

METAVERSE

Theme:Multilinguality for Multimedia

Description:Metaverse is an exciting project whose goal is to provide a standardized global framework enabling the interoperability between virtual worlds (for example Second Life, World of Warcraft, IMVU, Active Worlds, Google Earth and many others) and the Real world (sensors, actuators, vision and rendering, social and welfare systems, banking, insurance, travel, real estate and many others).

Administrative context:ITEA2 07016

Web site: http:// www. metaverse1. org/

Period:start 2009-01-01 / end 2011-12-31

Contact:Samuel Cruz-Lara

Partner(s):Belgian partners: Alcatel-Lucent Bell N.V., Nazooka, IBBT-SMIT; French partners: Alcatel-Lucent France, Orange Labs, CEA List, Artefacto; Greek partners: Forthnew S.A., Ellinogermanki Agogi; Dutch partners: Philips Research, Philips I-Lab, DevLab, Technical University Eindhoven, University of Twente, Stg. EPN, VU Economics & BA, VU CAMeRA; Spanish partners: Innovalia, Ceeda, VirtualWare, CBT, Nextel, Corsa, Avantalia, I&IMS, VicomTECH, E-PYME, CIC Tour Game, UPF-MTH; Israeli partners: Metaverse Labs.

SEMbySEM: Services management by Semantics

Theme:Multilinguality for Multimedia

Description:The goal of the SEMbySEM project is to develop a new open source supervision system adapted to the increasing complexity of “systems of systems”. This new supervisions system will be based on the extensive use of semantic technologies (notably ontologies). It will provide a set of tools allowing the set up of dedicated supervision systems according to the various stakeholders' needs and domain knowledge.

The TALARIS team's contribution to this project will center on providing language technology for developing, maintaining, and enriching ontologies and on developing ISO standards for multilingual user interfaces.

Administrative context:ITEA2 07021

Web site: http:// www. sembysem. org/

Period:start 2008-07-31 / end 2010-12-31

Contact:Samuel Cruz-Lara

Partner(s):Finnish partners: Identoi, LogiNets, Oliotalo, VTT; French partners: Thales (Project Leader), ArcInformatique, CityPassenger, LISSI (Université de Paris 12), LIG (IMAG GRenoble); Spanish partners: Trimek, DataPixel, SQS, CBT, Innovalia; Turkish partners: AGM Lab, METU.

International Initiatives InToHyLo: Inference Tools for Hybrid Logics

Theme:Logics for Natural Language and Knowledge Representation

Description:The main aim of the InToHyLo project is to investigate inference methods for hybrid logics, to develop highly optimized inference tools based on these methods, and to use these tools in natural language applications. Talaris and GLyC are currently leaders in automated theorem proving for hybrid logics, and they are the developers of the two provers HyLoRes (based on resolution) and HTab (based on tableaux). With the InToHyLo project we want to investigate how to combine resolution and tableaux algorithms to allow our provers to collaborate and share partial results. We will integrate our tools in a platform suitable for inference in NLP applications (focusing on Dialogue Systems and Textual Entailment). This platform will include not only tools for satisfiability testing, but also for model building, model checking, bisimulation checking, and knowledge maintenance and retrieval. Finally, we want to develop parallel inference algorithms to improve performance, and distributed testing to speed up developing.

Administrative context:INRIA (Equipes Associées)

Web site: http:// led. loria. fr/ dokuwiki/ doku. php?id=intohylo_-_inria_equipes_associees

Period:start 2009-01 / end 2012-01

Contact:Carlos Areces

Partner(s):Universidad de Buenos Aires, Argentina.

Dissemination PhD Theses

Luciana Benotti defended her PhD thesis at the Université Henri Poincaré entitled Implicature as an Interactive Processsupervised by Patrick Blackburn, on 28 January 2010.

Dimitri Sustretov defended his PhD thesis at the Université Henri Poincaré entitled Topological semantics for hybrid logicsupervised by Patrick Blackburn, on 9 July 2010.

Paul Bedaride defended his PhD thesis at the Université Henri Poincaré entitled Implication textuelle et réécrituresupervised by Claire Gardent, on 18 October 2010. He is now a Postdoc at Stuttgart University.

Guillaume Hoffman defended his PhD thesis at the Université Henri Poincaré entitled Taches de raisonnement en logiques hybridessupervised by Patrick Blackburn and Carlos Areces, on 13 December 2010.

Habilitations

Jean-Charles Lamirel defended his habilitation (HdR) at the Université Henri Poincaré entitled Vers un approche systémique et multivues pour l'analyse de données et la recherche d'information : un nouveau paradigmeon 6 December 2010 .

Service to the Scientific Community

Carlos Areces:

Member of the Management Board of the Association of Logic, Language and Information (FoLLI), 2005-2010.

Patrick Blackburn

Liaison officer for the Erasmus MundusMasters in Language and Communication Technology.

Samuel Cruz-Lara

In charge, at the national level, of the reception of Mexican students in the “Professional Licences of Computer Science”.

Member of W3C's SYnchronized MultiMedia Group http:// www. w3. org/ AudioVideo/ Group/

Member of ISO's TC37 “Terminologies and other Language Resources” / SC4 “Linguistic Resources Management”. Project leader of the Multi Lingual Information Framework (ISO DIS 24616).

Christine Fay-Varnier

Vice president of the Council of studies and university life of the INPL.

Representative of the INPL for the steering committee TICE (Information and Communication Technology for Education) for Nancy University.

Claire Gardent

Member of the LORIA steering committee.

Coordinator of the TALC theme (Computational Linguistics and Computational Approaches to Knowledge) for the MISN CPER (National and Regional Research Funding).

Organiser of the LORIA TALC seminar http:// talc. loria. fr/ -TALC-Seminar-. html

Local organiser for the NaTAL 2010 workshop http:// talc. loria. fr/ NATAL-10-16-18-juin-2010-LORIA,178. html

Jean-Charles Lamirel:

Member of the Management Board of the Collnet international research group in Scientometrics/Informetrics/Webometrics, 2005-2010.

Fabienne Venant

Member of the Administrative Council of ATALA, the French national organisation for computational linguistics (see http:// www. atala. org/ ).

Editorial and Program Committee Work

Carlos Areces

Member of the Editorial Board of the FoLLI Publications on Logic, Language, and Information (part of the Lecture Notes in Artificial Intelligence series published by Springer-Verlag). Since 2006.

Member of the Scientific Board of The Baltic International Yearbook of Cognition, Logic and Comunication. Since 2005

Member of the Editorial Board of the Journal of Logic, Language and Information. Since 2004.

Member of the Editorial Board of the Journal of Applied Logics. Since 2004.

Member of the Organizing Committee of the Workshop on NLP and Web-based technologiesheld in conjunction with IBERAMIA 2010, Bahía Blanca, Argentina.

Member of the Program Committee of the 36th Latin American Conference of Informatics, Asunción, Paraguay.

Member of the Program Committee of the 2010 Workshop on Hybrid Logics (HyLo 10), Edinburgh, United Kingdom.

Member of the Program Committee of the 1era Escuela de Lingüística Computacional (ELiC-1), Buenos Aires, Argentina.

Member of the Program Committee of the 2010 International Workshop on Description Logics (DL2010), Waterloo, Canada.

Member of the Program Committee of the International Joint Conference on Automated Reasoning (IJCAR10)Edinburgh, United Kingdom.

Member of the Program Committee of Advances in Modal Logic (AiML10)Moscu, Russia.

Patrick Blackburn

editor of Review of Symbolic Logic

editor of Notre Dame Journal of Formal Logic

editorial board of Logique et Analyse

subject editor (Logic and Language), Stanford Encyclopedia of Philosophy

Program committee of Advances in Modal Logic 2010 (AiML 2010)

Program committee of Hybrid Logic 2010 (HyLo 2010)

Program committee of Workshop on Theories of Information Dynamics and Interaction and their Application to Dialogue 2010 (TIDIAD@ESSLLI 10)

Nadia Bellalem

PC Member for ICEIS 2010 (The 12th International Conference on Enterprise Information Systems), Madeira, Portugal.

Samuel Cruz-Lara

PC member for KEOD 2010 (The International Conference on Knowledge Engineering and Ontology Development), Valencia, Spain.

Member of the Editorial Board of Revista Iberoamericana de Tecnologías del Aprendizaje

Claire Gardent

PC member for TALN 2010 (Traitement Automatique des Langues Naturelles) 2010, Montréal, Canada.

PC member for SemDial 2010 (14th Workshop on the Semantics and Pragmatics of Dialogue), Poznan, Poland.

PC member for RFIA 2010 (17ème Colloque Francophone sur la Reconnaissance des Formes et l'Intelligence Artificielle), Caen, France.

PC member for EMNLP 2010 (Conference on Empirical Methods in Natural Language Processing), MIT, USA.

PC member for LREC 2010 (The seventh international conference on Language Resources and Evaluation), Malta.

PC member for INLG 2010 (12th European Workshop on Natural Language Generation),Dublin, Irland.

PC member for ACL 2010 (Annual meeting of the Association for Computational Linguistics), Uppsala, Sweden.

Jean-Charles Lamirel

Member of editorial board of the new international journal “COLLNET Journal of Scientometrics and Information Management”, Taru Publications, New Delhi, India ( http:// www. tarupublications. com/ ).

Reviewer for the Neural Networks, Geographical Information Systems and Collnet International Journals.

Program chair of VSST Technological and Strategic Survey Conference VSST 2010, Toulouse, France, October 2010.

Organizer of Special Session on Incremental clustering and novelty detection techniques and their application to intelligent analysis of time varying information in the framework of IEA/IAE International Conference, Syracuse, NY, USA, June 2011.

Co-organizer of the ECG Workshop: Clustering incrémental et méthodes de détection de nouveauté et leur application à l'analyse intelligente d'information évoluant au cours du temps, EGC 2011 Workshop, Brest, France, January 2011.

Christine Fay-Varnier

Member of the organizing committee of TICE 2010 conference (http://www.tice2010.nancy-universite.fr/)

Teaching

Carlos Areces

Invited one week course at ELiC-1, Buenos Aires, Argentina. Invited one week course at NASSLLI 2010, Indiana, USA. Invited talk at JCC 2010, Rosario, Argentina.

Patrick Blackburn

M2 course “Mathematics for Computer Science: Introduction to computability and computational complexity”. 15 hours, Erasmus Mundus Master “Language and Communication Technology”, University of Nancy 2, France, http:// webloria. loria. fr/ ~blackbur/ courses/ math/ .

M2 course “Discourse and Dialogue”. 15 hours, Erasmus Mundus Master “Language and Communication Technology”, University of Malta, http:// webloria. loria. fr/ ~blackbur/ courses/ dad/

Samuel Cruz-Lara

M2 course “Cognitive Sciences and Digital Media Technologies”. 22 hours, Cognitive Sciences, University of Nancy 2, France.

M2 course “Declarative Languages and Multimedia Applications”. 40 hours, Cognitive Sciences, University of Nancy 2, France.

M2 course “Video: Streaming and Captioning”. 12 hours, Cognitive Sciences, University of Nancy 2, France.

Claire Gardent

Software project tutoring “Error mining and Surface Realisation”, Erasmus Mundus Master “Language and Communication Technology”, Nancy 2.

Invited tutorial on “Natural language processing and computer aided language learning”, 4th intensive Summer School and Collaborative Franco-Thai Workshop on Natural Language Processing, Kasetsart University, Bangkok, Thailand.

Jean-Charles Lamirel

Master and PhD course on “Text Mining techniques applied to Linguistics: Introduction to the use of statistical methods for the analysis of literature. Case studies in French Literature. 20 hours, University of Alger.

Which Semantics for Neighbourhood Semantics? Carlos Areces C. Santiago Figueira S. Proceedigns of IJCAI 09 Pasadena, California, USA 2009 671–676 Hybrid Logics Carlos Areces C. Balder ten Cate B. Patrick Blackburn P. Frank Wolter F. Johan van Benthem J. Handbook of Modal Logics Elsevier 2006 Incomplete Knowledge and Tacit Action: Enlightened Update in a Dialogue Game Luciana Benotti L. DECALOG 2007 Workshop on the Semantics and Pragmatics of Dialogue Rovereto, Italy 2007 Pure Extensions, Proof Rules, and Hybrid Axiomatics Patrick Blackburn P. Balder ten Cate B. Studia Logica 84 2006 277–322 Handbook of Modal Logic Patrick Blackburn P. Johan van Benthem J. Frank Wolter F. Elsevier 2007 Termination for Hybrid Tableaus Thomas Bolander T. Patrick Blackburn P. Journal of Logic and Computation 17 2007 517–554 An efficient, streamable text format for multimedia captions and subtitles Dick C. A. Bulterman D. C. A. A. J. Jansen A. J. Pablo Cesar P. Samuel Cruz-Lara S. DocEng '07: Proceedings of the 2007 ACM symposium on Document engineering New York, NY, USA ACM 2007 101–110 http:// doi. acm. org/ 10. 1145/ 1284420. 1284451 Incorporating Asymmetric and Asychronous Evidence of Understanding in a Grounding Model Alexandre Denis A. Guillaume Pitel G. Matthieu Quignard M. Patrick Blackburn P. DECALOG 2007 Workshop on the Semantics and Pragmatics of Dialogue Rovereto, Italy 2007 A symbolic approach to near-deterministic surface realisation using Tree Adjoining Grammar Claire Gardent C. Eric Kow E. Proceedings of ACL Prague 2007 Création d'un corpus annoté pour le traitement des descriptions définies Claire Gardent C. Hélène Manuélian H. Traitement Automatique des Langues 46 1 2005 Generating Bridging Definite Descriptions Claire Gardent C. Kristina Striegnitz K. H. Bunt H. R. Muskens R. Computing Meaning Studies in Linguistics and Philosophy Series 3 Kluwer Academic Publishers 2007 Special Issue on Hybrid Logics Carlos Areces C. Patrick Blackburn P. 8 Elsevier 2010 http:// hal. inria. fr/ inria-00549320/ en Special Issue of the Journal of Applied Logic. C. Areces and P. Blackburn (editors). A new multi-viewpoint and multi-level clustering paradigm for efficient data mining tasks Jean-Charles Lamirel J.-C. INTECH Open Access Publisher 2010 http:// hal. inria. fr/ inria-00535971/ en Vers une approche systémique et multivues pour l'analyse de données et la recherche d'information : un nouveau paradigme Jean-Charles Lamirel J.-C. 2010 Implication Textuelle et Réécriture Paul Bedaride P. Université Henri Poincaré - Nancy I October 2010 http:// hal. inria. fr/ tel-00541581/ en Ph. D. Thesis L'implicature comme un Processus Interactif Luciana Benotti L. Université Henri Poincaré - Nancy I January 2010 http:// hal. inria. fr/ tel-00541571/ en Ph. D. Thesis Tâches de raisonnement en logiques hybrides Guillaume Hoffmann G. Université Henri Poincaré - Nancy I December 2010 http:// hal. inria. fr/ tel-00541664/ en Ph. D. Thesis The Expressive Power of Memory Logics Carlos Areces C. Santiago Figueira S. Sergio Mera S. 1755-0203 Review of Symbolic Logic August 2010 http:// hal. inria. fr/ inria-00519040/ en Coinductive models and normal forms for modal logics (or how we learned to stop worrying and love coinduction) Carlos Areces C. Daniel Gorin D. 1570-8683 Journal of Applied Logic August 2010 http:// hal. inria. fr/ inria-00519038/ en Resolution with Order and Selection for Hybrid Logics Carlos Areces C. Daniel Gorin D. 0168-7433 Journal of Automated Reasoning February 2010 http:// hal. inria. fr/ inria-00519035/ en Lightweight Hybrid Tableaux Guillaume Hoffmann G. 1570-8683 Journal of Applied Logic 8 4 2010 397–408 http:// hal. inria. fr/ hal-00535976/ en Recherche des évolutions technologiques à partir de bases de données bibliographiques : apport de la classification incrémentale Jean-Charles Lamirel J.-C. Pascal Cuxac P. Claire François C. STI Collection S. Technologies de la Connaissance et Recherche d'Information en Contexte Hermes Science Publishing Ltd 2010 http:// hal. inria. fr/ inria-00535972/ en Modal Logics with Counting Carlos Areces C. Guillaume Hoffmann G. Alexandre Denis A. 17th Workshop on Logic, Language, Information and Computation - WoLLIC 2010 Brazil Brasilia 2010 http:// hal. inria. fr/ hal-00482337/ en Workshop on Logic, Language, Information and Computation 17 WoLLIC Benchmarking for syntax-based sentential inference Paul Bedaride P. Claire Gardent C. The 23rd International Conference on Computational Linguistics - COLING 2010 China Beijing August 2010 http:// hal. inria. fr/ inria-00536022/ en International Conference on Computational Linguistics 23 COLING Syntactic testsuites and Textual Entailment Recognition Paul Bedaride P. Claire Gardent C. The seventh International Conference on Language Resources and Evaluation - LREC 2010 Malta Valletta May 2010 http:// hal. inria. fr/ inria-00536020/ en International Conference on Language Resources and Evaluation 7 LREC Negotiating causal implicatures Luciana Benotti L. Patrick Blackburn P. Conference of the Special Interest Group in Dialogue Japan Tokyo September 2010 http:// hal. inria. fr/ inria-00547508/ en Conference of the Special Interest Group in Dialogue 2010 Building and Exploiting a Dependency Treebank for French Radio Broadcast Christophe Cerisara C. Claire Gardent C. Corinna Anderson C. TLT9 – the ninth international workshop on Treebanks and Linguistic Theories Estonia Tartu November 2010 http:// hal. inria. fr/ inria-00537147/ en International Workshop on Treebanks and Linguistic Theories 9 TLT MLIF: A Metamodel to Represent and Exchange Multilingual Textual Information Samuel Cruz-Lara S. Gil Francopoulo G. Laurent Romary L. Nasredine Semar N. LREC (Language Resources and Evaluation Conference) Malta Valletta May 2010 http:// hal. inria. fr/ inria-00518060/ en International Conference on Language Resources and Evaluation 7 LREC Standards for communication and E-learning in virtual worlds: The Multilingual-Assisted Chat Interface Samuel Cruz-Lara S. Tarik Osswald T. Jordan Guinaud J. Nadia Bellalem N. Lotfi Bellalem L. José Cordeiro J. Joaquim Filipe J. 12th International Conference on Enterprise Information Systems - ICEIS 2010 Portugal Funchal SciTePress June 2010 http:// hal. inria. fr/ inria-00512632/ en International Conference on Enterprise Information Systems 12 ICEIS Les méthodes de classification non supervisées appliquées aux textes : mesure de la performance des résultats de clustering de documents Pascal Cuxac P. Jean-Charles Lamirel J.-C. Maha Ghribi M. Association Canadienne des Science de l'Information - ACSI 2010 Canada Montreal 2010 http:// hal. inria. fr/ inria-00535941/ en Association Canadienne des Science de l'Information - ACSI 2010 ACSI Generating Referring Expressions with Reference Domain Theory Alexandre Denis A. INLG 2010 Ireland Dublin July 2010 27-35 http:// hal. inria. fr/ hal-00502414/ en International Natural Language Generation Conference 6 INLG Reference reversibility with Reference Domain Theory Alexandre Denis A. Proceedings of the 11th annual SIGdial Meeting on Discourse and Dialogue - SIGDIAL 2010 Japan Tokyo September 2010 http:// hal. inria. fr/ hal-00516998/ en SIGDIAL Meeting on Discourse and Dialogue 11 SIGDIAL Extending MMIL Semantic Representation: Experiments in Dialogue Systems and Semantic Annotation of Corpora Alexandre Denis A. Lina Maria Rojas Barahona L. M. Matthieu Quignard M. Fifth Joint ISO-ACL/SIGSEM Workshop on Interoperable Semantic Annotation isa-5 China Hong-Kong 2010 http:// hal. inria. fr/ hal-00481868/ en ISO-ACL/SIGSEM Workshop on Interoperable Semantic Annotation isa-5 5 Metadata for Wicri, a network of semantic Wikis for communities in research and innovation Jacques Ducloy J. Thierry Daunois T. Muriel Foulonneau M. Alice Hermann A. Jean-Charles Lamirel J.-C. Stéphane Sire S. Jean-Pierre Thomesse J.-P. Christine Vanoirbeek C. International Conference on Dublin Core and Metadata Applications - DC-2010 United States Pittsburgh 2010 http:// hal. inria. fr/ inria-00535962/ en International Conference on Dublin Core and Metadata Applications 2010 DC CH Multilingual Lexical Support for the SEMbySEM project. Ingrid Falk I. Samuel Cruz-Lara S. Nadia Bellalem N. Tarik Osswald T. Vincent Herrmann V. LREC Workshop Language Resource and Technology Standards Malta La Valetta May 2010 http:// hal. inria. fr/ inria-00509763/ en LREC Workshop Language Resource and Technology Standards 2010 LREC Bootstrapping a Classification of French Verbs Using Formal Concept Analysis. Ingrid Falk I. Claire Gardent C. Interdisciplinary Workshop on Verbs Italy Pisa November 2010 6 http:// hal. inria. fr/ hal-00521268/ en Interdisciplinary Workshop on Verbs 2010 Using Formal Concept Analysis to Acquire Knowledge about Verbs Ingrid Falk I. Claire Gardent C. Alejandra Lorenzo A. 7th International Conference on Concept Lattices and Their Applications - CLA 2010 Spain Sevilla October 2010 12 http:// hal. inria. fr/ hal-00516789/ en International Conference on Concept Lattices and their Applications 7 CLA The role of argumentation in online epistemic communities: the anatomy of a conflict in Wikipedia Dominique Fréard D. Alexandre Denis A. Françoise Détienne F. Michael Baker M. Matthieu Quignard M. Flore Barcellini F. European Conference on Cognitive Ergonomics - ECCE 2010 Netherlands Delft August 2010 91-98 http:// hal. inria. fr/ hal-00516994/ en European Conference for Cognitive Ergonomics 28 ECCE Semi-Automatic Propbanking for French Claire Gardent C. Christophe Cerisara C. TLT9 - The Ninth International Workshop on Treebanks and Linguistic Theories Estonia Tartu November 2010 http:// hal. inria. fr/ inria-00537148/ en International Workshop on Treebanks and Linguistic Theories 9 TLT Comparing the performance of two TAG-based surface realisers using controlled grammar traversal Claire Gardent C. Benjamin Gottesman B. Laura Perez-Beltrachini L. Coling 2010: Posters China Beijing August 2010 http:// hal. inria. fr/ inria-00537017/ en COLING Workshop on Cross-Framework and Cross-Domain Parser Evaluation 2010 COLING DE Identifying Sources of Weakness in Syntactic Lexicon Extraction Claire Gardent C. Alejandra Lorenzo A. Nicoletta Calzolari N. Khalid Choukri K. Bente Maegaard B. Joseph Mariani J. Jan Odijk J. Stelios Piperidis S. Mike Rosner M. Daniel Tapias D. The seventh international conference on Language Resources and Evaluation - LREC'10 Malta Malta European Language Resources Association (ELRA) November 2010 http:// hal. inria. fr/ inria-00537150/ en International Conference on Language Resources and Evaluation 7 LREC ISBN : 2-9517408-6-7 RTG based surface realisation for TAG Claire Gardent C. Laura Perez-Beltrachini L. Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010) China Beijing August 2010 367–375 http:// hal. inria. fr/ inria-00537159/ en International Conference on Computational Linguistics 23 COLING Mesures de qualité de clustering de documents : prise en compte de la distribution des mots clés. Maha Ghribi M. Pascal Cuxac P. Jean-Charles Lamirel J.-C. Alain Lelu A. Nicolas Béchet N. Évaluation des méthodes d'Extraction de Connaissances dans les Données- EvalECD'2010 Tunisia Hammamet Fatiha Saïs January 2010 14 http:// www. lirmm. fr/ ~bechet/ EvalECD2010/ Actes. html http:// hal. inria. fr/ inria-00442953/ en Évaluation des méthodes d'Extraction de Connaissances dans les Données- EvalECD 2010 EvalECD Construction dynamique du sens : application à la prédication verbale Guillaume Jacquet G. Jean-Luc Manguin J.-L. Fabienne Venant F. Bernard Victorri B. Rochebrune 2010 France Rochebrune 2010 http:// hal. inria. fr/ hal-00526804/ en Rencontres interdisciplinaires sur les systèmes complexes naturels et artificiels 2010 A new incremental growing neural gas algorithm based on clusters labeling maximization: application to clustering of heterogeneous textual data Jean-Charles Lamirel J.-C. Zied Boulila Z. Maha Ghribi M. Pascal Cuxac P. Claire François C. 23rd International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA-AIE 2010) Spain Cordoba 2010 http:// hal. inria. fr/ inria-00535942/ en International Conference on Industrial, Engineering & Other Applications of Applied Intelligent Systems (IEA-AIE) 23 IEA-AIE A new incremental neural clustering approach for performing reliable large scope scientometrics analysis Jean-Charles Lamirel J.-C. Zied Boulila Z. Maha Ghribi M. Pascal Cuxac P. Claire François C. 6th International Conference on Webometrics, Informetrics and Scientometrics and 11th COLLNET Meeting India Mysore 2010 http:// hal. inria. fr/ inria-00535967/ en International Conference on Webometrics, Informetrics and Scientometrics 6 WIS Un nouvel algorithme incrémental de gaz neuronal croissant basé sur des l'étiquetage des clusters par maximisation de vraisemblance : application au clustering des gros corpus de données textuelles hétérogènes Jean-Charles Lamirel J.-C. Zied Boulila Z. Maha Ghribi M. Pascal Cuxac P. Claire François C. Technological and Strategic Survey Conference – VSST 2010 France Toulouse 2010 http:// hal. inria. fr/ inria-00535968/ en Colloque Veille Stratégique Scientifique et Technologique 6 VSST A new general paradigm for mining research topics evolving over time Jean-Charles Lamirel J.-C. Pascal Cuxac P. Charif Haydar C. Claire François C. 3rd International Conference on Information Systems and Economic Intelligence - SIIE'2010 Tunisia Sousse 2010 http:// hal. inria. fr/ inria-00535938/ en Conférence Internationale sur les Systèmes d'Informations et Intelligence Economique 3 SIIE Exploitation d'une mesure de micro-précision cumulée non supervisée pour l'évaluation fiable de la qualité des résultats de clustering Jean-Charles Lamirel J.-C. Maha Ghribi M. Pascal Cuxac P. 42èmes Journées de Statistique France Marseille Société Française de Statistique (SFdS) 2010 http:// hal. inria. fr/ inria-00494723/ en Journées de Statistique 42 JDS Micro rappel précision non supervisés : vers de nouvelles mesures de qualité de clustering Jean-Charles Lamirel J.-C. Maha Ghribi M. Pascal Cuxac P. XVIIèmes Rencontres de la Société Francophone de Classification- SFC'10 France Saint-Denis de La Réunion 2010 http:// hal. inria. fr/ inria-00535940/ en Rencontres de la Société Francophone de Classification 17 SFC Unsupervised recall and precision measures: a step towards new efficient clustering quality indexes Jean-Charles Lamirel J.-C. Maha Ghribi M. Pascal Cuxac P. 19th International Conference on Computational Statistics - COMPSTAT'2010 France Paris 2010 http:// hal. inria. fr/ inria-00535961/ en Symposium of the International Association for Statistical Computing 19 COMPSTAT Use of distance-based indexes might well lead to misinterpretation of clustering quality results Jean-Charles Lamirel J.-C. Ghada Safi G. 1st International Workshop on Validation Statistics Germany Berlin 2010 http:// hal. inria. fr/ inria-00535963/ en International Workshop on Validation Statistics 1 Mining research topics evolving over time using a diachronic multi-source approach Jean-Charles Lamirel J.-C. Ghada Safi G. Navesh Pryankar N. Pascal Cuxac P. The Fourth International Workshop on Mining Multiple Information Sources - ICDM 2010 Australia Sydney 2010 http:// hal. inria. fr/ inria-00535969/ en International Workshop on Mining Multiple Information Sources - ICDM 4 ICDM Extraction of Clinical Information from Clinical Reports: an Application to the Study of Medication Overuse Headaches in Italy. Cristiana Larizza C. Matteo Gabetta M. Lina Maria Rojas Barahona L. M. Giusseppe Milani G. Elena Guaschino E. Grazia Sances G. Cristina Cereda C. Riccardo Bellazzi R. AMIA Summit on Translational Bioinformatics United States San Francisco 2010 http:// hal. inria. fr/ inria-00519826/ en AMIA Summit on Translational Bioinformatics 2010 IT Memory-Based Active Learning for French Broadcast News Frédéric Tantini F. Christophe Cerisara C. Claire Gardent C. INTERSPEECH 2010 Japan Tokyo September 2010 1377-1380 http:// hal. inria. fr/ inria-00540423/ en Annual Conference of the International Speech Communication Association 11 INTERSPEECH Meaning representation: from continuity to discreteness Fabienne Venant F. Seventh international conference on Language Resources and Evaluation (LREC) Malta Malte 2010 http:// hal. inria. fr/ hal-00526811/ en LREC Workshop Language Resource and Technology Standards 2010 LREC The GIVE-2 Nancy Generation Systems NA and NM Alexandre Denis A. Marilisa Amoia M. Luciana Benotti L. Laura Perez-Beltrachini L. Claire Gardent C. Tarik Osswald T. INRIA Nancy Grand Est July 2010 http:// hal. inria. fr/ inria-00541578/ en Research Report Topological semantics for hybrid logic Dmitry Sustretov D. July 2010 http:// hal. inria. fr/ inria-00541569/ en Using set constraints to generate distinguishing descriptions Marilisa Amoia M. Claire Gardent C. Stefan Thater S. Proc. of the 7th International Workshop on Natural Language Understanding and Logic Programming - NLULP'02, Copenhagen, Denmark Jul 2002 Analyzing the Core of Categorial Grammar Carlos Areces C. Raffaella Bernardi R. Journal of Logic, Language and Information 13 2 2004 121–137 Computational Semantics Patrick Blackburn P. Johan Bos J. Theoria 18 46 Jan 2003 27-45 Representation and Inference for Natural Language: A First Course in Computational Semantics Patrick Blackburn P. Johan Bos J. CSLI Press Aug 2005 Alternations, monotonicity and the lexicon: an application to factorising information in a Tree Adjoining Grammar Benoît Crabbé B. Proc. of the European Summer School of Logic Language and Computation, Student Session - ESSLLI'03, Vienna, Austria Aug 2003 69-80 Lexical Classes for structuring the lexicon of a TAG Benoît Crabbé B. Lorraine-Saarland Workshop series: prospects and advances in the syntax semantics interface, Nancy, France Oct 2003 Reprsentation et gestion de grammaires d'arbres adjoints lexicalises Benoît Crabbé B. Bertrand Gaiffe B. Azim Roussanaly A. Traitement Automatique des Langues 44 3 Dec 2003 67-91 Une plateforme de conception et d'exploitation de grammaire d'arbres adjoints lexicaliss Benoît Crabbé B. Bertrand Gaiffe B. Azim Roussanaly A. Traitement Automatique du Langage Naturel 2003 - TALN 2003, Batz-sur-Mer, France Jun 2003 Metagrammar Redux Benoît Crabbé B. Denys Duchier D. Proc. of the International Workshop on Constraint Solving and Language Processing - CSLP 2004, Copenhagen, Norway Sep 2004 The MetaGrammar Compiler: An NLP Application with a Multi-paradigm Architecture Denys Duchier D. Joseph Le Roux J. Yannick Parmentier Y. Proc. of the 2nd International Mozart/Oz Conference - MOZ 2004, Charleroi, Belgium Oct 2004 A New Metagrammar Compiler Bertrand Gaiffe B. Benoît Crabbé B. Azim Roussanaly A. Proc. of the 6th International Workshop on Tree Adjoining Grammars and Related Frameworks - TAG+6, Venice, Italy May 2002 Generating minimal definite descriptions Claire Gardent C. Proc. of the 40th Annual Meeting of the Association for Computational Linguistic - ACL'02, Philadelphia, USA Jul 2002 Generating and selecting paraphrases Claire Gardent C. Eric Kow E. Proc. of the 10th European Workshop on Natural Language Generation - ENLG 05, Aberdeen, Scotland Aug 2005 191-196 Cration d'un corpus annot pour le traitement des descriptions dfinies Claire Gardent C. Hélène Manuélian H. Traitement automatique des langues 46 1 2005 Which bridges for bridging definite descriptions? Claire Gardent C. Hélène Manuélian H. Eric Kow E. Proc. of the 4th International Workshop on Linguistically Interpreted Corpora - LINC'03, Budapest, Hungary Apr 2003 Generating Definite Descriptions: Non incrementality, inference and data Claire Gardent C. Hélène Manuélian H. Kristina Striegnitz K. Marilisa Amoia M. Speech production Walter de Gruyter, Berlin Dec 2003 Large scale semantic construction for Tree Adjoining Grammars Claire Gardent C. Yannick Parmentier Y. Philippe Blache P. Ed Stabler E. Joan Busquets J. Richard Moot R. Proc. of Logical Aspects of Computational Linguistics - LACL'05, Bordeaux, France Lecture Notes in Computer Science 3492 Springer Apr 2005 131-146 Generating Bridging Definite Descriptions Claire Gardent C. Kristina Striegnitz K. H. Bunt H. R. Muskens R. Computing Meaning Studies in Linguistics and Philosophy Series 3 Kluwer Academic Publishers Dec 2003 Adapting polarised disambiguation to surface realisation Eric Kow E. Proc. of the 17th European Summer School in Logic, Language and Information - ESSLLI'05, Edinburgh, United Kingdom Aug 2005 Annotation des descriptions dfinies : le cas des reprises par les rles thmatiques Hélène Manuélian H. Rencontre des tudiants Chercheurs en Informatique pour le Traitement Automatique des Langues - RECITAL'2002, Nancy, France Jun 2002 Coreferential Definite and Demonstrative Descriptions in French: A Corpus Study for Text Generation Hélène Manuélian H. Proc. of the ESSLLI'03 Student Session, Vienna, Austria Aug 2003 Descriptions Dfinies et Dmonstratives : Analyses de corpus pour la gnration Hélène Manuélian H. Universit de Nancy 2 Nov 2003 Ph. D. Thesis Gnration de descriptions dfinies et dmonstratives Hélène Manuélian H. Huitime Atelier des doctorants en linguistique - ADL'2003, Paris, France Jul 2003 Une analyse des emplois du dmonstratif en corpus Hélène Manuélian H. Traitement Automatique des Langues Naturelles - TALN 2003, Batz sur Mer, France Jun 2003 Generating Coreferential Descriptions from a Structured Model of the Context Hélène Manuélian H. Proc. of the International Conference on Language Ressources and Evaluation - LREC 2004, Lisbon, Portugal May 2004 XMG: a Multi-formalism Metagrammatical Framework Yannick Parmentier Y. Joseph Le Roux J. Proc. of the 17th European Summer School in Logic, Language and Information - ESSLLI '05, Edinburgh, United Kingdom Aug 2005 Des arbres de dérivation aux forêts de dépendance : un chemin via les forêts partagées Djamé Seddah D. Bertrand Gaiffe B. Traitement automatique des langues Naturelles - TALN'05, Dourdan, France Jun 2005 How to Build Argumental graphs Using TAG Shared Forest: a view from control verbs problematic Djamé Seddah D. Bertrand Gaiffe B. Proc. of the 5th International Conference on the Logical Aspect of Computional Linguistic - LACL'05, Bordeaux, France Apr 2005 Using both Derivation tree and Derived tree to get dependency graph in derivation forest Djamé Seddah D. Bertrand Gaiffe B. Proc. of the 6th International Workshop on Computational Semantics - IWCS-6, Tilburg, The Netherlands Jan 2005 Synchronisation des connaissances syntaxiques et sémantiques pour l'analyse d'énoncés en langage naturel à l'aide des grammaires d'arbres adjoints lexicalisées Djamé Seddah D. Université Henri Poincaré - Nancy 1 Nov 2004 Ph. D. Thesis Génération d'expressions anaphoriques. Raisonnement contextuel et planification de phrases Kristina Striegnitz K. Université Henri Poincaré Nov 2004 Ph. D. Thesis