Hybrid Logics

TALARIS Traitement Automatique des Langues: Représentations, Inférences et Sémantique SYM Patrick Blackburn INRIA Chercheur

Lorraine

Research Director (DR2), INRIA oui Claire Gardent CNRS Chercheur

Lorraine

Research Director (DR2), SHS Department CNRS oui Isabelle Blanchard CNRS Assistant

Lorraine

Carlos Areces INRIA Chercheur

Lorraine

Research Scientist (CR1), INRIA Matthieu Quignard CNRS Chercheur

Lorraine

Research Scientist (CR2), SHS Department CNRS Jesse Tseng CNRS Chercheur

Lorraine

Research Scientist (CRS), SHS Department CNRS (left TALARIS 2007-09-01) Lotfi Bellalem UnivFr Enseignant

Lorraine

PRAG, ESIAL, Univerity of Nancy 1 (UHP) Nadia Bellalem UnivFr Enseignant

Lorraine

Assistant Professor, IUT Nancy-Charlemagne, University of Nancy 2 Daniel Coulon UnivFr Enseignant

Lorraine

Professor, UFR MI, University of Nancy 2 Samuel Cruz-Lara UnivFr Enseignant

Lorraine

Assistant Professor, IUT Nancy-Charlemagne, University of Nancy 2 Christine Fay-Varnier UnivFr Enseignant

Lorraine

Assistant Professor, School of Geology, INPL Azim Roussanaly UnivFr Enseignant

Lorraine

Assistant Professor, UFR MI, University of Nancy 2 Fabienne Venant UnivFr Enseignant

Lorraine

Assistant Professor, IUT Nancy-Charlemagne, University of Nancy 2 Sylvain Schmitz INRIA PostDoc

Lorraine

2007-08-01 until 2008-08-01 Marilisa Amoia UnivFr PhD

Lorraine

UHP Nancy 1, Co-tutelle University of the Saarland, from 2006-10-15 until 2008-10-15 Paul Bedaride UnivFr PhD

Lorraine

Ministry grant, UHP Nancy 1, from 2006-10-15 until 2009-10-15 Luciana Benotti UnivFr PhD

Lorraine

CORDIS grant, UHP Nancy 1, from 2006-06-10 until 2009-12-31 Alexandre Denis UnivFr PhD

Lorraine

INRIA grant, UHP Nancy 1, from 2006-10-15 until 2009-10-15 Daniel Gorin UnivFr PhD

Lorraine

UHP Nancy 1, Co-tutelle Universidad de Buenos Aires Sébastien Hinderer UnivFr PhD

Lorraine

Ministry grant, UHP Nancy 1, from 2004-10-30 until 2008-10-30 Eric Kow UnivFr PhD

Lorraine

UHP Nancy 1, INRIA grant, finished 2007-14-11 Phuong Le Hong UnivFr PhD

Lorraine

University of Nancy 2, Co-tutelle University of Hanoi, from 2006-10-01 until 2009-10-01 Cheng Yu Lu UnivFr PhD

Lorraine

INRIA (with Taiwan), UHP Nancy 1, from 2005-07-19 Chia-Hung Lin UnivFr PhD

Lorraine

INRIA (with Taiwan), UHP Nancy 1, from 2005-07-19 Sergio Mera UnivFr PhD

Lorraine

UHP Nancy 1, Co-tutelle Universidad de Buenos Aires Yannick Parmentier UnivFr PhD

Lorraine

UFR STMIA UHP Nancy 1 (INRIA grant - UHP Nancy 1 Monitor), finished 2007-04-06 Joseph Roumier UnivFr PhD

Lorraine

ENSEM INPL (CIFRE grant financed par CEGELEC) until 2008-10-01 Dmitry Sustretov UnivFr PhD

Lorraine

UHP Nancy 1, INRIA grant from 2005-10-01 until 2008-10-01 Emilie Colin INRIA Technique

Lorraine

until 2007-12-31 Ingrid Falk INRIA Technique

Lorraine

until 2007-12-31 Karen Fort INRIA Technique

Lorraine

until 2008-9-30

TALARIS is an INRIA project-team (UMR 7503) common to INRIA, the CNRS, the University of Nancy 1 (Henri Poincaré), the University of Nancy 2, and the National Polytechnic Institute of Lorraine. For more details, we invite the reader to consult the team web site at http:// talaris. loria. fr/ .

Overall Objectives Background

TALARIS stands for Traitment Automatique des Langues: Representation, Inference, et Semantique. As this name suggests, the aim of the TALARIS team is to investigate semantic phenomena (broadly construed) in natural language from a computational perspective. More concretely, TALARIS's goal is to develop grammars (with a special emphasis on French) with a semantic dimension, to explore the linguistic and computational issues involved in such areas as natural language generation, textual entailment recognition, discourse and dialogue modeling, pragmatics, and multilinguality, and to investigate the interplay between representation and inference in computational semantics for natural language.

Organization

The work of the TALARIS team can be subdivided into four overlapping and mutually supporting categories.

Computational Semantics. These theme is devoted to the theoretical and computational issues involved in building semantic representations for natural language. Special emphasis is placed on developing large scale semantic coverage for the French Language.

Discourse, Dialogue and Pragmatics. This theme is devoted to developing theoretical and computational models of discourse and dialogue precessing, and investigating the inferential impact of pragmatic factors (that is, the factors affecting how humans being actually use language).

Logics for Natural Language and Knowledge Representation. The theme is devoted to theoretical and computational tools for working with logics suitable for natural language inference and knowledge representation. Special emphasis is place on hybrid logic, higher order logic, and discourse representation theory (DRT).

Multilinguality for Multimedia. This theme is devoted to creating generic ISO-based mechanisms for representing and dealing with multilingual textual information. The center of this activity is the MLIF (Multi Lingual Information Framework) specification platform for elementary multilingual units.

Overall Objectives

The major long term computational goals of the TALARIS team are:

The creation of a large scale computational semantics framework for French that supports deep semantic analysis and surface realisation (the production of sentences from meaning representations),

The creation of dialogue systems (in particular, for French) that support flexible and realistic interaction with the user.

The creation of efficient inference systems for logics that are capable of representing natural language content and the background knowledge required to support reasoning.

These computational goals will be pursued in the context of theoretical investigations that will rigorously map out the required scientific and mathematical context.

Highlights

The main highlight of 2007 is simply this: it is the first year of TALARIS's existence! Like many other INRIA project teams, TALARIS was born out of a previously existing project — but the past year has been marked by heavy personnel changes. The result is a smaller team that is tightly focused on the goals just listed.

But our birthday year was marked by another highlight — the successful inauguration of a brand new Erasmus MundusMasters degree in Language and Communication Technology(for detailed information, see http:// lct-master. org). This degree, which is taught as one of the tracks of the Masters in Cognitive Science at the University of Nancy 2, is designed to attract to Europe the very best international students in computational linguistics. Students must spend one year at each of two partner universities in two different European countries (in addition to the University of Nancy 2, they can choose between the University of Bolzano in Italy, Charles University in the Czech Republic, the University of Groningen in the Netherlands, the University of the Saarland in Germany, and the University of Malta in Malta).

Needless to say, this Masters degree is not just relevant to TALARIS — indeed it is interesting precisely because it is relevant to the entire Nancy computational linguistics and language engineering community (which includes such LORIA based teams as CALLIGRAMME, ORPAILLEUR, PAROLE and READ and the Lexique team at ATILF). Nonetheless, TALARIS members (notably Carlos Areces, Patrick Blackburn and Claire Gardent) were heavily involved in setting up the program — and so its successful start in 2007 is special cause for celebration in the team. Seven Erasmus MundusMasters students (3 first year, 4 second year) chose to come to Nancy 2 this year; we hope this success will be repeated in the remaining four years of the program.

Computational Linguistics and Computational Logic computational semantics computational logic knowledge representation inference linguistic resources logic engineering empirical studies

We said above that the central research theme of TALARIS was computational semantics (where “semantics” is broadly construed to cover various pragmatic and discourse level phenomena) and that TALARIS is particularly focused on investigating the interplay between representation and inference. Another way of putting this would be to say that the scientific foundations of TALARIS's work boil down to the motto: computational linguisticsmeets computational logicand knowledge representation.

From computational linguistics we take the large linguistic and lexical semantics resources, the parsing and generation algorithms, and the insight that (whenever possible) statistical information should be employed to cope with ambiguity. From computational logic and knowledge representation we take the various languages and methodologies that have been developed for handling different forms of information (such as temporal information), the computational tools (such as theorem provers, model builders, model checkers, sat-solvers and planners) that have been devised for working with them, together with the insight that, whenever possible, it is better to work with inference tools that have been tuned for particular problems, and moreover that, whenever possible, it is best to devote as little computational energy to inference as possible.

This picture is somewhat idealized. For example, for many languages (and French is one of them) the large scale linguistic resources (lexicons, grammars, WordNet, FrameNet, PropBank, etc.) that exist for English are not yet available. In addition, the syntax/semantics interface often cannot be taken for granted, and existing inference tools often need to be adapted to cope with the logics that arise in natural language applications (for example, existing provers for Description Logic, though excellent, do not cope with temporal reasoning). Thus we are not simply talking about bringing together known tools, and investigating how they work once they are combined — often a great deal of research, background work and development is needed. Nonetheless, the ideal of bringing together the best tools and ideas from computational linguistics, knowledge representation and computational logic and putting them to work in coordination is the guiding line.

Another simplification involved in the “computational linguistics meet computational logic and knowledge representation” motto is that often the goal is to find out when the use of computational logic can be avoidedor minimized. Logical inference can be computationally expensive, and if simpler statistical methods can be used, or if only computationally tractable inference methods (such as model checking) are required, then it is highly desirable to turn to them. Empirically inspired heuristics are needed so that the tools of computational logic are only applied when truly needed, and only to the smallest problems possible.

To ensure that theoretically plausible ideas really are applicable, and to gain insight as to when empirically oriented methods can be usefully employed, TALARIS focuses on concrete semantic phenomena (for example, tense and aspect, presupposition and anaphora resolution, dialogue structure, etc.). By carefully examining the empirical data, we aim to determine which phenomena require inference and which not; which can be dealt with using weak logics and which not; which can be handled statistically and which not; what scales up successfully and what does not...

Semantics and Inference

Over the next decade, progress in natural language semantics will likely depend on obtaining a deeper understanding of the role played by inference. One of the simplest levels at which inference enters natural language is as a disambiguation mechanism. Utterances in natural language are typically highly ambiguous: inference allows human beings to (seemingly effortlessly) eliminate the irrelevant possibilities and isolate the intended meaning. But inference can be used in many other processes, for example, in the integration of new information into a known context. This is important when generating natural language utterances. For this task we need to be sure that the utterance we generate is suitable for the person being addressed. That is, we need to be sure that the generated representations fit in well with the recipient's knowledge and expectations of the world, and it is inference which guides us in achieving this.

Much recent semantic research actively addresses such problems by systematically integrating inference as a key element. This is an interesting development, as such work redefines the boundary between semantics and pragmatics. For example, van der Sandt's algorithm for presupposition resolution (a classic problem of pragmatics) uses inference to guarantee that new information is integrated in a coherent way with the old information.

The TALARIS team investigates such semantic/pragmatic problems from various angles (for example, from generation and discourse analysis perspectives) and tries to combine the insights offered by different approaches. For example, for some applications (e.g., the textual entailment task) shallow syntactic parsing combined with fast inference in description logic may be the most suitable approach. In other cases, deep analysis of utterances or sentences and the use of a first-order inference engine may be better. Our aim is to explore these approaches and their limitations.

Linguistic Resources

In an ideal world, computational semanticists would not have to worry overly much about linguistic resources. Large scale lexica, treebanks, and wide coverage grammars (supported by fast parsers and offering a flexible syntax semantics interface) would be freely available and easy to combine and use. The semanticist could then focus on modeling semantic phenomena and their interactions.

Needless to say, in reality matters are not nearly so straightforward. For a start, for many languages (including French) there are no large-scale resources of the sort that exist for English. Furthermore even in the case of English, the idealized situation just sketched does not obtain. For example, the syntax/semantics interface cannot be regarded as a solved problem: phenomena such as gapping and VP-ellipsis (where a verb, or verb phrase, in a coordinated sentence is missing and has to be somehow “reconstructed” from the previous context) still offer challenging problems for semantic construction.

Thus a team like TALARIS simply cannot focus exclusively on semantic issues: it must also have competence in developing and maintaining a number of different lexical resources (and in particular, resources for French).

TALARIS is involved in such aspects in a number of ways. For example, it participates in the development of an open source syntactic an synonymic lexicon for French, in an attempt to lay the ground for a French version of FrameNet; and it also works on developing a large scale, reversible (i.e., usable both for parsing and for generation) Tree Adjoining Grammar for French.

Logic Engineering

Once again, in the ideal world, not only would computational semanticists not have to worry about the linguistic resources at their disposal, but they would not have to worry about the inference tools available either. These could be taken for granted, applied as needed, and the semanticist could concentrate on developing linguistically inspired inference architectures. But in spite of the spectacular progress made in automated theorem proving (both for very expressive logics like predicate logics, and for weak logics like description logics) over the last decade, we are not yet in the ideal world. The tools currently offered by the automated reasoning community still have a number of drawbacks when it comes to natural language applications.

For a start, most of the efforts of the first-order automated reasoning community have been devoted to theorem proving; model building, which is also a required technology for natural language processing, is nowhere nearly as well developed, and far fewer systems are available. Secondly, the first-order reasoning community has adopted a resolutely `classical' approach to inference problems: their provers focus exclusively on the satisfiability problem. The description logic community has been much more flexible, offering architectures and optimisations which allow a greater range of problems to be handled more directly. One reason for this has been that historically, not all description logics offered full Boolean expressivity. So there is a long tradition in description logic of treating a variety of inference problems directly, rather than via reduction to satisfiability. Thirdly, many of the logics for which optimised provers exists do not directly offer the kinds of expressivity required for natural language applications. For example, it is hard to encode temporal inference problems in implemented versions of description logics. Fourth, for very strong logics (notably higher-order logics) few implementations exists and their performance is currently inadequate.

These problems are not insurmountable, and TALARIS members are actively investigating ways of overcoming them. For a start, logics such as higher-order logic, description logic and hybrid logic are nowadays thought of as various fragments of (or theories expressed in) first-order logic. That is, first-order logic provides a unifying framework that often allows transfer of tools or testing methodologies to a wide range of logics. For example, the hybrid logics used in TALARIS (which can be thought of as more expressive versions of description logics) make heavy use of optimization techniques from first-order theorem proving.

Moreover — and from a logical perspective, this is the most interesting point — the interaction between natural language and computational logic is not a one way street. The problems that arise in natural language may well be significant for developments in computational logic. As an example of this, early versions of the CURT software (an educational system for computational semantics developed by Patrick Blackburn and Johan Bos) made use of a standard first-order model builder called MACE. The inference problems that the system generated were then used as tests when the PARADOX model builder was developed, leading to considerable performance improvements. Similarly, natural language applications have also inspired significant performance enhancements to the RACER description logic prover. Feedback from natural language to logic is likely to be an important theme in future developments.

Empirical Studies

The role of empirical methods (model learning, data extraction from corpora, evaluation) has greatly increased in importance in both linguistics and computer science over the last fifteen years. TALARIS members have been working for many years on the creation, management and dissemination of linguistic resources reusable by the scientific community, both in the context of implementation of data servers, and in the definition of standardized representation formats like TAG-ML. In addition, they have also worked on the applications of linguistic ideas in multimodal settings and multimedia.

The work in this area is in concordance with our scientific projects. As we said above, one of the most important points that needs to be understood about logical inference is how its use can be minimized and intelligently guided. Ultimately, such minimization and guidance must be based on empirical observations concerning the kinds of problems that arise repeatedly in natural language applications.

Finally, is should be remarked that the emphasis on empirical studies lends another dimension to what is meant by inference. While much of TALARIS's focus is on symbolic approaches to inference, statistical and probabilistic methods, either on their own or blended with symbolic approaches, are likely to play an increasingly important role in the future. TALARIS researchers are well aware of the importance of such approaches and are interested in exploring their strengths and weaknesses, and where relevant, intend to integrate them into their work.

Application Domains Modular Grammar Building.

The development of large scale grammars is a complex task which usually involves factorising information as much as possible. While good grammar writing and factorisation environments exist for “non tree grammars” (e.g., HPSG, LFG), this is not the case for “tree based grammars” such as TAG, Interaction Grammars or Tree Description Grammars. The Extended Metagrammar Compiler (XMG) developed at TALARIS remedies this shortcoming while additionally providing a clean and modular way to describe several linguistic dimensions thereby supporting the production of tree grammars with semantic information , , , , , , , , , , , , .

Referential Expressions

TALARIS has a longstanding interest in the semantics and the processing of referential expressions. In recent years, an extensive corpus annotation has been carried out on 5.000 definite descriptions , , , , , , , ; an algorithm for generating bridging definite descriptions has been specified and implemented which illustrates the interaction of realisation and inference , , , ; a constraint based algorithm for definite description has been proposed which differs from the standard one in that it uses constraints to produce a minimal description , ; and a shallow anaphora resolver for French has been developed and evaluated within the national evaluation campaign MeDIA.

Surface Realization

The tree adjoining grammar for French developed by TALARIS associates with each NL expression not only a syntactic tree but also a semantic representation. Interestingly, the semantic calculus used is reversible in that the association between strings and semantic representations is non-directional (declarative). We put this feature to work and have been working over the years towards developing a surface realiser for French called GenI , . At present GenI is the only surface realiser available for French. Current work concentrates on improving both coverage and efficiency.

Textual Entailment.

In essence, the textual entailment recognition task is an inference task, namely deciding whether the information contained in a given text T₁can be inferred from the information provided by another text T₂.

It is crucial to be able to answer this question. One important characteristic of natural language is the large number of ways in which it can express the same information. Many natural language processing applications like question answering, information retrieval, generation, and anaphora resolution need to deal with this diversity efficiently and accurately, and recognising textual entailments is a key step towards this.

Textual entailment recognition is a difficult task. The approach we are experimenting with is to encode lexical information as a description logic ontology (or a hybrid logic theory) and then to use logical inference to compute the result.

Computational Logics and Computational Semantics.

Members of TALARIS are among the main figures proposing the idea of using inference (and in particular, using computational tools like model builders and theorem provers) as an integral part of different tasks in computational semantics, mainly during semantic construction , . The book “Representation and Inference for Natural Language: A First Course in Computational Semantics” by Blackburn and Bos is nowadays an important reference in this area.

Hybrid Automated Deduction

TALARIS main contribution in this topic has been the design of resolution calculi for hybrid logics, that were then implemented into the HyLoRes theorem prover. In particular, TALARIS members have proved that such calculi can be enhanced with optimisations of order and selection functions without losing completeness. Moreover, the first `effective' (i.e., directly implementable) termination proof for $Im1 ${\#8459 (@)}$$ has been recently established and the technique is being extended to more expressive languages , . Current work includes optimising performance by use of ordered resolution, adding a temporal reasoning component to HyLoRes, extending the architecture to allow querying against a background theory without having to re-saturate the theory with each new query, and testing HyLoResperformance against dedicated first-order provers using a structure-preserving translation method.

During this year, and making use of modularized code from HyLoRes, we have implemented a tableaux based prover for hybrid logics called HTab . HTabis an optimised tableaux prover for hybrid logics, using algorithms that ensure termination . It ultimately aims to cover a number of frame conditions (i.e., reflexivity, symmetry, antisymmetry, etc.), as far as it is possible to ensure termination . Moreover, we are interested in providing a range of inference services beyond satisfiability checking. For example, the current version of HTabincludes model generation (i.e., HTabcan generate a model from a saturated open branch in the tableau).

We have also started to explore other decision methods (e.g., game based decision methods) which are useful for non-standard semantics like topological semantics. The prover HyLoBanis an example of this work .

Multimedia

MLIF (Multi Lingual Information Framework) is being designed as a generic ISO-based mechanism for representing and dealing with multilingual textual information. A preliminary version of MLIF has been associated to digital media within the ISO/IEC MPEG context and dealing with subtitling of video content, dialogue prompts, menus in interactive TV, and descriptive information for multimedia scenes. MLIF comprises a flexible specification platform for elementary multilingual units that may be either embedded in other types of multimedia content or used autonomously to localise existing content.

Software The eXtended Meta-Grammar ( XMG) Compiler and Tools

A metagrammar compiler generates automatically a grammar from a reduced description called a MetaGrammar. This description captures the linguistic properties underlying the syntactical rules of a grammar. Various past and present TALARIS members have been working on metagrammar compilation since 2001 and several tools have been developed within this framework starting with the MGC system of Bertrnad Gaiffe (now of ATILF, Analyse et Traitment Informatique de la Langue Francaisea Nancy-based CNRS unit) to the newly developed XMGsystem of Crabbé et al.

The XMGsystem is a 2nd generation compiler that proposes (a) a representation language allowing the user to describe in a factorised and flexible way the linguistic information contained in the grammar, and (b) a compiler for this language (using a Warren Abstract Machine-like architecture). An innovative feature of this compiler is the fact that it makes it possible to describe several linguistic dimensions, and in particular it is possible to define a natural Syntax/Semantics interface within the Metagrammar.

The compiler actually supports two syntactic formalisms (Tree Adjoining Grammars and Interaction Grammars) and the description both of the syntactic and of the semantic dimension of natural language. The generated grammars are in XML format, which makes them easy to reuse. Plug-ins have been realised with the LLP2 parser, with Eric de la Clergerie's DyALog parser and with the GenIgenerator. Future work concerns the modularisation and the extension of XMGto define a library of languages describing linguistic data allowing the user to describe his own target formalism.

Developed under the supervision of Denys Duchier, the XMGcompiler is the result of an intensive collaboration with CALLIGRAMME. It has been implemented in Oz/Mozart and runs under the Linux, Mac, and Windows platforms. It is available with tools easing its use with parsers and generators (tree viewer, duplicate remover, anchoring module, metagrammar browser).

The system is currently being used and tested by Owen Rambow (University of Columbia, USA) and Laura Kallmeyer (University of Tuebingen, Germany).

Version: 1.1.4

License: CeCILL

Last update: 27/09/2005

Web site: http:// sourcesup. cru. fr/ xmg/

Documentation: http:// sourcesup. cru. fr/ xmg/ #Documentation

Authors: Benoit Crabbé, Denys Duchier, Joseph Le Roux, Yannick Parmentier

Contact: Benoit Crabbé, Yannick Parmentier

Frolog

Frolog is a dialogue system based on current technology from computational linguistics, artificial intelligence planning and theorem proving. It implements a text adventure game engine that uses natural language processing techniques to analyse the player's input and generate the system's output.

The Frolog core is implemented in Prolog, but it uses external tools for the most heavy-loaded tasks. It performs syntactic analysis of the input based on an English grammar developed using XMGand computes a flat semantic representation using the SemConstsemantic construction tool. It then uses the constructed semantic representation and an off-the-shelf planner to interpret the player's intention and change the world model accordingly. The world is modelled as a knowledge base in description logics, and accessed using the Description Logic theorem prover Racer. Finally, the results of the action, or descriptions of objects, are generated automatically, using the GenIgenerator.

Frolog's main utility is to serve as a laboratory in order to test pragmatic theories about presupposition accommodation. However, it will also result in the first integrated system to use SemTag(the LORIA toolbox for TAG-based Parsing and Generation).

Version: 0.9

License: GPL

Last update: 2007-11-26

Web site: http:// trac. loria. fr/ projects/ frolog/ wiki

Documentation: http:// trac. loria. fr/ projects/ frolog/ wiki/ Documentation

Authors: Luciana Benotti, Alejandra Lorenzo, Laura Perez

Contact: Luciana Benotti

GenIGenerator

The GenIgenerator is a successor of the InDiGen generator. Also based on a chart algorithm, it is implemented in Haskell (one of the leading functional programming languages available nowadays) and aims for modularity, re-usability and extensibility. The system is “stand-alone” as we use the Glasgow Haskell compiler to obtain executable code for Windows, Solaris, Linux and Mac OS X.

The GenIgenerator uses efficient datatypes and intelligent rule application to minimise the generation of redundant structures. It also uses a notion of polarities as a means first, of coping with lexical ambiguity and second, of selecting variants obeying given syntactic constraints.

The grammar used by the GenIgenerator is produced using the MetaGrammar Compiler and covers the basic syntactic structures of French as described in Anne Abeillé's book “An electronic grammar for French”.

The system can process the output of the XMGMetagrammar compiler mentioned above.

Version: 0.8

License: GPL

Last update: 2005-10-17

Web site: http:// wiki. loria. fr/ wiki/ GenI

Documentation: http:// wiki. loria. fr/ wiki/ GenI/ Manual

Project(s): GenI

Authors: Carlos Areces, Claire Gardent, Eric Kow

Contact: Claire Gardent

HyLoRes, a Resolution Based Theorem Prover for Hybrid Logics

HyLoResis a resolution based theorem prover for hybrid logics (it is complete for the hybrid language H(@, $Im2 $\#8595 $$ ), a very expressive but undecidable language, and it implements a decision method for the sublanguage H(@)). It implements a version of the “given clause” algorithm which is the underlying framework of many current state of the art resolution-based theorem provers for first-order logic; and uses heuristics of order and selection function to prune the search space on the space of possible generated clauses.

HyLoResis implemented in Haskell, and compiled with the Glasgow Haskell compiler (thus, users need no additional software to use the prover). We have also developed a graphical interface.

The interest of HyLoResis twofold: on one hand it is the first mature theorem prover for hybrid languages, and on the other, it is the first modern resolution based prover for modal-like languages implementing optimisations and heuristics like order resolution with selection functions.

Version: 2.4

License: GPL

Last update: 2007-12-01

Web site: http:// trac. loria. fr/ projects/ hylores

Documentation: http:// trac. loria. fr/ projects/ hylores

Authors: Carlos Areces, Daniel Gorín and Juan Heguiabehere

Contact: Carlos Areces

HTab, a Tableau Based Theorem prover for Hybrid Logics

The main goal behind HTabis to make available an optimised tableaux prover for hybrid logics, using algorithms that ensure termination. We ultimately aim to cover a number of frame conditions (i.e., reflexivity, symmetry, antisymmetry, etc.), as far as we can ensure termination. Moreover, we are interested in providing a range of inference services beyond satisfiability checking. For example, the current version of HTabincludes model generation (i.e., HTabcan generate a model from a saturated open branch in the tableau).

HTaband HyLoResare actually being developed in coordination, and a generic inference system involving both provers is being designed. The aim is to take advantage of the dual behaviour existing between the resolution and tableaux algorithms: while resolution is usually most eficient for unsatisfiable formulas (i.e., a contradiction can be reported as soon as the empty clause is derived), tableaux methods are better suited to handle satisfiable formulas (i.e., a saturated open branch in the tableaux represents a model for the input formula).

Version: 1.2.1

License: GPL

Last update: 2007-12-01

Web site: http:// trac. loria. fr/ projects/ htab

Documentation: http:// trac. loria. fr/ projects/ htab

Authors: Carlos Areces, Guillaume Hoffmann

Contact: Guillaume Hoffmann

HyLoBan, a Game Based Theorem Prover for Topological Hybrid Logics

HyLoBanis a game-based prover, resulting from a direct implementation of Sustretov's game-based proofs of the PSPACE-completeness of the hybrid logics of T0 and T1 topological spaces. The interest of this approach is that termination is guaranteed and in addition the underlying game-based architecture is of independent interest; its disadvantage is that (at present) it is still extremely ineficient.

Version: 0.2

License: GPL

Last update: 2007-12-01

Web site: http:// trac. loria. fr/ projects/ hyloban

Documentation: http:// trac. loria. fr/ projects/ hyloban

Authors: Carlos Areces, Guillaume Hoffmann, Dmitry Sustretov

Contact: Guillaume Hoffmann

hGEN, a Random Formula Generator

hGen is a random CNF (conjunctive normal form) generator of formulas for sublanguages of H(@, $Im2 $\#8595 $$ , A, P). It is an extension of the latest proposal of Patel-Schneider and Sebastiane, nowadays considered the standard testing environment for classical modal logics. The random generator is used for assessing the performance of different provers.

Version: 1.1

License: GPL

Last update: 2007-12-01

Web site: http:// trac. loria. fr/ projects/ hgen

Documentation: http:// trac. loria. fr/ projects/ hgen

Authors: Carlos Areces, Daniel Gorín, Juan Heguiabehere and Guillaume Hoffmann

Contact: Carlos Areces

SynLex: Extracting a Syntactical Lexicon from the LADL Tables

Maurice Gross' grammar lexicon contains extremely rich and exhaustive information about the morphosyntactic and semantic properties of French syntactic functors (verbs, adjectives, nouns). Yet its use within natural language processing systems is still restricted.

The aim of our work is to translate this information into a format which is more suitable for use by NLP systems and also compatible with the state of the art practice in lexical data representation.

The lexicon should assign to each verb a set of subcategorisation frames. Frames are defined by a list of atoms (e.g., A0 V A1 ) representing the verb and its arguments, and by a list of atoms/feature structure pairs specifying the feature values associated with each of these atoms.

Two sets of subcategorisation lexicons (called LADL-SynLex and NLP-SynLex) were extracted from the LADL tables. The current SynLex contains the LADL- and NLP-SynLex lexicons for the LADL-tables 1, 2, 4, 5, 7, 8, 10, 11, 13, 14 and 16 which amounts to roughly 2.000 verb usages. Work is underway to process the remaining available tables which should yield a description of roughly 6.500 verbs.

SynLex is the result of joint work between TALARIS, ATILF and CALLIGRAMME , . It is currently being evaluated by the members of the ILF (Institut de la Langue Française) funded LexSynt project, and is partially available at http:// www. loria. fr/ ~gardent/ ladl.

Last update: 2005-10-14

Web site: http:// www. loria. fr/ ~gardent/ ladl/

Documentation: http:// www. loria. fr/ ~gardent/ ladl/

Project(s): SynLex

Authors: Claire Gardent, Guy Perrier, Bruno Guillaume, Ingrid Falk

Contact: Claire Gardent

MEDIA

In the framework of the MEDIA project, software has been developed to process transcriptions of a spoken dialogue corpus and to provide a semantic representation of their task-related content. This software contains a tokeniser, a TAG parser (LLP2), a TAG grammar, an OWL ontology and a set of rules in description logic, and works together with a reasoner such as RACER. The modularity of its architecture and the use of the Java programming language enable this software to be run on multiple platforms and to be easily adapted to other transactional contexts besides hotel reservation (it original application domain). This software aims to be further improved to implement reference resolution and dialogic contextual understanding (during the second stage of the MEDIA project) and eventually to be embedded in dialogue systems.

Version: 0.3

License: GPL

Last update: 08/11/2005

Project(s): MEDIA

Authors and Contact: Alexandre Denis

CURT (Clever Use of Reasoning Tools)

The CURT (Clever Use of Reasoning Tools) family is a series of simple dialogue systems which illustrate how tools for building semantic representations can be combined with inference tools.

The behaviour of the different CURT programs is as follows: the user extends CURT's knowledge by entering English sentences, and can query it about its acquired knowledge.

The CURT family is composed of Baby CURT (the backbone of the CURT system using no inference services), Rugrat CURT (including either a simple free variable tableau prover or resolution prover to check the consistency of the current dialog), Clever CURT (which performs consistency checking by running a sophisticated first-order theorem prover and model checker in parallel), Sensitive CURT (which checks in addition for informativeness of the discourse), Scrupulous CURT (which eliminates equivalent interpretations), Knowledgeable CURT (which adds lexical and world knowledge) and Helpful CURT (which is able to handle simple natural language questions from the user).

A multilingual version of CURT is being developed (covering French, Romanian and Spanish).

Version: 1.0

License: GPL

Last update: 2005-10-11

Web site: http:// www. comsem. org

Documentation: http:// www. comsem. org

Authors: Carlos Areces, Patrick Blackburn, Johan Bos, Sébastien Hinderer, Daniela Solomon

Contact: Carlos Areces, Patrick Blackburn, Sébastien Hinderer.

Nessie

Nessie is a library providing facilities for semantic construction. It is written in OCaml and uses typed lambda-calculus and first-order logic as underlying formalisms. It allows the user to flexibly build terms and term trees; and once a lambda-term tree is built, its semantic representation can be efficiently computed.

For test purposes, this library has been interfaced with CURT's parser.

Future developments of Nessie will include an extension to other logics (modal logic, hybrid logic...), interfaces with French grammars (e.g., those generated by the XMGsystem), interface with inference tools, etc.

Last update: 2005-10-11

Authors: Sébastien Hinderer

Contact: Sébastien Hinderer

DeDe Corpus

DeDe is a corpus of roughly 50.000 words where around 5.000 definite descriptions have been annotated as coreferential, contextually dependent, non referential or autonomous. The corpus consists of articles from the newspaper Le Mondeand is annotated with Multext-based morphosyntactic information .

Authors: Claire Gardent, Hélène Manuelian Contact: Claire Gardent

SemFRaG

A TAG grammar developed with the XMGmetagrammar compiler and which describes both the syntax and the semantics of natural language expressions. Syntactically, the grammars covers the TSNLP testsuite and work is in progress to acquire an equivalent semantic coverage. Used both for parsing and for generation.

Authors: Claire Gardent, Benoit Crabbé

Contact: Claire Gardent

FRoG — French Resource Grammar

An HPSG grammar of French developed with the LKB platform. The grammar incorporates a treatment of interface phenomena (syntax-semantics, phonology-syntax, morphology-syntax) in a constraint-based framework designed for bidirectionality (parsing and generation).

Version: 0.1

License: LGPL-LR

Last update: 01/11/2005

Project(s): Delph-In

Authors and Contact: Jesse Tseng

LLP2

LLP2 is an LTAG Parser based on the bottom up algorithm described in Patrice Lopez's thesis. The present version is restricted to TIG. The parser is compliant with the TAGML2 resources format and is capable of processing a graph of words as input. Furthermore, an external utterance segmenter can be plugged in. The distribution comes with graphical exploration and debugging tools.

Version: 1.0

Last update: 31/05/2005

Web site: http:// www. loria. fr/ ~azim/ LLP2/ help/ fr/

Documentation: http:// www. loria. fr/ ~azim/ LLP2/ help/ fr/

Project(s): Passage

Authors and Contact: Azim Roussanaly

New Results Introduction

To structure our discussion of of the new results for TALARIS in 2007, we shall discuss the four main themes in turn:

Computational Semantics

Discourse, Dialogue and Pragmatics

Logics for Natural Language and Knowledge Representation,

Multilinguality for Multimedia

Computational Semantics

The Computational Semantics group in TALARIS focuses on two main points:

The development of a computational infrastructure for the semantic processing of French

The interfacing of natural language processing (NLP) systems with knowledge based inference

The bulk of the work in these area is led by Claire Gardent, who guides the work of Marilisa Amoia, Paul Bedaride, Ingrid Falk, Eric Kow, Yannick Parmentier, Sylvain Schmitz, and Fabienne Venant. In addition, Patrick Blackburn and Sébastien Hinderer work on computational semantics (though applications in French are not the focus of their work). Many of the tools produced by Computational Semantics group are used by other TALARIS researchers, notably from the Discourse, Dialogue and Pragmatics group.

Computational Semantics for French

In 2007, work concentrated mainly on finalising a basic parsing and generating architecture for the semantic processing of French. This involves developing, integrating and evaluating several modules including: a syntactic lexicon ( SynLeX) and a lexicon for verb synonyms ( Syn2), a grammar ( SemFrag), a module for semantic construction ( SemConst) and a module for surface realisation ( GenI). The computational semantic group also initiated and organised in collaboration with CALLIGRAMME a workshop on XMG, a grammar writing environmment developed in Nancy.

SynLeX.

A verbal syntactic lexicon lists for each verb the type and the number of its arguments. Such a lexicon is required for any NLP application involving either parsing or generation. However until recently, no such lexicon was available for French. Over the last three years, TALARIS has worked on extracting such a lexicon from a large scale linguistic resource manually developed under Maurice Gross' guidance namely, the LADL tables. Although these tables are extremely rich, they cannot directly be used in NLP for two reasons. First, their format does not fit the requirements of NLP systems and second, much of its information (in particular inter-dependencies) is only implicitly represented. To remedy these shortcomings, we devised a method to extract the relevant information from these tables and translate it into a format amenable to NLP , , . However it was immediately clear that the lexicon extracted using these methods was imperfect. We therefore carried out an evaluation of this lexicon using corpus based methods and computing precision (percent of correct entries wrt a gold standard) and recall (percent of gold standard entries present in the extracted lexicon). The results show that SynLeXhas a low precision but reasonably good recall thus suggesting that manual validation is a good way to clean up the lexicon and thereby produce a reasonably extensive and correct resource. To this end, we currently cooperate with CALLIGRAMME within the MISN CPER operation BDSyn, where the objective is to develop a database to normalise, store and manipulate the lexicon content; and to then interface this database with a web service supporting validation by multiple annotators.

Syn2.

To reason about the meaning of text, lexical semantic knowledge is necessary and in particular, knowledge about synonyms (to detect that two sentences carry the same meaning, it is necessary to be able to detect synonymous words). Together with ATILF, we initiated the MISN CPER operation Syn2 whose aim is to merge 5 synonym dictionaries on the basis of the TLFi definitions. In a first step, we defined a method based on similarity measures between definitions which permits regrouping synonyms by senses. It remains to be seen how this method performs on the data. Next we intend to evaluate and compare several variants of the proposed methods wrt a gold standard. Once the best method is identified, a unique synonym lexicon will be created which assigns a word and its possible meanings, the sets of synonyms corresponding to each given meaning.

SemFrag.

To parse and generate sentences a grammar is required. CALLIGRAMME and TALARIS have been developing over the last 5 years medium size grammars for French using the XMGgrammar writing environment developed by Denys Duchier, Yannick Parmentier and Joseph LeRoux. The TALARIS grammar SemFragis a Tree Adjoining Grammar augmented with a unification based compositional semantics. A distinguishing feature of SemFragis its reversibility: the grammar can be used with a parser to derive the semantic representation of a sentence or conversely, with a realiser to produce the sentences associated by the grammar with a given meaning. In , we show that the type of semantics used in SemFragobey some general principles which are common to other types of unification based semantics such as the glue semantics used in Lexical Functional Grammar and make it easier to integrate the required semantic information in a large scale grammar. In future, we plan to use these principles to reformulate the semantic dimension of SemFragin a more general and compact way.

SemConst.

To derive semantic representations from sentences, we use SemFragtogether with Eric de la Clergerie's TAG parser and a semantic construction module implemented by Claire Gardent. The module is available on the web and was presented both at TALN (the French NLP conference) and at ACL (Association for Computational Linguistics) , . A more detailed description of the system and of its theoretical underpinnings is given in Yannick Parmentier's PhD thesis .

GenI.

The surface realiser GenItakes as input SemFragand a semantic representation and produces as output the set of strings associated by the grammar with the input semantic representation. In , , we show how the realiser's input can be parameterised to enforce symbolic based determinism (the realiser produces a single output obeying the symbolic constraints encoded in the enriched input). In , we show further how GenIcan be used to detect over-generation in a grammar – we show in particular that the proposed methods permits reducing over-generation by 70% in roughly 13 hours linguist work. A detailed description of GenIand of its theoretical underpinnings is given in Eric Kow's PhD thesis .

XMG workshop.

The XMGgrammar writing environment was developed by Denys Duchier, Yannick Parmentier and Joseph LeRoux in 2006 and is now used by a medium size international (France, Germany, Israel, USA) community. To assess the needs of the users and better plan further extensions, TALARIS and CALLIGRAMME organised a 2 day workshop in Nancy http:// www. loria. fr/ ~parmenti/ WSP-XMG/ which gathered 31 participants from 4 countries and 12 distinct institutions. The workshop resulted in a number of changes to XMGincluding the addition of a graphical interface for visualising and manipulating an XMGhierarchy and the adaptation of the framework to Multi Components Tree Adjoining Grammars .

NLP and Inference

Work on inference has concentrated on textual entailment recognition for English. shows how description logic can be used to model lexical semantic based reasoning when assessing whether one sentence entails another while presents a first order semantics for adjectives which permits capturing the complex interplay of compositional, lexical and morphoderivational semantics determining whether or not entailment holds. In additon, Patrick Balckburn and Sébastien Hinderer have worked on the automatic generation of models for Polish temporal expressions.

Discourse, Dialogue and Pragmatics

The Discourse, Dialogue and Pragmatics group in TALARIS focuses on two main themes:

The study of grounding, mutual understanding, and collaboration.

The study of presupposition and information accommodation for a planning-based perspective.

The bulk of the work in this area is conducted by Matthieu Quignard, Patrick Blackburn, Alexandre Denis, Luciana Benotti, Daniel Coulon and Carlos Areces. The group is making increasingly heavy use of the tools provided by the Computational Semantics group (notably the GenIgenerator, XMGbased grammars, and the SemConstgrammar constructor), a trend that is likely to continue. In addition, the group is making increasing use of a inference tools, which leads to links with the themes explored by the the Logics for Natural Language and Knowledge Representation group. Indeed, this is as it should. Linguistics have long viewed pragmatics, the study of how natural languages are actually used, as providing an insight into a level of meaning over an above (though mediated by) the meanings provided by pragmatics. And, crucially, this level of meaning crucially involves inference. Thus, in a sense this group provides a bridge between the semantically and logically oriented work of the other themes.

Grounding and Mutual Understanding

One of the most difficult problems in discourse is guaranteeing mutual understanding. Early AI approaches to dialog (that is, work from the 1970s and 1980s) tended to ignore the problem: such models assumed perfect understanding or had very crude models of repair.

From a theoretical perspective such approaches are very bad. Dialog is essentially one long mutual effort at negotiating and checking understanding. Moreover, from a practical perspective such a model leads to inflexible dialogue systems.

One of the successes of the group this year was to propose a detailed computational model of the process of grounding, that is, the exchange of signals that takes place during the process of accepting/rejecting dialog information . A notable feature of this model is that it offered an approach to deal with situations where the evidence of understanding has itself been misunderstood.

This model has now been implemented, and forms part of the ongoing work by Alexander Denis and Matthieu Quignard on obtaining a detailed computational model of dialog suitable for implementing robust dialogue systems — one of the long term goals of the group.

Presupposition and Information Accommodation

A pervasive feature of the way we use natural language is the heavy use made of inference to smooth the process on communication. We don't have to spell everything out: we rely on the fact that the people we talk with have lots of knowledge and experience that lets them find their way to the correct interpretation.

For example, when giving people instructions, we typically don't give all the details: if we ask someone to make a salad, we typically don't tell them that they should wash the lettuce as a part of this process. We rely on the fact that people can successfully “fill in” such tacit actions. The study of such linguistic inferences belongs to the area known as pragmatics, and in particular, the study of presupposition and accommodation.

An explicit computational model of part of this process was given by Luciana Benotti . Taking as her starting point a text adventure game called FrOz that made use of Description Logic inference tools, she added planning capability to it (yielding FrOzA, or FrOz Advanced system). The use of planning techniques enables the game dialogue system to “fill in” tacit action required by the players instructions, which results in far more natural, and linguistically plausible interactions.

This work is currently being extended by Luciana Benotti and Patrick Blackburn. A reimplementation, called Frolog, of this system is nearing completion. This reimplementation allows all language processing and inference and planning tasks to be handed over to external modules so that the ideas can be applied in more sophisticated settings.

Logics for Natural Language and Knowledge Representation

The Logics for Natural Language and Knowledge representation focuses on two main points:

The theoretical study of hybrid logic (propositional, first-order, and higher-order) and the implementation of efficient proof methods for them.

Investigating other logics of relevance to natural language and knowledge representation, notably description logic, dedicated planning methods, and discourse representation theory (DRT).

The bulk of the work in this area is conducted by Carlos Areces, Patrick Blackburn, Dmitry Sustretov, Guillaume Hoffmann, Daniel Gorín and Sergio Mera. The inference methods studied by this group are relevant to the work of the Computational Semantics group and the Discourse, Dialogue and Pragmatics group, in particular the work of Luciana Benotti on presupposition and information accomodation (which makes heavy used of description logic and planning) and Paul Bedaride's work on textual entailment (which explores the use of description and hybrid logics).

Automated Theorem Proving for Hybrid Logics

During 2007, this topic received an important impulse. On the one hand, HyLoRes, the resolution based theorem prover for hybrid logics, has finally arrived to a very stable stage of developement. Important work has been done during this year, on modularization, testing and performance improving. A graphical interface has been developed, and model generation capabilities included. More importantly, we have started working in the two topics which we want to address during the next year: paralelization and moving into a client-server architecture. We will discuss these two topics in more detail below.

Taking advantage of the modularized code of HyLoRes, the second important result of 2007 was the development of HTab, a tableaux based prover for hybrid logics . HTabis an optimised tableaux prover for hybrid logics, using algorithms that ensure termination . It ultimately aims to cover a number of frame conditions (i.e., reflexivity, symmetry, antisymmetry, etc.), as far as it is possible to ensure termination . Moreover, we are interested in providing a range of inference services beyond satisfiability checking. For example, the current version of HTabincludes model generation (i.e., HTabcan generate a model from a saturated open branch in the tableau). Even though other provers for languages similar to $Im3 ${\#8459 \#120001 (@)}$$ exists, HTabhas a number of particularities that make it a potentially useful tool. It has already outperformed HyLoTab , a previously existing tableaux based prover for hybrid logics, and in many inputs it also outperforms HyLoRes.

After our experience in designing and developing HyLoResand HTab, we are currently in the process of drawing the main lines of a new system that we call InToHyLo.

InToHyLois actually an integrated collection of tools that work in collaboration to offer a varied spectrum of inference services for different hybrid logics. The system will be developed under the GPL licence and the source code will be available on-line. In addition to the core inference tools, we will also make available the testing environment used to test InToHyLooptimizations and heuristics, in order to encourage independent development.

The main inference task addressed by InToHyLowill be satisfiability checking, but the system will also be able to offer more varied and complex services, like model generation, model checking and instance retrieval. Initially, InToHyLowill be created from the integration of HyLoResand HTab, and this is what we are going to discuss in detail below. But in the future we will consider the addition of other tools (like the HyLoBan for topological semantics).

The core idea behind InToHyLois to take advantage of the inherent dual behavior existing between the resolution based and the tableaux based calculi: while the resolution method performs better on unsatisfiable formulas (during resolution we only need to derive the empty clause to detect that a formula is unsatisfiable), the tableau method performs better on satisfiable formulas (while constructing a tableau we only need to find a saturated open branch to detect that a formula is satisfiable). We have already tested this intuition empirically: there are satisfiable formulas for which even the current version of HTab(with only minimal optimizations) beats HyLoRes, even though HyLoResis on average far better performing than HTab, see .

Our first step will be to transform HyLoResand HTabinto server applications, while HyLoRunwill act as a client application which will connect to the provers submitting queries and displaying the results. HyLoRunwill detect whether HyLoResand/or HTabare running as servers and connect to them using either HTTP or TCP services. This architecture is the one used by the description logic prover RACERand we believe that it has some important benefits:

To start with, the different components of InToHyLo(currently, the two provers and the front-end) can evolve independently without interfering with applications making use of them, as long as the communication protocols are maintained. In addition, new inference tools can be added as additional servers and only the front-end will need to be modified to offer these additional services.

Secondly, and as we will further explain later, we want to investigate ways in which the two provers can collaborate while working in a given problem. With this idea in mind, we want the provers to be able to exchange information (i.e., partial results) in a manner that is transparent to the user.

But the most important reason for choosing this architecture is that it lets us implement a notion of `proof state'. This idea is again a fundamental characteristic of DL provers like RACER: the user should be able to `load' a problem into the system, and then query it for answers. Perhaps many different queries will be posed to the prover about the same problem, and the prover can take advantage of previous results to answer future queries.

As an example, let us discuss points (2) and (3) above for the case of HyLoRes. With respect to point (2), given an input formula $Im4 $\#981 $$ , during the computation of the saturated set of clauses corresponding to $Im4 $\#981 $$ , HyLoRescan derive unit clauses which can be used by HTabto close branches. Similarly, formulas which are common to all branches in a tableaux which is being constructed by HTab, can be sent as unit clauses to be used by HyLoRes. This idea is intuitively appealing, but care must be taken in coordinating the way that new nominals (which are used both by HyLoResand HTabto decompose modal formulas into simpler cases) are introduced.

With respect to point (3), consider a formula $Im4 $\#981 $$ and suppose that we want to decide whether $Im5 ${\#8871 \#981 \#8594 \#968 _i}$$ for formulas $\psi$ ₁, ..., $\psi$ _n. With the current version of HyLoRes, we can only check formulas $Im6 ${\#981 \#8743 ¬\#968 _i}$$ for satisfiability one by one, reporting that $Im5 ${\#8871 \#981 \#8594 \#968 _i}$$ everytime that the prover finds an inconsistency. In doing this, we are recomputing time and again the saturated set for $Im4 $\#981 $$ . It would be much more efficient to first load the formula $Im4 $\#981 $$ into HyLoResas our current problem, and then send the queries $\psi$ _ione by one. HyLoReswould then be able to compute only once the saturated set for $Im4 $\#981 $$ , and just expand it as necessary for each query $\psi$ _i.

Higher Order Hybrid Logics

Higher Order Logics is a classical formalism for natural language semantics semantics. In previous years, we have investigated how the addition of hybrid operators in the classical framework of higher order logics can improve langauge modelling .

This year we have taken up again this issue and we are currently working in a sound and complete axiomatization for higher order hybrid logics. We expect to obtain a general completeness result (i.e., covering extensions of the basic axiomatization with pure formulas and existential saturation rules) as it is the case for first-order hybrid logics. Such a result would be of interesting when providing semantics for different natural language phenomena (e.g., time and aspect) which assume special conditions on their formal models.

Multilinguality for Multimedia

Work in this domain is primarily carried out by Samuel Cruz-Lara, Nadia Bellelem, and Lotfi Bellelem. This is the most applied part of the TALARIS team and also the most independent. Their work centers around:

MLIF (the Multi Lingual Information Framework) a generic ISO-based mechanism for representing and dealing with multilingual textual information.

The W3C's SMIL (Synchronized Multimedia Integration Language), which allows an author to describe the temporal behaviour of a multi-media presentation.

This has been a transitional year for the group. The group was responsible for the PASSEPARTOUT project in interactive television which came (very successfully) to an end on the 31 May 2007. Since then the group has been involved in developing a number of research proposals though at the time of writing it was unclear whether these had been successful or not. One of the most interesting of these is the BIONORM project. This is the name of a regional project whose main objective is to develop and to test a platform allowing to manipulate in a standard and interoperable way, several results related to biological examinations published by biological medical laboratories.

In order to help the group achieve its goals, TALARIS recently expanded it: Lotfi Bellalem joined TALARIS in 30 September 2007. With the group thus increased in size, it is hoped that the following years will see a closer integration of their work in the mainstream of TALARIS research. It is also possible that Samuel Cruz-Lara will request temporary attachment to INRIA to aid this process.

Other Grants and Activities International level Nancy TEI-host

Description:On January 1st, 2005, Nancy became the fourth host of the TEI consortium (Text Encoding Initiative - http:// www. tei-c. org). This resulted from the wish of ATILF, Loria (TALARIS) and INIST to act together in their contribution to standardisation activities for the encoding of textual information. In this context, TALARIS is particularly active in the domains of spoken corpora, terminology and dictionary encoding techniques.

Partner(s):ATILF, INIST

European Level AMIGO: Ambient Intelligence for a Networked Home Environment

Theme:Ambient Intelligence

Description:Within the current trend of research in AI about ambient intelligence, the European AMIGO Project focuses on the design of middleware architecture supporting an optimal interoperability between devices and services for home care and family life. Amongst those services, a particular effort is planned for providing the most convenient way of interacting between systems and human users, and is based on use case scenarios (health and security; home information and entertainment; extended home activities such as working at home) and multimodal interfaces (voice, text messages, 2D and 3D gestures). The participation of TALARIS in the framework of this project is motivated by the design of an enhanced multimodal fusion module, which would extend the one designed in the former OZONE project (voice + 2D paths on a tablet PC) to process also 3D pointing gestures.

Although 2D and 3D devices provides more or less the same type of information (2D paths on a projection screen or display) and the same communicative intention (designation), the introduction of 3D gestures in our multimodal fusion will imply deep changes in our fusion algorithms. In the OZONE system, the moves of the pencil on the touch screen allowed users to select objects with very good accuracy. The low amount of ambiguities enabled us to process the fusion in a quite restricted verbal context. The introduction of massive ambiguities at the level of selected objects would need a better structuration of the dialogue history to eradicate those objects that are not salient in the current dialogue focus, and thus should not be relevant for the fusion.

Administrative context:IST European Program

Web site: http:// www. extra. research. philips. com/ euprojects/ amigo

Person(s) in charge:Harmke de Groot (Philips, Eindhoven)

Period:start 2004-09-01 / end 2008-02-28

Contact:Matthieu Quignard

Partner(s):Philips Research (Eindhoven)

Passepartout: Interactive Television

Theme:Linguistic and multimedia resources

Description:Digitisation of society is always accelerating. A key factor in this acceleration is now software technology. This project focuses on the convergence of digital systems and applications in home media-centres in compliance with the ITEA roadmap “The Road towards Convergence” thus matching the vision of industries, institutions, SME and government partners. New technologies are expected to emerge from this project, that propel the European software industries on to convergence, over terminals and network towards the final goal of ambient intelligence.

The project aims at coupling home media-centers to home networks for rendering scalable content from high definition television (HDTV) to lower definitions in a seamless fashion. Integral to the content will be reactive access and interactivity of high-resolution graphics using ISO and W3C standards for object oriented TV. With the project's goal to make a step towards ambient intelligence through mass personalisation of reactive content (RAMPEG), implementation shall use the most practical elements of MPEG-4 and MPEG-7 with W3C standards such as SMIL, related content synthesis and syndication in XML. Implications will stretch far beyond infrastructure and basic services but will also affect content, human system interaction and engineering.

Implementation will be based on content access using a PVR media-centre as server to new generations of access networks, including Blu-ray optical storage and WIMAX wireless technology. These networks will support the creation of home media-centers that move beyond current STB and PVR-DVD players using MPEG-2 technology, to create true mass-customisation device for family entertainment, with the goals of content packaging and personalisation to match the cultural and linguistic needs of the states of the EU and their economies.

Administrative context:ITEA

Period:start 2005-01-01 / end 2007-03-31

Contact:Samuel Cruz-Lara

Partner(s):Cybercultus, Centre de Recherche Publique Henry Tudor, INT, Thomson, RTL, Philips, Telvent, Universidad Politecnica de Madrid, Universidad de Vigo, CharToon, Stoneroos, ETRI, VTT Electronics, V2, CWI, Technische Universiteit Eindhoven, Gradient

National Level Lexsynt (ILF)

Theme:Lexicon: syntax and semantic

Description:The main aim of this project is to establish cooperation between several teams specialising in computational linguistics based on computational models of French. The accent is on the lexicon (both syntactical and semantic aspects are considered) but a global perspective which takes into account the interface between lexicon and grammar is emphasized.

Web site: http:// lexsynt. inria. fr

Period:start 2005-01-01 / end 2007-31-12

Contact:Claire Gardent

Mosaïque

Theme:Computational linguistics and resources for French

Description:The principle motivation of this ARC is to see a detailed and motivated computational grammar of French emerge that is based on a high level formalism and is freely available to the computational linguistics community.

Web site: http:// mosaique. labri. fr

Period:start start 2006-01-01 / end 2007-31-12

Contact:Claire Gardent

Passage

Theme:Computational linguistics and resources for French

Description:The PASSAGE project has two main aim. The first is to improve the robustness and precision of existing computational grammars for French, and to use them on large corpora (corpora containing several million words). The second is to exploit the result syntactical analyses to create richer linguistic resources (such as Treebanks) for the French language.

Administrative context:ANR MDCA

Web site: http:// atoll. inria. fr/ passage/ home-fr. html

Period:start 2007-01-01 / end 2009-31-12

Contact:Claire Gardent

Dissemination PhD Theses

Eric Kow defended his University of Nancy 1 (UHP) PhD thesis entitled Surface realisation: ambiguity and determinism, supervised by Claire Gardent, on 13 November 2007.

Yannick Parmentier defended his University of Nancy 1 (UHP) PhD thesis entitled SemTAG : une plate-forme pour le calcul sémantique à partir de Grammaires d'Arbres Adjoints, supervised by Claire Gardent, on 6 April 2007.

Service to the Scientific Community

Carlos Areces

Member of the Management Board of the Association of Logic, Language and Information, 2005–2008.

Liason officer for the Erasmus MundusMasters in Language and Communication Technology.

Patrick Blackburn

Member of the INRIA Nancy-Grand Est steering committee.

Member of the Management Board of the Association of Logic, Language and Information, 2005–2008.

Liason officer for the Erasmus MundusMasters in Language and Communication Technology.

Samuel Cruz-Lara

Samuel Cruz-Lara: Person in charge, at the national level, of the reception of Mexican students in the “Professional Licences of Computer Science”.

Christine Fay-Varnier

Vice president of the Council of studies and university life of the INPL.

Representative of the INPL for the steering committee TICE (Information and Communication Technology for Education) for Nancy University.

Claire Gardent

Member of the nominating committee of the European Chapter of the Association for Computational Linguistics (EACL)

Member of the FOLLI Editorial Board for the new series of books in Logic Language and Information to be published with Springer-Verlag as Lecture Notes in Computer Science (LNCS) and/or Lecture notes in Artificial Intelligence (LNCS/LNAI).

Member of the ESSLLI Standing committee

Member of the LORIA steering committee.

Coordinator of the theme TALC (Computational Linguistics and Computational Approaches to Knowledge) for the CPER-MISN (National and Regional Research Funding).

Organiser of the LORIA seminar on Computational Linguistics.

Member of the recruiting committee for short term posts at INRIA Lorraine/LORIA.

Matthieu Quignard

Coordinator for TEI Nancy (Text Encoding Initiative, Nancy branch) concerning spoken corpus annotation normalisation.

Fabienne Venant

Member of the Administrative Council of ATALA, the French national organisation for computational linguistics (see http:// www. atala. org/ ).

Editorial and Program Committee Work

Carlos Areces:

Editor of the Journal of Logic, Language, and Information, 2005 – Present.

Editor of Journal of Applied Logic, 2004 – Present.

Member of the Program Committee of the 2007 Eurolan Summer School (EUROLAN07), Iasi, Romania.

Co-chair of the Program Committee of the 5th Methods for Modalities Workshop (M4M5), Chachan, France.

Member of the Program Committee of the 2007 International Workshop on Description Logics (DL2007), Brixen-Bressanone, Italy.

Member of the Program Committee of the Noveno Simposio Argentino de Inteligencia Artificial (ASAI 2007), Mar del Plata, Argentina.

Member of the Program Committee of the 2007 Workshop on Hybrid Logics (HyLo 07), Dublin, Ireland.

Patrick Blackburn:

Chief Editor of the Journal of Logic, Language, and Information, 2002 – Present.

Editor of the Journal of Philosophical Logic, 2004 – Present.

Editor of the Notre dame Journal of Formal Logic, 2005 – Present.

Editor of the Review of Symbolic Logic, from 2007.

Subject Editor (Logic and Language) for the Stanford Encyclopedia of Philosophy.

Foreign Correspondent of Logique et Analyse.

Claire Gardent

Member of the Program Committee for the 4th International workshop on Constraint Solving and Language Processing is held together with CONTEXT'07 in Roskilde (Denmark), August 20-21, 2007.

Member of the Program Committee for the Lexis & Grammar Conference, Bonifacio (France), October 2-6, 2007.

Member of the Program Committee for the Joint Meeting of the Conference on Empirical Methods on Natural Language Processing (EMNLP) and the Conference on Natural Language Learning (CoNLL), Prague, June 28-30, 2007,

Member of the Program Committee for the atelier TALN "Formalismes syntaxiques de haut niveau", TALN, Toulouse, June 2007.

Member of the Program Committee for the Traitement Automatique des Langues Naturelles (TALN) 2007, Toulouse, 12-15 juin 2007.

Member of the Program Committee for the 19th European Summer School in Logic, Language and Information (ESSLLI), Student Session - Language & Computation, Dublin (Ireland), 6-17 August, 2007.

Member of the Program Committee for the Journées Sémantique et Modélisation (JSM) 07, Paris, March 29-30 2007.

Member of the Program Committee for the 6th Discourse Anaphora and Anaphor Resolution Colloquium (DAARC'2007), Lagos (Portugal), March 29-30, 2007.

Member of the Program Committee of the 2007 Eurolan Summer School (EUROLAN07), Iasi, Romania.

Invited presentations

Carlos Areces:

Invited lecture, ILCLI International Workshop on Logic and Philosophy of Knowledge, Communication and Action, Universidad del Pais Vasco, 28 November, 2007.

Patrick Blackburn:

Invited course, Linguistics Institute, Stanford University, 1–27 July, 2007.

Invited lecture, 16th Amsterdam Colloquium, 17 December 2007.

Claire Gardent

Invited Tutorial. Semantics in NLP University of Buenos Aires, (Argentina) , November 2006. 15 hours, Postgraduate.

Invited Tutorial. Tree Adjoining Grammar: Theory and Practice. LAICS Summer School (Language, Artificial Intelligence and Computer Science for Natural Language Processing applications), Bangkok, (Thailand) , October 2006. 4 hours.

Invited Tutorial. Natural Language Generation. LAICS Summer School (Language, Artificial Intelligence and Computer Science for Natural Language Processing applications) Bangkok, (Thailand) , October 2006. 3 hours.

Hybrid Logics C. Areces C. B. ten Cate B. P. Blackburn P. F. Wolter F. J. van Benthem J. Handbook of Modal Logics Elsevier 2006 Incomplete Knowledge and Tacit Action: Enlightened Update in a Dialogue Game Luciana Benotti L. DECALOG 2007 Workshop on the Semantics and Pragmatics of Dialogue, Rovereto, Italy 2007 Pure Extensions, Proof Rules, and Hybrid Axiomatics Patrick Blackburn P. Balder ten Cate B. Studia Logica 84 2006 277–322 Handbook of Modal Logic P. Blackburn P. J. van Benthem J. F. Wolter F. Elsevier 2007 Termination for Hybrid Tableaus Thomas Bolander T. Patrick Blackburn P. Journal of Logic and Computation 17 2007 517–554 An efficient, streamable text format for multimedia captions and subtitles Dick C. A. Bulterman D. C. A. A. J. Jansen A. J. Pablo Cesar P. Samuel Cruz-Lara S. DocEng '07: Proceedings of the 2007 ACM symposium on Document engineering, New York, NY, USA ACM 2007 101–110 Incorporating Asymmetric and Asychronous Evidence of Understanding in a Grounding Model Alexandre Denis A. Guillaume Pitel G. Matthieu Quignard M. Patrick Blackburn P. DECALOG 2007 Workshop on the Semantics and Pragmatics of Dialogue, Rovereto, Italy 2007 A symbolic approach to near-deterministic surface realisation using Tree Adjoining Grammar C. Gardent C. E. Kow E. Proceedings of ACL, Prague 2007 Création d'un corpus annoté pour le traitement des descriptions définies Claire Gardent C. Hélène Manuélian H. Traitement Automatique des Langues 46 1 2005 Generating Bridging Definite Descriptions Claire Gardent C. Kristina Striegnitz K. H. Bunt H. R. Muskens R. Computing Meaning Studies in Linguistics and Philosophy Series 3 Kluwer Academic Publishers 2007 Prolog, tout de suite! Patrick Blackburn P. Johan Bos J. Kristina Striegnitz K. College Publications 2007 Handbook of Modal Logic P. Blackburn P. J. van Benthem J. F. Wolter F. Elsevier 2007 Surface realisation: ambiguity and determinism Eric Kow E. Ph. D. Thesis Université Henri Poincaré 2007 SemTAG : une plate-forme pour le calcul sémantique à partir de Grammaires d'Arbres Adjoints Yannick Parmentier Y. Ph. D. Thesis Université Henri Poincaré 2007 Termination for Hybrid Tableaus Thomas Bolander T. Patrick Blackburn P. 0955-792X Journal of Logic and Computation 17 2007 517–554 P. Cesar P. K. Baker K. D. Bulterman D. L. Soares L. S. Cruz-Lara S. A. Kaptein A. Technologies and applications Interactive Digital Television G. Lekakos G. K. Chorianopoulos K. G. Doukidis G. IGI Publishing

Hershey (PA), USA

2007 91–111 Daniel Coulon D. Michel Musiol M. Daniel Brixhe D. Perturbations et Réajustements Langue et Langage Béatrice Vaxelaire B. Rudolph Sock R. Georges Kleiber G. Fabrice Marsac F. Du rôle des perturbations de compréhension dans la conduite des interlocutions 2007 227–236 Algorithms for finding clique-transversals of graphs Guillermo Durán G. Min Chih Lin M. C. Sergio Mera S. Jayme Szwarcfiter J. 0254-5330 Annals of Operations Research in press 2007 Generating Bridging Definite Descriptions Claire Gardent C. Kristina Striegnitz K. H. Bunt H. R. Muskens R. Computing Meaning Studies in Linguistics and Philosophy Series 3 Kluwer Academic Publishers 2007 Structure de l'interaction verbale et rationalité argumentative chez l'adolescent polyhandicapé Michel Musiol M. Alain Trognon A. Daniel Coulon D. Christine Bocerean C. 0458-7251 Le Langage et l'Homme XXXXI 2007 27–43 A cross-linguistic approach to Slavic past tense and conditional constructions Jesse Tseng J. Anna Kupść A. P. Kosta P. L. Schürcks L. Linguistic Investigations into Formal Description of Slavic Languages, Frankfurt am Main Peter Lang 2007 427–439 A first order semantic approach to adjectival inference Marilisa Amoia M. Claire Gardent C. Proceedings of ACL-Pascal Workshop on Textual Entailment and Paraphrasing, Prague, Czech Republic 2007 ACL Workshop on Textual Entailment and Paraphrasing 2007 ACL Hybrid Logics: The Old and the New C. Areces C. Proceedings of LogKCA-07 November 2007 International Workshop on Logic and Philosophy of Knowledge, Communication and Action 2007 LogKCA Using description logics for recognising textual entailment P. Bedaride P. 19th European Summer School in Logic, Language and Information - ESSLLI'07 2007 http:// hal. inria. fr/ inria-00179593/ fr/ European Summer School in Logic, Language and Information 19 ESSLLI Représentation des données en XML pour l'analyse d'images de documents Abdel Belaïd A. Ingrid Falk I. Yves Rangoni Y. 10 ème Colloque International sur le Document Electronique, CIDE'07, Nancy 2007 Colloque International sur le Document Électronique 10 CIDE XML Data Representation in Document Image Analysis Abdel Belaïd A. Ingrid Falk I. Yves Rangoni Y. 9th. International Conference on Document Analysis and Recognition, ICDAR'07 IEEE Computer Society Press 2007 International Conference on Document Analysis and Recognition 9 ICDAR Incomplete Knowledge and Tacit Action: Enlightened Update in a Dialogue Game Luciana Benotti L. DECALOG 2007 Workshop on the Semantics and Pragmatics of Dialogue, Rovereto, Italy 2007 http:// hal. inria. fr/ inria-00179646/ fr/ Workshop on the Semantics and Pragmatics of Dialogue 11 LONDIAL Terminating Tableau Calculi for Hybrid Logics extending K P. Blackburn P. T. Bolander T. Proceedings of Methods for Modalities 5 November 2007 Workshop Methods for Modalities 2007 Evaluer SynLex Ingrid Falk I. Gil Francopoulo G. Claire Gardent C. Traitement Automatique des Langues Naturelles, TALN'07, Toulouse 2007 Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN PrepLex : un lexique des prépositions du français pour l'analyse syntaxique Karën Fort K. Bruno Guillaume B. Actes de Traitement Automatique des Langues Naturelles, Toulouse, France 2007 http:// hal. inria. fr/ inria-00186761/ en/ Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN PrepLex: a lexicon of French prepositions for parsing Karën Fort K. Bruno Guillaume B. Fourth ACL-SIGSEM Workshop on Prepositions, Prague, République Tchèque 2007 http:// hal. inria. fr/ inria-00186777/ en/ Workshop on Prepositions 4 ACL-SIGSEM Tree Adjoining Grammar, Semantic Calculi and Labelling Invariant C. Gardent C. Proceedings of the International Workshop on Computational Semantics (IWCS-7), Tilburg, The Netherlands 2007 International Workshop on Computational Semantics 7 IWCS A symbolic approach to Near-Deterministic Surface Realisation using Tree Adjoining Grammar C. Gardent C. E. Kow E. Proceedings of ACL'07, Prague 2007 http:// hal. inria. fr/ inria-00149366/ fr/ Annual Meeting of the Association for Computational Linguistics 45 ACL Spotting overgeneration suspect C. Gardent C. E. Kow E. 11th European Workshop on Natural Language Generation (ENLG), Schloss Dagstuhl, Germany 2007 http:// hal. inria. fr/ inria-00149372/ fr/ European Workshop on Natural Language Generation 11 ENLG Une réalisateur de surface basé sur une grammaire réversible C. Gardent C. E. Kow E. TALN (Poster) 2007 Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN SemTAG, une architecture pour le développement et l'utilisation de grammaires d'arbres adjoints à portée sémantique C. Gardent C. Y. Parmentier Y. Conference sur le Traitement Automatique des Langues Naturelles (TALN'2007), Toulouse 2007 http:// hal. inria. fr/ inria-00160393/ fr/ Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN SemTAG: a platform for specifying Tree Adjoining Grammars and performing TAG-based Semantic Construction C. Gardent C. Y. Parmentier Y. Proceedings of the Conference of the Association for Computational Linguistics, ACL 2007, Praha 2007 http:// hal. inria. fr/ inria-00160387/ fr/ Annual Meeting of the Association for Computational Linguistics 45 ACL HTab: A Terminating Tableaux System for Hybrid Logic G. Hoffmann G. C. Areces C. Proceedings of Methods for Modalities 5 November 2007 http:// hal. inria. fr/ inria-00187300/ fr/ Workshop Methods for Modalities 2007 XMG: eXtending MetaGrammars to MCTAG Y. Parmentier Y. L. Kallmeyer L. T. Lichte T. W. Maier W. Actes de l'atelier sur les formalismes syntaxiques de haut niveau, Conference sur le Traitement Automatique des Langues Naturelles (TALN'2007), Toulouse 2007 http:// hal. inria. fr/ inria-00160400/ fr/ Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN Améliorer un lexique syntaxique á l'aide des tables du lexique-grammaire - Adverbes en -ment. Benoît Sagot B. Karën Fort K. Actes du Colloque Lexique et Grammaire, Bonifacio, France 2007 http:// hal. inria. fr/ inria-00186779/ fr/ Colloque international sur le Lexique et la Grammaire 26 CLG An Experimental Ambiguity Detection Tool Sylvain Schmitz S. Anthony Sloane A. Adrian Johnstone A. LDTA'07: 7th Workshop on Language Descriptions, Tools and Applications Electronic Notes in Theoretical Computer Science 2007 Workshop on Language Descriptions, Tools and Applications 7 LDTA Conservative Ambiguity Detection in Context-Free Grammars Sylvain Schmitz S. Lars Arge L. Christian Cachin C. Tomasz Jurdziński T. Andrzej Tarlecki A. ICALP'07: 34th International Colloquium on Automata, Languages and Programming Lecture Notes in Computer Science 4596 Springer 2007 692–703 International Colloquium on Automata, Languages and Programming 34 ICALP Experiments in Theorem Proving for Topological Hybrid Logic D. Sustretov D. G. Hoffmann G. C. Areces C. P. Blackburn P. Proceedings of Methods for Modalities 5 November 2007 http:// hal. inria. fr/ inria-00187303/ fr/ Workshop Methods for Modalities 2007 Topological semantics and decidability Dmitry Sustretov D. Proceedingo of International Workshop on Hybrid Logic (HyLo'2007) 2007 International Workshop on Hybrid Logic 2007 HYLO English prepositional passive constructions Jesse Tseng J. Stefan Müller S. The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, Stanford CSLI Publications 2007 271–286 International Conference on Head-Driven Phrase Structure Grammar 14 HPSG L'abstraction de l'extraction Jesse Tseng J. Nabil Hathout N. Philippe Muller P. TALN '07: Actes de la 14e conférence sur le Traitement Automatiques des Langues Naturelles, Toulouse IRIT Press 2007 463–472 Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN La construction du sens: un système dynamique complexe Fabienne Venant F. ARCO'07 2007 Colloque annuel de l'Association pour la Recherche Cognitive 2007 ARCO From TY <span class="math" align="left"><sub><hi rend="it">n</hi></sub></span>to DRT: an implementation Patrick Blackburn P. Sébastien Hinderer S. 3rd International Language & Technology Conference - L&TC'07, Poznam Pologne 2007 384-388 http:// hal. inria. fr/ inria-00179297/ fr/ Generating models for temporal representations Patrick Blackburn P. Sébastien Hinderer S. Recent Advances in Natural Language Processing - RANLP 2007 International conference recent advances in natural language processing, Borovets Bulgarie 2007 69-75 http:// hal. inria. fr/ inria-00179485/ fr/ International Conference on Recent Advances in Natural Language Processing 2007 RANLP An Efficient, Streamable Text Format for Multimedia Captions and Subtitles Dick Bulterman D. Jack Jansen J. Pablo Cesar P. Samuel Cruz-Lara S. ACM Symposium on Document Engineering, Winnipeg Canada 2007 http:// hal. inria. fr/ inria-00192467/ fr/ ACM Symposium on Document Engineering 7 DocEng Non-Intrusive User Interfaces for Interactive Digital Television Experiences Pablo Cesar P. Dick C. A. Bulterman Dick C. A. Zeljko Obrenovic Z. Julien Ducret J. Samuel Cruz-Lara S. 5th European Interactive TV Conference, Amsterdam Pays-Bas 2007 http:// hal. inria. fr/ inria-00192461/ fr/ European Interactive TV Conference 5 EuroITV Résolution de la référence dans des dialogues homme-machine : évaluation sur corpus de deux approches symbolique et probabiliste Alexandre Denis A. Frédéric Béchet F. Matthieu Quignard M. Traitement Automatique des Langues Naturelles - TALN 2007, Toulouse France 2007 http:// hal. inria. fr/ inria-00179697/ fr/ Conférence Annuelle sur le Traitement Automatique des Langues Naturelles 14 TALN Incorporating Asymmetric and Asynchronous Evidence of Understanding in a Grounding Model Alexandre Denis A. Guillaume Pitel G. Matthieu Quignard M. Patrick Blackburn P. Ron Artstein R. Laure Vieu L. 11th Workshop on the Semantics and Pragmatics of Dialogue - DECALOG 2007, Trento Italie 2007 http:// hal. inria. fr/ inria-00179694/ fr/ Workshop on the Semantics and Pragmatics of Dialogue 11 LONDIAL Context-based image retrieval: A case study in background image access for Multimedia presentations Sheng-Hao Hung S.-H. Pai-Hsun Chen P.-H. Jen-Shin Hong J.-S. Samuel Cruz-Lara S. IADIS International Conference WWW/Internet 2007, Vila Real Portugal 2007 http:// hal. inria. fr/ inria-00192463/ fr/ IADIS International Conference WWW/Internet 2007 WWW/Internet Event-based textual document retrieval by using semantic role labeling and coreference resolution Chia-Hung Lin C.-H. Chia-Wei Yen C.-W. Jen-Shin Hong J.-S. Samuel Cruz-Lara S. IADIS International Conference WWW/Internet 2007, Vila Real Portugal 2007 http:// hal. inria. fr/ inria-00192465/ fr/ IADIS International Conference WWW/Internet 2007 WWW/Internet BrlAPI: Simple, Portable, Concurrent, Application-level Control of Braille Terminals Samuel Thibault S. Sébastien Hinderer S. The First International Conference on Information and Communication Technology and Accessibility - ICTA 2007, Hammamet/Tunisia K.: Computing Milieux/K.4: COMPUTERS AND SOCIETY/K.4.2: Social Issues/K.4.2.1: Assistive technologies for persons with disabilities 04 2007 27–31 http:// hal. inria. fr/ inria-00135946/ fr/ International Conference on Information and Communication Technology and Accessibility 1 ICTA Using set constraints to generate distinguishing descriptions Marilisa Amoia M. Claire Gardent C. Stefan Thater S. Proc. of the 7th International Workshop on Natural Language Understanding and Logic Programming - NLULP'02, Copenhagen, Denmark Jul 2002 http:// www. loria. fr/ publications/ 2002/ A02-R-459/ A02-R-459. ps Analyzing the Core of Categorial Grammar C. Areces C. R Bernardi R. Journal of Logic, Language and Information 13 2 2004 121–137 Reichenbach, Prior and Montague: A semantic get-together C. Areces C. P. Blackburn P. Articles in Honor of Dov Gabbay's 60th Birthday Kings College Press 2005 Ordered Resolution with Selection for H(@) Carlos Areces C. Daniel Gorín D. Franz Baader F. Andrei Voronkov A. Proc. of the 11th International Conference on Logic for Programming Artificial Intelligence and Reasoning - LPAR 2004, Montevideo, Uruguay Lecture Notes in Computer Science 3452 Springer Mar 2005 125-141 hGen: A Random CNF Formula Generator for Hybrid Languages Carlos Areces C. Juan Heguiabehere J. Proc. of Methods for Modalities 3 - M4M-3, Nancy, France Sep 2003 http:// www. loria. fr/ publications/ 2003/ A03-R-306/ A03-R-306. ps Computational Semantics Patrick Blackburn P. Johan Bos J. Theoria 18 46 Jan 2003 27-45 Representation and Inference for Natural Language: A First Course in Computational Semantics Patrick Blackburn P. Johan Bos J. CSLI Press Aug 2005 Metagrammar Redux Benoît Crabbé B. Denys Duchier D. Proc. of the International Workshop on Constraint Solving and Language Processing - CSLP 2004, Copenhagen, Norway Sep 2004 Alternations, monotonicity and the lexicon: an application to factorising information in a Tree Adjoining Grammar Benoit Crabbé B. Proc. of the European Summer School of Logic Language and Computation, Student Session - ESSLLI'03, Vienna, Austria Aug 2003 69-80 Lexical Classes for structuring the lexicon of a TAG Benoit Crabbé B. Lorraine-Saarland Workshop series: prospects and advances in the syntax semantics interface, Nancy, France Oct 2003 http:// www. loria. fr/ publications/ 2003/ A03-R-396/ A03-R-396. ps Représentation et gestion de grammaires d'arbres adjoints lexicalisées Benoit Crabbé B. Bertrand Gaiffe B. Azim Roussanaly A. Traitement Automatique des Langues 44 3 Dec 2003 67-91 Une plateforme de conception et d'exploitation de grammaire d'arbres adjoints lexicalisés Benoit Crabbé B. Bertrand Gaiffe B. Azim Roussanaly A. Traitement Automatique du Langage Naturel 2003 - TALN 2003, Batz-sur-Mer, France Jun 2003 The MetaGrammar Compiler: An NLP Application with a Multi-paradigm Architecture Denys Duchier D. Joseph Le Roux J. Yannick Parmentier Y. Proc. of the 2nd International Mozart/Oz Conference - MOZ 2004, Charleroi, Belgium Oct 2004 A New Metagrammar Compiler Bertrand Gaiffe B. Benoit Crabbé B. Azim Roussanaly A. Proc. of the 6th International Workshop on Tree Adjoining Grammars and Related Frameworks - TAG+6, Venice, Italy May 2002 http:// www. loria. fr/ publications/ 2002/ A02-R-495/ A02-R-495. ps Generating minimal definite descriptions Claire Gardent C. Proc. of the 40th Annual Meeting of the Association for Computational Linguistic - ACL'02, Philadelphia, USA Jul 2002 http:// www. loria. fr/ publications/ 2002/ A02-R-460/ A02-R-460. ps Extracting a subcategorisation lexicon from Maurice Gross Grammar Lexicon Claire Gardent C. Bruno Guillaume B. Guy Perrier G. Ingrid Falk I. Archive of Control Sciences 2005 Extracting subcategorisation information from Maurice Gross' Grammar Lexicon C. Gardent C. B. Guillaume B. G. Perrier G. I. Falk I. Archives of Control Sciences 15 LI 2005 253-264 Maurice Gross' Grammar Lexicon and Natural Language Processing Claire Gardent C. Bruno Guillaume B. Guy Perrier G. Ingrid Falk I. Proc. of the 2nd Language and Technology Conference - L&T'05, Poznan, Poland Apr 2005 Maurice Gross' Grammar Lexicon and Natural Language Processing C. Gardent C. B. Guillaume B. G. Perrier G. I. Falk I. Proceedings of the 2nd Language and Technology Conference, Poznan, Poland 2005 Extraction d'information de sous-catégorisation à partir des tables du LADL C. Gardent C. B. Guillaume B. G. Perrier G. I. Falk I. Actes de La 13ème édition de la conférence sur le Traitement Automatique des Langues Naturelles (TALN 2006) 2006 Generating and selecting paraphrases Claire Gardent C. Eric Kow E. Proc. of the 10th European Workshop on Natural Language Generation - ENLG 05, Aberdeen, Scotland Aug 2005 191-196 Création d'un corpus annoté pour le traitement des descriptions définies Claire Gardent C. Hélène Manuélian H. Traitement automatique des langues 46 1 2005 Which bridges for bridging definite descriptions? Claire Gardent C. Hélène Manuélian H. Eric Kow E. Proc. of the 4th International Workshop on Linguistically Interpreted Corpora - LINC'03, Budapest, Hungary Apr 2003 http:// www. loria. fr/ publications/ 2003/ A03-R-067/ A03-R-067. ps Generating Definite Descriptions: Non incrementality, inference and data Claire Gardent C. Hélène Manuélian H. Kristina Striegnitz K. Marilisa Amoia M. Speech production Walter de Gruyter, Berlin Dec 2003 Large scale semantic construction for Tree Adjoining Grammars Claire Gardent C. Yannick Parmentier Y. P. Blache P. E. Stabler E. J. Busquets J. R. Moot R. Proc. of Logical Aspects of Computational Linguistics - LACL'05, Bordeaux, France Lecture Notes in Computer Science 3492 Springer Apr 2005 131-146 Generating Bridging Definite Descriptions Claire Gardent C. Kristina Striegnitz K. H. Bunt H. R. Muskens R. Computing Meaning Studies in Linguistics and Philosophy Series 3 Kluwer Academic Publishers Dec 2003 Adapting polarised disambiguation to surface realisation Eric Kow E. Proc. of the 17th European Summer School in Logic, Language and Information - ESSLLI'05, Edinburgh, United Kingdom Aug 2005 Generating Coreferential Descriptions from a Structured Model of the Context Hélène Manuélian H. Proc. of the International Conference on Language Ressources and Evaluation - LREC 2004, Lisbon, Portugal May 2004 http:// www. loria. fr/ publications/ 2004/ A04-R-060/ A04-R-060. ps Annotation des descriptions définies : le cas des reprises par les rôles thématiques Hélène Manuélian H. Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues - RECITAL'2002, Nancy, France Jun 2002 http:// www. loria. fr/ publications/ 2002/ A02-R-076/ A02-R-076. ps Coreferential Definite and Demonstrative Descriptions in French: A Corpus Study for Text Generation Hélène Manuélian H. Proc. of the ESSLLI'03 Student Session, Vienna, Austria Aug 2003 http:// www. loria. fr/ publications/ 2003/ A03-R-108/ A03-R-108. ps Descriptions Définies et Démonstratives : Analyses de corpus pour la génération Hélène Manuélian H. Ph. D. Thesis Université de Nancy 2 Nov 2003 http:// www. loria. fr/ publications/ 2003/ A03-T-527/ A03-T-527. ps Génération de descriptions définies et démonstratives Hélène Manuélian H. Huitième Atelier des doctorants en linguistique - ADL'2003, Paris, France Jul 2003 Une analyse des emplois du démonstratif en corpus Hélène Manuélian H. Traitement Automatique des Langues Naturelles - TALN 2003, Batz sur Mer, France Jun 2003 http:// www. loria. fr/ publications/ 2003/ A03-R-107/ A03-R-107. ps XMG: a Multi-formalism Metagrammatical Framework Yannick Parmentier Y. Joseph Le Roux J. Proc. of the 17th European Summer School in Logic, Language and Information - ESSLLI '05, Edinburgh, United Kingdom Aug 2005 Des arbres de dérivation aux forêts de dépendance : un chemin via les forêts partagées Djamé Seddah D. Bertrand Gaiffe B. Traitement automatique des langues Naturelles - TALN'05, Dourdan, France Jun 2005 How to Build Argumental graphs Using TAG Shared Forest: a view from control verbs problematic Djamé Seddah D. Bertrand Gaiffe B. Proc. of the 5th International Conference on the Logical Aspect of Computional Linguistic - LACL'05, Bordeaux, France Apr 2005 Using both Derivation tree and Derived tree to get dependency graph in derivation forest Djamé Seddah D. Bertrand Gaiffe B. Proc. of the 6th International Workshop on Computational Semantics - IWCS-6, Tilburg, The Netherlands Jan 2005 Synchronisation des connaissances syntaxiques et sémantiques pour l'analyse d'énoncés en langage naturel à l'aide des grammaires d'arbres adjoints lexicalisées Djamé Seddah D. Ph. D. Thesis Université Henri Poincaré - Nancy 1 Nov 2004 Génération d'expressions anaphoriques. Raisonnement contextuel et planification de phrases Kristina Striegnitz K. Ph. D. Thesis Université Henri Poincaré Nov 2004 HyLoTab — Tableau-based Theorem Proving for Hybrid Logics J. van Eijck J. 2002 http:// www. cwi. nl/ ~jve/ hylotab A first order semantic approach to adjectival inference Marilisa Amoia M. Claire Gardent C. Proceedings of ACL-Pascal Workshop on Textual Entailment and Paraphrasing, Prague, Czech Republic 2007 Hybrid Logics: The Old and the New C. Areces C. Proceedigns of LogKCA-07 November 2007 Terminating Tableau Calculi for Hybrid Logics extending K P. Blackburn P. T. Bolander T. Proceedings of Methods for Modalities 5 November 2007 Termination for Hybrid Tableaus Thomas Bolander T. Patrick Blackburn P. Journal of Logic and Computation 17 2007 517–554 Using description logics for recognising textual entailment P. Bedaride P. 19th European Summer School in Logic, Language and Information - ESSLLI'07 2007 http://hal.inria.fr/inria-00179593/fr/ Incomplete Knowledge and Tacit Action: Enlightened Update in a Dialogue Game Luciana Benotti L. DECALOG 2007 Workshop on the Semantics and Pragmatics of Dialogue, Rovereto, Italy 2007 http://hal.inria.fr/inria-00179646/fr/ Représentation des données en XML pour l'analyse d'images de documents Abdel Belaïd A. Ingrid Falk I. Yves Rangoni Y. 10 ème Colloque International sur le Document Electronique, CIDE'07, Nancy 2007 XML Data Representation in Document Image Analysis Abdel Belaïd A. Ingrid Falk I. Yves Rangoni Y. 9th. International Conference on Document Analysis and Recognition, ICDAR'07 IEEE Computer Society Press 2007 From TY n to DRT: an implementation Patrick Blackburn P. Sébastien Hinderer S. 3rd International Language & Technology Conference - L&TC'07, Poznam Pologne 2007 384-388 http://hal.inria.fr/inria-00179297/fr/ Generating models for temporal representations Patrick Blackburn P. Sébastien Hinderer S. Recent Advances in Natural Language Processing - RANLP 2007 International conference recent advances in natural language processing, Borovets Bulgarie 2007 69-75 http://hal.inria.fr/inria-00179485/fr/ An Efficient, Streamable Text Format for Multimedia Captions and Subtitles Dick Bulterman D. Jack Jansen J. Pablo Cesar P. Samuel Cruz-Lara S. ACM Symposium on Document Engineering, Winnipeg Canada 2007 http://hal.inria.fr/inria-00192467/fr/ Handbook of Modal Logic P. Blackburn P. J. van Bethenm J. F. Wolter F. Elsevier 2007 P. Cesar P. K. Baker K. D. Bulterman D. L. Soares L. S. Cruz-Lara S. A. Kaptein A. Interactive Digital Television: technologies and applications G. Lekakos G. K. Chorianopoulos K. G. Doukidis G. IGI Publishing

Hershey (PA), USA

2007 91–111 Non-Intrusive User Interfaces for Interactive Digital Television Experiences Pablo Cesar P. Dick C. A. Bulterman D. Zeljko Obrenovic Z. Julien Ducret J. Samuel Cruz-Lara S. 5th European Interactive TV Conference, Amsterdam Pays-Bas 2007 http://hal.inria.fr/inria-00192461/fr/ Daniel Coulon D. Michel Musiol M. Daniel Brixhe D. Perturbations et Réajustements Langue et Langage Béatrice Vaxelaire B. Rudolph Sock R. Georges Kleiber G. Fabrice Marsac F. Du rôle des perturbations de compréhension dans la conduite des interlocutions 2007 227–236 Résolution de la référence dans des dialogues homme-machine : évaluation sur corpus de deux approches symbolique et probabiliste Alexandre Denis A. Frédéric Béchet F. Matthieu Quignard M. Traitement Automatique des Langues Naturelles - TALN 2007, Toulouse France 2007 http://hal.inria.fr/inria-00179697/fr/ Algorithms for finding clique-transversals of graphs Guillermo Durán G. Min Chih Lin M. C. Sergio Mera S. Jayme Szwarcfiter J. Annals of Operations Research in press 2007 Incorporating Asymmetric and Asynchronous Evidence of Understanding in a Grounding Model Alexandre Denis A. Guillaume Pitel G. Matthieu Quignard M. Patrick Blackburn P. Ron Artstein R. Laure Vieu L. 11th Workshop on the Semantics and Pragmatics of Dialogue - DECALOG 2007, Trento Italie 2007 http://hal.inria.fr/inria-00179694/fr/ Evaluer SynLex Ingrid Falk I. Gil Francopoulo G. Claire Gardent C. Traitement Automatique des Langues Naturelles, TALN'07, Toulouse 2007 PrepLex : un lexique des prépositions du français pour l'analyse syntaxique Karën Fort K. Bruno Guillaume B. Actes de Traitement Automatique des Langues Naturelles, Toulouse, France 2007 http://hal.inria.fr/inria-00186777/en/ PrepLex: a lexicon of French prepositions for parsing Karën Fort K. Bruno Guillaume B. Fourth ACL-SIGSEM Workshop on Prepositions, Prague, République Tchèque 2007 http://hal.inria.fr/inria-00186761/en/ Tree Adjoining Grammar, Semantic Calculi and Labelling Invariant C. Gardent C. Proceedings of the International Workshop on Computational Semantics (IWCS-7), Tilburg, The Netherlands 2007 A symbolic approach to Near-Deterministic Surface Realisation using Tree Adjoining Grammar C. Gardent C. E. Kow E. Proceedings of ACL'07, Prague 2007 http://hal.inria.fr/inria-00149366/fr/ Spotting overgeneration suspect C. Gardent C. E. Kow E. 11th European Workshop on Natural Language Generation (ENLG), Schloss Dagstuhl, Germany 2007 http://hal.inria.fr/inria-00149372/fr/ Une réalisateur de surface basé sur une grammaire réversible C. Gardent C. E. Kow E. TALN (Poster) 2007 SemTAG, une architecture pour le développement et l'utilisation de grammaires d'arbres adjoints à portée sémantique C. Gardent C. Y. Parmentier Y. Conference sur le Traitement Automatique des Langues Naturelles (TALN'2007), Toulouse 2007 http://hal.inria.fr/inria-00160393/fr/ SemTAG: a platform for specifying Tree Adjoining Grammars and performing TAG-based Semantic Construction C. Gardent C. Y. Parmentier Y. Proceedings of the Conference of the Association for Computational Linguistics, ACL 2007, Praha 2007 http://hal.inria.fr/inria-00160387/fr/ Generating Bridging Definite Descriptions Claire Gardent C. Kristina Striegnitz K. H. Bunt H. R. Muskens R. Computing Meaning Studies in Linguistics and Philosophy Series 3 Kluwer Academic Publishers 2007 HTab: A Terminating Tableaux System for Hybrid Logic G. Hoffmann G. C. Areces C. Proceedings of Methods for Modalities 5 November 2007 http://hal.inria.fr/inria-00187300/fr/ Context-based image retrieval: A case study in background image access for Multimedia presentations Sheng-Hao Hung S.-H. Pai-Hsun Chen P.-H. Jen-Shin Hong J.-S. Samuel Cruz-Lara S. IADIS International Conference WWW/Internet 2007, Vila Real Portugal 2007 http://hal.inria.fr/inria-00192463/fr/ Surface realisation: ambiguity and determinism Eric Kow E. Ph. D. Thesis Université Henri Poincaré 2007 Event-based textual document retrieval by using semantic role labeling and coreference resolution Chia-Hung Lin C.-H. Chia-Wei Yen C.-W. Jen-Shin Hong J.-S. Samuel Cruz-Lara S. IADIS International Conference WWW/Internet 2007, Vila Real Portugal 2007 http://hal.inria.fr/inria-00192465/fr/ Structure de l'interaction verbale et rationalité argumentative chez l'adolescent polyhandicapé Michel Musiol M. Alain Trognon A. Daniel Coulon D. Christine Bocerean C. Le Langage et l'Homme XXXXI 2007 27–43 SemTAG : une plate-forme pour le calcul sémantique à partir de Grammaires d'Arbres Adjoints Yannick Parmentier Y. Ph. D. Thesis Université Henri Poincaré 2007 XMG: eXtending MetaGrammars to MCTAG Y. Parmentier Y. L. Kallmeyer L. T. Lichte T. W. Maier W. Actes de l'atelier sur les formalismes syntaxiques de haut niveau, Conference sur le Traitement Automatique des Langues Naturelles (TALN'2007), Toulouse 2007 http://hal.inria.fr/inria-00160400/fr/ An Experimental Ambiguity Detection Tool Sylvain Schmitz S. Anthony Sloane A. Adrian Johnstone A. LDTA'07: 7th Workshop on Language Descriptions, Tools and Applications Electronic Notes in Theoretical Computer Science 2007 http://hal.inria.fr/inria-00179646/fr/ Conservative Ambiguity Detection in Context-Free Grammars Sylvain Schmitz S. Lars Arge L. Christian Cachin C. Tomasz Jurdziński T. Andrzej Tarlecki A. ICALP'07: 34th International Colloquium on Automata, Languages and Programming Lecture Notes in Computer Science 4596 Springer 2007 692–703 Améliorer un lexique syntaxique á l'aide des tables du lexique-grammaire - Adverbes en -ment. Benoît Sagot B. Karën Fort K. Actes du Colloque Lexique et Grammaire, Bonifacio, France 2007 http://hal.inria.fr/inria-00186779/fr/ Experiments in Theorem Proving for Topological Hybrid Logic D. Sustretov D. G. Hoffmann G. C. Areces C. P. Blackburn P. Proceedings of Methods for Modalities 5 November 2007 http://hal.inria.fr/inria-00187303/fr/ Topological semantics and decidability Dmitry Sustretov D. Proceedingo of International Workshop on Hybrid Logic (HyLo'2007) 2007 BrlAPI: Simple, Portable, Concurrent, Application-level Control of Braille Terminals Samuel Thibault S. Sébastien Hinderer S. The First International Conference on Information and Communication Technology and Accessibility - ICTA 2007, Hammamet/Tunisia K.: Computing Milieux/K.4: COMPUTERS AND SOCIETY/K.4.2: Social Issues/K.4.2.1: Assistive technologies for persons with disabilities 04 2007 27–31 http://hal.inria.fr/inria-00135946/fr/ A cross-linguistic approach to Slavic past tense and conditional constructions Jesse Tseng J. Anna Kupść A. P. Kosta P. L. Schürcks L. Linguistic Investigations into Formal Description of Slavic Languages, Frankfurt am Main Peter Lang 2007 427–439 English prepositional passive constructions Jesse Tseng J. Stefan Müller S. The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, Stanford CSLI Publications 2007 271–286 L'abstraction de l'extraction Jesse Tseng J. Nabil Hathout N. Philippe Muller P. TALN '07: Actes de la 14e conférence sur le Traitement Automatiques des Langues Naturelles, Toulouse IRIT Press 2007 463–472 La construction du sens: un système dynamique complexe Fabienne Venant F. ARCO'07 2007