EN FR
EN FR


Section: Partnerships and Cooperations

National Initiatives

ANR

Polymnie : Parsing and synthesis with abstract categorial grammars. From lexicon to discourse

Participants : Maxime Amblard, Philippe de Groote, Aleksandre Maskharashvili, Sylvain Pogodalla [coordinator] .

Polymnie (http://semagramme.loria.fr/doku.php?id=projects:polymnie) is a research project funded by the French national research agency (ANR) from September 2012 to February 2016. It relies on the grammatical framework of Abstract Categorial Grammars (ACG). A feature of this formalism is to provide the same mathematical perspective both on the surface forms and on the more abstract forms the latter correspond to. As a consequence:

  • ACG allows for the encoding of a large variety of grammatical formalisms such as context-free grammars, Tree Adjoining grammars (TAG), etc.

  • ACG defines two languages: an abstract language for the abstract forms, and an object language for the surface forms.

Importantly, the notions of object language and abstract language are relative to each other. If we can naturally see surface forms as strings for instance and abstract forms as the associated syntactic trees, we can also consider to associate this abstract form to a first order logical formula as surface (object) form. This property is central in our project as it offers a unified approach to text analysis and text generation, in particular considering the underlying algorithms and their complexity.

ACG definition uses type-theory and lambda-calculus. From this point of view, they smoothly integrate formal semantics models issuing from Montague's proposal. Theories that extend to the discourse level such as Discourse Representation Theory (DRT) and Dynamic Predicate Logic (DPL) were not initially formulated using lambda-calculus. But such formulations have been proposed. In particular, a formulation based on continuation semantics allows them to be expressed quite naturally in the ACG architecture. Dynamic effects of discourse, in particular those related to anaphora resolution or rhetorical relation inference, have then to be expressed by lexical semantics or computed from the syntactic rules as studied in the Inria Collaborative Research Project (ARC) CAuLD (https://members.loria.fr/SPogodalla/files/cauld).

It has been shown that the discourse structure of texts plays a key role in their understanding. This is the case for both human readers and automatic processing systems. For instance, it can enhance text transformation systems such as the ones performing automatic summarization.

Polymnie focuses on studying and implementing the modeling of sentences and discourses in a compositional paradigm that takes into account their dynamics and their structures, both in parsing and in generation. To that end, we rely on the ACG framework. The kind of processing we are interested in relates to the automatic construction of summaries or to text simplification. This has to be considered in the limits of the modeling of the linguistic processes (as opposed to inferential processes for instance) these tasks involve.

Partners:

  • Sémagramme people,

  • Alpage (Paris 7 university & Inria Paris-Rocquencourt): Laurence Danlos (local coordinator), C. Braud, C. Roze, Éric Villemonte de la Clergerie,

  • MELODI (IRIT, CNRS): Stergos Afantenos, Nicholas Asher (local coordinator), Juliette Conrath, Philippe Muller,

  • Signes (LaBRI, CNRS): Jérôme Kirman, Richard Moot, Christian Retoré (local coordinator), Sylvain Salvati, Noémie-Fleur Sandillon-Rezer.

The project has been presented during the journés du numérique de l'ANR [23]. A demonstration of the ACGtk software has been given during the TALN conference 2016 [22].

DGLFLF (Délégation générale à la langue française et aux langues de France)

ZombiLingo

Participants : Bruno Guillaume [coordinator] , Nicolas Lefebvre.

The goal of the ZombiLingo project is to develop an online GWAP (Game With A Purpose) to help the construction of linguistic resources. See 6.3.1 for more information.