TEXTE: Exploration et exploitation de donnees textuelles

Nous développons des modèles et des outils pour analyse automatique, syntaxique et sémantique, du langage naturel ainsi que pour la constitution des ressources nécessaires. 

Membres

Permanents

Non permanents

Thématiques de Recherche

L’équipe TEXTE développe des méthodes, des outils et des ressources pour le traitement automatique du langage naturel, surtout écrit.  Ces travaux portent plus particulièrement sur sa syntaxe et sur sa sémantique aussi bien logique que lexicale. Nous utilisons plutôt des méthodes symboliques, le plus souvent logiques,  d'où notre rattachement au pôle Intelligence artificielle.  Bien qu'elles soient toutes reliées entre elles, distinguons dans Texte les activités suivantes: 

  • Construction, acquisition de ressources pour le traitement automatique des langues (lexique, grammaire)
  • Analyse automatique de la syntaxe et de la sémantique du langage naturel.

Ces travaux nécessitent des recherches fondamentales,  souvent fédérées par la logique:

  • Programmation logique par contraintes pour la syntaxe guidée par les modèles
  • Analyse syntaxique et  sémantique en théorie des types.
  • Règles d'inférence dans un réseau lexical.
  • Représentation des connaissances.

D'autres méthodes sont aussi utilisées: jeux sérieux collaboratifs, algorithmique distribuée sur des graphes (fourmis), algèbre linéaire (vecteurs de mots), statistiques (suppression du bruit, étiquetage grammatical).

Publications depuis 2013 - Evaluation 2019

Articles de revues internationales

2018

  1. Combining logical and distributional methods in type-logical grammars
    Richard Moot
    Journal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, In press.

2017

  1. An outline of type-theoretical approaches to lexical semantics
    Robin Cooper, Christian Retoré
    Journal of Language Modelling, Institute of Computer Science, Polish Academy of Sciences, Poland, 2017, 5 (2), pp.165-178.
  2. From logical and linguistic generics to Hilbert’s tau and epsilon quantifiers
    Stergios Chatzikyriakidis, Fabio Pasquali, Christian Retoré
    IfColog Journal of Logics and their Applications (FLAP), College Publications, 2017, Hilbert’s epsilon and tau in Logic, Informatics and Linguistics 4 (2), pp.231-255.

2016

  1. Conditions d’assertion de "chaque" et de "tout" et règles de déduction du quantificateur universel
    Alda Mari, Christian Retoré
    Travaux de Linguistique : Revue Internationale de Linguistique Française, De Boeck Université, 2016, 72, pp.89-106.
  2. Quantification in Ordinary Language and Proof Theory
    Michele Abrusci, Fabio Pasquali, Christian Retoré
    Philosophia Scientiae, Paris; Editions Kime; [2014], 2016, pp.185-205.

2015

  1. Recognition of logical units in log files
    Hassan Saneifar, Stéphane Bonniol, Pascal Poncelet, Mathieu Roche
    Intelligent Data Analysis, IOS Press, 2015, 19 (2), pp.431-448.
  2. Software understanding: Automatic classification of software identifiers
    Pattaraporn Warintarawej, Anne Laurent, Marianne Huchard, Mathieu Lafourcade, Pierre Pompidor
    Intelligent Data Analysis, IOS Press, 2015, 19 (4), pp.761-778.

2014

  1. From Logical to Distributional Models
    Anne Preller
    Electronic Proceedings in Theoretical Computer Science, EPTCS, 2014, 171, pp.113-131.
  2. Deverbal semantics and the Montagovian generative lexicon ΛTyn
    Livy-Maria Real-Coelho, Christian Retoré
    Journal of Logic, Language and Information, Springer Verlag, 2014, 23 (3), pp.347-366.
  3. Partially Commutative Linear Logic and Lambek Caculus with Product: Natural Deduction, Normalisation, Subformula Property
    Maxime Amblard, Christian Retoré
    IfColog Journal of Logics and their Applications (FLAP), College Publications, 2014, 1 (1), pp.53-94.
  4. A natural framework for natural language semantics: many sorted logic and Hilbert operators in type theory
    Christian Retoré
    Bulletin of Symbolic Logic, Association for Symbolic Logic, 2014, 20 (2), pp.241-241.
  5. Category theory, logic and formal linguistics: some connections, old and new
    Jean Gillibert, Christian Retoré
    Journal of Applied Logic, Elsevier, 2014, 12 (1), pp.1-13.
  6. Natural Language Semantics in Biproduct Dagger Categories
    Anne Preller
    Journal of Applied Logic, Elsevier, 2014, 12, pp.88-108.
  7. How can catchy titles be generated without loss of informativeness?
    Cédric Lopez, Violaine Prince, Mathieu Roche
    Expert Systems with Applications, Elsevier, 2014, 41 (4), pp.1051-1062.
  8. How to Combine Text-Mining Methods to Validate Induced Verb-Object Relations?
    Nicolas Béchet, Jacques Chauché, Violaine Prince, Mathieu Roche
    Computer Science and Information Systems, ComSIS Consortium, 2014, 11 (1), pp.133-155.
  9. Are opinions expressed in land-use planning documents?
    Eric Kergosien, Bernard Laval, Mathieu Roche, Maguelonne Teisseire
    International Journal of Geographical Information Science, Taylor & Francis, 2014, 28 (4), pp.739-762.

2013

  1. Can Mammographic Assessments Lead to Consider Density as a Risk Factor for Breast Cancer?
    Catherine Colin, Violaine Prince, Pierre-Jean Valette
    European Journal of Radiology, Elsevier, 2013, 82, pp.404-411.
  2. Sud4science, de l'acquisition d'un grand corpus de SMS en français à l'analyse de l'écriture SMS
    Rachel Panckhurst, Catherine Détrie, Cédric Lopez, Claudine Moïse, Mathieu Roche, Bertrand Verine
    Episteme, Cambridge University Press (CUP), 2013, Communication électronique et écritures numériques, pp.107-138.

Communications internationales

2018

  1. Cheap, Fast and Good! Voting Games with a Purpose
    Karën Fort, Mathieu Lafourcade, Nathalie Le Brun
    Games4NLP: Games and Gamification for Natural Language Processing , May 2018, Miyazaki, Japan.

2017

  1. An Empirical Study for a Machine Aided Translation of French Prepositions '` a', 'de' and 'en' into English
    Violaine Prince
    8th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Nov 2017, Poznan, Poland.
  2. Ontolex JeuxDeMots and Its Alignment to the Linguistic Linked Open Data Cloud
    Andon Tchechmedjiev, Théophile Mandon, Mathieu Lafourcade, Anne Laurent, Konstantin Todorov
    ISWC: International Semantic Web Conference, Oct 2017, Vienne, Austria. 16th International Semantic Web Conference, LNCS (10587), pp.678-693, 2017.
  3. Ambiguss, a game for building a Sense Annotated Corpus for French
    Mathieu Lafourcade, Nathalie Le Brun
    IWCS: International Conference on Computational Semantics, Sep 2017, Montpellier, France. 12th International Conference on Computational Semantics, 2017. <https://www.lirmm.fr/iwcs2017/>
  4. Explicative Path Finding in a Semantic Network
    Kévin Cousot, Mathieu Lafourcade
    IWCS: International Conference on Computational Semantics, Sep 2017, Montpellier, France. 12th International Conference on Computational Semantics, 2017. <https://www.lirmm.fr/iwcs2017/>
  5. Identifying Polysemous Words and Inferring Sense Glosses in a Semantic Network
    Maxime Chapuis, Mathieu Lafourcade
    IWCS: International Conference on Computational Semantics, Sep 2017, Montpellier, France. 12th International Conference on Computational Semantics, 2017. <https://www.lirmm.fr/iwcs2017/>
  6. If mice were reptiles, then reptiles could be mammals or How to detect errors in the JeuxDeMots lexical network?
    Mathieu Lafourcade, Alain Joubert, Nathalie Le Brun
    RANLP: Recent Advances in Natural Language Processing, Sep 2017, Varna, Bulgaria. International Conference on Recent Advances in Natural Language Processing, 2017. <http://lml.bas.bg/ranlp2017/>
  7. Towards the Automatic Detection of Nutritional Incompatibilities Based on Recipe Titles
    Nadia Clairet, Mathieu Lafourcade
    Andreas Holzinger; Peter Kieseberg; A Min Tjoa; Edgar Weippl. 1st International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2017, Reggio, Italy. Springer International Publishing, Lecture Notes in Computer Science, LNCS-10410, pp.346-366, 2017, Machine Learning and Knowledge Extraction.
  8. Parcourir, reconnaître et réfléchir. Combinaison de méthodes légères pour l'extraction de relations sémantiques.
    Mathieu Lafourcade, Nathalie Le Brun
    TALN: Traitement Automatique des Langues, Jun 2017, Orléans, France. 24rd French Conference on Natural Language Processing, 2017. <http://taln2017.cnrs.fr>

2016

  1. Compilation de grammaire de propriétés pour l'analyse syntaxique par optimisation de contraintes
    Jean-Philippe Prost, Remi Coletta, Christophe Lecoutre
    TALN: Traitement Automatique des Langues Naturelles, Jul 2016, Paris, France. 23ème Conférence sur le Traitement Automatique des Langues Naturelles, 2016. <https://jep-taln2016.limsi.fr/>
  2. Patrons sémantiques pour l'extraction de relations entre termes - Application aux comptes rendus radiologiques
    Lionel Ramadier, Mathieu Lafourcade
    TALN 2016, Jul 2016, Paris, France. Actes de la conférence conjointe JEP-TALN-RECITAL 2016, jep-taln2016.
  3. Construire un lexique de sentiments par crowdsourcing et propagation
    Mathieu Lafourcade, Nathalie Le Brun, Alain Joubert
    TALN: Traitement Automatique des Langues Naturelles, Jul 2016, Paris, France. 5ème édition conjointe de la conférence JEP-TALN-RECITAL 23e conférence sur le Traitement Automatique des Langues Naturelles (TALN), 2016. <https://jep-taln2016.limsi.fr/actes/index.php?lang=fr>
  4. Mixing Crowdsourcing and Graph Propagation to Build a Sentiment Lexicon
    Mathieu Lafourcade, Nathalie Le Brun, Alain Joubert
    Feelings are contagious. NLDB: Natural Language to Information Systems, Jun 2016, Manchester, United Kingdom. 21st International Conference on Applications of Natural Language to Information Systems, LNCS (9612), pp.258-266, 2016.
  5. Découverte des patrons de connaissance grâce à la modélisation sémantique des phrases d'instructions
    Nadia Clairet, Sylvie Despres, Mathieu Lafourcade
    TOTh, Jun 2016, Chambéry, France. 10ème édition de la Conférence TOTh, 2016, « Tournant linguistique et renouveau conceptuel ». <http://porphyre.org/toth/toth-2016>
  6. Using Constraints on a general Knowledge lexical networK for domain-specific semantic relation extraction and modeling
    Nadia Bebeshina-Clairet, Lionel Ramadier, Mathieu Lafourcade
    Dialogue 2016, Jun 2016, Moscou, Russia. 22nd International Conference on Computational Linguistics and Intellectual Technologies, 15 (22), 2016, Computational Linguistics and Intellectual Technologies. <http://www.dialog-21.ru/en/dialogue2016/results/>
  7. Semantic RelationExtraction with Semantic Patterns: Experiment on Radiology Report
    Mathieu Lafourcade, Lionel Ramadier
    LREC 2016 Conference on Language Resources and Evaluation, May 2016, Portorož, Slovenia. 10th, LREC 2016 Proceedings.

2015

  1. “Chaque vin a sa lie." versus “Toute nuit a un jour." --- does the difference in the human processing of " chaque" and " tout" match the difference between the proof rules for conjunction and quantification?
    Alda Mari, Christian Retoré
    (In)coherence of Discourse, Dec 2015, Nancy, France. (In)coherence of Discourse 3, 2015. <http://discours.loria.fr/2015/>
  2. Are Books Events? Ontological Inclusions as Coercive Sub-Typing, Lexical Transfers as Entailment
    Bruno Mery, Christian Retoré
    Eric McReady. LENLS12: Logic and Engineering of Natural Language Semantics 12, Nov 2015, Tokyo, Japan. ISBN 978-4-915905-68-1, pp.74-87.
  3. A Case Study of Copredication over a Deverbal that Reconciles Empirical Data with Computational Semantics
    Livy Real, Christian Retoré
    Eric McReady. LENLS12: Logic and Engineering of Natural Language Semantics 12, Nov 2015, Tokyo, Japan. ISBN: 978-4-915905-68-1, 2015.
  4. Medical Imaging Report Indexing: Enrichment of Index through an Algorithm of Spreading over a Lexico-semantic Network
    Mathieu Lafourcade, Lionel Ramadier
    RANLP: Recent Advances in Natural Language Processing, Sep 2015, Hissar, Bulgaria. 2015.
  5. Typed Hilbert Operators for the Lexical Semantics of Singular and Plural Determiner Phrases
    Bruno Mery, Christian Retoré
    Epsilon: Hilbert’s Epsilon and Tau in Logic, Informatics and Linguistics, Aug 2015, Montpellier, France. 2015. <https://sites.google.com/site/epsilon2015workshop/>
  6. Type Theories and Lexical Networks: Using Serious Games as the Basis for Multi-Sorted Typed Systems
    Stergios Chatzikyriakidis, Mathieu Lafourcade, Lionel Ramadier, Manel Zarrouk
    ESSLLI: European Summer School in Logic, Language and Information, Aug 2015, Barcelona, Spain. 2015.
  7. Vous aimez ?...ou pas ? LikeIt, un jeu pour construire une ressource lexicale de polarité.
    Mathieu Lafourcade, Nathalie Le Brun, Alain Joubert
    TALN: Traitement Automatique des Langues Naturelles, Jun 2015, Caen, France. 22e conférence sur le Traitement Automatique des Langues Naturelles, 2015. <http://www.atala.org/taln_archives/TALN/TALN-2015/>
  8. Quantifier scope: a formal and experimental study
    Arthur Capelier-Mourguy, Philippe Blache, Christian Retoré, Laurent Prevot
    CJC-SC: Colloque des Jeunes Chercheurs en Sciences Cognitives, Jun 2015, Compiègne, France. 2015. <http://cjcsc.sciencesconf.org>

2014

  1. Computing the Semantics of Plurals and Massive Entities Using Many-Sorted Types
    Bruno Mery, Christian Retoré
    Koji Mineshima. LENLS: Logic and Engineering of Natural Language Semantics, Nov 2014, Kanagawa, Japan. Keio University Press, JSAI-isAI 2014 Workshops, LENLS, JURISIN, and GABA, Kanagawa, Japan, October 27-28, 2014, Revised Selected Papers The Eleventh International Workshop of Logic and Engineering of Natural Language Semantics 11 (LENLS11), LNCS (9067), pp.144-159, 2015, New Frontiers in Artificial Intelligence.
  2. From NL Preference Expressions to Comparative Preference Statements: A Preliminary Study in Eliciting Preferences for Customised Decision Support.
    Souhila Kaci, Namrata Patel, Violaine Prince
    ICTAI: International Conference on Tools with Artificial Intelligence, Nov 2014, Limassol, Cyprus. 26th International Conference on Tools with Artificial Intelligence, pp.591-598, 2014.
  3. Mining Tweet Data - Statistic and semantic information for political tweet classification
    Guillaume Tisserant, Violaine Prince, Mathieu Roche
    KDIR: Knowledge Discovery and Information Retrieval, Oct 2014, Rome, Italy. KDIR'14: International Conference on Knowledge Discovery and Information Retrieval, pp.523-529, 2014, Text-Mining Session.
  4. Typed Hilbert Epsilon Operators and the Semantics of Determiner Phrases (Invited Lecture)
    Christian Retoré
    Glyn Morrill; Frank Richter; Rainer Osswald; Reinhard Muskens. FG: Formal Grammar, Aug 2014, Tübingen, Germany. Springer, The 19th Conference on Formal Grammar will be held from August 16th to August 17th, 2014, in conjunction with the 26th European Summer School in Logic, Language and Information (ESSLLI 2014) in Tübingen, Germany., 8612, pp.15-33, 2014, LNCS.
  5. Jugement exact de grammaticalité d'arbre syntaxique probable
    Jean-Philippe Prost
    TALN: Traitement Automatique des Langues Naturelles, Jul 2014, Marseille, France. Actes de la 21ème conférence sur le Traitement Automatique des Langues Naturelles (TALN'2014), 2014. <http://www.atala.org/taln_archives/TALN/TALN-2014/>
  6. Les couleurs des gens
    Mathieu Lafourcade, Nathalie Le Brun, Virginie Zampa
    TALN: Traitement Automatique des Langues Naturelles, Jul 2014, Marseille, France. 21ème conférence sur le Traitement Automatique des Langues Naturelles, 2014. <https://www.atala.org/TALN-RECITAL-2014-21eme-conference>
  7. Crowdsourcing Word-Color Associations
    Mathieu Lafourcade, Nathalie Le Brun, Virginie Zampa
    Métais E.; Roche M.; Teisseire M. NLDB: Natural Language in the Database and Information Systems, Jun 2014, Montpellier, France. Springer, Cham, 19th International Conference on Applications of Natural Language to Information Systems, LNCS (8455), pp.39-44, 2014.
  8. Propa-L: a Semantic Filtering Service from a Lexical Network Created using Games With A Purpose
    Mathieu Lafourcade, Karën Fort
    International Conference on Language Resources and Evaluation (LREC), May 2014, Reykjavik, Iceland. 2014.
  9. From Natural Language to RDF Graphs with Pregroups
    Antonin Delpeuch, Anne Preller
    EACL'2014: 14th Conference of the European Chapter of the Association for Computational Linguistics, Apr 2014, Gothenburg, Sweden. EACL, pp.55-62, 2014.
  10. Spreading Relation Annotations in a Lexical Semantic Network Applied to Radiology
    Lionel Ramadier, Manel Zarrouk, Mathieu Lafourcade, Antoine Micheau
    CICLing: Computational Linguistics and Intelligent Text Processing, Apr 2014, Kathmandu, Nepal. 15th International Conference, CICLing 2014, Kathmandu, Nepal, April 6-12, 2014, Proceedings, Part I, LNCS (8403), pp.40-51, 2014, Computational Linguistics and Intelligent Text Processing.
  11. Vectorisation paramétrée des données textuelles
    Célia Da Costa Pereira, Mathieu Lafourcade, Patrick Lloret, Cédric Lopez, Mathieu Roche
    EGC: Extraction et Gestion des Connaissances, Jan 2014, Rennes, France. 14èmes Journées Internationales Francophones sur l’Extraction et la Gestion des Connaissances, RNTI-E-26, pp.593-596, 2014.

2013

  1. How to extract unit of measure in scientific documents?
    Soumia Lilia Berrahou, Patrice Buche, Juliette Dibie, Mathieu Roche
    KDIR: Knowledge Discovery and Information Retrieval, Sep 2013, Vilamoura, Portugal. Springer, 5th International Conference on Knowledge Discovery and Information Retrieval, pp.454-459, 2013.
  2. From Functional to Distributional Models
    Anne Preller
    Quantum Physics and Logic 2013, Jul 2013, Barcelona, Spain. pp.17, 2013.
  3. GenDesc: A Partial Generalization of Linguistic Features For Text Classification
    Guillaume Tisserant, Violaine Prince, Mathieu Roche
    NLDB'2013: International Conference on Applications of Natural Language to Information Systems, Jun 2013, United Kingdom. pp.6, 2013.
  4. Text2Geo: from textual data to geospatial information
    Sabiha Tahrat, Eric Kergosien, Sandra Bringay, Mathieu Roche, Maguelonne Teisseire
    WIMS: Web Intelligence, Mining and Semantics, Jun 2013, Madrid, Spain. 13th International Conference on Web Intelligence, Mining and Semantics, 2013. <http://aida.ii.uam.es/wims13/>
  5. Inference and Reconciliation in a Crowdsourced Lexical-Semantic Network
    Manel Zarrouk, Mathieu Lafourcade, Alain Joubert
    CICLING: International Conference on Intelligent Text Processing and Computational Linguistics, Mar 2013, Samos, Greece. 14th International Conference on Intelligent Text Processing and Computational Linguistics March 24–30, 2013. University of the Aegean, Samos, Greece, 2013. <http://www2.lirmm.fr/~mzarrouk/publications/CICLING2013.pdf>
  6. Approaches of anonymisation of an SMS corpus
    Namrata Patel, Pierre Accorsi, Diana Inkpen, Cédric Lopez, Mathieu Roche
    CICLing: Conference on Intelligent Text Processing and Computational Linguistics, Mar 2013, Samos, Greece. Springer-Verlag, 14th International Conference on Intelligent Text Processing and Computational Linguistics, LNCS (7816), pp.77-88, 2013.

Mots-clés

Syntaxe, sémantique lexicale, sémantique compositionnelle. Constitution de ressources linguistiques. Analyse automatique.

Dernière mise à jour le 27/06/2018