ZENITH: Gestion de données scientifiques

Zenith s’attaque aux défis posés par la gestion (stockage, partage, traitement, recherche analyse) des données massives (big data, données scientifiques). Ces défis (correspondant aux trois big V : Volume, Velocity, Variety) peuvent se résumer ainsi:

1. très grande échelle (big data, big analytics) ;

2. données en continu (produits par des capteurs, des appareils mobiles, …) ;

3. hétérogénéité et complexité des données (différences sémantiques, données incertaines ou multi-échelles, …).

Notre objectif est d’apporter des solutions innovantes, en démontrant des avantages en termes de passage à l’échelle, fonctionnalité, facilité d’usage et performance, dans des environnements distribués et parallèles (P2P, grid, cloud).

Nous cherchons à produire des résultats fondamentaux et algorithmiques, que nous pouvons implémenter dans des environnements spécifiques, par ex. Grid5K. Pour valider nos solutions, nous collaborons avec des partenaires scientifiques (INRA, CIRAD, IRD, etc.) et industriels (Data Publica, Bull, EDF, Orange, Microsoft, MonetDB, Sparsity, etc.).

Membres

Permanents

Non permanents

Collaborateurs réguliers

  • Hervé Goëau
  • Michel Riveill
  • Christophe Pradal

Thématiques de recherche

Le projet Zenith est organisé en trois thèmes complémentaires :

1. Gestion de données et métadonnées : gestion et intégration de données et métadonnées (schémas, ontologies) à grande échelle, en particulier, stockage de big data, résolution d’entités incertaines et traitement de requêtes probabilistes.

2. Partage de données et processus : gestion des données et processus scientifiques dans des environnements distribués et parallèles, avec partage de données en P2P, recommandation dans les communautés en ligne et support des workflows scientifiques.

3. Analyse de données : fouille de données et recherche de données par contenu en exploitant le parallélisme du cloud et les nouvelles technologies NoSQL et MapReduce.

Ces trois thèmes reflètent le continuum qui va de la capture des données, en passant par leur intégration, gestion et partage, jusqu’à leur analyse, afin de produire informations et connaissances.

Publications majeures depuis 2008

R. Akbarinia, P. Valduriez, G. Verger, Efficient Evaluation of SUM Queries Over Probabilistic Data. IEEE Transactions on Knowledge and Data Engineering, Data. Vol. 25, No. 4, 764-775, 2013.

M. El Dick, E. Pacitti, R. Akbarinia, B. Kemme, Building a Peer-to-Peer Content Distribution Network with High Performance, Scalability and Robustness, Information Systems, Vol. 36, No 2, p. 222-247, 2011.

P. Letessier, O. Buisson, A. Joly, N. Boujemaa, Scalable Mining of Small Visual Objects, ACM Multimedia Conf.,  2012.

E. Ogasawara, D. De Oliveira, P. Valduriez, J. Dias, F. Porto, M. Mattoso, An Algebraic Approach for Data-Centric Scientific Workflows, Proceedings of VLDB, Vol. 4, No 11, p. 1328-1339, 2011. 

F. Petitjean, F. Masseglia, P. Gançarski, G. Forestier, Discovering Significant Evolution Patterns from Satelllite Image Time Series, International Journal of Neural Systems, Vol. 21, No 6, 475-489, 2011.

Publications depuis 2014 - Evaluation 2019

Articles de revues internationales

2019

  1. CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning
    Fabian-Robert Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël Habets
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2019, 27 (2), pp.268-282.
  2. Musical Source Separation: An Introduction
    Estefania Cano, Derry Fitzgerald, Antoine Liutkus, Mark Plumbley, Fabian-Robert Stöter
    IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2019, 36 (1), pp.31-40.
  3. Parallel Computation of PDFs on Big Spatial Data Using Spark
    Ji Liu, Noel Moreno Lemus, Esther Pacitti, Fábio Porto, Patrick Valduriez
    Distributed and Parallel Databases, Springer, In press, pp.1-38.

2018

  1. ParCorr: efficient parallel methods to identify similar time series pairs across sliding windows
    Djamel-Edine Yagoubi, Reza Akbarinia, Boyan Kolev, Oleksandra Levchenko, Florent Masseglia, Patrick Valduriez, Dennis Shasha
    Data Mining and Knowledge Discovery, Springer, 2018, 32 (5), pp.1481-1507.
  2. Genetic and environmental dissection of biomass accumulation in multi-genotype maize canopies
    Tsu-Wei Chen, Llorenç Cabrera-Bosquet, Santiago Alvarez Prado, Raphael Perez, Simon Artzet, Christophe Pradal, Aude Coupel-Ledru, Christian Fournier, Francois Tardieu
    Journal of Experimental Botany, Oxford University Press (OUP), 2018. <10.1093/jxb/ery309>
  3. DfAnalyzer: Runtime Dataflow Analysis of Scientific Applications using Provenance
    Vítor Silva, Daniel De Oliveira, Patrick Valduriez, Marta Mattoso
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2018, 11 (12), pp.2082-2085.
  4. Non-parametric Bayesian annotator combination
    Maximilien Servajean, Romain Chailan, Alexis Joly
    Information Sciences, Elsevier, 2018, 436-437, pp.131-145.
  5. Species distribution modeling based on the automated identification of citizen observations
    Christophe Botella, Alexis Joly, Pierre Bonnet, Pascal Monestiez, François Munoz
    Applications in Plant Sciences, Wiley, 2018, Green Digitization: Online Botanical Collections Data Answering Real‐World Questions, 6 (2), pp.1-11.
  6. AutoWIG: automatic generation of python bindings for C++ libraries
    Pierre Fernique, Christophe Pradal
    PeerJ Computer Science, PeerJ, 2018, 4. <10.7717/peerj-cs.149>
  7. Distributed Management of Scientific Workflows for High-Throughput Plant Phenotyping
    Christophe Pradal, Sarah Cohen-Boulakia, Gaetan Heidsieck, Esther Pacitti, Francois Tardieu, Patrick Valduriez
    ERCIM News, ERCIM, 2018, Smart Farming, pp.36-37.
  8. In situ visualization and data analysis for turbidity currents simulation
    José Camata, Vitor Silva, Patrick Valduriez, Marta Mattoso, Alvaro Coutinho
    Computers & Geosciences, Elsevier, 2018, 110, pp.23-31.
  9. An Overview of Lead and Accompaniment Separation in Music
    Zafar Rafii, Antoine Liutkus, Fabian-Robert Stöter, Stylianos Ioannis Mimilakis, Derry Fitzgerald, Bryan Pardo
    IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2018. <10.1109/TASLP.2018.2825440>
  10. Data reduction in scientific workflows using provenance monitoring and user steering
    Renan Souza, Vitor Silva, Alvaro L.G.A. Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, In press, pp.1-21.
  11. Efficient Scheduling of Scientific Workflows using Hot Metadata in a Multisite Cloud
    Ji Liu, Luis Pineda, Esther Pacitti, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu, Marta Mattoso
    IEEE Transactions on Knowledge and Data Engineering, Institute of Electrical and Electronics Engineers, In press, pp.1-20.
  12. A Survey of Scheduling Frameworks in Big Data Systems
    Ji Liu, Esther Pacitti, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, 2018, 7 (2), pp.103-128.

2017

  1. Going deeper in the automated identification of Herbarium specimens
    Jose Carranza-Rojas, Herve Goeau, Pierre Bonnet, Erick Mata-Montero, Alexis Joly
    BMC Evolutionary Biology, BioMed Central, 2017, 17 (1), pp.181.
  2. Crowdsourcing Thousands of Specialized Labels: A Bayesian Active Training Approach
    Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
    IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2017, 19 (6), pp.1376-1391.
  3. InfraPhenoGrid: A scientific workflow infrastructure for Plant Phenomics on the Grid
    Christophe Pradal, Simon Artzet, Jerome Chopard, Dimitri Dupuis, Christian Fournier, Michael Mielewczik, Vincent Negre, Pascal Neveu, Didier Parigot, Patrick Valduriez, Sarah Cohen-Boulakia
    Future Generation Computer Systems, Elsevier, 2017, 67, pp.341-353.
  4. A Highly Scalable Parallel Algorithm for Maximally Informative k-Itemset Mining
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017, 50 (1), pp.1-26.
  5. Data placement in massively distributed environments for fast parallel mining of frequent itemsets
    Saber Salah, Reza Akbarinia, Florent Masseglia
    Knowledge and Information Systems (KAIS), Springer, 2017, 53 (1), pp.207-237.
  6. Scientific Workflow Scheduling with Provenance Data in a Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2017, 33, pp.80-112.
  7. Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities
    Sarah Cohen-Boulakia, Khalid Belhajjame, Olivier Collin, Jérôme Chopard, Christine Froidevaux, Alban Gaignard, Konrad Hinsen, Pierre Larmande, Yvan Le Bras, Frédéric Lemoine, Fabien Mareuil, Hervé Ménager, Christophe Pradal, Christophe Blanchet
    Future Generation Computer Systems, Elsevier, 2017. <10.1016/j.future.2017.01.012>
  8. Raw data queries during data-intensive parallel workflow execution
    Vítor Silva, José Leite, José Camata, Daniel De Oliveira, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2017, 75, pp.402-422.

2016

  1. CloudMdsQL: Querying Heterogeneous Cloud Data Stores with a Common Language
    Boyan Kolev, Patrick Valduriez, Carlyna Bondiombouy, Ricardo Jiménez-Peris, Raquel Pau, José Pereira
    Distributed and Parallel Databases, Springer, 2016, 34 (4), pp.463-503.
  2. AgroLD API. Une architecture orientée services pour l'extraction de connaissances dans la base de données liées AgroLD
    Gildas Tagny Ngompe, Aravind Venkatesan, Nordine El Hassouni, Manuel Ruiz, Pierre Larmande
    Revue des Sciences et Technologies de l'Information - Série ISI : Ingénierie des Systèmes d'Information, Lavoisier, 2016, 21 (5-6), pp.133-158.
  3. Categorizing plant images at the variety level: Did you say fine-grained?
    Julien Champ, Titouan Lorieul, Pierre Bonnet, Najate Maghnaoui, Christophe Sereno, Thierry Dessup, Jean-Michel Boursiquot, Laurent Audeguin, Thierry Lacombe, Alexis Joly
    Pattern Recognition Letters, Elsevier, 2016, 81, pp.71-79.
  4. Database System Support of Simulation Data
    Hermano Lustosa, Fabio Porto, Pablo Blanco, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2016, 9 (13), pp.1329-1340.
  5. Categorizing plant images at the variety level: Did you say fine-grained?
    Julien Champ, Titouan Lorieul, Pierre Bonnet, Najate Maghnaoui, Christophe Sereno, Thierry Dessup, Jean-Michel Boursiquot, Laurent Audeguin, Thierry Lacombe, Alexis Joly
    Pattern Recognition Letters, Elsevier, 2016, In press. <10.1016/j.patrec.2016.05.022>
  6. Gigwa—Genotype investigator for genome- wide analyses
    Guilhem Sempéré, Florian Philippe, Alexis Dereeper, Manuel Ruiz, Gautier Sarah, Pierre Larmande
    GigaScience, BioMed Central, 2016. <10.1186/s13742-016-0131-8>
  7. Social Networks and Information Retrieval, How Are They Converging? A Survey, a Taxonomy and an Analysis of Social Information Retrieval Approaches and Platforms
    Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub
    Information Systems, Elsevier, 2016, 56, pp.1-18.
  8. Multistore Big Data Integration with CloudMdsQL
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2016, 28, pp.48-74.
  9. FP-Hadoop: Efficient Processing of Skewed MapReduce Jobs
    Miguel Liroz-Gistau, Reza Akbarinia, Divyakant Agrawal, Patrick Valduriez
    Information Systems, Elsevier, 2016, 60, pp.69-84.
  10. Analyzing Related Raw Data Files through Dataflows
    Vitor Silva, Daniel De Oliveira, Patrick Valduriez, Marta Mattoso
    Concurrency and Computation: Practice and Experience, Wiley, 2016, 28 (8), pp.2528-2545.
  11. Multi-Objective Scheduling of Scientific Workflows in Multisite Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Daniel De Oliveira, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2016, 63, pp.76-95.
  12. Effective and Efficient Similarity Search in Scientific Workflow Repositories
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    Future Generation Computer Systems, Elsevier, 2016, 56, pp.584-594.
  13. Guest Editorial: Environmental Multimedia Retrieval
    Stefanos Vrochidis, Kostas D. Karatzas, Ari Karppinen, Alexis Joly
    Multimedia Tools and Applications, Springer Verlag, 2016, 75 (3), pp.1557--1562.
  14. Plant identification: Man vs. Machine
    Pierre Bonnet, Alexis Joly, Hervé Goëau, Julien Champ, Christel Vignau, Jean-François Molino, Daniel Barthélémy, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2016, LifeCLEF 2014 plant identification challenge, 75 (3), pp.1647-1665.
  15. A look inside the Pl@ntNet experience
    Alexis Joly, Pierre Bonnet, Hervé Goëau, Julien Barbe, Souheil Selmi, Julien Champ, Samuel Dufour-Kowalski, Antoine Affouard, Jennifer Carré, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Multimedia Systems, Springer Verlag, 2016, 22 (6), pp.751-766.
  16. Query processing in multistore systems: an overview
    Carlyna Bondiombouy, Patrick Valduriez
    International Journal of Cloud Computing, Inderscience Publishers, 2016, pp.38.

2015

  1. Rank aggregation with ties: Experiments and Analysis
    Bryan Brancotte, Bo Yang, Guillaume Blin, Sarah Cohen-Boulakia, Alain Denise, Sylvie Hamel
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, pp.2051.
  2. Increasing Coverage in Distributed Search and Recommendation with Profile Diversity
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2015, LNCS (9430), pp.115-144.
  3. Profile Diversity for Query Processing using User Recommendations
    Maximilien Servajean, Reza Akbarinia, Esther Pacitti, Sihem Amer-Yahia
    Information Systems, Elsevier, 2015, Information Systems, 48, pp.44-63.
  4. Data-Centric Iteration in Dynamic Workflows
    Jonas Dias, Gabriel Guerra, Fernando Rochinha, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    Future Generation Computer Systems, Elsevier, 2015, 46, pp.114-126.
  5. A Survey of Data-Intensive Scientific Workflow Management
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    Journal of Grid Computing, Springer Verlag, 2015, 13, 44 p. <10.1007/s10723-015-9329-8>
  6. FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data
    Miguel Liroz-Gistau, Reza Akbarinia, Patrick Valduriez
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2015, 8 (12), pp.1856-1867.

2014

  1. Autonomic Intrusion Detection: Adaptively Detecting Anomalies over Unlabeled Audit Data Streams in Computer Networks
    Wei Wang, Thomas Guyet, René Quiniou, Marie-Odile Cordier, Florent Masseglia, Xiangliang Zhang
    Knowledge-Based Systems, Elsevier, 2014.
  2. Special section on data-intensive cloud infrastructure
    Ashraf Aboulnaga, Beng Chin Ooi, Patrick Valduriez
    The VLDB Journal, Springer, 2014, pp.1.
  3. The anti-bouncing data stream model for web usage streams with intralinkings
    Chongsheng Zhang, Florent Masseglia, Yves Lechevallier
    Information Sciences, Elsevier, 2014, 278, pp.757-772.
  4. Similarity Search for Scientific Workflows
    Johannes Starlinger, Bryan Brancotte, Sarah Cohen-Boulakia, Ulf Leser
    Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2014, 7 (12), pp.1143-1154.
  5. Evaluation of Direct Manipulation using Finger Tracking for Complex Tasks in an Immersive Cube
    Emmanuelle Chapoulie, Maud Marchal, Evanthia Dimara, Maria Roussou, Jean-Christophe Lombardo, George Drettakis
    Virtual Reality, Springer Verlag, 2014, pp.15.
  6. Query Reformulation in PDMS Based on Social Relevance
    Angela Bonifati, Gianvito Summa, Esther Pacitti, Fady Draidi
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, Transactions on Large-Scale Data- and Knowledge-Centered Systems XIII, LNCS, pp.59-90.
  7. Dynamic Workload-Based Partitioning Algorithms for Continuously Growing Databases
    Miguel Liroz-Gistau, Reza Akbarinia, Esther Pacitti, Fabio Porto, Patrick Valduriez
    Transactions on Large-Scale Data- and Knowledge-Centered Systems, Springer Berlin / Heidelberg, 2014, pp.105.
  8. Entity Resolution for Probabilistic Data
    Ayat Naser, Reza Akbarinia, Hamideh Afsarmanesh, Patrick Valduriez
    Information Sciences, Elsevier, 2014, 277, pp.492-511.
  9. Interactive plant identification based on social image data
    Alexis Joly, Hervé Goëau, Pierre Bonnet, Vera Bakić, Julien Barbe, Souheil Selmi, Itheri Yahiaoui, Jennifer Carré, Elise Mouysset, Jean-François Molino, Nozha Boujemaa, Daniel Barthélémy
    Ecological Informatics, Elsevier, 2014, 23, pp.22-34.
  10. Object-based visual query suggestion
    Amel Hamzaoui, Pierre Letessier, Alexis Joly, Olivier Buisson, Nozha Boujemaa
    Multimedia Tools and Applications, Springer Verlag, 2014, Multimedia Tools and Applications, 68 (2), pp.429-454.

Communications internationales

2019

  1. Speech enhancement with variational autoencoders and alpha-stable distributions
    Simon Leglaive, Umut Simsekli, Antoine Liutkus, Laurent Girin, Radu Horaud
    ICASSP 2019 - IEEE International Conference on Acoustics Speech and Signal Processing, May 2019, Brighton, United Kingdom. IEEE, pp.1-5, 2019.
  2. Dirichlet Process Mixture Models made Scalable and Effective by means of Massive Distribution
    Khadidja Meguelati, Bénédicte Fontez, Nadine Hilgert, Florent Masseglia
    SAC: Symposium on Applied Computing, Apr 2019, Limassol, Cyprus. 34th ACM/SIGAPP Symposium On Applied Computing, 2019. <10.1145/3297280.3297327>

2018

  1. Parallel Polyglot Query Processing on Heterogeneous Cloud Data Stores with LeanXcale
    Boyan Kolev, Oleksandra Levchenko, Esther Pacitti, Patrick Valduriez, Ricardo Vilaça, Rui Gonçalves, Ricardo Jiménez-Peris, Pavlos Kranas
    IEEE BigData, Dec 2018, Seattle, United States. IEEE, IEEE International Conference on Big Data, pp.10, 2018.
  2. Spark-parSketch: A Massively Distributed Indexing of Time Series Datasets
    Oleksandra Levchenko, Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Boyan Kolev, Dennis Shasha
    CIKM: Conference on Information and Knowledge Management, Oct 2018, Turin, Italy. 27th ACM International Conference on Information and Knowledge Management, pp.1951-1954, 2018.
  3. SiSEC 2018: State of the art in musical audio source separation - subjective selection of the best algorithm
    Dominic Ward, Russel D. Mason, Chungeun Kim, Fabian-Robert Stöter, Antoine Liutkus, Mark Plumbley
    WIMP: Workshop on Intelligent Music Production, Sep 2018, Huddersfield, United Kingdom. 4th Workshop on Intelligent Music Production, 2018. <http://epubs.surrey.ac.uk/id/eprint/849086>
  4. Overview of LifeCLEF 2018: A Large-Scale Evaluation of Species Identification and Recommendation Algorithms in the Era of AI
    Alexis Joly, Hervé Goëau, Christophe Botella, Hervé Glotin, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Henning Müller
    CLEF: Cross-Language Evaluation Forum, Sep 2018, Avignon, France. 9th International Conference of the Cross-Language Evaluation Forum for European Languages, LNCS (11018), pp.247-266, 2018, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
  5. Overview of ExpertLifeCLEF 2018: how far automated identification systems are from the best experts?
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2018, Avignon, France. 9th International Conference of the CLEF Association, 2018.
  6. Location-based species recommendation using co-occurrences and environment-GeoLifeCLEF 2018 challenge
    Benjamin Deneu, Maximilien Servajean, Christophe Botella, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2018, Avignon, France. Working Notes of CLEF - Conference and Labs of the Evaluation Forum, CEUR Workshop Proceedings (2125), 2018.
  7. How Can You Mend a Broken Inconsistent KBs in Existential Rules Using Argumentation
    Bruno Yun
    SSA: Summer School on Argumentation, Sep 2018, Varsovie, Poland. 3rd Summer School on Argumentation: Computational and Linguistic Perspectives, 2018. <http://ssa2018.argdiap.pl/>
  8. Discovering Tight Space-Time Sequences
    Riccardo Campisano, Heraldo Borges, Fábio Porto, Fabio Perosi, Esther Pacitti, Florent Masseglia, Eduardo Ogasawara
    DaWaK: Data Warehousing and Knowledge Discovery, Sep 2018, Regensburg, Germany. 20th International Conference on Big Data Analytics and Knowledge Discovery, LNCS (11031), pp.247-257, 2018.
  9. Answering Top-k Queries over Outsourced Sensitive Data in the Cloud
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    DEXA: Database and Expert Systems Applications, Sep 2018, Regensburg, Germany. 29th International Conference on Database and Expert Systems Applications, LNCS (11029), pp.218-231, 2018.
  10. Privacy-Preserving Top-k Query Processing in Distributed Systems
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    Euro-Par: European Conference on Parallel and Distributed Computing, Aug 2018, Turin, Italy. 24th International European Conference on Parallel and Distributed Computing, pp.281-292, 2018.
  11. Computation of PDFs on Big Spatial Data: Problem & Architecture
    Ji Liu, Noel Lemus, Esther Pacitti, Fábio Porto, Patrick Valduriez
    LADaS: Latin America Data Science Workshop, Aug 2018, Rio de Janeiro, Brazil. CEUR-WS.org, Proceedings of the Latin America Data Science Workshop co-located with 44th International Conference on Very Large Data Bases (VLDB), 2170, pp.6, 2018.
  12. Scientific Data Analysis Using Data-Intensive Scalable Computing: the SciDISC Project
    Patrick Valduriez, Marta Mattoso, Reza Akbarinia, Heraldo Borges, José Camata, Alvaro Coutinho, Daniel Gaspar, Noel Lemus, Ji Liu, Hermano Lustosa, Florent Masseglia, Fabricio Nogueira da Silva, Vitor Silva, Renan Souza, Kary Ocaña, Eduardo Ogasawara, Daniel Oliveira, Esther Pacitti, Fábio Porto, Dennis Shasha
    LADaS: Latin America Data Science Workshop, Aug 2018, Rio de Janeiro, Brazil. CEUR-WS.org, Proceedings of the Latin America Data Science Workshop co-located with 44th International Conference on Very Large Data Bases (VLDB), CEUR Workshop Proceedings (2170), 2018. <http://ceur-ws.org/Vol-2170>
  13. F ReeP: towards parameter recommendation in scientific workflows using preference learning
    Daniel Silva, Aline Paes, Esther Pacitti, Daniel De Oliveira
    SBBD: Simpósio Brasileiro de Banco de Dados, Aug 2018, Rio de Janeiro, Brazil. SBC, 33rd Annual Brazilian Symposium on Databases, 2018. <http://sbbd.org.br/2018>
  14. Detecçao de Anomalias Frequentes no Transporte Rodoviario Urbano
    Ana Cruz, João Ferreira, Diego Carvalho, Eduardo Mendes, Esther Pacitti, Rafaelli Coutinho, Fábio Porto, Eduardo Ogasawara
    SBBD: Simpósio Brasileiro de Banco de Dados, Aug 2018, Rio de Janeiro, Brazil. SBC, 33rd Annual Brazilian Symposium on Databases, pp.271-276, 2018.
  15. Constellation Queries over Big Data
    Fábio Porto, Amir Khatibi, Joao Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha
    SBBD: Simpósio Brasileiro de Banco de Dados, Aug 2018, Rio de Janeiro, Brazil. SBC, 33rd Annual Brazilian Symposium on Databases, pp.85-96, 2018.
  16. Rumo à Integração da Álgebra de Workflows com o Processamento de Consulta Relacional
    João Ferreira, Jorge Soares, Fábio Porto, Esther Pacitti, Rafaelli Coutinho, Eduardo Ogasawara
    SBBD: Simpósio Brasileiro de Banco de Dados, Aug 2018, Rio de Janeiro, Brazil. SBC, 33rd Annual Brazilian Symposium on Databases, pp.205-210, 2018.
  17. Point Pattern Search in Big Data
    Fabio Porto, Joao Rittmeyer, Eduardo Ogasawara, Alberto Krone-Martins, Patrick Valduriez, Dennis Shasha
    SSDBM: Scientific and Statistical Database Management, Jul 2018, Bozen-Bolzano, Italy. ACM, 30th International Conference on Scientific and Statistical Database Management, 2018. <10.1145/3221269.3221294>
  18. The 2018 Signal Separation Evaluation Campaign
    Fabian-Robert Stöter, Antoine Liutkus, Nobutaka Ito
    LVA ICA: Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. 14th International Conference on Latent Variable Analysis and Signal Separation, 2018. <http://cvssp.org/events/lva-ica-2018/>
  19. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition
    Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, Roland Badeau
    LVA ICA 2018 - 14th International Conference on Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. 2018.
  20. A Distributed Collaborative Filtering Algorithm Using Multiple Data Sources
    Mohamed Bouadjenek, Esther Pacitti, Maximilien Servajean, Florent Masseglia, Amr Abbadi
    DBKDA: Advances in Databases, Knowledge, and Data Applications, May 2018, Nice, France. 10th International Conference on Advances in Databases, Knowledge, and Data Applications, 2018. <https://www.iaria.org/conferences2018/DBKDA18.html>
  21. A Differentially Private Index for Range Query Processing in Clouds
    Cetin Sahin, Tristan Allard, Reza Akbarinia, Amr Abbadi, Esther Pacitti
    ICDE: International Conference on Data Engineering, Apr 2018, Paris, France. 34th IEEE International Conference on Data Engineering, pp.857-868, 2018.
  22. Interference reduction on full-length live recordings
    Diego Di Carlo, Antoine Liutkus, Ken Déguernel
    ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. IEEE, pp.736-740.
  23. Blind Source Separation Using Mixtures of Alpha-Stable Distributions
    Nicolas Keriven, Antoine Deleforge, Antoine Liutkus
    ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. IEEE, pp.771-775.
  24. Audio source separation with magnitude priors: the BEADS model
    Antoine Liutkus, Christian Rohlfing, Antoine Deleforge
    ICASSP 2018 – IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. 43th IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1-5, 2018, Signal Processing and Artificial Intelligence: Changing the World.
  25. Alpha-stable low-rank plus residual decomposition for speech enhancement
    Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gaël Richard
    ICASSP 2018 - IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. 2018.
  26. Maximally Informative k-Itemset Mining from Massively Distributed Data Streams
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    SAC: Symposium on Applied Computing, Apr 2018, Pau, France. 33rd ACM/SIGAPP Symposium On Applied Computing, pp.1-10, 2018.

2017

  1. DPiSAX: Massively Distributed Partitioned iSAX
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Themis Palpanas
    ICDM: International Conference on Data Mining, Nov 2017, New Orleans, United States. IEEE International Conference on Data Mining, pp.1-6, 2017.
  2. Massively Distributed Environments and Closed Itemset Mining: The DCIM Approach
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. 33ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, 4, pp.1-15, 2017.
  3. Efficient Scheduling of Scientific Workflows using Hot Metadata in a Multisite Cloud
    Ji Liu, Luis Pineda-Morales, Esther Pacitti, Alexandru Costan, Patrick Valduriez, Gabriel Antoniu, Marta Mattoso
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. 33ème conférence sur la Gestion de Données — Principes, Technologies et Applications, pp.13, 2017.
  4. End-to-end Graph Mapper
    Benjamin Billet, Mickaël Jurret, Didier Parigot, Patrick Valduriez
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. 33ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, 2017. <https://project.inria.fr/bda2017/>
  5. Querying Key-Value Stores Under Simple Semantic Constraints : Rewriting and Parallelization
    Olivier Rodriguez, Corentin Colomier, Cecilie Rivière, Reza Akbarinia, Federico Ulliana
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2017, Nancy, France. 33ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, 2017. <https://project.inria.fr/bda2017/>
  6. Tracking of Online Parameter Fine-tuning in Scientific Workflows
    Renan Souza, Vitor Silva, José Camata, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    WORKS: Workflows in Support of Large-scale Science, Nov 2017, Denver, United States. 12th Workflows in Support of Large-Scale Science (WORKS) Workshop in conjunction with ACM/IEEE Supercomputing, 2017.
  7. TARS: An Array Model with Rich Semantics for Multidimensional Data
    Hermano Lustosa, Noel Lemus, Fabio Porto, Patrick Valduriez
    ER FORUM 2017: Conceptual Modeling : Research In Progress, Nov 2017, Valencia, Spain. 2017.
  8. Pl@ntNet -My Business
    Alexis Joly, Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Hervé Goëau
    MM: Multimedia, Oct 2017, Mountain View, United States. 25th ACM International Conference on Multimedia, pp.1-11, 2017.
  9. RadiusSketch: Massively Distributed Indexing of Time Series
    Djamel-Edine Yagoubi, Reza Akbarinia, Florent Masseglia, Dennis Shasha
    DSAA: Data Science and Advanced Analytics, Oct 2017, Tokyo, Japan. IEEE International Conference on Data Science and Advanced Analytics, pp.1-10, 2017.
  10. Spark Scalability Analysis in a Scientific Workflow
    Renan Souza, Vitor Silva, Pedro Miranda, Alexandre Lima, Patrick Valduriez, Marta Mattoso
    SBBD: Simpósio Brasileiro de Banco de Dados, Oct 2017, Uberlandia, Brazil. 32th Brazilian Symposium on Databases, pp.1-6, 2017.
  11. Automated Herbarium Specimen Identification using Deep Learning
    Jose Carranza-Rojas, Alexis Joly, Pierre Bonnet, Hervé Goëau, Erick Mata-Montero
    TDWG: Biodiversity Information Standards, Oct 2017, Ottawa, Canada. Annual Conference on Biodiversity Information Standards, 2017, Data Integration in a Big Data Universe: Associating Occurrences with Genes, Phenotypes, and Environments. <10.3897/tdwgproceedings.1.20302>
  12. Plant identification based on noisy web data: the amazing performance of deep learning (LifeCLEF 2017)
    Herve Goeau, Pierre Bonnet, Alexis Joly
    CLEF 2017 - Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. pp.1-13, 2017.
  13. LifeCLEF Bird Identification Task 2017
    Herve Goeau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
    CLEF 2017 - Conference and Labs of the Evaluation Forum, Sep 2017, Dublin, Ireland. pp.1-9.
  14. LifeCLEF 2017 Lab Overview: Multimedia Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Jean-Christophe Lombardo, Robert Planque, Simone Palazzo, Henning Müller
    Gareth J.F. Jones; Séamus Lawless; Julio Gonzalo; Liadh Kelly; Lorraine Goeuriot; Thomas Mandl; Linda Cappellato; Nicola Ferro. CLEF: Cross-Language Evaluation Forum for European Languages, Sep 2017, Dublin, Ireland. Springer, 8th International Conference of the Cross-Language Evaluation Forum for European Language, LNCS (10456), pp.255-274, 2017, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
  15. TARDIS: Optimal Execution of Scientific Workflows in Apache Spark
    Daniel Gaspar, Fabio Porto, Reza Akbarinia, Esther Pacitti
    DaWaK: Data Warehousing and Knowledge Discovery, Aug 2017, Lyon, France. 19th International Conference on Big Data Analytics and Knowledge Discovery, LNCS (10440), pp.74-87, 2017.
  16. Pre-processing and Indexing techniques for Constellation Queries in Big Data
    Amir Khatibi, Fabio Porto, Joao Rittmeyer, Eduardo Ogasawara, Patrick Valduriez, Dennis Shasha
    DaWaK: Data Warehousing and Knowledge Discovery, Aug 2017, Lyon, France. Springer, 19th International Conference on Big Data Analytics and Knowledge Discovery, LNCS (10440), pp.164-172, 2017, Big Data Analytics and Knowledge Discovery.
  17. Going deeper in the automated identification of Herbarium specimens
    Pierre Bonnet, Alexis Joly, Hervé Goëau, Jean-Christophe Lombardo, Antoine Affouard, Sen Wang, Rémi Knaff, Jean-François Molino, Daniel Barthélémy
    Botany 2017 - Botanical Crossroads, Jun 2017, Forth Worth, Texas, United States. 2017.
  18. Massively Distributed Environments and Closed Itemset Mining: The DCIM Approach
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    CAiSE: Advanced Information Systems Engineering, Jun 2017, Essen, Germany. 29th International Conference on Advanced Information Systems Engineering, LNCS (10253), pp.231-246, 2017.
  19. Pl@ntNet app in the era of deep learning
    Antoine Affouard, Hervé Goëau, Pierre Bonnet, Jean-Christophe Lombardo, Alexis Joly
    nnet, Jean-Christophe Lombardo, Alexis Joly. Pl@ntNet app in the era of deep learning. ICLR: International Conference on Learning Representations, Apr 2017, Toulon, France. 5th International Conference on Learning Representations, pp.1-6, 2017.

2016

  1. Benchmarking Polystores: the CloudMdsQL Experience
    Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Pereira
    Vijay Gadepally. International Conference on Big Data, Dec 2016, Washington, DC, United States. IEEE Computing Society, IEEE BigData 2016: Workshop on Methods to Manage Heterogeneous Big Data and Polystore Databases, 2017. <10.1109/BigData.2016.7840899>
  2. Managing Hot Metadata for Scientific Workflows on Multisite Clouds
    Luis Pineda-Morales, Ji Liu, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso
    IEEE BigData, Dec 2016, Washington, United States. IEEE International Conference on Big Data, 2016. <10.1109/BigData.2016.7840628>
  3. Privacy Preserving Query Processing in the Cloud
    Sakina Mahboubi, Reza Akbarinia, Patrick Valduriez
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, 2016. <https://bda2016.ensma.fr>
  4. Demonstration of the CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Spain, José Pereira
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, 2016. <https://bda2016.ensma.fr/>
  5. Extending CloudMdsQL with MFR for Big Data Integration
    Carlyna Bondiombouy, Boyan Kolev, Patrick Valduriez, Oleksandra Levchenko
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, 2016. <https://bda2016.ensma.fr>
  6. Scientific Workflow Execution with Multiple Objectives in Multisite Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Daniel De Oliveira, Marta Mattoso
    BDA: Gestion de Données — Principes, Technologies et Applications, Nov 2016, Poitiers, France. 32ème Conférence sur la Gestion de Données - Principes, Technologies et Applications, 2016, Principes, Technologies et Applications. <https://bda2016.ensma.fr>
  7. Online Input Data Reduction in Scientific Workflows
    Renan Souza, Vítor Silva, Alvaro Coutinho, Patrick Valduriez, Marta Mattoso
    ACM SIGHPC; IEEE. WORKS: Workflows in Support of Large-scale Science, Nov 2016, Salt Lake City, United States. 11th Workshop on Workflows in Support of Large-scale Science, in conjunction with SC2016, 2016. <http://works.cs.cardiff.ac.uk>
  8. ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing
    Maximilien Servajean, Alexis Joly, Dennis Shasha, Julien Champ, Esther Pacitti
    MM: Multimedia, Oct 2016, Amsterdam, Netherlands. 24th ACM International Conference on Multimedia, 2016. <http://www.acmmm.org/2016/>
  9. Crowdsourcing Biodiversity Monitoring: How Sharing your Photo Stream can Sustain our Planet
    Alexis Joly, Hervé Goëau, Julien Champ, Samuel Dufour-Kowalski, Henning Müller, Pierre Bonnet
    MM: Multimedia, Oct 2016, Amsterdam, Netherlands. 24th ACM International Conference on Multimedia, 2016. <http://www.acmmm.org/2016/>
  10. Unsupervised Individual Whales Identification: Spot the Difference in the Ocean
    Alexis Joly, Jean-Christophe Lombardo, Julien Champ, Anjara Saloma
    Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Evora, Portugal. pp.469--480, 2016.
  11. Floristic participation at LifeCLEF 2016 Plant Identification Task
    Julien Champ, Hervé Goëau, Alexis Joly
    CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Évora, Portugal. Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, pp.450--458, 2016.
  12. Plant Identification in an Open-world (LifeCLEF 2016)
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Évora, Portugal. Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, pp.428--439, 2016.
  13. LifeCLEF Bird Identification Task 2016: The arrival of Deep learning
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Alexis Joly
    Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, Sep 2016, Evora, Portugal. pp.440--449, 2016.
  14. LifeCLEF 2016: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Julien Champ, Robert Planqué, Simone Palazzo, Henning Müller
    Norbert Fuhr; Paulo Quaresma; Teresa Gonçalves ; Birger Larsen ; Krisztian Balog ; Craig Macdonald; Linda Cappellato; Nicola Ferro. CLEF 2016 - 7th International Conference of the CLEF Association, Sep 2016, Evora, Portugal. Springer, pp.286--310, 2016, Experimental IR Meets Multilinguality, Multimodality, and Interaction.
  15. Enhancing Energy Production with Exascale HPC Methods
    José Camata, José Cela, Danilo Costa, Alvaro Lga Coutinho, Daniel Fernández-Galisteo, Carmen Jimenez, Vadim Kourdioumov, Marta Mattoso, Rafael Mayo-García, Thomas Miras, José Moríñigo, Jorge Navarro, Philippe Navaux, Daniel De Oliveira, Manuel Rodríguez-Pascual, Vítor Silva, Renan Souza, Patrick Valduriez
    CARLA: Latin American High Performance Computing Conference, Aug 2016, Mexico City, Mexico. Springer, 3rd Latin American High Performance Computing Conference, CCIS (697), pp.233-246, 2017.
  16. Scientific Workflow Scheduling with Provenance Support in Multisite Cloud
    Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso
    VECPAR, Jun 2016, Porto, Portugal. 12th International Meeting on High Performance Computing for Computational Science, pp.8, 2016.
  17. The CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Patrick Valduriez, Ricardo Jiménez-Peris, Raquel Pau, José Pereira
    ACM SIGMOD, Jun 2016, San Francisco, United States. ACM SIGMOD 35th International Conference on Management of Data, 2016. <10.1145/2882903.2899400>
  18. Development of a knowledge system for Big Data: Case study to plant phenotyping data
    Luyen Le Ngoc, Anne Tireau, Aravind Venkatesan, Pascal Neveu, Pierre Larmande
    WIMS: Web Intelligence, Mining and Semantics, Jun 2016, Nimes, France. ACM, 6th International Conference on Web Intelligence, Mining and Semantics, 2016. <10.1145/2912845.2912869>
  19. Exposing French agronomic resources as Linked Open Data
    Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    IN-OLIVE, Jun 2016, Montpellier, France. Workshop In Ovive @ Ingenierie des Connaissances IC2016, 2016.
  20. Spatially Localized Visual Dictionary Learning
    Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
    ICMR: International Conference on Multimedia Retrieval, Jun 2016, New York, United States. ACM, Proceedings of the ACM on International Conference on Multimedia Retrieval, pp.367-370, 2016.
  21. A New Privacy-Preserving Solution for Clustering Massively Distributed Personal Times-Series
    Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
    ICDE: International Conference on Data Engineering, May 2016, Helsinki, Finland. 32nd IEEE International Conference on Data Engineering, ICDE 2016, 2016. <http://icde2016.fi/>
  22. Design and Implementation of the CloudMdsQL Multistore System
    Boyan Kolev, Carlyna Bondiombouy, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jimenez-Péris, Raquel Pau, Jose Pereira
    CLOSER: Cloud Computing and Services Science, Apr 2016, Roma, Italy. 6th International Conference on Cloud Computing and Services Science, 1, pp.352-359, 2016, DataDiversityConvergence Workshop.

2015

  1. Exposing French agronomic resources as Linked Open Data
    Aravind Venkatesan, Nordine El Hassouni, Florian Philippe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    SWAT4LS: Semantic Web Applications and Tools for Life Sciences, Dec 2015, Cambridge, United Kingdom. 1546, 2015. <http://ceur-ws.org/Vol-1546/>
  2. Managing Simulation Data with Multidimensional Arrays
    Hermano Lustosa, Fabio Porto, Ramon Costa, Pablo Blanco, Patrick Valduriez
    SBBD: Simpósio Brasileiro de Banco de Dados, Oct 2015, Petropolis, Brazil. 30th Brazilian Symposium on Databases, 2015. <http://dexl.lncc.br/sbbd2015/>
  3. Query Processing in Cloud Multistore Systems
    Carlyna Bondiombouy
    BDA: Gestion de Données — Principes, Technologies et Applications, Sep 2015, Île de Porquerolles, France. 31ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, 2015, Gestion de données – principes, technologies et applications. <http://bda2015.univ-tln.fr>
  4. Ontology-based services and knowledge management in the Agronomic Domain
    Pierre Larmande
    RDA: Research Data Alliance, Sep 2015, Paris, France. The 6th Research Data Alliance plenary meeting, 2015. <https://rd-alliance.org/plenary-meetings/rda-sixth-plenary-meeting.html>
  5. A comparative study of fine-grained classification methods in the context of the LifeCLEF plant identification challenge 2015
    Julien Champ, Titouan Lorieul, Maximilien Servajean, Alexis Joly
    CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes. <http://ceur-ws.org/Vol-1391/>
  6. Shared nearest neighbors match kernel for bird songs identification -LifeCLEF 2015 challenge
    Alexis Joly, Valentin Leveau, Julien Champ, Olivier Buisson
    ceur-ws. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes. <http://ceur-ws.org/Vol-1391/>
  7. LifeCLEF Plant Identification Task 2015
    Hervé Goëau, Pierre Bonnet, Alexis Joly
    CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 Working notes. <http://ceur-ws.org/Vol-1391/>
  8. LifeCLEF Bird Identification Task 2015
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
    CEUR-WS. CLEF: Conference and Labs of the Evaluation forum, Sep 2015, toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 1391, 2015, CLEF2015 working notes. <http://ceur-ws.org/Vol-1391/>
  9. LifeCLEF 2015: Multimedia Life Species Identification Challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Simone Palazzo, Bob Fisher, Henning Müller
    CLEF: Conference and Labs of the Evaluation forum, Sep 2015, Toulouse, France. Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum - Toulouse, France, September 8-11, 2015., 2015.
  10. Data Partitioning for Fast Mining of Frequent Itemsets in Massively Distributed Environments
    Saber Salah, Reza Akbarinia, Florent Masseglia
    DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. 26th International Conference on Database and Expert Systems Applications, 2015. <http://www.dexa.org>
  11. A Prime Number Based Approach for Closed Frequent Itemset Mining in Big Data
    Mehdi Zitouni, Reza Akbarinia, Sadok Ben Yahia, Florent Masseglia
    DEXA: Database and Expert Systems Applications, Sep 2015, Valencia, Spain. 26th International Conference on Database and Expert Systems Applications, LNCS (9261), pp.509-516, 2015.
  12. Integrating Big Data and Relational Data with a Functional SQL-like Query Language
    Carlyna Bondiombouy, Boyan Kolev, Oleksandra Levchenko, Patrick Valduriez
    Qiming Chen; Abdelkader Hameurlain; Farouk Toumani; Roland Wagner; Hendrik Decker. Globe, Sep 2015, Valencia, Spain. 8th International Conference on Data Management in Cloud, Grid and P2P Systems (Globe) and 26th International Conference on Database and Expert Systems Applications (DEXA), LNCS (9261), pp.170-185, 2015.
  13. An Efficient Solution for Processing Skewed MapReduce Jobs
    Reza Akbarinia, Miguel Liroz-Gistau, Divyakant Agrawal, Patrick Valduriez
    Globe, Sep 2015, Valencia, Spain. 8th International Conference on Data Management in Cloud, Grid and P2P Systems and International Conference on Database and Expert Systems Applications, LNCS (9262), pp.417-429, 2015.
  14. Fast Parallel Mining of Maximally Informative k-Itemsets in Big Data
    Saber Salah, Reza Akbarinia, Florent Masseglia
    ICDM: International Conference on Data Mining, Aug 2015, Atlantic city, United States. 15th IEEE International Conference on Data Mining, pp.359-368, 2015.
  15. When sharing computer science with everyone also helps avoiding digital prejudices.
    Marie Duflot, Martin Quinson, Florent Masseglia, Didier Roy, Julien Vaubourg, Thierry Viéville
    Escape computer dirty magic: learn Scratch !. Scratch2015AMS, Aug 2015, Amsterdam, Netherlands. 2015.
  16. On Term Selection Techniques for Patent Prior Art Search
    Mona Golestan Far, Scott Sanner, Mohamed Reda Bouadjenek, Gabriela Ferraro, David Hawking
    SIGIR: Research and Development in Information Retrieval, Aug 2015, Santiago, Chile. ACM, 2015, SIGIR '15: 38th International SIGIR Conference on Research and Development in Information Retrieval. <10.1145/2766462.2767801>
  17. Aggregation-Aware Compression of Probabilistic Streaming Time Series
    Reza Akbarinia, Florent Masseglia
    MLDM: Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. 11th International Conference on Machine Learning and Data Mining in Pattern Recognition, LNCS (9166), pp.232-247, 2015.
  18. Optimizing the Data-Process Relationship for Fast Mining of Frequent Itemsets in MapReduce
    Saber Salah, Reza Akbarinia, Florent Masseglia
    MLDM: Machine Learning and Data Mining, Jul 2015, Hamburg, Germany. 11th International Conference on Machine Learning and Data Mining in Pattern Recognition, LNCS (9166), pp.217-231, 2015.
  19. Towards efficient data integration and knowledge management in the Agronomic domain
    Aravind Venkatesan, Nordine El Hassouni, Florian Phillipe, Cyril Pommier, Hadi Quesneville, Manuel Ruiz, Pierre Larmande
    APIA: Applications Pratiques de l'Intelligence Artificielle , Jul 2015, Rennes, France. 1ère conférence sur les Application Pratiques de l'Intelligence Artificielle (APIA), 2015. <http://pfia2015.inria.fr/actes/index.php?procpage=apia>
  20. OpenAlea: Scientific Workflows Combining Data Analysis and Simulation
    Christophe Pradal, Christian Fournier, Patrick Valduriez, Sarah Cohen-Boulakia
    SSDBM: Scientific and Statistical Database Management, Jun 2015, San Diego, United States. 27th International Conference on Scientific and Statistical Database Management, 2015. <10.1145/2791347.2791365>
  21. DigInPix: Visual Named-Entities Identification in Images and Videos
    Pierre Letessier, Nicolas Hervé, Alexis Joly, Hakim Nabi, Mathieu Derval, Olivier Buisson
    ICMR: International Conference on Multimedia Retrieval, Jun 2015, Shanghai, China. ACM, Proceedings of the 5th ACM on International Conference on Multimedia Retrieval - ICMR '15, pp.661-664, 2015.
  22. Kernelizing Spatially Consistent Visual Matches for Fine-Grained Classification
    Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez
    ICMR: International Conference on Multimedia Retrieval, Jun 2015, Shangai, China. 5th ACM on International Conference on Multimedia Retrieval, pp.155-162, 2015.
  23. A Study of Query Reformulation for Patent Prior Art Search with Partial Patent Applications
    Mohamed Reda Bouadjenek, Scott Sanner, Gabriela Ferraro
    ICAIL: International Conference on Artificial Intelligence and Law, Jun 2015, San Diego, United States. 2015, ICAIL'2015: 15th International Conference on Artificial Intelligence and Law.
  24. Chiaroscuro: Transparency and Privacy for Massive Personal Time-Series Clustering
    Tristan Allard, Georges Hébrail, Florent Masseglia, Esther Pacitti
    ACM SIGMOD, May 2015, Melbourne, Australia. ACM SIGMOD 34th International Conference on Management of Data, pp.779-794, 2015.
  25. Data-intensive HPC: opportunities and challenges
    Patrick Valduriez
    BDEC: Big Data and Extreme-scale Computing, Jan 2015, Barcelone, Spain. 2015.

2014

  1. Fine-grained Visual Faceted Search
    Julien Champ, Alexis Joly, Bonnet Pierre
    MM: Multimedia, Nov 2014, Orlando, FL, United States. 22nd ACM International Conference on Multimedia, 2014. <10.1145/2647868.2654875>
  2. Recognizing Thousands of Legal Entities through Instance-based Visual Classification
    Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez
    MM: Multimedia, Nov 2014, Orlando, FL, United States. 22nd ACM International Conference on Multimedia, 2014. <10.1145/2647868.2655038>
  3. NACluster: A Non-Supervised Clustering Algorithm for Matching Multi Catalogues
    Vinicius P. Freire, José A. F. De Macêdo, Fábio Porto, Reza Akbarinia
    IEEE e-Science Workshop, Oct 2014, Guarujá, SP, Brazil. 2014. <http://escience.ime.usp.br/preliminary-program/accepted-papers/accepted-papers-workshops>
  4. Layer Decomposition: An Effective Structure-based Approach for Scientific Workflow Similarity
    Johannes Starlinger, Sarah Cohen-Boulakia, Sanjeev Khanna, Susan Davidson, Ulf Leser
    IEEE e-Science conference, Oct 2014, Guarujá, Brazil. 2014.
  5. Multisite Management of Data-intensive Scientific Workflows in the Cloud
    Ji Liu
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. 30ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, pp.28-30, 2014, Gestion de données - principes, technologies et applications.
  6. PlantRT : a Distributed Recommendation Tool for Citizen Science
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Alexis Joly, Julien Champ
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. 30ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, pp.48-50, 2014.
  7. Compression de flux de données probabilistes attentive à l'agrégation
    Reza Akbarinia, Florent Masseglia
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. 30ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, 2014. <http://bda2014.imag.fr>
  8. Exploiting Diversification in Distributed Recommendation
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    BDA: Gestion de Données — Principes, Technologies et Applications, Oct 2014, Autrans, France. 30ème Conférence sur la Gestion de Données — Principes, Technologies et Applications, 2014. <http://bda2014.imag.fr>
  9. Instance-based bird species identication with undiscriminant features pruning - LifeCLEF 2014
    Alexis Joly, Julien Champ, Olivier Buisson
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. 2014, Information Access Evaluation meets Multilinguality, Multimodality, and Interaction. <http://clef2014.clef-initiative.eu>
  10. Lifeclef 2014: multimedia life species identification challenges
    Alexis Joly, Hervé Goëau, Hervé Glotin, Concetto Spampinato, Pierre Bonnet, Willem-Pier Vellinga, Robert Planque, Andreas Rauber, Bob Fisher, Henning Müller
    CLEF 2014 - Conference and Labs of the Evaluation forum, Sep 2014, Sheffield, United Kingdom. 5th International Conference of the CLEF Initiative, CLEF 2014, Sheffield, UK, September 15-18, 2014. Proceedings, LNCS (8685), pp.229-249, 2014, Information Access Evaluation. Multilinguality, Multimodality, and Interaction.
  11. LifeCLEF Bird Identification Task 2014
    Hervé Goëau, Hervé Glotin, Willem-Pier Vellinga, Robert Planqué, Andreas Rauber, Alexis Joly
    CLEF: Conference and Labs of the Evaluation Forum, Sep 2014, Sheffield, United Kingdom. 2014, Information Access Evaluation meets Multilinguality, Multimodality, and Interaction. <http://clef2014.clef-initiative.eu>
  12. Exploiting Diversification in Gossip-Based Recommendation
    Maximilien Servajean, Esther Pacitti, Miguel Liroz-Gistau, Sihem Amer-Yahia, Amr El Abbadi
    Globe, Sep 2014, Munich, Germany. 7th International Conference on Data Management in Cloud, Grid and P2P Systems, LNCS (8648), pp.25-36, 2014, Data Management in Cloud, Grid and P2P Systems.
  13. Scientific Workflow Partitioning in Multi-site Clouds
    Ji Liu, Esther Pacitti, Patrick Valduriez, Vitor Silva Souza, Marta Mattoso
    L. Lopes. Euro-Par: European Conference on Parallel Processing, Aug 2014, Porto, Portugal. Springer, Euro-Par 2014: Parallel Processing Workshops, LNCS (8805), pp.105-116, 2014.
  14. Towards Efficient Power Management in MapReduce: Investigation of CPU-Frequencies Scaling on Power Efficiency in Hadoop
    Shadi Ibrahim, Diana Moise, Houssem-Eddine Chihoub, Alexandra Carpen-Amarie, Luc Bougé, Gabriel Antoniu
    ARMS-CC: Adaptive Resource Management and Scheduling for Cloud Computing, Jul 2014, Paris, France. 1st International Workshop on Adaptive Resource Management and Scheduling for Cloud Computing Held in Conjunction with ACM Symposium on Principles of Distributed Computing (PODC), 2014. <10.1007/978-3-319-13464-2_11>
  15. The role of hydraulics FSPMs in the context of root breeding : a case study on Pearl Millet
    Adama Ndour, Christophe Pradal, Vincent Vadez, Sixtine Passot, Yann Guédon, Laurent Laplaze, Mikael Lucas
    EGU: European Geosciences Union, Apr 2014, Vienne, Austria. European Geosciences Union General Assembly, 20, 2018.
  16. LifeCLEF: Multimedia Life Species Identification
    Alexis Joly, Robert Planque, Concetto Spampinato, Henning Müller, Hervé Goëau, Andreas Rauber, Bonnet Pierre, Willem-Pier Vellinga, Robert B. Fisher, Hervé Glotin
    EMR: Environmental Multimedia Retrieval, Apr 2014, Glasgow, United Kingdom. 1st International Workshop on Environmental Multimedia Retrieval (EMR) co-located with ACM International Conference on Multimedia Retrieval (ICMR), 2014. <http://ceur-ws.org/Vol-1222/>
  17. Pl@ntNet Mobile 2014: Android port and new features
    Hervé Goëau, Bonnet Pierre, Alexis Joly, Antoine Affouard, Vera Bakić, Julien Barbe, Samuel Dufour-Kowalski, Souheil Selmi, Yahiaoui Itheri, Christel Vignau, Daniel Barthelemy, Nozha Boujemaa
    ICMR: International Conference on Multimedia Retrieval, Apr 2014, Glasgow, United Kingdom. ACM International Conference on Multimedia Retrieval, 2014. <10.1145/2578726.2582618>

Mots-clés

Big data, Données scientifiques, Gestion de données distribuées et parallèles, Analyse et fouille de données, Recommandation et recherche de contenus, Communautés en ligne, Workflows scientifiques, Intégration, Confidentialité, Recherche d’information par contenu, P2P, Grid, Cloud

Dernière mise à jour le 01/02/2019