===== Franck Michel ===== [[http://www.univ-cotedazur.fr|Université Côte d’Azur]], [[https://www.cnrs.fr/|CNRS]], [[https://www.inria.fr/|Inria]]. {{:franckmichel2_600.jpg?nolink&130 | }} I am a CNRS research engineer involved in the **integration of heterogeneous data** and their publication and sharing as **Knowledge Graphs** on the Web, using knowledge engineering, the Semantic Web and **Linked Open Data** technologies. I am a member of [[http://www.i3s.unice.fr/|I3S]] laboratory's [[https://sparks.i3s.unice.fr/|SPARKS]] group, and a member of [[https://www.inria.fr/|Inria]]'s [[https://team.inria.fr/wimmics/|Wimmics]] team. ===== Research activities ===== I am involved in research activities meant to enable the **integration of heterogeneous data** based on a **knowledge engineering** approach, as well as the sharing and reuse of these data. My work addresses several research questions: - How to **build FAIR Knowledge Graphs** complying with the Linked Data 4-star principles? - How to **overcome data structural and semantic heterogeneity** in order to reconcile and make sense of large data sets distributed at Web-scale? - **How to foster data reuse** by publishing them in machine-processable formats? This work is concerned with leveraging the **Linked Data principles** to integrate heterogeneous legacy data sources and make them available in the Web of Data. This was the main topic of my Ph.D that I defended in 2017, with a specific focus on the translation of data from NoSQL databases into RDF. - How to **enable the Web-scale discovery and consumption of data**? This work is concerned with methods to make data and query services discoverable, and the types of interfaces that are suitable to consume Linked Data. I hold several collaborations with researchers in the biodiversity domain. To understand the effects of climate change on biodiversity, researchers have a pressing need to make sense of myriad data produced all over the world by biodiversity-related projects. In this context, I work with the French National Museum of Natural History towards the publication of their data as Linked Open Data. More generally, together with communities like Bioschemas.org, we strive to enable Web-scale integration of biodiversity. Here are the main projects I'm currently involved in: * [[http://www.d2kab.org/|D2KAB]] (ANR): From Data to Knowledge in Agronomy, Agriculture and Biodiversity * [[https://anr.fr/Projet-ANR-19-CE23-0014|DeKaloG]] (ANR): Decentralized Knowledge Graphs * [[https://issa.cirad.fr/|ISSA 2]] (Collex-Persée): Indexation Sémantique d'une archive scientifique et Services Associés pour la science ouverte * [[https://github.com/frmichel/sparql-micro-service|SPARQL micro-services]] * [[https://github.com/frmichel/taxref-ld/|TAXREF-LD]]: the French Linked Data Taxonomic Registry * [[http://fr.dbpedia.org/|DBpedia French chapter]] * [[https://github.com/Wimmics/CovidOnTheWeb|Covid on the Web]] Here are the community projects I'm currently involved in: * [[https://bioschemas.org/|Bioschemas]]: Schema.org for Life Sciences * [[https://www.w3.org/community/kg-construct/|Knowledge Graph Construction W3C Community Group]] ===== Publications ===== {{url>https://haltools.inria.fr/Public/afficheRequetePubli.php?idHal=fmichel&CB_auteur=oui&CB_titre=oui&CB_article=oui&CB_vignette=oui&langue=Anglais&tri_exp=annee_publi&tri_exp2=typdoc&ordre_aff=TA&Fen=Aff&css=../css/styles_publicationsHAL.css%20100%,400px%20noborder}} Complete list of publications and communications: [[https://cv.archives-ouvertes.fr/fmichel/|HAL CV]]. Also find me on [[https://www.researchgate.net/profile/Franck_Michel3|ReasearchGate]]. ===== Contact ===== **Address**:\\ Université Côte d’Azur, CNRS, Inria - I3S, UMR 7271\\ 930 route des Colles - Bât. Les Templiers\\ BP 145 - 06903 Sophia Antipolis CEDEX - France **Email**: fmichel [at] i3s.unice [dot] fr, franck [dot] michel [at] inria [dot] fr\\ **Find me on**: [[https://www.researchgate.net/profile/Franck_Michel3|ResearchGate]], [[https://github.com/frmichel|Github]], [[https://www.linkedin.com/in/franck-michel-b3064b1a/|LinkedIn]], [[https://twitter.com/franck_michel2|Twitter]], [[https://www.slideshare.net/FranckMichel|SlideShare]], [[https://flickr.com/photos/franckmichel/|Flickr]], [[https://instagram.com/franck.michel.photo/|Instagram]] ===== Selected talks ===== === Open Science, reproducibible research, and the citation of articles, code and data alike === Given 2024-04-04. {{url>https://www.youtube.com/embed/MqlEqaBXUyY 200,150 noscroll YouTube player}} === ISSA: Generic Knowledge Model and Visualization tools to Help Scientists Make Sense of Archive === Wimmics Monthly Seminar 2022-12-15 / ISWC 2022 resource track replay {{url>https://www.youtube.com/embed/mbgz58rP-ps 200,150 noscroll |YouTube player}} === Covid-on-the-Web: Knowledge Graph and Services to Advance COVID-19 Research === Presented at the ISWC 2020 conference, resource track. {{url>https://www.youtube.com/embed/kZAtxFpm6N0 200,150 noscroll |YouTube player}} === Bioschemas: Marking up biodiversity websites for data discovery & integration === TDWG webinar series, 2021-03. {{url>https://www.youtube.com/embed/2GZ-YtUjzJM 200,150 noscroll |YouTube player}} === Integration of biodiversity data from web pages to knowledge graphs, a computer scientist view point === DIADE research unit seminars (http://diade.ird.fr), 2021-04-13. {{url>https://www.youtube.com/embed/0InzhhZ6T5k 200,150 noscroll |YouTube player}} ===== Creation/publication of datasets ===== **ISSA Agritrop Dataset**, Semantic index of the Agritrop open scientific archive. [[https://github.com/issa-project/issa-pipeline|github]] [[https://doi.org/10.5281/zenodo.10381606|DOI]] **TAXREF-LD**, Linked Data knowledge graph of the French taxonomic register. Franck MICHEL, Catherine FARON, Sandrine TERCERIE, Olivier GARGOMINY. 2017(2022. [[https://github.com/frmichel/taxref-ld/|github]] [[http://taxref.mnhn.fr/sparql|sparql]] [[https://hal.archives-ouvertes.fr/hal-01617708|article]] [[https://doi.org/10.5281/zenodo.5876775|DOI]] **Covid-on-the-Web**. Franck Michel, Fabien Gandon, Valentin Ah-Kane, Anna Bobasheva, Elena Cabrio, Olivier Corby, Raphaël Gazzotti, Alain Giboin, Santiago Marro, Tobias Mayer, Mathieu Simon, Serena Villata, Marco Winckler. 2020. [[https://github.com/Wimmics/CovidOnTheWeb|github]] [[https://covidontheweb.inria.fr/sparql|sparql]] [[https://hal.archives-ouvertes.fr/hal-02939363|article]] [[https://doi.org/10.5281/zenodo.4247134|DOI]]. **WASABI RDF Knowledge Graph**. An RDF representation of the WASABI corpus of songs enriched with metadata extracted from music databases on the Web, and resulting from the processing of song lyrics and from audio analysis. 2020. [[https://github.com/micbuffa/WasabiDataset|github]] [[http://wasabi.inria.fr/sparql|sparql]] [[https://hal.archives-ouvertes.fr/hal-03282619/|article]] [[https://doi.org/10.5281/zenodo.5603369|DOI]] **WeKG-MF**, Weather Knowledge Graph of Météo France Meteorological Observations. 2022. [[https://github.com/Wimmics/weather-kg|github]] [[http://weakg.i3s.unice.fr/sparql|sparql]] [[https://hal.inria.fr/hal-03619869/|article]] [[https://doi.org/10.5281/zenodo.5925413|DOI]] **WheatGenomicsSLKG**, Wheat Genomics Scientific Literature Knowledge Graph. 2023. [[https://github.com/Wimmics/WheatGenomicsSLKG|github]] [[http://weakg.i3s.unice.fr/sparql|sparql]] [[https://doi.org/10.5281/zenodo.10410742|DOI]] ===== Software Development ===== **ISSA visualization and search web application**: Franck MICHEL, Youssef Mekouar (2022). Github: [[https://github.com/issa-project/web-visualization/|visu]] and [[https://github.com/issa-project/web-backend/|backend]] **ISSA Processing Pipeline**: Anna Bobasheva, Franck MICHEL (2022). [[https://github.com/issa-project/issa-pipeline|github]] [[https://doi.org/10.5281/zenodo.10376913|DOI]] **SPARQL Micro-Services: Querying Web APIs with SPARQL**. Franck Michel. 2018. [[https://github.com/frmichel/sparql-micro-service|github]] **Morph-xR2RML: MongoDB-to-RDF translation and SPARQL rewriting**: Franck Michel, Freddy Pryiatna. Implementation of the xR2RML mapping language for MongoDB databases. 2017. [[https://github.com/frmichel/morph-xr2rml/|github]] [[https://doi.org/10.5281/zenodo.16547|DOI]] **The VO Administration and operations PORtal (VAPOR)**. Franck Michel, Flavien Forestier. 2014. [[https://wiki.egi.eu/wiki/VT_VAPOR|web]] [[https://doi.org/10.5281/zenodo.10276|DOI]] **EGI Virtual Organisations Support Tools**. Franck Michel. 2013. [[https://github.com/frmichel/vo-support-tools|web]] [[https://doi.org/10.5281/zenodo.10276|DOI]] **NeuroLOG platform**. Alban Gaignard, Franck Michel, Johan Montagnat, Javier Rojas Balderrama, Farooq Ahmad, Bacem Wali. 2008. [[http://neurolog.i3s.unice.fr/|web]] ===== Background and Position ===== * **CNRS Research engineer (IR)**, [[http://univ-cotedazur.fr/|Université Côte d'Azur]], [[https://www.cnrs.fr/|CNRS]], [[https://www.inria.fr/|Inria]], [[http://www.i3s.unice.fr/|I3S]] laboratory. Jan. 2011 until now. * **PhD in Computer Sciences** at [[http://univ-cotedazur.fr/|Université Côte d'Azur]], March 2017. [[https://hal.archives-ouvertes.fr/cel-01585312|Manuscript]] * **Expert software engineer** in IRISA team [[https://team.inria.fr/visages/|VisAGeS]], May 2008 to Dec. 2010 * **Expert telecom engineer**, company [[http://www.fr.capgemini.com/secteurs/media/|Capgemini Telecom, Media Networks]], 1999 à 2008 * **Development engineer**, company [[http://www.nortel.com/|Nortel Networks France]], 1995 à 1999 * **Engineering degree** in Computer Sciences, [[http://www.insa-rennes.fr|INSA de Rennes]], 1995 ===== Organizing and Program Committees ===== I was/am a member of the program committees for the following conferences and/or workshops: * [[https://www.ecai2024.eu/|ECAI 2024]], European Conference on AI * [[https://2024.eswc-conferences.org/|ESWC 2024]], The Extended Semantic Web Conference * [[https://www2024.thewebconf.org/|The Web Conference 2024]], The Extended Semantic Web Conference * [[https://2023-eu.semantics.cc/|SEMANTiCS 2023]], 15th International Conference on Semantic Systems * [[https://2022.eswc-conferences.org/|ESWC 2022]], The Extended Semantic Web Conference * [[https://https://kg-construct.github.io/workshop/2022/|KGCW 2022]], Third International Workshop on Knowledge Graph Construction * [[https://mosaicrown.github.io/scg2021/|SCG 2021]], First workshop on Squaring the circle on graphs * [[https://https://kg-construct.github.io/workshop/2021/|KGCW 2021]], Second International Workshop on Knowledge Graph Construction * [[https://2021.eswc-conferences.org/|ESWC 2021]], The Extended Semantic Web Conference * [[https://www.iccs-meeting.org/iccs2020/|ICCS 2020]], The International Conference on Computational Science * [[https://ijcai20.org/|IJCAI 2020]], 29th International Joint Conference on Artificial Intelligence * [[https://2019.semantics.cc/|SEMANTiCS 2019]], 15th International Conference on Semantic Systems * [[http://kgb-workshop.org/|Knowledge Graph Building]] (KBD), workshop of the Extended Semantic Web Conference 2019 (ESWC) * [[https://www.hyperagents.org/|Hypermedia Multi-Agent Systems]] (HyperAgents 2019), workshop of the Web Conference 2019 * [[https://project.inria.fr/ekaw2018/|EKAW 2018]], 21th International Conference on Knowledge Engineering and Knowledge Management * [[https://2018.semantics.cc/|SEMANTiCS 2018]], 14th International Conference on Semantic Systems * [[http://iswc2018.semanticweb.org/|ISWC 2018]], 17th International Semantic Web Conference * [[http://blogs.napier.ac.uk/iccs/|ICCS 2018]], 23rd International Conference on Conceptual Structures * [[https://www2018.thewebconf.org/|WWW 2018]], The Web Conference 2018 * [[http://iswc2017.semanticweb.org/|ISWC 2017]], 16th International Semantic Web Conference * [[https://2017.semantics.cc/|SEMANTiCS 2017]], 13th International Conference on Semantic Systems * [[https://www.irit.fr/ICCS2016/|ICCS 2016]], 22nd International Conferences on Conceptual Structures * [[http://inforsid.fr/Biarritz2015/wp-content/uploads/2015/05/SIIA2015.pdf|SI&IA 2015]], Systèmes d'Information et Intelligence Artificielle 2015 I was/am a member of the organizing committees for the following conferences and/or workshops: * [[https://oaei.ontologymatching.org/2022/|Ontology Alignment Evaluation Initiative 2022]]: ontology provider for the complex alignment and biodiversity tracks * [[https://oaei.ontologymatching.org/2021/|Ontology Alignment Evaluation Initiative 2021]]: ontology provider for the complex alignment and biodiversity tracks * [[https://www.tdwg.org/conferences/2021/session-list/#sym12%20connecting%20biodiversity%20data%20with%20knowledge%20graphs|TDWG 2021 Symposium on Connecting biodiversity data with knowledge graphs]] * [[https://fusion.cs.uni-jena.de/s4biodiv2021/|S4Biodiv : 3rd International Workshop on Semantics for Biodiversity]] * [[http://devlog.cnrs.fr/jdev2020|JDEV2020 : Journées CNRS du développement logiciels]] * [[http://devlog.cnrs.fr/apsem2019|APSEM2019 : écosystèmes pour la science ouverte et recherche par les données]] * [[http://devlog.cnrs.fr/apsem2018|APSEM2018 : Apprentissage et sémantique]] * [[http://devlog.cnrs.fr/jdev2017|JDEV2017 : Journées CNRS du développement logiciels ]] ===== Wild ideas ===== ==== Large Language Models as the components of a conscious AI? ==== AIs, and LLMs in particular, are not conscious. They are reactive systems, they respond to an input by producing an output. By contrast, consciousness can be defined as the ability to form thoughts for oneself, without the need for external stimulus. What if we fine-tuned several LLMs to collaborate together, following the model of human psyche. * The **"conscious" model** would be fine-tuned to remain in the realm of values, morality, norms, and logical thinking. This is the one that would interact with the "outside" world and provide material to the "unconscious" model. * The **"unconscious" model** (or "subconscious" depending on the definitions) would be fine-tuned to phrase drive, desires, regardless of any norms nor value system. * The **"preconscious" model** would be fine-tuned to filter/rewrite outputs of the "unconscious" to let only acceptable outputs make their way to the "conscious", while also providing it with material in a feed-back loop. This way, we could imagine being able to design some sort of a conscious AI system. But this raises multiple questions: How would it be bootstrapped? Individually, each of the 3 LLMs remains a question-answering system, it does not take the initiative of producing an output. So how to start this, and once this starts, how to control the flow? If we skip the conscious model (to try and simulate sleep) and leave the unconscious model talk with itself, could it come up with dreams? ==== Frugality by design: less is good ==== Multiple tools of the daily life consume energy and/or resource even though that's not the intend of the user. These tools must be redesigned with a specific **bias towards frugality**. This can be implemented as a default behavior, as a nudge etc. Examples: * Mixer tap delivers heated water by default: In the middle position, most mixer taps mix half ambient temperature water and half heated water. Although users may not need heated water. These should be redesigned with a middle position that only delivers ambient temperature water, so that getting heated water will require a deliberate action from the user. * Public space fountain water delivers cooled by default: Fountains in public spaces usually deliver cooled water by default although users may not want that. These should be redesigned with a default ambient temperature water, so that getting cooled water will require a deliberate action from the user.