PractiKPharma Project Homepage

PractiKPharma
Practice-based evidences for actioning Knowledge in Pharmacogenomics
ANR-15-CE23-0028 -- http://practikpharma.loria.fr
2016-2021
OBJECTIVES
PractiKPharma is a collaborative research project funded by the French National Research Agency (ANR). It studies computer sciences approaches to extract, compare, validate state of the art knowledge in the biomedical domain of Pharmacogneomics.
Pharmacogenomics studies how genetics impacts drug response phentoypes.

Units of knowledge in PGx typically have the form of ternary relationships gene variant–drug–adverse event, and can be formalized in various manner, in particular with biomedical ontologies.
1. Knowledge Extraction from reference databases & the biomedical literature
We extract from open-access biomedical articles (PubMed citations), strongly typed PGx relationships between fine grained PGx entities, i.e., gene variant, drug, adverse event.
We focus on the description of phenotypes related to drug responses in term of signs and symptoms e.g., “anemias”, or “low RBC”.
A challenge for the extraction of PGx knowledge is to associate with PGx relationships a validation level on how hypothetical a relationship is. For evaluating knowledge extraction quality we use the content of the Pharmacogenomic Knowledge Base (PharmGKB) developed at Stanford.
2. Knowledge Extraction from Electronic Health Records (EHRs)
We treat Electronic Health Record data from the HEGP and annotate their content with biomedical ontologies. Because EHRs are in French, we reuse and adapt the SIFR Annotator to annotate French clinical narratives with French ontologies.

3. Comparison of state-of-the-art and observational knowledge
Knowledge previously extracted from literature and clinical notes are from different nature and come from multilingual data. We develop a framework for comparing those knowledge units and detect concurring, ambiguous and conflicting knowledge units. We study Formal Concept Analysis to establish correspondences between knowledge representations and data sources. We consequently map ontologies natively used in both worlds (literature and EHRs), considering that they can be either in English or in French.
4. Explaining PGx knowledge using omics data
We consider several omics databases (e.g., DrugBank, Uniprot, KEGG) to investigate molecular mechanisms that may explain adverse reactions of pharmacogenomic drugs. Some of these databases are available as Linked Open Data, facilitating their reuse, some are not. Once these databases connected, we will evaluate novel data analysis methods to provide elements of explanation of how PGx genes impact drug response.
RESULTS

Publications
10 journals, 13 international workshops or conferences, 4 national workshops or conferences (as of May 2019)
We published in international journals (Journal of Biomedical Semantics, Gigascience, Scientific Reports, BMC Bioinformatics, Scientific Data), in international conferences (EKAW, ISMIS, MedInfo), in international peer-reviewed workshops (LOUHI, NETTAB, BioOntologies, etc.) and a popular article in ERCIM News magazine.
Nouvelle
Pharmacogenomic Linked Open Data (PGxLOD)
Linking PGx data
We started with six data sources which focuses on drugs (DrugBank), genes (ClinVar, DisGeNET) and drug responses (Side, MediSpan) and added pharmacogenomic knowledge units extracted from PharmGKB and the litterature. These data sources were mapped and transformed into a single RDF graph available at https://pgxlod.loria.fr. PGxLOD is intended to host pharmacogenomic-related data and knowledge units of various provenance. PGxLOD paper.
Pharmacogenomic Ontology (PGxO)
A simplistic representation of PGx relationships and their provenance
To structure and allow the coexistence of pharmacogenomic knowledge elements of diverse origins within PGxLOD, we propose a minimal ontology, called PGxO. This simple schema enables both representing pharmacogneomic knowledge units and documenting their provenance. PGxO paper.
Pharmacogenomic Corpus (PGxCorpus)
Deep Learning for knowledge extraction
We aim at enabling deep learning to extract PGx relationships from text. Because these approaches require large sets of training data we have built a corpus, called PGxCorpus, from PubMed articles currently made of 945 sentences with >80% with a PGx relationship (2875) manually annotated by 11 annotators.
Meanwhile, we are investigating transfer learning approaches to enable PGx relationship extraction.PGxCorpus paper.
SIFR Annotator
Semantic annotation of clinical notes
We have adapted the French Annotator developed within the SIFR project to annotate French clinical text. We added enhanced functionalities such as scoring, detection of context (negation, experiencer, temporality), new output formats, and coarse-grained concept recognition.
Clinical Data Bioinformatics Workflow
Facilitate clinical records process
We developed a bioinformatics pipeline using NextFlow to facilitate the processing and populating of HEGP Electronic Health Records. The resulting workflow enables the design of reproducible studies associated with metadata and parametrization. Molecular biologists use it to annotate tumor variants.
Software Developments
Our software is open source and available on GitHub: https://github.com/practikpharma
DISSEMINATION
La thèse de William Digan, en 180 secondes
PractiKPharma scientific meetings
4th (05/19): Invited: Grégoire Rey (directeur du CépIDC, INSERM) and Matthias Samwald (Medical University of Vienna).
3rd (11/18): The 3rd scientific meeting is replaced by out active involvement in the Biohackathon 2018 Paris.
2nd (12/17): Invited: Cédrick Fairon (Univ. Catholique de Louvain) and Nicolas Garcelon (Institut des maladies génétiques Imagine, Paris)
1st (12/16): Invited: Aaron Henricksson (Univ. of Stockholm) and Aurélie Névéol (LIMSI, Paris).
PractiKPharma plenary meetings
23/5/2019 at LIRMM, Montpellier
17/5/2018 at the Cordelier Research Center, Paris
1/12/2017 at the Cordelier Research Center, Paris
9/5/2017 at the Cordelier Research Center, Paris
Invited: Clement Goerhs (ERIAS/C-NAPPS)
8/12/2016 at Loria, Nancy
20/1/2016 at Loria, Nancy
Other co-organized events
27th French "Ingénierie des Connaissances" Conference (IC), 6-10 June 2016 , Montpellier, France.
2nd French Health & AI Workshop, 6 June 2016, Montpellier, France.
4th French Symposium "Ingénierie de l'Information Médicale" (SIIM), 23-24 November 2017, Toulouse, France.
33th French "Base de Données Avancées" Conference (BDA), Novembre 2017, Nancy, France.
21st Knowledge Acquisition, Modeling and Management (EKAW) International Conference, November 2018, Nancy, France.
External collaborations
CHU de Nancy
INSERM (UMR U1122 IGE-PCV)
Stanford BMIR (Musen & Shah labs)
ERIAS/Synapse-Medicine (Clement Goehrs)
Etablissement Français du Sang
Tutorials & Presentations
Jan. 2021: Poster at AI4Health Winter School (selected)
Dec. 2020: PhD thesis defenses. Both William Digan and Pierre Monnin successfully defended their PhD, end of December. Congratulations!
Oct. 2020: Open Science Colloquium at Université de Lorraine (invited)
Jun. 2020: Malotec seminar at Loria
Sept. 2019: Presentation at Grand Est Numérique (invited)
Aug. 2019: MedInfo:
Full paper of W. Digan (29/8, 16:50, peer reviewed)
Podium Abstract of FE Calvier (28/08, 8:30, peer reviewed)
Nov. 2018: Hacktask at the Biohackathon 2018 Paris (selected).
June 2017: 2nd Summer Datathon on Linguistic Linked Open Data.
June 2017: French Minister meeting on multi-terminology health portals (invited)
May 2017: French Infoday on health data security (invited)
April 2017: INSERM health data management workshop
June 2016: Tutorial on ontology management given at IC 2016
Publications & Software
Publications related to the project are available on HAL,
Code is on GitHub.
Hackathon
PractiKPharma has participated to the BioHackathon 2018 in Paris. Our team worked on exploring pharmacogenomic linked open data .
Prize of the EGC 2022 PhD Thesis
The EGC association granted his 2022 PhD prize to Pierre Monnin for his thesis untitled "Matching and mining in knowledge graphs of the Web of data : Applications in pharmacogenomics" prepared within PractiKPharma. Congrats Pierre!
CONSORTIUM

Lorraine Research Laboratory in Computer Science and its Applications
Nancy
Georges Pompidou European Hospital, APHP
Paris
Laboratory of Informatics, Robotics and Microelectronics of Montpellier
Montpellier
St Etienne University Hospital
St Etienne
TEAM

Adrien Coulet
Assistant Professor
LORIA
Bastien Rance
Research Scientist
HEGP
Cedric Bousquet
Hospital Practitioner
CHU St Etienne
Clement Jonquet
Assistant Professor
LIRMM
Joel Legrand
Assistant Professor (previously Postdoc)
CRIStAL (previously at LORIA)
Pierre Monnin
PhD student
LORIA
Andon Tchechmedjiev
Adjunct Assistant Professor (previously Postdoc)
LGI2P, Mines d'Alès (previously at LIRMM)
Amine Abdaoui
Postdoc
LIRMM
William Digan
PhD student
HEGP
Francois-Elie Calvier
Postdoc
CHU St Etienne
Anita Burgun
Professor
HEGP
Malika Smaïl-Tabbone
Associate Professor
LORIA
Gabin Personeni
PhD Student
LORIA
Amina Annane
PhD Student
LIRMM
Yannick Toussaint
Professor
LORIA
Sandra Bringay
Professor
LIRMM
Amedeo Napoli
DR CNRS
LORIA
Zohra Bellahsene
Professor
LIRMM
Stella Zevio
MSc Intern
LIRMM (now PhD student at LIPN)
Romain Gogdemir
MSc Intern
LORIA
Graziella Husson
MSc Intern
LORIA
Mehdi Mirzapour
Postdoc
LIRMM
JOBS

Research engineer position (closed)
Web architect – Semantic web technologies – Text and data mining, LIRMM, Montpellier
Postdoctoral position (closed)
Multilingual semantic annotations of Electronic Health Records and Pharmacogenomics data with ontologies, LIRMM, Montpellier
Postdoctoral position (closed)
Extracting pharmacogenomic relationships from text , considering domain knowledge, LORIA, Nancy
PhD position (closed)
Knowledge extraction and comparison for pharmacogenomics (in French), LORIA, Nancy
CONTACT
LORIA, Campus Scientifique, 615 Rue du Jardin-Botanique, 54506 Vandœuvre-lès-Nancy, France
adrien *dot* coulet *at* loria *dot* fr

Comment
SUPPORT

ANR-15-CE23-0028
OTHER SUPPORT

VSR program
Associate Team program

PractiKPharma

Practice-based evidences for actioning Knowledge in PharmacogenomicsANR-15-CE23-0028 -- http://practikpharma.loria.fr2016-2021

OBJECTIVES

1. Knowledge Extraction from reference databases & the biomedical literature

2. Knowledge Extraction from Electronic Health Records (EHRs)

3. Comparison of state-of-the-art and observational knowledge

4. Explaining PGx knowledge using omics data

RESULTS

Publications

10 journals, 13 international workshops or conferences, 4 national workshops or conferences (as of May 2019)

Pharmacogenomic Linked Open Data (PGxLOD)

Linking PGx data

Pharmacogenomic Ontology (PGxO)

A simplistic representation of PGx relationships and their provenance

Pharmacogenomic Corpus (PGxCorpus)

Deep Learning for knowledge extraction

SIFR Annotator

Semantic annotation of clinical notes

Clinical Data Bioinformatics Workflow

Facilitate clinical records process

Software Developments

DISSEMINATION

La thèse de William Digan, en 180 secondes

PractiKPharma scientific meetings

PractiKPharma plenary meetings

Other co-organized events

External collaborations

Tutorials & Presentations

Publications & Software

Hackathon

Prize of the EGC 2022 PhD Thesis

CONSORTIUM

Lorraine Research Laboratory in Computer Science and its Applications

Georges Pompidou European Hospital, APHP

Laboratory of Informatics, Robotics and Microelectronics of Montpellier

St Etienne University Hospital

TEAM

Adrien Coulet

Assistant Professor

Bastien Rance

Research Scientist

Cedric Bousquet

Hospital Practitioner

Clement Jonquet

Assistant Professor

Joel Legrand

Assistant Professor (previously Postdoc)

Pierre Monnin

PhD student

Andon Tchechmedjiev

Adjunct Assistant Professor (previously Postdoc)

Amine Abdaoui

Postdoc

William Digan

PhD student

Francois-Elie Calvier

Postdoc

Anita Burgun

Professor

Malika Smaïl-Tabbone

Associate Professor

Gabin Personeni

PhD Student

Amina Annane

PhD Student

Yannick Toussaint

Professor

Sandra Bringay

Professor

Amedeo Napoli

DR CNRS

Zohra Bellahsene

Professor

Stella Zevio

MSc Intern

Romain Gogdemir

MSc Intern

Graziella Husson

MSc Intern

Mehdi Mirzapour

Practice-based evidences for actioning Knowledge in Pharmacogenomics
ANR-15-CE23-0028 -- http://practikpharma.loria.fr
2016-2021