Exploring semantic deep learning for building reliable and reusable one health knowledge from PubMed systematic reviews and veterinary clinical notes

Arguello-Casteleiro, M.; Stevens, R.; Des Diz, José Julio; Wroe, C.; Fernandez-Prieto, M. J.; Maroto, N.; Maseda-Fernandez, D.; Demetriou, G.; Peters, S.; Noble, P. J. M.; Jones, P. H.; Dukes-McEwan, J.; Radford, A. D.; Keane, J.; Nenadic, G.

doi:http://dx.doi.org/10.1186/s13326-019-0212-6

Arguello-Casteleiro, M.; Stevens, R.; Des Diz, José Julio; Wroe, C.; Fernandez-Prieto, M. J.; Maroto, N.; Maseda-Fernandez, D.; Demetriou, G.; Peters, S.; Noble, P. J. M.; Jones, P. H.; Dukes-McEwan, J.; Radford, A. D.; Keane, J.; Nenadic, G.

Estadísticas

Ver Estadísticas de uso

Identificadores

URI: http://hdl.handle.net/20.500.11940/15971

PMID: 31711540

DOI: http://dx.doi.org/10.1186/s13326-019-0212-6

ISSN: 2041-1480

Registro completo

Servicios

Visualización o descarga de ficheros

J Biomed Semantics. 2019 Nov 12;10(Suppl 1):22. (3.365Mb)

Fecha de publicación

2019

Título de revista

J Biomed Semantics

Tipo de contenido

Artigo

DeCS

ontologías biológicas | bases del conocimiento

MeSH

Knowledge Bases | Biological Ontologies

Resumen

BACKGROUND: Deep Learning opens up opportunities for routinely scanning large bodies of biomedical literature and clinical narratives to represent the meaning of biomedical and clinical terms. However, the validation and integration of this knowledge on a scale requires cross checking with ground truths (i.e. evidence-based resources) that are unavailable in an actionable or computable form. In this paper we explore how to turn information about diagnoses, prognoses, therapies and other clinical concepts into computable knowledge using free-text data about human and animal health. We used a Semantic Deep Learning approach that combines the Semantic Web technologies and Deep Learning to acquire and validate knowledge about 11 well-known medical conditions mined from two sets of unstructured free-text data: 300 K PubMed Systematic Review articles (the PMSB dataset) and 2.5 M veterinary clinical notes (the VetCN dataset). For each target condition we obtained 20 related clinical concepts using two deep learning methods applied separately on the two datasets, resulting in 880 term pairs (target term, candidate term). Each concept, represented by an n-gram, is mapped to UMLS using MetaMap; we also developed a bespoke method for mapping short forms (e.g. abbreviations and acronyms). Existing ontologies were used to formally represent associations. We also create ontological modules and illustrate how the extracted knowledge can be queried. The evaluation was performed using the content within BMJ Best Practice. RESULTS: MetaMap achieves an F measure of 88% (precision 85%, recall 91%) when applied directly to the total of 613 unique candidate terms for the 880 term pairs. When the processing of short forms is included, MetaMap achieves an F measure of 94% (precision 92%, recall 96%). Validation of the term pairs with BMJ Best Practice yields precision between 98 and 99%. CONCLUSIONS: The Semantic Deep Learning approach can transform neural embeddings built from unstructured free-text data into reliable and reusable One Health knowledge using ontologies and content from BMJ Best Practice.

Excepto si se señala otra cosa, la licencia del ítem se describe como Atribución 4.0 Internacional

Repositorio digital RUNA