1.Improving accessibility and distinction between negative results in biomedical relation extraction
Diana SOUSA ; Andre LAMURIAS ; Francisco M. COUTO
Genomics & Informatics 2020;18(2):e20-
Accessible negative results are relevant for researchers and clinicians not only to limit their search space but also to prevent the costly re-exploration of research hypotheses. However, most biomedical relation extraction datasets do not seek to distinguish between a false and a negative relation among two biomedical entities. Furthermore, datasets created using distant supervision techniques also have some false negative relations that constitute undocumented/unknown relations (missing from a knowledge base). We propose to improve the distinction between these concepts, by revising a subset of the relations marked as false on the phenotype-gene relations corpus and give the first steps to automatically distinguish between the false (F), negative (N), and unknown (U) results. Our work resulted in a sample of 127 manually annotated FNU relations and a weighted-F1 of 0.5609 for their automatic distinction. This work was developed during the 6th Biomedical Linked Annotation Hackathon (BLAH6).
2.COVID-19 recommender system based on an annotated multilingual corpus
Márcia BARROS ; Pedro RUAS ; Diana SOUSA ; Ali Haider BANGASH ; Francisco M. COUTO
Genomics & Informatics 2021;19(3):e24-
Tracking the most recent advances in Coronavirus disease 2019 (COVID-19)‒related research is essential, given the disease's novelty and its impact on society. However, with the publication pace speeding up, researchers and clinicians require automatic approaches to keep up with the incoming information regarding this disease. A solution to this problem requires the development of text mining pipelines; the efficiency of which strongly depends on the availability of curated corpora. However, there is a lack of COVID-19‒related corpora, even more, if considering other languages besides English. This project's main contribution was the annotation of a multilingual parallel corpus and the generation of a recommendation dataset (EN-PT and EN-ES) regarding relevant entities, their relations, and recommendation, providing this resource to the community to improve the text mining research on COVID-19‒related literature. This work was developed during the 7th Biomedical Linked Annotation Hackathon (BLAH7).
3.Improving accessibility and distinction between negative results in biomedical relation extraction
Diana SOUSA ; Andre LAMURIAS ; Francisco M. COUTO
Genomics & Informatics 2020;18(2):e20-
Accessible negative results are relevant for researchers and clinicians not only to limit their search space but also to prevent the costly re-exploration of research hypotheses. However, most biomedical relation extraction datasets do not seek to distinguish between a false and a negative relation among two biomedical entities. Furthermore, datasets created using distant supervision techniques also have some false negative relations that constitute undocumented/unknown relations (missing from a knowledge base). We propose to improve the distinction between these concepts, by revising a subset of the relations marked as false on the phenotype-gene relations corpus and give the first steps to automatically distinguish between the false (F), negative (N), and unknown (U) results. Our work resulted in a sample of 127 manually annotated FNU relations and a weighted-F1 of 0.5609 for their automatic distinction. This work was developed during the 6th Biomedical Linked Annotation Hackathon (BLAH6).
4.The Value of Transcranial Doppler Sonography in Hyperperfusion Syndrome after Carotid Artery Stenting: A Nationwide Prospective Study
Francisco MONICHE ; Irene ESCUDERO-MARTÍNEZ ; Fernando MANCHA ; Alejandro TOMASELLO ; Marc RIBÓ ; Fernando DELGADO-ACOSTA ; Juán José OCHOA ; Joaquín GIL ; Rosario GIL ; Montserrat GONZÁLEZ-DELGADO ; Eduardo MURIAS ; Alain LUNA ; Alberto GIL ; Sonia MOSTEIRO ; María Dolores FERNÁNDEZ-COUTO ; Luis Fernández de ALARCÓN ; José M. RAMÍREZ-MORENO ; Joaquín ZAMARRO ; Guillermo PARRILLA ; José L. CANIEGO ; Gustavo ZAPATA-WAINBERG ; Andrés GONZÁLEZ-MANDLY ; José A. de las HERAS ; Luis LÓPEZ-MESONERO ; Joaquín ORTEGA ; Juan F. ARENILLAS ; Ernesto GARCÍA ; Pedro P. ALCÁZAR ; Elena ZAPATA-ARRIAZA ; Asier de ALBÓNIGA-CHINDURZA ; Juan Antonio CABEZAS ; Pilar ALGABA ; Aurelio CAYUELA ; Joan MONTANER ; Alejandro González GARCÍA
Journal of Stroke 2020;22(2):254-257