Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Building a PubMed knowledge graph

Full metadata record
DC Field Value Language
dc.contributor.authorXu, Jian-
dc.contributor.authorKim, Sunkyu-
dc.contributor.authorSong, Min-
dc.contributor.authorJeong, Minbyul-
dc.contributor.authorKim, Donghyeon-
dc.contributor.authorKang, Jaewoo-
dc.contributor.authorRousseau, Justin F.-
dc.contributor.authorLi, Xin-
dc.contributor.authorXu, Weijia-
dc.contributor.authorTorvik, Vetle I.-
dc.contributor.authorBu, Yi-
dc.contributor.authorChen, Chongyan-
dc.contributor.authorEbeid, Islam Akef-
dc.contributor.authorLi, Daifeng-
dc.contributor.authorDing, Ying-
dc.date.accessioned2021-08-30T20:36:44Z-
dc.date.available2021-08-30T20:36:44Z-
dc.date.created2021-06-18-
dc.date.issued2020-06-26-
dc.identifier.issn2052-4463-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/54966-
dc.description.abstractPubMed(R)is an essential resource for the medical domain, but useful concepts are either difficult to extract or are ambiguous, which has significantly hindered knowledge discovery. To address this issue, we constructed a PubMed knowledge graph (PKG) by extracting bio-entities from 29 million PubMed abstracts, disambiguating author names, integrating funding data through the National Institutes of Health (NIH) ExPORTER, collecting affiliation history and educational background of authors from ORCID(R), and identifying fine-grained affiliation data from MapAffil. Through the integration of these credible multi-source data, we could create connections among the bio-entities, authors, articles, affiliations, and funding. Data validation revealed that the BioBERT deep learning method of bio-entity extraction significantly outperformed the state-of-the-art models based on the F1 score (by 0.51%), with the author name disambiguation (AND) achieving an F1 score of 98.09%. PKG can trigger broader innovations, not only enabling us to measure scholarly impact, knowledge usage, and knowledge transfer, but also assisting us in profiling authors and organizations based on their connections with bio-entities.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherNATURE PUBLISHING GROUP-
dc.subjectDATABASE-
dc.subjectDISAMBIGUATION-
dc.subjectRECOGNITION-
dc.subjectSYSTEM-
dc.subjectGENES-
dc.subjectCGRP-
dc.titleBuilding a PubMed knowledge graph-
dc.typeArticle-
dc.contributor.affiliatedAuthorKang, Jaewoo-
dc.identifier.doi10.1038/s41597-020-0543-2-
dc.identifier.wosid000545966200002-
dc.identifier.bibliographicCitationSCIENTIFIC DATA, v.7, no.1-
dc.relation.isPartOfSCIENTIFIC DATA-
dc.citation.titleSCIENTIFIC DATA-
dc.citation.volume7-
dc.citation.number1-
dc.type.rimsART-
dc.type.docTypeArticle; Data Paper-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
dc.subject.keywordPlusDATABASE-
dc.subject.keywordPlusDISAMBIGUATION-
dc.subject.keywordPlusRECOGNITION-
dc.subject.keywordPlusSYSTEM-
dc.subject.keywordPlusGENES-
dc.subject.keywordPlusCGRP-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kang, Jae woo photo

Kang, Jae woo
Department of Computer Science and Engineering
Read more

Altmetrics

Total Views & Downloads

BROWSE