Sequence tagging for biomedical extractive question answeringopen access
- Authors
- Yoon, Wonjin; Jackson, Richard; Lagerberg, Aron; Kang, Jaewoo
- Issue Date
- 2-8월-2022
- Publisher
- OXFORD UNIV PRESS
- Citation
- BIOINFORMATICS, v.38, no.15, pp.3794 - 3801
- Indexed
- SCIE
SCOPUS
- Journal Title
- BIOINFORMATICS
- Volume
- 38
- Number
- 15
- Start Page
- 3794
- End Page
- 3801
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/146612
- DOI
- 10.1093/bioinformatics/btac397
- ISSN
- 1367-4803
- Abstract
- Motivation: Current studies in extractive question answering (EQA) have modeled the single-span extraction setting, where a single answer span is a label to predict for a given question-passage pair. This setting is natural for general domain EQA as the majority of the questions in the general domain can be answered with a single span. Following general domain EQA models, current biomedical EQA (BioEQA) models utilize the single-span extraction setting with post-processing steps. Results: In this article, we investigate the question distribution across the general and biomedical domains and discover biomedical questions are more likely to require list-type answers (multiple answers) than factoid-type answers (single answer). This necessitates the models capable of producing multiple answers for a question. Based on this preliminary study, we propose a sequence tagging approach for BioEQA, which is a multi-span extraction setting. Our approach directly tackles questions with a variable number of phrases as their answer and can learn to decide the number of answers for a question from training data. Our experimental results on the BioASQ 7b and 8b list-type questions outperformed the best-performing existing models without requiring post-processing steps.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.