Multivariate approach for protein identification based on mass spectrometric data
- Authors
- Lee, Jung Bok; Lee, Jae Won
- Issue Date
- 12월-2013
- Publisher
- SAGE PUBLICATIONS LTD
- Keywords
- protein identification; mass spectrometry; generalized linear mixed model; two-part model
- Citation
- STATISTICAL METHODS IN MEDICAL RESEARCH, v.22, no.6, pp.553 - 566
- Indexed
- SCIE
SCOPUS
- Journal Title
- STATISTICAL METHODS IN MEDICAL RESEARCH
- Volume
- 22
- Number
- 6
- Start Page
- 553
- End Page
- 566
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/101399
- DOI
- 10.1177/0962280211434960
- ISSN
- 0962-2802
- Abstract
- Protein mass spectrometry provides a powerful tool for detecting and identifying proteins. Several database searching algorithms may be used for this purpose. However, most of them depend on the heuristic approaches and the use of probability-based or statistical approach was very restrictive in the current algorithms. In this study, we present a statistical modelling of scores based on a generalized linear mixed model and provide a feasible computation method using penalized generalized weighted least squares. This model incorporates the dependency among matches into a new statistical scoring function, and uses the beta-binomial distribution to derive the score. Based on simulation experiments and analysis using real examples, we have improved protein searching performance and provided feasible computation procedures to deal with very large datasets. In particular, our methods may significantly increase accuracy in identifying medium and small proteins.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Political Science & Economics > Department of Statistics > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.