Robust multivariate mixture regression models with incompletedata
- Authors
- Lim, Hwa Kyung; Narisetty, Naveen N.; Cheon, Sooyoung
- Issue Date
- 1월-2017
- Publisher
- TAYLOR & FRANCIS LTD
- Keywords
- Incomplete data; outlier; multivariate mixture regression models; multivariate t-distribution; robust clustering
- Citation
- JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, v.87, no.2, pp.328 - 347
- Indexed
- SCIE
SCOPUS
- Journal Title
- JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION
- Volume
- 87
- Number
- 2
- Start Page
- 328
- End Page
- 347
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/85142
- DOI
- 10.1080/00949655.2016.1209198
- ISSN
- 0094-9655
- Abstract
- Multivariate mixture regression models can be used to investigate the relationships between two or more response variables and a set of predictor variables by taking into consideration unobserved population heterogeneity. It is common to take multivariate normal distributions as mixing components, but this mixing model is sensitive to heavy-tailed errors and outliers. Although normal mixture models can approximate any distribution in principle, the number of components needed to account for heavy-tailed distributions can be very large. Mixture regression models based on the multivariate t distributions can be considered as a robust alternative approach. Missing data are inevitable in many situations and parameter estimates could be biased if the missing values are not handled properly. In this paper, we propose a multivariate t mixture regression model with missing information to model heterogeneity in regression function in the presence of outliers and missing values. Along with the robust parameter estimation, our proposed method can be used for (i) visualization of the partial correlation between response variables across latent classes and heterogeneous regressions, and (ii) outlier detection and robust clustering even under the presence of missing values. We also propose a multivariate t mixture regression model using MM-estimation with missing information that is robust to high-leverage outliers. The proposed methodologies are illustrated through simulation studies and real data analysis.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Applied Statistics > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.