TY - JOUR
T1 - Modelling interval data with Normal and Skew-Normal distributions
AU - Brito, Paula
AU - Silva, A. Pedro Duarte
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2012/1
Y1 - 2012/1
N2 - A parametric modelling for interval data is proposed, assuming a multivariate Normal or Skew-Normal distribution for the midpoints and log-ranges of the interval variables. The intrinsic nature of the interval variables leads to special structures of the variance-covariance matrix, which is represented by five different possible configurations. Maximum likelihood estimation for both models under all considered configurations is studied. The proposed modelling is then considered in the context of analysis of variance and multivariate analysis of variance testing. To access the behaviour of the proposed methodology, a simulation study is performed. The results show that, for medium or large sample sizes, tests have good power and their true significance level approaches nominal levels when the constraints assumed for the model are respected; however, for small samples, sizes close to nominal levels cannot be guaranteed. Applications to Chinese meteorological data in three different regions and to credit card usage variables for different card designations, illustrate the proposed methodology.
AB - A parametric modelling for interval data is proposed, assuming a multivariate Normal or Skew-Normal distribution for the midpoints and log-ranges of the interval variables. The intrinsic nature of the interval variables leads to special structures of the variance-covariance matrix, which is represented by five different possible configurations. Maximum likelihood estimation for both models under all considered configurations is studied. The proposed modelling is then considered in the context of analysis of variance and multivariate analysis of variance testing. To access the behaviour of the proposed methodology, a simulation study is performed. The results show that, for medium or large sample sizes, tests have good power and their true significance level approaches nominal levels when the constraints assumed for the model are respected; however, for small samples, sizes close to nominal levels cannot be guaranteed. Applications to Chinese meteorological data in three different regions and to credit card usage variables for different card designations, illustrate the proposed methodology.
KW - ANOVA
KW - MANOVA
KW - Parametric modelling of interval data
KW - Skew-Normal distribution
KW - Statistical tests for interval data
KW - Symbolic data
UR - http://www.scopus.com/inward/record.url?scp=84858216480&partnerID=8YFLogxK
U2 - 10.1080/02664763.2011.575125
DO - 10.1080/02664763.2011.575125
M3 - Article
AN - SCOPUS:84858216480
SN - 0266-4763
VL - 39
SP - 3
EP - 20
JO - Journal of Applied Statistics
JF - Journal of Applied Statistics
IS - 1
ER -