Estimating Box-Cox power transformation parameter via goodness-of-fit tests


Creative Commons License

Asar O., İLK DAĞ Ö., DAĞ O.

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, cilt.46, sa.1, ss.91-105, 2017 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 46 Sayı: 1
  • Basım Tarihi: 2017
  • Doi Numarası: 10.1080/03610918.2014.957839
  • Dergi Adı: COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.91-105
  • Anahtar Kelimeler: Artificial covariate, Data transformation, Normality tests, Searching algorithms, Statistical software, FALSE DISCOVERY RATE, VARIANCE TEST, APPROXIMATE ANALYSIS, NORMALITY
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Box-Cox power transformation is a commonly used methodology to transform the distribution of the data into a normal distribution. The methodology relies on a single transformation parameter. In this study, we focus on the estimation of this parameter. For this purpose, we employ seven popular goodness-of-fit tests for normality, namely Shapiro-Wilk, Anderson-Darling, Cramer-von Mises, Pearson Chi-square, Shapiro-Francia, Lilliefors and Jarque-Bera tests, together with a searching algorithm. The searching algorithm is based on finding the argument of the minimum or maximum depending on the test, i.e., maximum for the Shapiro-Wilk and Shapiro-Francia, minimum for the rest. The artificial covariate method of Dag etal. (2014) is also included for comparison purposes. Simulation studies are implemented to compare the performances of the methods. Results show that Shapiro-Wilk and the artificial covariate method are more effective than the others and Pearson Chi-square is the worst performing method. The methods are also applied to two real-life datasets. The R package AID is proposed for implementation of the aforementioned methods.