Entropy based constrained inference for some HDLSS genomic models: UI tests in a Chen-Stein perspective |
| |
Authors: | Ming-Tien Tsai Pranab Kumar Sen |
| |
Affiliation: | a Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan, ROC b Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599-7420, USA c Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC 27599-7420, USA |
| |
Abstract: | For qualitative data models, Gini-Simpson index and Shannon entropy are commonly used for statistical analysis. In the context of high-dimensional low-sample size (HDLSS) categorical models, abundant in genomics and bioinformatics, the Gini-Simpson index, as extended to Hamming distance in a pseudo-marginal setup, facilitates drawing suitable statistical conclusions. Under Lorenz ordering it is shown that Shannon entropy and its multivariate analogues proposed here appear to be more informative than the Gini-Simpson index. The nested subset monotonicity prospect along with subgroup decomposability of some proposed measures are exploited. The usual jackknifing (or bootstrapping) methods may not work out well for HDLSS constrained models. Hence, we consider a permutation method incorporating the union-intersection (UI) principle and Chen-Stein Theorem to formulate suitable statistical hypothesis testing procedures for gene classification. Some applications are included as illustration. |
| |
Keywords: | 62F30 62H15 |
本文献已被 ScienceDirect 等数据库收录! |
|