首页 | 本学科首页   官方微博 | 高级检索  
     


One-Step Generalized Estimating Equations With Large Cluster Sizes
Authors:Stuart Lipsitz  Garrett Fitzmaurice  Debajyoti Sinha  Nathanael Hevelone  Jim Hu  Louis L. Nguyen
Affiliation:1. Brigham &2. Women’s Hospital, Boston, MA;3. Harvard Medical School, Boston, MA;4. Florida State University, Tallahassee, FL;5. Cornell Medical College, New York, NY
Abstract:Medical studies increasingly involve a large sample of independent clusters, where the cluster sizes are also large. Our motivating example from the 2010 Nationwide Inpatient Sample (NIS) has 8,001,068 patients and 1049 clusters, with average cluster size of 7627. Consistent parameter estimates can be obtained naively assuming independence, which are inefficient when the intra-cluster correlation (ICC) is high. Efficient generalized estimating equations (GEE) incorporate the ICC and sum all pairs of observations within a cluster when estimating the ICC. For the 2010 NIS, there are 92.6 billion pairs of observations, making summation of pairs computationally prohibitive. We propose a one-step GEE estimator that (1) matches the asymptotic efficiency of the fully iterated GEE; (2) uses a simpler formula to estimate the ICC that avoids summing over all pairs; and (3) completely avoids matrix multiplications and inversions. These three features make the proposed estimator much less computationally intensive, especially with large cluster sizes. A unique contribution of this article is that it expresses the GEE estimating equations incorporating the ICC as a simple sum of vectors and scalars.
Keywords:Clustered data  Efficient estimation  Exchangeable correlation  Fully iterated  Intra-cluster correlation
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号