首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Measuring Lineup Difficulty By Matching Distance Metrics With Subject Choices in Crowd-Sourced Data
Authors:Niladri Roy Chowdhury  Dianne Cook  Heike Hofmann  Mahbubul Majumder
Institution:1. Biometrics and Data Management, Novartis Oncology, Cambridge, MA;2. Department of Econometrics and Business Statistics, Monash University, Clayton, Australia;3. Department of Statistics and Statistical Laboratory, Iowa State University, Ames, IA;4. Department of Mathematics, University of Nebraska–Omaha, Omaha, NE
Abstract:Graphics play a crucial role in statistical analysis and data mining. Being able to quantify structure in data that is visible in plots, and how people read the structure from plots is an ongoing challenge. The lineup protocol provides a formal framework for data plots, making inference possible. The data plot is treated like a test statistic, and lineup protocol acts like a comparison with the sampling distribution of the nulls. This article describes metrics for describing structure in data plots and evaluates them in relation to the choices that human readers made during several large Amazon Turk studies using lineups. The metrics that were more specific to the plot types tended to better match subject choices, than generic metrics. The process that we followed to evaluate metrics will be useful for general development of numerically measuring structure in plots, and also in future experiments on lineups for choosing blocks of pictures. Supplementary materials for this article are available online.
Keywords:Cognitive perception  Data mining  Data science  Data visualization  Distance metrics  Exploratory data analysis  Information visualization  Statistical graphics  Visual inference
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号