Abstract
Box score statistics in the National Basketball Association are used to measure and evaluate player performance. Some of these statistics are subjective in nature and since box score statistics are recorded by scorekeepers hired by the home team for each game, there exists potential for inconsistency and bias. These inconsistencies can have far reaching consequences, particularly with the rise in popularity of daily fantasy sports. Using box score data, we estimate models able to quantify both the bias and the generosity of each scorekeeper for two of the most subjective statistics: assists and blocks. We then use optical player tracking data for the 2015–2016 season to improve the assist model by including other contextual spatio-temporal variables such as time of possession, player locations, and distance traveled. From this model, we present results measuring the impact of the scorekeeper and of the other contextual variables on the probability of a pass being recorded as an assist. Results for adjusting season assist totals to remove scorekeeper influence are also presented.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Acharya RA, Ahmed AJ, D’Amour AN, Lu H, Morris CN, Oglevee BD, Peterson AW, Swift RN (2008) Improving major league baseball park factor estimates. J Quant Anal Sports 4(2):Article 4. doi:10.2202/1559-0410.1108
Baghal T (2012) Are the “four factors” indicators of one factor? an application of structural equation modeling methodology to NBA data in prediction of winning percentage. J Quant Anal Sports 8(1):1559-0410. doi:10.1515/1559-0410.1355
Basketball Reference (2015) Calculating win shares. http://www.basketball-reference.com/about/ws.html. Accessed 2 Dec 2015
Cervone D, D’Amour A, Bornn L, Goldsberry K (2014) A multiresolution stochastic process model for predicting basketball possession outcomes. arXiv:1408.0777. Accessed 2 Dec 2015
Craggs T (2009) The confessions of an NBA scorekeeper. http://deadspin.com/5345287/the-confessions-of-an-nba-scorekeeper. Accessed 2 Dec 2015
Deshpande SK, Jensen ST (2016) Estimating an NBA players impact on his teams chances of winning. J Quant Anal Sports 12(2):51–72. doi:10.1515/jqas-2015-0027
DraftKings (2015) Daily fantasy basketball league rules. https://www.draftkings.com/help/nba. Accessed 20 Jan 2016
ESPN.com Contributors (2015) Daily fantasy basketball: building blocks, fades for Nov. 17. http://espn.go.com/blog/fantasy-basketball/post/_/id/3764/daily-fantasy-basketball-building-blocks-fades-for-nov-17. Accessed 20 Jan 2016
FanDuel (2015) Rules and scoring. https://www.fanduel.com/rules. Accessed 20 Jan 2016
Fearnhead P, Taylor BM (2011) On estimating the ability of NBA players. J Quant Anal Sports 7(3):Article 11. doi:10.2202/1559-0410.1298
Gramacy RB, Jensen ST, Taddy M (2013) Estimating player contribution in hockey with regularized logistic regression. J Quant Anal Sports 9(1):97–111. doi:10.1515/jqas-2012-0001
Groll A, Schauberger G, Tutz G (2015) Prediction of major international soccer tournaments based on team-specific regularized poisson regression: an application to the FIFA world cup 2014. J Quant Anal Sports 11(2):97–115. doi:10.1515/jqas-2014-0051
Hamrick J, Rasp J (2011) Using local correlation to explain success in baseball. J Quant Anal Sports 7(4):Article 5. doi:10.2202/1559-0410.1278
Hollinger J (2004) Pro basketball forecast 2004–2005. Brasseys, Washington, DC
Macdonald B (2012) Adjusted plus–minus for NHL players using ridge regression with goals, shots, fenwick, and corsi. J Quant Anal Sports 8(3):1–22. doi:10.1515/1559-0410.1447
NBA (2013) Basketball U on assists. http://www.nba.com/canada/Basketball_U_on_Assists-Canada_Generic_Article-18072.html. Accessed 2 Dec 2015
Neal D, Tan J, Hao F, Wu SS (2010) Simply better: using regression models to estimate major league batting averages. J Quant Anal Sports 6(3):Article 12. doi:10.2202/1559-0410.1229
Oberstone J (2009) Differentiating the top English premier league football clubs from the rest of the pack: identifying the keys to success. J Quant Anal Sports 5(3):Article 10. doi:10.2202/1559-0410.1183
Okamoto DM (2011) Stratified odds ratios for evaluating NBA players based on their plus/minus statistics. J Quant Anal Sports 7(2):1–10. doi:10.2202/1559-0410.1320
O’Keeffe K (2015) Daily fantasy-sports operators await reality check. Washington Post. http://www.wsj.com/articles/daily-fantasy-sports-operators-await-reality-check-1441835630. Accessed 20 Jan 2016
Price J, Wolfers J (2010) Racial discrimination among NBA referees. Q J Econ 125(4):1859–1887
Schuckers M, Macdonald B (2014) Accounting for rink effects in the national hockey league’s real time scoring system. http://arxiv.org/abs/1412.1035v1. Accessed 2 Dec 2015
Teramoto M, Cross CL (2010) Relative importance of performance factors in winning NBA games in regular season versus playoffs. J Quant Anal Sports 6(3):Article 2. doi:10.2202/1559-0410.1260
Tibshirani R (1994) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B 58:267–288
Yue Y, Lucey P, Carr P, Bialkowski A, Matthews I (2014) Learning fine-grained spatial models for dynamic sports play prediction. In: Proceeding of IEEE international conference on data mining, pp 670–679
Acknowledgements
This work was partially supported by U.S. National Science Foundation grant 1461435, by DARPA under Grant No. FA8750-14-2-0117, by ARO under Grant No. W911NF-15-1-0172, by Amazon, by NSERC, and by the National Association of Basketball Coaches.
Author information
Authors and Affiliations
Corresponding author
Additional information
Responsible editor : Ulf Brefeld and Albrecht Zimmermann.
Appendix: Availability of data
Appendix: Availability of data
Sections 3 and 4 use ESPN box score data from the 2015–2016 NBA season. This data is publicly available and can be found at http://www.espn.com/nba/scoreboard. The SportVu optical player tracking data from STATS LLC used in Sect. 5 for the 2013–2014, 2014–2015, and 2015–2016 NBA seasons remains proprietary. However, to address concerns of reproducibility, our lab has released a full game of tracking data, available at https://github.com/dcervone/EPVDemo/blob/master/data/2013_11_01_MIA_BKN.csv.
Rights and permissions
About this article
Cite this article
van Bommel, M., Bornn, L. Adjusting for scorekeeper bias in NBA box scores. Data Min Knowl Disc 31, 1622–1642 (2017). https://doi.org/10.1007/s10618-017-0497-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10618-017-0497-y