Shapley curves, nonparametric statistics, explainable ML, additive models, smoothing techniques
Originating from cooperative game theory, Shapley values have become one of the most widely used measures for variable importance in applied Machine Learning. However, the statistical understanding of Shapley values is still limited. In this paper, we take a nonparametric (or smoothing) perspective by introducing Shapley curves as a local measure of variable importance. We propose two estimation strategies and derive the consistency and asymptotic normality both under independence and dependence among the features. This allows us to construct confidence intervals and conduct inference on the estimated Shapley curves. The asymptotic results are validated in extensive experiments. In an empirical application, we analyze which attributes drive the prices of vehicles.
Hey there! I am a PhD student in statistics at HU Berlin. Please feel free to contact me at any time via miftachr@hu-berlin.de. My research interests are explainability, nonparametric statistics and inference.