skip to main content
10.1145/3178876.3186151acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article
Free Access

Learning Causal Effects From Many Randomized Experiments Using Regularized Instrumental Variables

Published:23 April 2018Publication History

ABSTRACT

Scientific and business practices are increasingly resulting in large collections of randomized experiments. Analyzed together multiple experiments can tell us things that individual experiments cannot. We study how to learn causal relationships between variables from the kinds of collections faced by modern data scientists: the number of experiments is large, many experiments have very small effects, and the analyst lacks metadata (e.g., descriptions of the interventions). We use experimental groups as instrumental variables (IV) and show that a standard method (two-stage least squares) is biased even when the number of experiments is infinite. We show how a sparsity-inducing l0 regularization can (in a reversal of the standard bias--variance tradeoff) reduce bias (and thus error) of interventional predictions. We are interested in estimating causal effects, rather than just predicting outcomes, so we also propose a modified cross-validation procedure (IVCV) to feasibly select the regularization parameter. We show, using a trick from Monte Carlo sampling, that IVCV can be done using summary statistics instead of raw data. This makes our full procedure simple to use in many real-world applications.

References

  1. Joshua D Angrist, Guido W Imbens, and Donald B Rubin. 1996. Identification of causal effects using instrumental variables. J. Amer. Statist. Assoc. Vol. 91, 434 (1996), 444--455.Google ScholarGoogle ScholarCross RefCross Ref
  2. Joshua D Angrist and Alan B Krueger. 1995. Split-sample instrumental variables estimates of the return to schooling. Journal of Business & Economic Statistics Vol. 13, 2 (1995), 225--235.Google ScholarGoogle Scholar
  3. Joshua D Angrist and Jörn-Steffen Pischke. 2008. Mostly Harmless Econometrics: An Empiricist's Companion. Princeton university press.Google ScholarGoogle Scholar
  4. Susan Athey and Guido Imbens. 2016. Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, Vol. 113, 27 (2016), 7353--7360.Google ScholarGoogle ScholarCross RefCross Ref
  5. E. Bakshy, D. Eckles, and M. S. Bernstein. 2014. Designing and Deploying Online Field Experiments. Proceedings of the 23rd ACM conference on the World Wide Web. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Abhijit Banerjee and Esther Duflo. 2012. Poor Economics: A Radical Rethinking of the Way to Fight Global Poverty. PublicAffairs.Google ScholarGoogle Scholar
  7. Paul A Bekker. 1994. Alternative approximations to the distributions of instrumental variable estimators. Econometrica: Journal of the Econometric Society (1994), 657--681.Google ScholarGoogle Scholar
  8. Alexandre Belloni, Daniel Chen, Victor Chernozhukov, and Christian Hansen. 2012. Sparse models and methods for optimal instruments with an application to eminent domain. Econometrica, Vol. 80, 6 (2012), 2369--2429.Google ScholarGoogle ScholarCross RefCross Ref
  9. Léon Bottou. 2014. From machine learning to machine reasoning. Machine Learning, Vol. 94, 2 (2014), 133--149. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Léon Bottou, Jonas Peters, Joaquin Quinonero Candela, Denis Xavier Charles, Max Chickering, Elon Portugaly, Dipankar Ray, Patrice Y Simard, and Ed Snelson. 2013. Counterfactual reasoning and learning systems: The example of computational advertising. Journal of Machine Learning Research Vol. 14, 1 (2013), 3207--3260. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Bob Carpenter, Andrew Gelman, Matt Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Michael A Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell. 2016. Stan: A probabilistic programming language. Journal of Statistical Software (2016).Google ScholarGoogle Scholar
  12. Gary Chamberlain and Guido Imbens. 2004. Random effects estimators with many instrumental variables. Econometrica, Vol. 72, 1 (2004), 295--306.Google ScholarGoogle ScholarCross RefCross Ref
  13. Dean Eckles, René F Kizilcec, and Eytan Bakshy. 2016. Estimating peer effects in networks with peer encouragement designs. Proceedings of the National Academy of Sciences, Vol. 113, 27 (2016), 7316--7322.Google ScholarGoogle ScholarCross RefCross Ref
  14. Ziv Epstein, Alexander Peysakhovich, and David G Rand. 2016. The good, the bad, and the unflinchingly selfish: Cooperative decision-making can be predicted with high accuracy when using only three behavioral types Proceedings of the 2016 ACM Conference on Economics and Computation. ACM, 547--559. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. John C Gittins. 1979. Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society. Series B (Methodological) (1979), 148--177.Google ScholarGoogle Scholar
  16. Mathew Goldman and Justin M Rao. 2014. Experiments as Instruments: Heterogeneous Position Effects in Sponsored Search Auctions. Available at SSRN 2524688 (2014).Google ScholarGoogle Scholar
  17. Donald P Green, Shang E Ha, and John G Bullock. 2010. Enough already about “black box” experiments: Studying mediation is more difficult than most scholars suppose. The Annals of the American Academy of Political and Social Science, Vol. 628, 1 (2010), 200--208.Google ScholarGoogle ScholarCross RefCross Ref
  18. Justin Grimmer, Solomon Messing, and Sean J Westwood. 2014. Estimating heterogeneous treatment effects and the effects of heterogeneous treatments with ensemble methods. Unpublished manuscript, Stanford University, Stanford, CA (2014).Google ScholarGoogle Scholar
  19. Christian Hansen and Damian Kozbur. 2014. Instrumental variables estimation with many weak instruments using regularized JIVE. Journal of Econometrics Vol. 182, 2 (2014), 290--308.Google ScholarGoogle ScholarCross RefCross Ref
  20. Jason Hartford, Greg Lewis, Kevin Leyton-Brown, and Matt Taddy. 2016. Counterfactual Prediction with Deep Instrumental Variables Networks. arXiv preprint arXiv:1612.09596 (2016).Google ScholarGoogle Scholar
  21. Lars G Hemkens, Despina G Contopoulos-Ioannidis, and John PA Ioannidis. 2016. Agreement of treatment effects for mortality from routinely collected data and subsequent randomized trials: Meta-epidemiological survey. British Medical Journal Vol. 352 (2016).Google ScholarGoogle Scholar
  22. Kosuke Imai, Dustin Tingley, and Teppei Yamamoto. 2013. Experimental designs for identifying causal mechanisms. Journal of the Royal Statistical Society: Series A (Statistics in Society), Vol. 176, 1 (2013), 5--51.Google ScholarGoogle ScholarCross RefCross Ref
  23. Guido Imbens, Joshua Angrist, and Alan Krueger. 1999. Jackknife Instrumental Variables Estimation. Journal of Applied Econometrics Vol. 14, 1 (1999).Google ScholarGoogle Scholar
  24. Jongbin Jung, Connor Concannon, Ravi Shroff, Sharad Goel, and Daniel G Goldstein. 2017. Simple rules for complex decisions. arXiv preprint arXiv:1702.04690 (2017).Google ScholarGoogle Scholar
  25. Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, and Nils Pohlmann. 2013. Online controlled experiments at large scale. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1168--1176. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Robert J LaLonde. 1986. Evaluating the econometric evaluations of training programs with experimental data. The American Economic Review (1986), 604--620.Google ScholarGoogle Scholar
  27. Finnian Lattimore, Tor Lattimore, and Mark D Reid. 2016. Causal Bandits: Learning Good Interventions via Causal Inference Advances in Neural Information Processing Systems. 1181--1189. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Lihong Li, Wei Chu, John Langford, and Robert E Schapire. 2010. A contextual-bandit approach to personalized news article recommendation Proceedings of the 19th international conference on World wide web. ACM, 661--670. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Michelle N Meyer. 2015. Two cheers for corporate experimentation: The A/B illusion and the virtues of data-driven innovation. J. on Telecomm. & High Tech. L. Vol. 13 (2015), 273.Google ScholarGoogle Scholar
  30. Art B. Owen. 2016. Monte Carlo Theory, Methods and Examples. http://statweb.stanford.edu/ owen/mc/Google ScholarGoogle Scholar
  31. Judea Pearl. 2009. Causality. Cambridge University Press.Google ScholarGoogle Scholar
  32. Alexander Peysakhovich and Jeffrey Naecker. 2017. Using methods from machine learning to evaluate behavioral models of choice under risk and ambiguity. Journal of Economic Behavior & Organization Vol. 133 (2017), 373--384.Google ScholarGoogle ScholarCross RefCross Ref
  33. Olav Reiersöl. 1945. Confluence analysis by means of instrumental sets of variables. Ph.D. Dissertation. bibinfoschoolStockholm College.Google ScholarGoogle Scholar
  34. Uri Shalit, Fredrik Johansson, and David Sontag. 2016. Bounding and Minimizing Counterfactual Error. arXiv preprint arXiv:1606.03976 (2016).Google ScholarGoogle Scholar
  35. Douglas Staiger and James H Stock. 1997. Instrumental Variables Regression with Weak Instruments. Econometrica (1997), 557--586.Google ScholarGoogle Scholar
  36. James H Stock, Jonathan H Wright, and Motohiro Yogo. 2012. A survey of weak instruments and weak identification in generalized method of moments. Journal of Business & Economic Statistics (2012).Google ScholarGoogle Scholar
  37. James H Stock and Motohiro Yogo. 2005. Testing for weak instruments in linear IV regression. Identification and Inference for Econometric Models: Essays in Honor of Thomas Rothenberg. Cambridge University Press, 80--108.Google ScholarGoogle Scholar
  38. Richard S Sutton and Andrew G Barto. 1998. Reinforcement learning: An introduction. Vol. Vol. 1. MIT press Cambridge. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Hal Varian. 2016. Intelligent Technology. Finance and Development Vol. 53, 3 (2016).Google ScholarGoogle Scholar
  40. Jeffrey M Wooldridge. 2010. Econometric Analysis of Cross Section and Panel Data. MIT Press.Google ScholarGoogle Scholar
  41. Philip Green Wright. 1928. The Tariff on Animal and Vegetable Oils. The Macmillan Co.Google ScholarGoogle Scholar
  42. Ya Xu, Nanyu Chen, Addrian Fernandez, Omar Sinno, and Anmol Bhasin. 2015. From infrastructure to culture: A/B testing challenges in large scale social networks Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2227--2236. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learning Causal Effects From Many Randomized Experiments Using Regularized Instrumental Variables

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          WWW '18: Proceedings of the 2018 World Wide Web Conference
          April 2018
          2000 pages
          ISBN:9781450356398

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          International World Wide Web Conferences Steering Committee

          Republic and Canton of Geneva, Switzerland

          Publication History

          • Published: 23 April 2018

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          WWW '18 Paper Acceptance Rate170of1,155submissions,15%Overall Acceptance Rate1,899of8,196submissions,23%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format