Inproceedings,

Graph cluster randomization: network exposure to multiple universes

J. Ugander, B. Karrer, L. Backstrom, and J. Kleinberg.
the 19th ACM SIGKDD international conference, page 329+. New York, New York, USA, ACM Press, (May 30, 2013)
DOI: 10.1145/2487575.2487695

Abstract

A/B testing is a standard approach for evaluating the effect of online experiments; the goal is to estimate the `average treatment effect' of a new feature or condition by exposing a sample of the overall population to it. A drawback with A/B testing is that it is poorly suited for experiments involving social interference, when the treatment of individuals spills over to neighboring individuals along an underlying social network. In this work, we propose a novel methodology using graph clustering to analyze average treatment effects under social interference. To begin, we characterize graph-theoretic conditions under which individuals can be considered to be `network exposed' to an experiment. We then show how graph cluster randomization admits an efficient exact algorithm to compute the probabilities for each vertex being network exposed under several of these exposure conditions. Using these probabilities as inverse weights, a Horvitz-Thompson estimator can then provide an effect estimate that is unbiased, provided that the exposure model has been properly specified. Given an estimator that is unbiased, we focus on minimizing the variance. First, we develop simple sufficient conditions for the variance of the estimator to be asymptotically small in n, the size of the graph. However, for general randomization schemes, this variance can be lower bounded by an exponential function of the degrees of a graph. In contrast, we show that if a graph satisfies a restricted-growth condition on the growth rate of neighborhoods, then there exists a natural clustering algorithm, based on vertex neighborhoods, for which the variance of the estimator can be upper bounded by a linear function of the degrees. Thus we show that proper cluster randomization can lead to exponentially lower estimator variance when experimentally measuring average treatment effects under interference.

BibTeX key: Ugander2013Graph
entry type: inproceedings
address: New York, New York, USA
booktitle: the 19th ACM SIGKDD international conference
year: 2013
month: may
day: 30
pages: 329+
publisher: ACM Press
citeulike-article-id: 12682616
isbn: 9781450321747
citeulike-linkout-2: http://arxiv.org/pdf/1305.6979
citeulike-linkout-1: http://arxiv.org/abs/1305.6979
priority: 2
posted-at: 2013-11-03 11:09:56
eprint: 1305.6979
citeulike-linkout-0: http://dx.doi.org/10.1145/2487575.2487695
archiveprefix: arXiv
location: Chicago, Illinois, USA
DOI: 10.1145/2487575.2487695
url: http://dx.doi.org/10.1145/2487575.2487695

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Conference Paper %1 Ugander2013Graph %A Ugander, Johan %A Karrer, Brian %A Backstrom, Lars %A Kleinberg, Jon %B the 19th ACM SIGKDD international conference %C New York, New York, USA %D 2013 %I ACM Press %K statistics social-networks algorithms k-core clustering %P 329+ %R 10.1145/2487575.2487695 %T Graph cluster randomization: network exposure to multiple universes %U http://dx.doi.org/10.1145/2487575.2487695 %X A/B testing is a standard approach for evaluating the effect of online experiments; the goal is to estimate the `average treatment effect' of a new feature or condition by exposing a sample of the overall population to it. A drawback with A/B testing is that it is poorly suited for experiments involving social interference, when the treatment of individuals spills over to neighboring individuals along an underlying social network. In this work, we propose a novel methodology using graph clustering to analyze average treatment effects under social interference. To begin, we characterize graph-theoretic conditions under which individuals can be considered to be `network exposed' to an experiment. We then show how graph cluster randomization admits an efficient exact algorithm to compute the probabilities for each vertex being network exposed under several of these exposure conditions. Using these probabilities as inverse weights, a Horvitz-Thompson estimator can then provide an effect estimate that is unbiased, provided that the exposure model has been properly specified. Given an estimator that is unbiased, we focus on minimizing the variance. First, we develop simple sufficient conditions for the variance of the estimator to be asymptotically small in n, the size of the graph. However, for general randomization schemes, this variance can be lower bounded by an exponential function of the degrees of a graph. In contrast, we show that if a graph satisfies a restricted-growth condition on the growth rate of neighborhoods, then there exists a natural clustering algorithm, based on vertex neighborhoods, for which the variance of the estimator can be upper bounded by a linear function of the degrees. Thus we show that proper cluster randomization can lead to exponentially lower estimator variance when experimentally measuring average treatment effects under interference. %@ 9781450321747

BibSonomy

Graph cluster randomization: network exposure to multiple universes

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on