copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Going beyond accuracy: estimating homophily in social networks using predictions

G. Berry, A. Sirianni, I. Weber, J. An, and M. Macy. (2020)cite arxiv:2001.11171Comment: 19 pages, 4 figures, 2 tables.

Abstract

In online social networks, it is common to use predictions of node categories to estimate measures of homophily and other relational properties. However, online social network data often lacks basic demographic information about the nodes. Researchers must rely on predicted node attributes to estimate measures of homophily, but little is known about the validity of these measures. We show that estimating homophily in a network can be viewed as a dyadic prediction problem, and that homophily estimates are unbiased when dyad-level residuals sum to zero in the network. Node-level prediction models, such as the use of names to classify ethnicity or gender, do not generally have this property and can introduce large biases into homophily estimates. Bias occurs due to error autocorrelation along dyads. Importantly, node-level classification performance is not a reliable indicator of estimation accuracy for homophily. We compare estimation strategies that make predictions at the node and dyad levels, evaluating performance in different settings. We propose a novel "ego-alter" modeling approach that outperforms standard node and dyad classification strategies. While this paper focuses on homophily, results generalize to other relational measures which aggregate predictions along the dyads in a network. We conclude with suggestions for research designs to study homophily in online networks. Code for this paper is available at https://github.com/georgeberry/autocorr.

Links and resources

BibTeX key: berry2020going
entry type: misc
year: 2020
url: http://arxiv.org/abs/2001.11171
note: cite arxiv:2001.11171Comment: 19 pages, 4 figures, 2 tables

Cite this publication

@misc{berry2020going, abstract = {In online social networks, it is common to use predictions of node categories to estimate measures of homophily and other relational properties. However, online social network data often lacks basic demographic information about the nodes. Researchers must rely on predicted node attributes to estimate measures of homophily, but little is known about the validity of these measures. We show that estimating homophily in a network can be viewed as a dyadic prediction problem, and that homophily estimates are unbiased when dyad-level residuals sum to zero in the network. Node-level prediction models, such as the use of names to classify ethnicity or gender, do not generally have this property and can introduce large biases into homophily estimates. Bias occurs due to error autocorrelation along dyads. Importantly, node-level classification performance is not a reliable indicator of estimation accuracy for homophily. We compare estimation strategies that make predictions at the node and dyad levels, evaluating performance in different settings. We propose a novel "ego-alter" modeling approach that outperforms standard node and dyad classification strategies. While this paper focuses on homophily, results generalize to other relational measures which aggregate predictions along the dyads in a network. We conclude with suggestions for research designs to study homophily in online networks. Code for this paper is available at https://github.com/georgeberry/autocorr.}, added-at = {2020-01-31T10:27:11.000+0100}, author = {Berry, George and Sirianni, Antonio and Weber, Ingmar and An, Jisun and Macy, Michael}, biburl = {https://www.bibsonomy.org/bibtex/2eb9ba12b192ca61bc8bbe3764b310cf0/mstrohm}, interhash = {5af436288197a771e1c418438b85cd1f}, intrahash = {eb9ba12b192ca61bc8bbe3764b310cf0}, keywords = {homophily uncertainty}, note = {cite arxiv:2001.11171Comment: 19 pages, 4 figures, 2 tables}, timestamp = {2020-01-31T10:27:11.000+0100}, title = {Going beyond accuracy: estimating homophily in social networks using predictions}, url = {http://arxiv.org/abs/2001.11171}, year = 2020 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Going beyond accuracy: estimating homophily in social networks using predictions

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Going beyond accuracy: estimating homophily in social networks using predictions

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Going beyond accuracy: estimating homophily in social networks using predictions

Comments and Reviews
(0)