copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Exploiting Hashtags for Adaptive Microblog Crawling

X. Wang, L. Tokarchuk, F. Cuadrado, and S. Poslad. Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, page 311--315. New York, NY, USA, ACM, (2013)
DOI: 10.1145/2492517.2492624

Abstract

Researchers have capitalized on microblogging services, such as Twitter, for detecting and monitoring real world events. Existing approaches have based their conclusions on data collected by monitoring a set of pre-defined keywords. In this paper, we show that this manner of data collection risks losing a significant amount of relevant information. We then propose an adaptive crawling model that detects emerging popular hashtags, and monitors them to retrieve greater amounts of highly associated data for events of interest. The proposed model analyzes the traffic patterns of the hashtags collected from the live stream to update subsequent collection queries. To evaluate this adaptive crawling model, we apply it to a dataset collected during the 2012 London Olympic Games. Our analysis shows that adaptive crawling based on the proposed Refined Keyword Adaptation algorithm collects a more comprehensive dataset than pre-defined keyword crawling, while only introducing a minimum amount of noise.

Description

Exploiting hashtags for adaptive microblog crawling

Links and resources

BibTeX key: Wang:2013:EHA:2492517.2492624
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
year: 2013
pages: 311--315
publisher: ACM
series: ASONAM '13
acmid: 2492624
isbn: 978-1-4503-2240-9
location: Niagara, Ontario, Canada
numpages: 5
DOI: 10.1145/2492517.2492624
url: http://doi.acm.org/10.1145/2492517.2492624

@amitl3s's tags highlighted

Cite this publication

@inproceedings{Wang:2013:EHA:2492517.2492624, abstract = {Researchers have capitalized on microblogging services, such as Twitter, for detecting and monitoring real world events. Existing approaches have based their conclusions on data collected by monitoring a set of pre-defined keywords. In this paper, we show that this manner of data collection risks losing a significant amount of relevant information. We then propose an adaptive crawling model that detects emerging popular hashtags, and monitors them to retrieve greater amounts of highly associated data for events of interest. The proposed model analyzes the traffic patterns of the hashtags collected from the live stream to update subsequent collection queries. To evaluate this adaptive crawling model, we apply it to a dataset collected during the 2012 London Olympic Games. Our analysis shows that adaptive crawling based on the proposed Refined Keyword Adaptation algorithm collects a more comprehensive dataset than pre-defined keyword crawling, while only introducing a minimum amount of noise.}, acmid = {2492624}, added-at = {2016-06-29T14:41:19.000+0200}, address = {New York, NY, USA}, author = {Wang, Xinyue and Tokarchuk, Laurissa and Cuadrado, F{\'e}lix and Poslad, Stefan}, biburl = {https://www.bibsonomy.org/bibtex/281c4f98d2f8f3b94bf1ea902c386b2b2/amitl3s}, booktitle = {Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining}, description = {Exploiting hashtags for adaptive microblog crawling}, doi = {10.1145/2492517.2492624}, interhash = {af5fc3bdeb427b6e0def7883553c64bf}, intrahash = {81c4f98d2f8f3b94bf1ea902c386b2b2}, isbn = {978-1-4503-2240-9}, keywords = {Hashtag adaptive crawling exploiting for in streams twitter}, location = {Niagara, Ontario, Canada}, numpages = {5}, pages = {311--315}, publisher = {ACM}, series = {ASONAM '13}, timestamp = {2016-06-29T14:41:19.000+0200}, title = {Exploiting Hashtags for Adaptive Microblog Crawling}, url = {http://doi.acm.org/10.1145/2492517.2492624}, year = 2013 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Exploiting Hashtags for Adaptive Microblog Crawling

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Exploiting Hashtags for Adaptive Microblog Crawling

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Exploiting Hashtags for Adaptive Microblog Crawling

Comments and Reviews
(0)