copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Sensing Trending Topics in Twitter

L. Aiello, G. Petkos, C. Martin, D. Corney, S. Papadopoulos, R. Skraba, A. Goker, I. Kompatsiaris, and A. Jaimes. Trans. Multi., 15 (6): 1268--1282 (October 2013)
DOI: 10.1109/TMM.2013.2265080

Abstract

Online social and news media generate rich and timely information about real-world events of all kinds. However, the huge amount of data available, along with the breadth of the user base, requires a substantial effort of information filtering to successfully drill down to relevant topics and events. Trending topic detection is therefore a fundamental building block to monitor and summarize information originating from social sources. There are a wide variety of methods and variables and they greatly affect the quality of results. We compare six topic detection methods on three Twitter datasets related to major events, which differ in their time scale and topic churn rate. We observe how the nature of the event considered, the volume of activity over time, the sampling procedure and the pre-processing of the data all greatly affect the quality of detected topics, which also depends on the type of detection method used. We find that standard natural language processing techniques can perform well for social streams on very focused topics, but novel techniques designed to mine the temporal distribution of concepts are needed to handle more heterogeneous streams containing multiple stories evolving in parallel. One of the novel topic detection methods we propose, based on <formula formulatype="inline"> <tex Notation="TeX">$n$-grams cooccurrence and <formula formulatype="inline"> <tex Notation="TeX">$df-idf_t$ topic ranking, consistently achieves the best performance across all these conditions, thus being more reliable than other state-of-the-art techniques.

@jaeschke's tags highlighted

Cite this publication

%0 Journal Article %1 aiello2013sensing %A Aiello, Luca Maria %A Petkos, Georgios %A Martin, Carlos %A Corney, David %A Papadopoulos, Symeon %A Skraba, Ryan %A Goker, Ayse %A Kompatsiaris, Ioannis %A Jaimes, Alejandro %C Piscataway, NJ, USA %D 2013 %I IEEE Press %J Trans. Multi. %K event hashtag topic trend trending twitter %N 6 %P 1268--1282 %R 10.1109/TMM.2013.2265080 %T Sensing Trending Topics in Twitter %U http://dx.doi.org/10.1109/TMM.2013.2265080 %V 15 %X Online social and news media generate rich and timely information about real-world events of all kinds. However, the huge amount of data available, along with the breadth of the user base, requires a substantial effort of information filtering to successfully drill down to relevant topics and events. Trending topic detection is therefore a fundamental building block to monitor and summarize information originating from social sources. There are a wide variety of methods and variables and they greatly affect the quality of results. We compare six topic detection methods on three Twitter datasets related to major events, which differ in their time scale and topic churn rate. We observe how the nature of the event considered, the volume of activity over time, the sampling procedure and the pre-processing of the data all greatly affect the quality of detected topics, which also depends on the type of detection method used. We find that standard natural language processing techniques can perform well for social streams on very focused topics, but novel techniques designed to mine the temporal distribution of concepts are needed to handle more heterogeneous streams containing multiple stories evolving in parallel. One of the novel topic detection methods we propose, based on <formula formulatype="inline"> <tex Notation="TeX">$n$-grams cooccurrence and <formula formulatype="inline"> <tex Notation="TeX">$df-idf_t$ topic ranking, consistently achieves the best performance across all these conditions, thus being more reliable than other state-of-the-art techniques.

@article{aiello2013sensing, abstract = {Online social and news media generate rich and timely information about real-world events of all kinds. However, the huge amount of data available, along with the breadth of the user base, requires a substantial effort of information filtering to successfully drill down to relevant topics and events. Trending topic detection is therefore a fundamental building block to monitor and summarize information originating from social sources. There are a wide variety of methods and variables and they greatly affect the quality of results. We compare six topic detection methods on three Twitter datasets related to major events, which differ in their time scale and topic churn rate. We observe how the nature of the event considered, the volume of activity over time, the sampling procedure and the pre-processing of the data all greatly affect the quality of detected topics, which also depends on the type of detection method used. We find that standard natural language processing techniques can perform well for social streams on very focused topics, but novel techniques designed to mine the temporal distribution of concepts are needed to handle more heterogeneous streams containing multiple stories evolving in parallel. One of the novel topic detection methods we propose, based on <formula formulatype="inline"> <tex Notation="TeX">$n$-grams cooccurrence and <formula formulatype="inline"> <tex Notation="TeX">$df-idf_{t}$ topic ranking, consistently achieves the best performance across all these conditions, thus being more reliable than other state-of-the-art techniques.}, acmid = {2719514}, added-at = {2016-08-09T17:01:35.000+0200}, address = {Piscataway, NJ, USA}, author = {Aiello, Luca Maria and Petkos, Georgios and Martin, Carlos and Corney, David and Papadopoulos, Symeon and Skraba, Ryan and Goker, Ayse and Kompatsiaris, Ioannis and Jaimes, Alejandro}, biburl = {https://www.bibsonomy.org/bibtex/23a5a14f527226ebf23553e7dda86c3dd/jaeschke}, doi = {10.1109/TMM.2013.2265080}, interhash = {2c494571e4a6309d0ba0b49f98d3bedb}, intrahash = {3a5a14f527226ebf23553e7dda86c3dd}, issn = {1520-9210}, issue_date = {October 2013}, journal = {Trans. Multi.}, keywords = {event hashtag topic trend trending twitter}, month = oct, number = 6, numpages = {15}, pages = {1268--1282}, publisher = {IEEE Press}, timestamp = {2016-08-09T17:01:35.000+0200}, title = {Sensing Trending Topics in Twitter}, url = {http://dx.doi.org/10.1109/TMM.2013.2265080}, volume = 15, year = 2013 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Sensing Trending Topics in Twitter

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Sensing Trending Topics in Twitter

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Sensing Trending Topics in Twitter

Comments and Reviews
(0)