copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity

V. Unnikrishnan, C. Beyer, P. Matuszyk, U. Niemann, R. Pryss, W. Schlee, E. Ntoutsi, and M. Spiliopoulou. International Journal of Data Science and Analytics, (2019)
DOI: 10.1007/s41060-019-00177-1

Abstract

Stream classification algorithms traditionally treat arriving instances as independent. However, in many applications, the arriving examples may depend on the “entity” that generated them, e.g. in product reviews or in the interactions of users with an application server. In this study, we investigate the potential of this dependency by partitioning the original stream of instances/“observations” into entity-centric substreams and by incorporating entity-specific information into the learning model. We propose a k-nearest-neighbour-inspired stream classification approach, in which the label of an arriving observation is predicted by exploiting knowledge on the observations belonging to this entity and to entities similar to it. For the computation of entity similarity, we consider knowledge about the observations and knowledge about the entity, potentially from a domain/feature space different from that in which predictions are made. To distinguish between cases where this knowledge transfer is beneficial for stream classification and cases where the knowledge on the entities does not contribute to classifying the observations, we also propose a heuristic approach based on random sampling of substreams using k Random Entities (kRE). Our learning scenario is not fully supervised: after acquiring labels for the initial m observations of each entity, we assume that no additional labels arrive and attempt to predict the labels of near-future and far-future observations from that initial seed. We report on our findings from three datasets.

Links and resources

BibTeX key: noauthororeditor
entry type: article
year: 2019
journal: International Journal of Data Science and Analytics
DOI: 10.1007/s41060-019-00177-1
url: https://doi.org/10.1007/s41060-019-00177-1

Cite this publication

%0 Journal Article %1 noauthororeditor %A Unnikrishnan, Vishnu %A Beyer, Christian %A Matuszyk, Pawel %A Niemann, Uli %A Pryss, Rüdiger %A Schlee, Winfried %A Ntoutsi, Eirini %A Spiliopoulou, Myra %D 2019 %J International Journal of Data Science and Analytics %K entity-centric-learning kmd stream-mining %R 10.1007/s41060-019-00177-1 %T Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity %U https://doi.org/10.1007/s41060-019-00177-1 %X Stream classification algorithms traditionally treat arriving instances as independent. However, in many applications, the arriving examples may depend on the “entity” that generated them, e.g. in product reviews or in the interactions of users with an application server. In this study, we investigate the potential of this dependency by partitioning the original stream of instances/“observations” into entity-centric substreams and by incorporating entity-specific information into the learning model. We propose a k-nearest-neighbour-inspired stream classification approach, in which the label of an arriving observation is predicted by exploiting knowledge on the observations belonging to this entity and to entities similar to it. For the computation of entity similarity, we consider knowledge about the observations and knowledge about the entity, potentially from a domain/feature space different from that in which predictions are made. To distinguish between cases where this knowledge transfer is beneficial for stream classification and cases where the knowledge on the entities does not contribute to classifying the observations, we also propose a heuristic approach based on random sampling of substreams using k Random Entities (kRE). Our learning scenario is not fully supervised: after acquiring labels for the initial m observations of each entity, we assume that no additional labels arrive and attempt to predict the labels of near-future and far-future observations from that initial seed. We report on our findings from three datasets.

@article{noauthororeditor, abstract = {Stream classification algorithms traditionally treat arriving instances as independent. However, in many applications, the arriving examples may depend on the “entity” that generated them, e.g. in product reviews or in the interactions of users with an application server. In this study, we investigate the potential of this dependency by partitioning the original stream of instances/“observations” into entity-centric substreams and by incorporating entity-specific information into the learning model. We propose a k-nearest-neighbour-inspired stream classification approach, in which the label of an arriving observation is predicted by exploiting knowledge on the observations belonging to this entity and to entities similar to it. For the computation of entity similarity, we consider knowledge about the observations and knowledge about the entity, potentially from a domain/feature space different from that in which predictions are made. To distinguish between cases where this knowledge transfer is beneficial for stream classification and cases where the knowledge on the entities does not contribute to classifying the observations, we also propose a heuristic approach based on random sampling of substreams using k Random Entities (kRE). Our learning scenario is not fully supervised: after acquiring labels for the initial m observations of each entity, we assume that no additional labels arrive and attempt to predict the labels of near-future and far-future observations from that initial seed. We report on our findings from three datasets.}, added-at = {2020-01-13T10:02:22.000+0100}, author = {Unnikrishnan, Vishnu and Beyer, Christian and Matuszyk, Pawel and Niemann, Uli and Pryss, Rüdiger and Schlee, Winfried and Ntoutsi, Eirini and Spiliopoulou, Myra}, biburl = {https://www.bibsonomy.org/bibtex/2f6ac7592c590739cbe45412eb3736593/kmd-ovgu}, doi = {10.1007/s41060-019-00177-1}, interhash = {ea37ee3c9f955b7655a338e32a0cc4f0}, intrahash = {f6ac7592c590739cbe45412eb3736593}, journal = {International Journal of Data Science and Analytics}, keywords = {entity-centric-learning kmd stream-mining}, timestamp = {2020-01-13T10:08:14.000+0100}, title = {Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity}, url = {https://doi.org/10.1007/s41060-019-00177-1}, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity

Comments and Reviews
(0)