Inproceedings,

Finding Person Relations in Image Data of News Collections in the Internet Archive

E. Müller-Budack, K. Pustu-Iren, S. Diering, and R. Ewerth.
Digital Libraries for Open Knowledge, page 229--240. Cham, Springer International Publishing, (2018)

Abstract

The amount of multimedia content in the World Wide Web is rapidly growing and contains valuable information for many applications in different domains. The Internet Archive initiative has gathered billions of time-versioned web pages since the mid-nineties. However, the huge amount of data is rarely labeled with appropriate metadata and automatic approaches are required to enable semantic search. Normally, the textual content of the Internet Archive is used to extract entities and their possible relations across domains such as politics and entertainment, whereas image and video content is usually disregarded. In this paper, we introduce a system for person recognition in image content of web news stored in the Internet Archive. Thus, the system complements entity recognition in text and allows researchers and analysts to track media coverage and relations of persons more precisely. Based on a deep learning face recognition approach, we suggest a system that detects persons of interest and gathers sample material, which is subsequently used to identify them in the image data of the Internet Archive. We evaluate the performance of the face recognition system on an appropriate standard benchmark dataset and demonstrate the feasibility of the approach with two use cases.

BibTeX key: 10.1007/978-3-030-00066-0_20
entry type: inproceedings
address: Cham
booktitle: Digital Libraries for Open Knowledge
year: 2018
pages: 229--240
publisher: Springer International Publishing
isbn: 978-3-030-00066-0

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{10.1007/978-3-030-00066-0_20, abstract = {The amount of multimedia content in the World Wide Web is rapidly growing and contains valuable information for many applications in different domains. The Internet Archive initiative has gathered billions of time-versioned web pages since the mid-nineties. However, the huge amount of data is rarely labeled with appropriate metadata and automatic approaches are required to enable semantic search. Normally, the textual content of the Internet Archive is used to extract entities and their possible relations across domains such as politics and entertainment, whereas image and video content is usually disregarded. In this paper, we introduce a system for person recognition in image content of web news stored in the Internet Archive. Thus, the system complements entity recognition in text and allows researchers and analysts to track media coverage and relations of persons more precisely. Based on a deep learning face recognition approach, we suggest a system that detects persons of interest and gathers sample material, which is subsequently used to identify them in the image data of the Internet Archive. We evaluate the performance of the face recognition system on an appropriate standard benchmark dataset and demonstrate the feasibility of the approach with two use cases.}, added-at = {2024-03-04T15:25:37.000+0100}, address = {Cham}, author = {M{\"u}ller-Budack, Eric and Pustu-Iren, Kader and Diering, Sebastian and Ewerth, Ralph}, biburl = {https://www.bibsonomy.org/bibtex/236053935f80cad20f10ecebb9e4a506d/ericmb}, booktitle = {Digital Libraries for Open Knowledge}, editor = {M{\'e}ndez, Eva and Crestani, Fabio and Ribeiro, Cristina and David, Gabriel and Lopes, Jo{\~a}o Correia}, interhash = {251e8bf3341f6d0bb070524758d66815}, intrahash = {36053935f80cad20f10ecebb9e4a506d}, isbn = {978-3-030-00066-0}, keywords = {myown}, pages = {229--240}, publisher = {Springer International Publishing}, timestamp = {2024-03-04T15:25:37.000+0100}, title = {Finding Person Relations in Image Data of News Collections in the Internet Archive}, year = 2018 }

BibSonomy

Finding Person Relations in Image Data of News Collections in the Internet Archive

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on