Article,

Schema.org : evolution of structured data on the Web ; big data makes common schemas even more necessary.

, , and .
Queue, 13 (9): 10--37 (November 2015)
DOI: 10.1145/2857274.2857276

Abstract

Separation between content and presentation has always been one of the important design aspects of the Web. Historically, however, even though most Web sites were driven off structured databases, they published their content purely in HTML. Services such as Web search, price comparison, reservation engines, etc. that operated on this content had access only to HTML. Applications requiring access to the structured data underlying these Web pages had to build custom extractors to convert plain HTML into structured data. These efforts were often laborious and the scrapers were fragile and error-prone, breaking every time a site changed its layout.

Tags

Users

  • @lepsky

Comments and Reviews