tag :: web | BibSonomy

bookmarks (hide)8778
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Home · internetarchive/heritrix3 Wiki · GitHub
This is the public wiki for the Heritrix archival crawler project. Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or mis-said as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits).
9 months ago by @astrupp
show all tags
archive
crawl
crawler
web
archivecrawlcrawlerweb
(0)
copydelete
- community post
- history of this post
4Web Data Commons
The Web Data Commons project extracts structured data from the Common Crawl, the largest web corpus available to the public, and provides the extracted data for public download in order to support researchers and companies in exploiting the wealth of information that is available on the Web.
9 months ago by @astrupp
show all tags
crawl
metadata
rdf
rdfa
semantic
web
crawlmetadatardfrdfasemanticweb
(0)
copydelete
- community post
- history of this post
1WDC - RDFa, Microdata, and Microformat Data Sets
More and more websites have started to embed structured data describing products, people, organizations, places, and events into their HTML pages using markup standards such as Microdata, JSON-LD, RDFa, and Microformats. The Web Data Commons project extracts this data from several billion web pages. So far the project provides 11 different data set releases extracted from the Common Crawls 2010 to 2022. The project provides the extracted data for download and publishes statistics about the deployment of the different formats.
9 months ago by @astrupp
show all tags
crawl
data
metadata
semantic
web
crawldatametadatasemanticweb
(0)
copydelete
- community post
- history of this post
1Meusel-etal-TheWDCMicrodataRdfaMicroformatsDataSeries-ISWC2014-rbds.pdf
Abstract. In order to support web applications to understand the content of HTML pages an increasing number of websites have started to annotate structured data within their pages using markup formats such as Microdata, RDFa, Microformats. The annotations are used by Google, Yahoo!, Yandex, Bing and Facebook to enrich search results and to display entity descriptions within their applications. In this paper, we present a series of publicly accessible Microdata, RDFa, Microformats datasets that we have extracted from three large web corpora dating from 2010, 2012 and 2013.
9 months ago by @astrupp
show all tags
data
metadata
paper
pdf
web
datametadatapaperpdfweb
(0)
copydelete
- community post
- history of this post
1Resource Description Framework (RDF): Concepts and Abstract Syntax
https://www.w3.org/TR/2004/REC-rdf-concepts-20040210/
9 months ago by @astrupp
show all tags
rdf
semantic
standard
syntax
web
rdfsemanticstandardsyntaxweb
(0)
copydelete
- community post
- history of this post

publications (hide)4906
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1The Semantics and Complexity of SPARQL
J. Perez, M. Arenas, and C. Gutierrez. (2006)
18 years ago by @wikier
show all tags
web
swaml
semantic
sparql
webswamlsemanticsparql
(0)
copydeleteadd this publication to your clipboard
6Explorer's Guide to the Semantic Web
T. Passin. Manning, (2004)
18 years ago by @wikier
show all tags
web
swaml
semantic
webswamlsemantic
(0)
copydeleteadd this publication to your clipboard
6Semantic Email
L. McDowell, O. Etzioni, A. Halevy, and H. Levy. Proceedings of the 13rd Internacional World Wide Web Conference, WWW2004, (2004)
18 years ago by @wikier
show all tags
web
swaml
semantic
mail
webswamlsemanticmail
(0)
copydeleteadd this publication to your clipboard
4SPHINX: A Framework for Creating Personal, Site-Specific Web Crawlers
R. Miller, and K. Bharat. Computer Network and ISDN Systems, (April 1998)
18 years ago by @lysander07
show all tags
web
crawler
webcrawler
(0)
copydeleteadd this publication to your clipboard
14Semantic Web Road Map
T. Berners-Lee. (1998)
18 years ago by @martinomy
show all tags
semanticWeb
Map
Semantic
Road
1998
Web
Tim
Berners-Lee
semanticWebMapSemanticRoad1998WebTimBerners-Lee
(0)
copydeleteadd this publication to your clipboard
3Microformats: a pragmatic path to the semantic web
R. Khare, and T. Celik. WWW '06: Proceedings of the 15th international conference on World Wide Web, page 865--866. New York, NY, USA, ACM Press, (2006)
18 years ago by @cedricmesnage
show all tags
web
engineering
microformats
sw
webengineeringmicroformatssw
(0)
copydeleteadd this publication to your clipboard
4Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by its Inventor
T. Berners-Lee, M. Fischetti, and M. Dertouzos. Harper San Francisco, (1999)
18 years ago by @blaueasterpro
show all tags
web
book
(*)
Berners-Lee
read
1999
webbook(*)Berners-Leeread1999
(0)
copydeleteadd this publication to your clipboard
1Knowledge Retrieval and the Word Wide Web
P. Martin, and P. Eklund. IEEE Intelligent Systems: Special Issue on Knowledge Management and Knowledge Distribution Over the Internet, (2000)
18 years ago by @jonducrou
show all tags
web
IR
WebKB
KVO
webIRWebKBKVO
(0)
copydeleteadd this publication to your clipboard
1Swing Based Remote GUI Emulation
T. Tilley, and P. Eklund. Proceedings Evolve2000, page 45-52. DSTC Pty Ltd, (2000)
18 years ago by @jonducrou
show all tags
web
UI
KVO
webUIKVO
(0)
copydeleteadd this publication to your clipboard
1GUI Framework Communication via the WWW
T. Tilley, and P. Eklund. Asia Pacific web Conference, in ``World Wide Web: Technologies and Applications for the New Millenium'', page 297-302. Computer Science Research, Education, and Applications Press, (2000)
18 years ago by @jonducrou
show all tags
web
UI
KVO
webUIKVO
(0)
copydeleteadd this publication to your clipboard
2XML-Based Offline Website Generation
P. Becker, and F. Amardeilh. Australian Document Computing Symposium (ADCS02), page 149-152. University of Sydney, School of Information Technologies, (2002)
18 years ago by @jonducrou
show all tags
web
XML
KVO
webXMLKVO
(0)
copydeleteadd this publication to your clipboard
1Web-based Collaborative Multi-criteria Decision Making
T. Tilley, P. Deer, and F. Modave. Proceedings the IConIT'2001 Conference, page 203-212. (2001)
18 years ago by @jonducrou
show all tags
web
decision
KVO
webdecisionKVO
(0)
copydeleteadd this publication to your clipboard
1Manageable Approaches to the Semantic Web
P. Martin, and P. Eklund. Practice and Experience" Track of WWW 2002, 11th International World Wide Web Conference, 2393, (2002)
18 years ago by @jonducrou
show all tags
web
KVO
webKVO
(0)
copydeleteadd this publication to your clipboard
1Browsing Semi-Structured Texts on the Web Using Formal Concept Analysis
R. Cole, P. Eklund, and F. Amardeilh. page 243--264. Springer, (2004)
18 years ago by @jonducrou
show all tags
web
FCA
KVO
webFCAKVO
(0)
copydeleteadd this publication to your clipboard
1Knowledge Representation, Sharing and Retrieval on the Web
P. Martin, and P. Eklund. Springer Verlag, (2002)
18 years ago by @jonducrou
show all tags
web
KVO
webKVO
(0)
copydeleteadd this publication to your clipboard
2ProntoRama: Web-based Delivery of Ontological Data
J. Ducrou. (2002)
18 years ago by @jonducrou
show all tags
web
ontology
KVO
webontologyKVO
(0)
copydeleteadd this publication to your clipboard
1Collaborative development of ontologies in a Peer-to-Peer environment
H. Akerstrom, and J. Grondahl. (2002)
18 years ago by @jonducrou
show all tags
web
ontology
Ontorama
KVO
webontologyOntoramaKVO
(0)
copydeleteadd this publication to your clipboard
2RDF-based Peer-to-Peer Based Ontology Editing
P. Becker, P. Eklund, and N. Roberts. Journal of Digital Information Management, 4 (1): 50-55 (2006)
18 years ago by @jonducrou
show all tags
web
ontology
KVO
RDF
webontologyKVORDF
(0)
copydeleteadd this publication to your clipboard
1Peer-to-Peer Based Ontology Editing
P. Becker, P. Eklund, and N. Roberts. International Conference on Next Generation Web Services Practices (NWeSP'05), page 259-264. IEEE Press, (2005)
18 years ago by @jonducrou
show all tags
web
ontology
KVO
webontologyKVO
(0)
copydeleteadd this publication to your clipboard
1Creating a Planet Me Blog Aggregator
B. Martin. (February 2006)
18 years ago by @jonducrou
show all tags
web
KVO
Linux
webKVOLinux
(0)
copydeleteadd this publication to your clipboard

BibSonomy

bookmarks (hide)8778
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Home · internetarchive/heritrix3 Wiki · GitHub

4Web Data Commons

1WDC - RDFa, Microdata, and Microformat Data Sets

1Meusel-etal-TheWDCMicrodataRdfaMicroformatsDataSeries-ISWC2014-rbds.pdf

1Resource Description Framework (RDF): Concepts and Abstract Syntax

publications (hide)4906
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1The Semantics and Complexity of SPARQL

6Explorer's Guide to the Semantic Web

6Semantic Email

4SPHINX: A Framework for Creating Personal, Site-Specific Web Crawlers

14Semantic Web Road Map

3Microformats: a pragmatic path to the semantic web

4Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by its Inventor

1Knowledge Retrieval and the Word Wide Web

1Swing Based Remote GUI Emulation

1GUI Framework Communication via the WWW

2XML-Based Offline Website Generation

1Web-based Collaborative Multi-criteria Decision Making

1Manageable Approaches to the Semantic Web

1Browsing Semi-Structured Texts on the Web Using Formal Concept Analysis

1Knowledge Representation, Sharing and Retrieval on the Web

2ProntoRama: Web-based Delivery of Ontological Data

1Collaborative development of ontologies in a Peer-to-Peer environment

2RDF-based Peer-to-Peer Based Ontology Editing

1Peer-to-Peer Based Ontology Editing

1Creating a Planet Me Blog Aggregator

browse

related tags

similar tags

bookmarks (hide)8778 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)4906 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

similar tags

bookmarks (hide)8778
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)4906
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...