tag :: metadata pdf

bookmarks (hide)11
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1ldow2012-inv-paper-1.pdf
2012. Metadata Statistics for a Large Web Corpus ABSTRACT We provide an analysis of the adoption of metadata standards on the Web based a large crawl of the Web. In particular, we look at what forms of syntax and vocabularies publishers are using to mark up data inside HTML pages. We also describe the process that we have followed and the difficulties involved in web data extraction.
a year ago by @astrupp
show all tags
archive
crawl
crawler
metadata
paper
pdf
standard
archivecrawlcrawlermetadatapaperpdfstandard
(0)
copydelete
- community post
- history of this post
1Meusel-etal-TheWDCMicrodataRdfaMicroformatsDataSeries-ISWC2014-rbds.pdf
Abstract. In order to support web applications to understand the content of HTML pages an increasing number of websites have started to annotate structured data within their pages using markup formats such as Microdata, RDFa, Microformats. The annotations are used by Google, Yahoo!, Yandex, Bing and Facebook to enrich search results and to display entity descriptions within their applications. In this paper, we present a series of publicly accessible Microdata, RDFa, Microformats datasets that we have extracted from three large web corpora dating from 2010, 2012 and 2013.
a year ago by @astrupp
show all tags
data
metadata
paper
pdf
web
datametadatapaperpdfweb
(0)
copydelete
- community post
- history of this post
1XMP metadata support in JabRef
http://jabref.sourceforge.net/help/XMPHelp.php
11 years ago by @jil
show all tags
jabref
java
metadata
pdf
xmp
jabrefjavametadatapdfxmp
(0)
copydelete
- community post
- history of this post
2http://www.niso.org/.../resources/UnderstandingMetadata.pdf
Handreichung über Metadaten. Was ist das, was kann man damit machen, etc.
12 years ago by @mruhl
show all tags
dissertation
dublincore
ead
filetype:pdf
media:document
metadata
niso
pdf
standard
dissertationdublincoreeadfiletype:pdfmedia:documentmetadatanisopdfstandard
(0)
copydelete
- community post
- history of this post
3Apache Tika - Apache Tika
http://tika.apache.org/
12 years ago by @nosebrain
show all tags
detection
extraction
java
lang
language
metadata
pdf
text
tika
detectionextractionjavalanglanguagemetadatapdftexttika
(0)
copydelete
- community post
- history of this post
1http://www.linhelp.com/pdfmeta.pl
http://www.linhelp.com/pdfmeta.pl
13 years ago by @draganigajic
show all tags
gui
metadata
pdf
pdftk
guimetadatapdfpdftk
(0)
copydelete
- community post
- history of this post
5cb2Bib: Overview
The cb2Bib is a free, open source, and multiplatform application for rapidly extracting unformatted, or unstandardized bibliographic references from email alerts, journal Web pages, and PDF files. The cb2Bib facilitates the capture of single references from unformatted and non standard sources. Output references are written in BibTeX. Article files can be easily linked and renamed by dragging them onto the cb2Bib window. Additionally, it permits editing and browsing BibTeX files, citing references, searching references and the full contents of the referenced documents, inserting bibliographic metadata to documents, and writing short notes that interrelate several references.
13 years ago by @draganigajic
show all tags
bibliography
bibtex
extraction
metadata
pdf
software
tools
bibliographybibtexextractionmetadatapdfsoftwaretools
(0)
copydelete
- community post
- history of this post
4iText, a F/OSS Java-PDF library: Product
http://itextpdf.com/
13 years ago by @draganigajic
show all tags
c#
java
library
metadata
pdf
c#javalibrarymetadatapdf
(0)
copydelete
- community post
- history of this post
1Catalogue files metadata miner software for file properties
http://peccatte.karefil.com/software/Catalogue/CatalogueENG.htm
13 years ago by @draganigajic
show all tags
$
extraction
metadata
pdf
software
$extractionmetadatapdfsoftware
(0)
copydelete
- community post
- history of this post
9Metadata Extraction Tool - Introduction
http://meta-extractor.sourceforge.net/
13 years ago by @draganigajic
show all tags
extraction
java
metadata
pdf
extractionjavametadatapdf
(0)
copydelete
- community post
- history of this post
100Connotea
Connotea é um serviço em linha livre da gerência da referência para cientistas, criado em dezembro 2004 perto Grupo Publicando Da Natureza. É uma de uma raça nova de social, similar a del.icio.nós e Flickr, onde os usuários podem conservar as liga
19 years ago by @kleverson
show all tags
backpack
Science
web2.0
bibliography
metadata
pdf
bookmarking
organization
backpackScienceweb2.0bibliographymetadatapdfbookmarkingorganization
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)2
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Vision and natural language for metadata extraction from scientific PDF documents
Z. Boukhers, and A. Bouabdallah. Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries, ACM, (June 2022)
2 years ago by @jaeschke
show all tags
citation
extraction
metadata
nlp
pdf
research
science
social
unknowndata
citationextractionmetadatanlppdfresearchsciencesocialunknowndata
(0)
copydeleteadd this publication to your clipboard
1brianc/node-pdf-text
Brian C. (2014)
11 years ago by @maxirichter
show all tags
documents
extraction
javascript
metadata
nodejs
pdf
text
documentsextractionjavascriptmetadatanodejspdftext
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

bookmarks (hide)11
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1ldow2012-inv-paper-1.pdf

1Meusel-etal-TheWDCMicrodataRdfaMicroformatsDataSeries-ISWC2014-rbds.pdf

1XMP metadata support in JabRef

2http://www.niso.org/.../resources/UnderstandingMetadata.pdf

3Apache Tika - Apache Tika

1http://www.linhelp.com/pdfmeta.pl

5cb2Bib: Overview

4iText, a F/OSS Java-PDF library: Product

1Catalogue files metadata miner software for file properties

9Metadata Extraction Tool - Introduction

100Connotea

publications (hide)2
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1Vision and natural language for metadata extraction from scientific PDF documents

1brianc/node-pdf-text

browse

related tags

bookmarks (hide)11 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)2 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

bookmarks (hide)11
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)2
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...