jaj > ocr | BibSonomy

bookmarks (hide)7
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1DocHive
raleighpublicrecord/dochive · GitHub, DocHive has 2 prerequisites, ImageMagic and Tesserac. coverts pdf pages to images and the OCRs the image. purpose is to extract numeric statistical tables in PDFs for import into spreadsheets.
12 years ago by @jaj
show all tags
ocr
pdf
statistics
tools
ocrpdfstatisticstools
(0)
copydelete
- community post
- history of this post
1GOCR, JOCR
optical character recognition software. links to related software.
12 years ago by @jaj
show all tags
ocr
tools
ocrtools
(0)
copydelete
- community post
- history of this post
12ocropus - The OCRopus(tm) open source document analysis and OCR system - Google Project Hosting
OCRopus is an OCR system written in Python, NumPy, and SciPy focusing on the use of large scale machine learning for addressing problems in document analysis. Formerly Tesseract.
12 years ago by @jaj
show all tags
ocr
tools
ocrtools
(0)
copydelete
- community post
- history of this post
2Impact | Improving access to text : Home
IMPACT is a Centre of Competence that makes digitisation of historical printed text in Europe faster, cheaper and better, and provides tools, services and facilities for further advancement of the State of the Art in this field.
12 years ago by @jaj
show all tags
digitization
ocr
digitizationocr
(0)
copydelete
- community post
- history of this post
1The hOCR Embedded OCR Workflow and Ou... - Google Drive
The purpose of this document is to define an open standard xml-like format for representing OCR results.
12 years ago by @jaj
show all tags
ocr
xml
ocrxml
(0)
copydelete
- community post
- history of this post
2ALTO: Technical Metadata for Optical Character Recognition (Standards, Library of Congress)
ALTO (Analyzed Layout and Text Object) is a XML Schema that details technical metadata for describing the layout and content of physical text resources, such as pages of a book or a newspaper. It most commonly serves as an extension schema used within the Metadata Encoding and Transmission Schema (METS) administrative metadata section. However, ALTO instances can also exist as a standalone document used independently of METS.
12 years ago by @jaj
show all tags
digitization
metadata
ocr
digitizationmetadataocr
(0)
copydelete
- community post
- history of this post
1OCR Data - Chronicling America (The Library of Congress)
Chronicling America provides bulk access to its OCR data. Each file will decompress into directory structure that lets you easily map the OCR file to the URL identifier for that page. Historic American Newspapers
12 years ago by @jaj
show all tags
digitization
history
newspapers
ocr
digitizationhistorynewspapersocr
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

No matching posts.

⟨⟨
⟨
⟩
⟩⟩

BibSonomy

bookmarks (hide)7
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1DocHive

1GOCR, JOCR

12ocropus - The OCRopus(tm) open source document analysis and OCR system - Google Project Hosting

2Impact | Improving access to text : Home

1The hOCR Embedded OCR Workflow and Ou... - Google Drive

2ALTO: Technical Metadata for Optical Character Recognition (Standards, Library of Congress)

1OCR Data - Chronicling America (The Library of Congress)

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

browse

related tags

concepts

tags

bookmarks (hide)7 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide) displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)7
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...