jaeschke > extraction

bookmarks (hide)6
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1GIANT: The 1-Billion Annotated Synthetic Bibliographic-Reference-String Dataset for Deep Citation Parsing [pre-print] – ISG Siegen
https://isg.beel.org/blog/2019/12/10/giant-the-1-billion-annotated-synthetic-bibliographic-reference-string-dataset-for-deep-citation-parsing-pre-print/
4 years ago by @jaeschke
show all tags
bibliography
bibtex
citation
dataset
extraction
giant
reference
bibliographybibtexcitationdatasetextractiongiantreference
(0)
copydelete
- community post
- history of this post
2Simone Teufel's Thesis work: Argumentative Zoning
http://www.cl.cam.ac.uk/~sht25/az.html
10 years ago by @jaeschke
show all tags
argument
extraction
ie
information
nlp
text
zone
argumentextractionieinformationnlptextzone
(0)
copydelete
- community post
- history of this post
1Apache UIMA - Apache UIMA Ruta
http://uima.apache.org/ruta.html
11 years ago by @jaeschke
show all tags
annotation
extraction
information
language
rule
ruta
text
uima
annotationextractioninformationlanguagerulerutatextuima
(0)
copydelete
- community post
- history of this post
350,000 Lessons on How to Read: a Relation Extraction Corpus
To help researchers investigate relation extraction, we’re releasing a human-judged dataset of two relations about public figures on Wikipedia: nearly 10,000 examples of “place of birth”, and over 40,000 examples of “attended or graduated from an institution”. Each of these was judged by at least 5 raters, and can be used to train or evaluate relation extraction systems. We also plan to release more relations of new types in the coming months.
12 years ago by @jaeschke
show all tags
dataset
extraction
ie
information
ner
relation
datasetextractionieinformationnerrelation
(1)
copydelete
- community post
- history of this post
1Twitter Calendar
http://statuscalendar.cs.washington.edu/
12 years ago by @jaeschke
show all tags
calendar
entity
extraction
information
named
ner
twitter
calendarentityextractioninformationnamednertwitter
(0)
copydelete
- community post
- history of this post
5SecondString Project Page
This is the project page for SecondString, an open-source Java-based package of approximate string-matching techniques. This code was developed by researchers at Carnegie Mellon University from the Center for Automated Learning and Discovery, the Department of Statistics, and the Center for Computer and Communications Security. SecondString is intended primarily for researchers in information integration and other scientists. It does or will include a range of string-matching methods from a variety of communities, including statistics, artificial intelligence, information retrieval, and databases. It also includes tools for systematically evaluating performance on test data. It is not designed for use on very large data sets.
16 years ago by @jaeschke
show all tags
extraction
information
java
matching
secondstring
string
text
extractioninformationjavamatchingsecondstringstringtext
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)52
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1»Who is the Madonna of Italian-American Literature?«: Extracting and Analyzing Target Entities of Vossian Antonomasia
M. Schwab, R. Jäschke, and F. Fischer. Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, page 110--115. Association for Computational Linguistics, (2023)
a year ago by @jaeschke
show all tags
2023
entity
extraction
myown
ner
nlp
vossanto
2023entityextractionmyownnernlpvossanto
(0)
copydeleteadd this publication to your clipboard
2»Der Frank Sinatra der Wettervorhersage« – Cross-Lingual Vossian Antonomasia Extraction
M. Schwab, R. Jäschke, and F. Fischer. Proceedings of the 5th International Conference on Natural Language and Speech Processing, page 282--287. Association for Computational Linguistics, (2022)
2 years ago by @jaeschke
show all tags
2022
cross-lingual
extraction
fewshot
myown
nlp
vossanto
2022cross-lingualextractionfewshotmyownnlpvossanto
(0)
copydeleteadd this publication to your clipboard
2Dataset or Not? A Study on the Veracity of Semantic Markup for Dataset Pages
T. Alrashed, D. Paparas, O. Benjelloun, Y. Sheng, and N. Noy. The Semantic Web -- ISWC 2021, page 338--356. Cham, Springer International Publishing, (2021)
2 years ago by @jaeschke
show all tags
dataset
extraction
markup
semantics
semanticweb
unknowndata
web
datasetextractionmarkupsemanticssemanticwebunknowndataweb
(0)
copydeleteadd this publication to your clipboard
1Vision and natural language for metadata extraction from scientific PDF documents
Z. Boukhers, and A. Bouabdallah. Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries, ACM, (June 2022)
2 years ago by @jaeschke
show all tags
citation
extraction
metadata
nlp
pdf
research
science
social
unknowndata
citationextractionmetadatanlppdfresearchsciencesocialunknowndata
(0)
copydeleteadd this publication to your clipboard
2A Game with Complex Rules: Literature References in Literary Studies
F. Arnold, and R. Jäschke. Proceedings of the Workshop Understanding LIterature references in academic full TExt at JCDL 2022, volume 3220 of ULITE-ws '22, page 7--15. CEUR Workshop Proceedings, (2022)
2 years ago by @jaeschke
show all tags
2022
citation
extraction
literature
myown
reference
2022citationextractionliteraturemyownreference
(0)
copydeleteadd this publication to your clipboard
2WebFormer: The Web-page Transformer for Structure Information Extraction
Q. Wang, Y. Fang, A. Ravula, F. Feng, X. Quan, and D. Liu. Proceedings of the ACM Web Conference 2022, ACM, (April 2022)
2 years ago by @jaeschke
show all tags
deeplearning
extraction
html
ie
information
page
plk
transformer
web
webformer
deeplearningextractionhtmlieinformationpageplktransformerwebwebformer
(0)
copydeleteadd this publication to your clipboard
5Following the Footnotes : A Bibliometric Analysis of Citation Patterns in Literary Studies
B. Hammarfelt. Uppsala universitet, Institutionen för ABM, (2012)© Björn Hammarfelt 2012.
3 years ago by @jaeschke
show all tags
analysis
bibliometrics
citation
extraction
humanities
ssh
analysisbibliometricscitationextractionhumanitiesssh
(0)
copydeleteadd this publication to your clipboard
3Referencing in the humanities and its implications for citation analysis
B. Hellqvist. Journal of the American Society for Information Science and Technology, 61 (2): 310--318 (2010)
3 years ago by @jaeschke
show all tags
analysis
bibliometrics
citation
extraction
humanities
ssh
analysisbibliometricscitationextractionhumanitiesssh
(0)
copydeleteadd this publication to your clipboard
3An End-to-End Approach for Extracting and Segmenting High-Variance References from PDF Documents
Z. Boukhers, S. Ambhore, and S. Staab. 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL), page 186-195. (June 2019)
4 years ago by @jaeschke
show all tags
citation
extraction
citationextraction
(0)
copydeleteadd this publication to your clipboard
2An Evaluation of the Effect of Reference Strings and Segmentation on Citation Matching
B. Ghavimi, W. Otto, and P. Mayr. Digital Libraries for Open Knowledge, page 365--369. Cham, Springer International Publishing, (2019)
4 years ago by @jaeschke
show all tags
citation
extraction
segmentation
citationextractionsegmentation
(0)
copydeleteadd this publication to your clipboard
2Neural ParsCit: a deep learning-based reference string parser
A. Prasad, M. Kaur, and M. Kan. International Journal on Digital Libraries, 19 (4): 323--337 (Nov 1, 2018)
4 years ago by @jaeschke
show all tags
citation
deep
deeplearning
extraction
learning
lstm
network
neural
citationdeepdeeplearningextractionlearninglstmnetworkneural
(0)
copydeleteadd this publication to your clipboard
2Deep Reference Mining From Scholarly Literature in the Arts and Humanities
D. Alves, G. Colavizza, and F. Kaplan. Frontiers in Research Metrics and Analytics, (July 2018)
4 years ago by @jaeschke
show all tags
bibtex
citation
deep
deeplearning
dh
extraction
learning
mining
reference
bibtexcitationdeepdeeplearningdhextractionlearningminingreference
(0)
copydeleteadd this publication to your clipboard
2Using BibTeX to Automatically Generate Labeled Data for Citation Field Extraction
D. Thai, Z. Xu, N. Monath, B. Veytsman, and A. McCallum. (2020)cite arxiv:2006.05563.
4 years ago by @jaeschke
show all tags
bibliographic
bibtex
citation
extraction
ie
information
metadata
publication
reference
bibliographicbibtexcitationextractionieinformationmetadatapublicationreference
(0)
copydeleteadd this publication to your clipboard
3A Two-stage Sieve Approach for Quote Attribution
G. Muzny, M. Fang, A. Chang, and D. Jurafsky. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, page 460--470. Valencia, Spain, Association for Computational Linguistics, (April 2017)
4 years ago by @jaeschke
show all tags
acl
citation
extraction
language
natural
nlp
processing
quotation
quote
aclcitationextractionlanguagenaturalnlpprocessingquotationquote
(0)
copydeleteadd this publication to your clipboard
3Model Architectures for Quotation Detection
C. Scheible, R. Klinger, and S. Padó. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), page 1736--1745. Berlin, Germany, Association for Computational Linguistics, (August 2016)
4 years ago by @jaeschke
show all tags
citation
detection
extraction
language
natural
nlp
processing
quotation
quote
citationdetectionextractionlanguagenaturalnlpprocessingquotationquote
(0)
copydeleteadd this publication to your clipboard
2Extracting and Aggregating Temporal Events from Text
L. Döhling, and U. Leser. Proceedings of the 23rd International Conference on World Wide Web, page 839--844. New York, NY, USA, ACM, (2014)
5 years ago by @jaeschke
show all tags
archive
estimate
event
extraction
information
temporal
text
time
web
archiveestimateeventextractioninformationtemporaltexttimeweb
(0)
copydeleteadd this publication to your clipboard
16CiteSeer: An Automatic Citation Indexing System
C. Giles, K. Bollacker, and S. Lawrence. Proceedings of the Third ACM Conference on Digital Libraries, page 89--98. New York, NY, USA, ACM, (1998)
6 years ago by @jaeschke
show all tags
citation
citeseer
dl
extraction
identification
indexing
citationciteseerdlextractionidentificationindexing
(0)
copydeleteadd this publication to your clipboard
3Evidence-based Information Extraction for High Accuracy Citation and Author Name Identification
B. Powley, and R. Dale. Large Scale Semantic Access to Content (Text, Image, Video, and Sound), page 618--632. Paris, France, France, LE CENTRE DE HAUTES ETUDES INTERNATIONALES D'INFORMATIQUE DOCUMENTAIRE, (2007)
6 years ago by @jaeschke
show all tags
citation
extraction
identification
ie
information
citationextractionidentificationieinformation
(0)
copydeleteadd this publication to your clipboard
2Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale
A. Rheinländer, M. Lehmann, A. Kunkel, J. Meier, and U. Leser. Proceedings of the 2016 International Conference on Management of Data, page 759--771. New York, NY, USA, ACM, (2016)
6 years ago by @jaeschke
show all tags
extraction
ie
information
web
extractionieinformationweb
(0)
copydeleteadd this publication to your clipboard
1GERBIL – General Entity Annotator Benchmarking Framework
R. Usbeck, M. Röder, and A. Ngonga. Proc. WWW, (2015)
10 years ago by @jaeschke
show all tags
annotation
benchmark
entity
extraction
framework
gerbil
named
ner
annotationbenchmarkentityextractionframeworkgerbilnamedner
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩

BibSonomy

bookmarks (hide)6
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1GIANT: The 1-Billion Annotated Synthetic Bibliographic-Reference-String Dataset for Deep Citation Parsing [pre-print] – ISG Siegen

2Simone Teufel's Thesis work: Argumentative Zoning

1Apache UIMA - Apache UIMA Ruta

350,000 Lessons on How to Read: a Relation Extraction Corpus

1Twitter Calendar

5SecondString Project Page

publications (hide)52
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

1»Who is the Madonna of Italian-American Literature?«: Extracting and Analyzing Target Entities of Vossian Antonomasia

2»Der Frank Sinatra der Wettervorhersage« – Cross-Lingual Vossian Antonomasia Extraction

2Dataset or Not? A Study on the Veracity of Semantic Markup for Dataset Pages

1Vision and natural language for metadata extraction from scientific PDF documents

2A Game with Complex Rules: Literature References in Literary Studies

2WebFormer: The Web-page Transformer for Structure Information Extraction

5Following the Footnotes : A Bibliometric Analysis of Citation Patterns in Literary Studies

3Referencing in the humanities and its implications for citation analysis

3An End-to-End Approach for Extracting and Segmenting High-Variance References from PDF Documents

2An Evaluation of the Effect of Reference Strings and Segmentation on Citation Matching

2Neural ParsCit: a deep learning-based reference string parser

2Deep Reference Mining From Scholarly Literature in the Arts and Humanities

2Using BibTeX to Automatically Generate Labeled Data for Citation Field Extraction

3A Two-stage Sieve Approach for Quote Attribution

3Model Architectures for Quotation Detection

2Extracting and Aggregating Temporal Events from Text

16CiteSeer: An Automatic Citation Indexing System

3Evidence-based Information Extraction for High Accuracy Citation and Author Name Identification

2Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale

1GERBIL – General Entity Annotator Benchmarking Framework

browse

related tags

concepts

tags

bookmarks (hide)6 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)52 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)6
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)52
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...