group :: regio | BibSonomy

bookmarks (hide)508
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1GitHub - open-webui/open-webui: ChatGPT-Style WebUI for LLMs (Formerly Ollama WebUI)
ChatGPT-Style WebUI for LLMs (Formerly Ollama WebUI) - open-webui/open-webui
2 months ago by @hotho
show all tags
web
chat
ollama
gui
chatgpt
interface
llm
open
webchatollamaguichatgptinterfacellmopen
copydelete
- community post
- history of this post
1WACZ Format - ReplayWeb.Page
Serverless Web Archive Replay directly in the browser
2 years ago by @jaeschke
show all tags
unknowndata
web
file
wacz
format
archive
warc
unknowndatawebfilewaczformatarchivewarc
copydelete
- community post
- history of this post
1Web Semantics
https://www.scimagojr.com/journalsearch.php?q=14879&tip=sid&clean=0
2 years ago by @hotho
show all tags
journal
web
jws
ranking
semantics
journalwebjwsrankingsemantics
copydelete
- community post
- history of this post
2Hypertext Style: Cool URIs don't change.
https://www.w3.org/Provider/Style/URI
3 years ago by @jaeschke
show all tags
web
rot
uri
link
url
webroturilinkurl
copydelete
- community post
- history of this post
1chatnoir-eu/chatnoir-resiliparse: A robust web archive analytics toolkit
A robust web archive analytics toolkit. Contribute to chatnoir-eu/chatnoir-resiliparse development by creating an account on GitHub.
3 years ago by @jaeschke
show all tags
code
web
analytics
python
toolkit
archive
analysis
warc
programming
codewebanalyticspythontoolkitarchiveanalysiswarcprogramming
copydelete
- community post
- history of this post
1GERiT: German Research Institutions | DFG
GERiT ist ein Informationsportal zu deutschen Forschungseinrichtungen. GERiT richtet sich an Studierende und Forschende aus dem In- und Ausland.
3 years ago by @jaeschke
show all tags
german
web
dataset
university
academic
institution
research
germanwebdatasetuniversityacademicinstitutionresearch
copydelete
- community post
- history of this post
1Linked Document Embedding for Classification | Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
https://dl.acm.org/doi/abs/10.1145/2983323.2983755
4 years ago by @parismic
show all tags
web
clustering
document
linked
classification
webclusteringdocumentlinkedclassification
copydelete
- community post
- history of this post
1Natural Language Processing (NLP) Techniques for Extracting Information | Search Technologies
This tutorial introduces the methodology and essential tools and techniques for a successful Natural Language Processing (NLP) project.
4 years ago by @parismic
show all tags
NLP
web
information
extraction
NLPwebinformationextraction
copydelete
- community post
- history of this post
1Visibility of disciplines in academic web space - Yang - 2018 - Proceedings of the Association for Information Science and Technology - Wiley Online Library
https://asistdl.onlinelibrary.wiley.com/doi/full/10.1002/pra2.2018.14505501185
4 years ago by @parismic
show all tags
web
hierarchical
extraction
webhierarchicalextraction
copydelete
- community post
- history of this post
1Mapping the “Worlds” of the World Wide Web: (Re)Structuring Global Commerce through Hyperlinks - STANLEY D. BRUNN, MARTIN DODGE, 2001
https://journals.sagepub.com/doi/abs/10.1177/0002764201044010011?casa_token=G1zSZ85oN4YAAAAA:XoQwHQTlTfFINF1sMPVKtTGscdhWu2fdH6cfR3Y88bZ76omHBf3pawChw6BR1P_8ObwjVv_Y-0Wl
4 years ago by @parismic
show all tags
hyperlink
web
structure
hyperlinkwebstructure
copydelete
- community post
- history of this post
3WDC - Hyperlink Graphs
http://webdatacommons.org/hyperlinkgraph/
4 years ago by @parismic
show all tags
web
graph
webgraph
copydelete
- community post
- history of this post
1Coauthorship Networks — tethne 0.6.1-beta documentation
https://diging.github.io/tethne/doc/0.6.1-beta/tutorial.coauthors.html
4 years ago by @parismic
show all tags
web
coauthor
wos
graph
webcoauthorwosgraph
copydelete
- community post
- history of this post
1entropy.pdf
https://mccurley.org/papers/entropy.pdf
4 years ago by @parismic
show all tags
overview
web
hierarchical
link
overviewwebhierarchicallink
copydelete
- community post
- history of this post
1OpenCitations - Home
https://opencitations.net
4 years ago by @hotho
show all tags
semantic
web
ref
citation
api
open
semanticwebrefcitationapiopen
copydelete
- community post
- history of this post
1What’s New on the Web? The Evolution of the Web from a Search Engine Perspective
http://cs.brown.edu/courses/cs253/papers/www04-ntoulas.pdf
4 years ago by @parismic
show all tags
discover
comparability
web
crawl
discovercomparabilitywebcrawl
copydelete
- community post
- history of this post
1Identifying named entities in academic biographies with supervised learning | SpringerLink
Personal webpages of researchers or faculty members make up a percentage of the academic web. These webpages contain semi-structured or plain text information, and research has shown the importance...
4 years ago by @parismic
show all tags
identifying
web
named
supervised
bio
page
entity
identifyingwebnamedsupervisedbiopageentity
copydelete
- community post
- history of this post
1Wimmics Research Team Overview 2017
Overview of the research domain and works of the joint research team Wimmics (UCS, Inria, CNRS, I3S) in Sophia Antipolis, France
5 years ago by @hotho
show all tags
overview
slides
semantic
web
gandon
social
fabien
*****
overviewslidessemanticwebgandonsocialfabien*****
copydelete
- community post
- history of this post
1Materials
https://materials.dagstuhl.de/index.php?semnr=18262
5 years ago by @hotho
show all tags
science
web
seminar
dagstuhl
sciencewebseminardagstuhl
copydelete
- community post
- history of this post
1LOD-a-lot
LOD-a-lot democratizes access to the Linked Open Data (LOD) Cloud by serving more than 28 billion unique triples from 650K datasets from a single self-indexed file. This corpus can be queried online with a sustainable Linked Data Fragments interface, or it can be downloaded and consumed locally: LOD-a-lot is easy to deploy and only requires limited resources (524 GB of disk space and 15.7 GB of RAM), enabling web-scale repeatable experimentation and research from a high-end laptop.
7 years ago by @hotho
show all tags
lod
semantic
web
ref
access
fast
storage
backend
lodsemanticwebrefaccessfaststoragebackend
copydelete
- community post
- history of this post
1Indego web interface
http://grauonline.de/alexwww/indego/indego.html
7 years ago by @hotho
show all tags
web
bosch
indego
rasenmäher
roboter
webboschindegorasenmäherroboter
copydelete
- community post
- history of this post
2Webmention.io
Webmention.io is a hosted service created to easily handle webmentions (and legacy pingbacks) on any web page.
7 years ago by @jaeschke
show all tags
web
trackback
pingback
mention
webmention
webtrackbackpingbackmentionwebmention
copydelete
- community post
- history of this post
1Social Media statt Web 2.0 | Henning Schürig
http://www.henningschuerig.de/2010/social-media-statt-web-20/
7 years ago by @hotho
show all tags
social_web
web
2.0
social
wj2017
social_webweb2.0socialwj2017
copydelete
- community post
- history of this post
1German Academic Web | SoBigData.eu
http://www.sobigdata.eu/dataset/german-academic-web
7 years ago by @jaeschke
show all tags
myown
german
web
dataset
sobigdata
academic
gaw
myowngermanwebdatasetsobigdataacademicgaw
copydelete
- community post
- history of this post
1Home · springernature/scigraph Wiki · GitHub
https://github.com/springernature/scigraph/wiki
7 years ago by @hotho
show all tags
science
lod
semantic
owl
web
dataset
bibliographic
scigraph
research
sciencelodsemanticowlwebdatasetbibliographicscigraphresearch
copydelete
- community post
- history of this post
1How OpenWayback handles revisit records in WARC files
https://github.com/iipc/openwayback/wiki/How-OpenWayback-handles-revisit-records-in-WARC-files
7 years ago by @jaeschke
show all tags
web
openwayback
wayback
revisit
archive
duplicate
warc
webopenwaybackwaybackrevisitarchiveduplicatewarc
copydelete
- community post
- history of this post
4Web Data Commons
http://webdatacommons.org/
8 years ago by @hotho
show all tags
semantic
rdf
web
dataset
common
data
relations
crawl
semanticrdfwebdatasetcommondatarelationscrawl
copydelete
- community post
- history of this post
2WebGraph
http://webgraph.di.unimi.it/
8 years ago by @jaeschke
show all tags
framework
library
web
webgraph
analysis
network
graph
compression
programming
frameworklibrarywebwebgraphanalysisnetworkgraphcompressionprogramming
copydelete
- community post
- history of this post
2RecSys Challenge 2015 - Challenge
http://2015.recsyschallenge.com/challenge.html
8 years ago by @hotho
show all tags
web
dataset
session
2015
challenge
recsys
webdatasetsession2015challengerecsys
copydelete
- community post
- history of this post
1Web of Science [v.5.22.3] - Web of Science Core Collection Home
https://webofknowledge.com/
8 years ago by @hotho
show all tags
science
scholarly
web
paper
citation
collection
ranking
hindex
wos
sciencescholarlywebpapercitationcollectionrankinghindexwos
copydelete
- community post
- history of this post
3Grafana - Beautiful Metrics Dashboards, Data Visualization and Monitoring
Grafana is the leading open source project for visualizing metrics. Supporting rich integration for every popular database like Graphite, Prometheus and InfluxDB.
8 years ago by @hotho
show all tags
grafana
web
visualization
data
metrics
grafanawebvisualizationdatametrics
copydelete
- community post
- history of this post
2Net Data Directory
The Net Data Directory collects and shares information on different sources of data about the Internet. For more about the project, see our about page. To get started, use the search box below, or check out our quick start guide.
8 years ago by @jaeschke
show all tags
web
dataset
data
monitor
directory
net
internet
webdatasetdatamonitordirectorynetinternet
copydelete
- community post
- history of this post
1Call for Papers: Special Issue on Mining Social Semantics on the Social Web | www.semantic-web-journal.net
http://www.semantic-web-journal.net/blog/call-papers-special-issue-mining-social-semantics-social-web
9 years ago by @jaeschke
show all tags
special
myown
semantic
journal
web
social
mining
issue
semantics
semanticweb
specialmyownsemanticjournalwebsocialminingissuesemanticssemanticweb
copydelete
- community post
- history of this post
18th International ACM Web Science Conference
The 8th International ACM Web Science Conference 2016 will be held from May 22 to May 25, 2016 in Hannover, Germany.
9 years ago by @jaeschke
show all tags
science
myown
hannover
web
conference
webscience
l3s
sciencemyownhannoverwebconferencewebsciencel3s
copydelete
- community post
- history of this post
1webarchive-commons/ResourceRecordReader.java at master · internetarchive/webarchive-commons · GitHub
This line needs to be removed with code that copies the HTTP payload into a byte array and returns it to Pig.
9 years ago by @jaeschke
show all tags
web
payload
record
archive
warc
body
content
webpayloadrecordarchivewarcbodycontent
copydelete
- community post
- history of this post
4Web Data Commons
http://webdatacommons.org/
9 years ago by @jaeschke
show all tags
lod
semantic
rdf
web
dataset
commoncrawl
data
microformat
open
crawl
linked
lodsemanticrdfwebdatasetcommoncrawldatamicroformatopencrawllinked
copydelete
- community post
- history of this post
1WikiReverse - reverse links to Wikipedia articles
https://wikireverse.org/
9 years ago by @jaeschke
show all tags
web
commoncrawl
archive
wikipedia
link
analysis
webcommoncrawlarchivewikipedialinkanalysis
copydelete
- community post
- history of this post
1YaCy - Freie Suchmaschinensoftware und dezentrale Websuche
http://yacy.net/
9 years ago by @jaeschke
show all tags
web
yacy
engine
free
p2p
search
open
webyacyenginefreep2psearchopen
copydelete
- community post
- history of this post
1Click Dataset | Center for Complex Networks and Systems Research
http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset/
9 years ago by @hotho
show all tags
web
click
dataset
stream
indiana
traffic
webclickdatasetstreamindianatraffic
copydelete
- community post
- history of this post
1hawarp
http://hawarp.openpreservation.org/
9 years ago by @jaeschke
show all tags
web
scape
hawarp
archive
warc
internet
webscapehawarparchivewarcinternet
copydelete
- community post
- history of this post
2What the Web Said Yesterday - The New Yorker
http://www.newyorker.com/magazine/2015/01/26/cobweb
9 years ago by @jaeschke
show all tags
web
archive
internet
webarchiveinternet
copydelete
- community post
- history of this post
1perma.cc
Perma.cc helps scholars, journals and courts create permanent links to the online sources cited in their work.
9 years ago by @jaeschke
show all tags
bookmark
web
citation
permanent
perma
archive
link
internet
bookmarkwebcitationpermanentpermaarchivelinkinternet
copydelete
- community post
- history of this post
1Host Link Graph JISC UK Web Domain Dataset (1996-2010)
UK Web Archive Open Data
9 years ago by @jaeschke
show all tags
web
dataset
uk
data
host
archive
link
graph
jisc
webdatasetukdatahostarchivelinkgraphjisc
copydelete
- community post
- history of this post
1JISC UK Web Domain Dataset (1996-2013)
UK Web Archive Open Data
9 years ago by @jaeschke
show all tags
domain
web
dataset
uk
data
archive
open
jisc
domainwebdatasetukdataarchiveopenjisc
copydelete
- community post
- history of this post
3HTTP Archive
The HTTP Archive tracks how the Web is built.
9 years ago by @jaeschke
show all tags
web
http
archive
internet
webhttparchiveinternet
copydelete
- community post
- history of this post
3Raw
http://raw.densitydesign.org/
10 years ago by @jaeschke
show all tags
diagram
web
plot
visualization
alluvial
graphics
diagramwebplotvisualizationalluvialgraphics
copydelete
- community post
- history of this post
1Oxford Internet Institute - Research - Projects - Wikipedia's Networks and Geographies: Representation and Power in Peer-Produced Content
This project brings together OII research fellows and doctoral students to shed light on the incorporation of new users and information into the Wikipedia community.
10 years ago by @jaeschke
show all tags
web
webscience
oxford
project
wikipedia
institute
internet
research
webwebscienceoxfordprojectwikipediainstituteinternetresearch
copydelete
- community post
- history of this post
1Web Science
http://eprints.soton.ac.uk/262615/1/Web%20Science.htm
10 years ago by @jaeschke
show all tags
science
web
webscience
sciencewebwebscience
copydelete
- community post
- history of this post
1WDC - Hyperlink Graphs
This page provides two large hyperlink graph for public download. The graphs have been extracted from the 2012 and 2014 versions of the Common Crawl web corpera. The 2012 graph covers 3.5 billion web pages and 128 billion hyperlinks between these pages. To the best of our knowledge, the graph is the largest hyperlink graph that is available to the public outside companies such as Google, Yahoo, and Microsoft. The2014 graph covers 1.7 billion web pages connected by 64 billion hyperlinks. Below we provide instructions on how to download the graphs as well as basic statistics about their topology.
10 years ago by @jaeschke
show all tags
web
dataset
link
graph
webdatasetlinkgraph
copydelete
- community post
- history of this post
1WIRE Workshop | Cambridge, MA: June 17 – 18, 2014
http://wp.comminfo.rutgers.edu/nsfia/
10 years ago by @jaeschke
show all tags
web
wire
workshop
archive
internet
webwireworkshoparchiveinternet
copydelete
- community post
- history of this post
1ACM Web Science Conference 2014 (WebSci14)
http://www.websci14.org/
10 years ago by @hotho
show all tags
science
pc
web
conference
2014
acm
sciencepcwebconference2014acm
copydelete
- community post
- history of this post
3WebCite
http://webcitation.org/
10 years ago by @jaeschke
show all tags
science
web
citation
webcite
sciencewebcitationwebcite
copydelete
- community post
- history of this post
3Web Data Mining, book by Bing Liu
Web data mining techniques and algorithm
10 years ago by @jaeschke
show all tags
web
mining
book
data
algorithm
webminingbookdataalgorithm
copydelete
- community post
- history of this post
6Speaking JavaScript
http://speakingjs.com/es5/
10 years ago by @jaeschke
show all tags
reference
web
book
tutorial
manual
javascript
programming
referencewebbooktutorialmanualjavascriptprogramming
copydelete
- community post
- history of this post
1Ian Milligan | A Digital, Public, and Youth Historian of 20th-Century Canada
A Digital, Public, and Youth Historian of 20th-Century Canada (by Ian Milligan)
10 years ago by @jaeschke
show all tags
humanities
web
archive
history
warc
digital
humanitieswebarchivehistorywarcdigital
copydelete
- community post
- history of this post
1Welcome to iamResearcher
The Open Knowledge Network
10 years ago by @jaeschke
show all tags
web
social
researcher
network
gaw
research
websocialresearchernetworkgawresearch
copydelete
- community post
- history of this post
1Web Archive Analysis Workshop - Internet Research - IA Webteam Confluence
https://webarchive.jira.com/wiki/display/Iresearch/Web+Archive+Analysis+Workshop
11 years ago by @jaeschke
show all tags
web
wat
archive
hadoop
analysis
pig
warc
internet
webwatarchivehadoopanalysispigwarcinternet
copydelete
- community post
- history of this post
1internetarchive/ia-web-commons · GitHub
Contribute to ia-web-commons development by creating an account on GitHub.
11 years ago by @jaeschke
show all tags
web
archive
hadoop
warc
webarchivehadoopwarc
copydelete
- community post
- history of this post
2CLARIN-NL | CLARIN-NL
The CLARIN infrastructure is a research infrastructure intended for humanities researchers that work with language data and tools.
11 years ago by @hotho
show all tags
web
data
infrastructure
text
tools
webdatainfrastructuretexttools
copydelete
- community post
- history of this post
3WDC - Hyperlink Graph
This page provides a large hyperlink graph for public download. The graph has been extracted from the Common Crawl 2012 web corpus and covers 3.5 billion web pages and 128 billion hyperlinks between these pages. To the best of our knowledge, this graph is the largest hyperlink graph that is available to the public outside companies such as Google, Yahoo, and Microsoft. Below we provide instructions on how to download the graph as well as basic statistics about its topology.
11 years ago by @hotho
show all tags
hyperlink
web
dataset
graph
hyperlinkwebdatasetgraph
copydelete
- community post
- history of this post
1ia-web-commons/src/main/java/org/archive/hadoop/ResourceRecordReader.java at master · internetarchive/ia-web-commons
https://github.com/internetarchive/ia-web-commons/blob/master/src/main/java/org/archive/hadoop/ResourceRecordReader.java
11 years ago by @jaeschke
show all tags
bigdata
web
archive
crawling
hadoop
analysis
warc
programming
bigdatawebarchivecrawlinghadoopanalysiswarcprogramming
copydelete
- community post
- history of this post
1Archival Web Graphs Workshop
https://home.archive.org/~vinay/archive-web-graphs-workshop/
11 years ago by @jaeschke
show all tags
web
workshop
wat
archive
analysis
pagerank
warc
graph
degree
webworkshopwatarchiveanalysispagerankwarcgraphdegree
copydelete
- community post
- history of this post
2Web Archive Transformation (WAT) Specification, Utilities, and Usage Overview - Internet Research - IA Webteam Confluence
https://webarchive.jira.com/wiki/display/Iresearch/Web+Archive+Transformation+(WAT)+Specification,+Utilities,+and+Usage+Overview
11 years ago by @jaeschke
show all tags
bigdata
web
wat
archive
crawling
hadoop
analysis
warc
bigdatawebwatarchivecrawlinghadoopanalysiswarc
copydelete
- community post
- history of this post
1A Linked-Data-driven and Semantically-enabled Journal Portal for Scientometrics | www.semantic-web-journal.net
http://www.semantic-web-journal.net/blog/linked-data-driven-and-semantically-enabled-journal-portal-scientometrics
11 years ago by @hotho
show all tags
semantic
journal
web
dataset
paper
data
linked
statistics
semanticjournalwebdatasetpaperdatalinkedstatistics
copydelete
- community post
- history of this post
1Semantic Pingback — Agile Knowledge Management and Semantic Web (AKSW)
http://aksw.org/Projects/SemanticPingback.html#./SemanticPingback.html?&_suid=137777581008007069462047300046
11 years ago by @jaeschke
show all tags
semantic
web
service
pingback
semanticwebservicepingback
copydelete
- community post
- history of this post
1Semantic Pingback Service
http://pingback.aksw.org/
11 years ago by @jaeschke
show all tags
semantic
web
service
pingback
semanticwebservicepingback
copydelete
- community post
- history of this post
1Elsevier Editorial SystemTM
Full-Function Web-Enabled Manuscript Submission and Tracking System for Peer Review
11 years ago by @hotho
show all tags
editor
area
semantic
journal
web
jws
review
editorareasemanticjournalwebjwsreview
copydelete
- community post
- history of this post
1Internet Census 2012
http://internetcensus2012.bitbucket.org/paper.html
11 years ago by @stumme
show all tags
science
web
L3S
Vermessung
internet
sciencewebL3SVermessunginternet
copydelete
- community post
- history of this post
3Carna-Botnet: Internet-Zensus mit Hacker-Methoden - SPIEGEL ONLINE
http://www.spiegel.de/netzwelt/web/carna-botnet-internet-zensus-mit-hacker-methoden-a-890225.html
11 years ago by @stumme
show all tags
science
web
L3S
Vermessung
internet
sciencewebL3SVermessunginternet
copydelete
- community post
- history of this post
1Journal of Web Semantics
http://journalofwebsemantics.blogspot.de/
11 years ago by @hotho
show all tags
semantic
journal
web
jws
semanticjournalwebjws
copydelete
- community post
- history of this post
1ESWC 2013 - NEWS | 10th ESWC 2013
http://2013.eswc-conferences.org/
11 years ago by @hotho
show all tags
pc
web
semsntic
chair
2013
track
pcwebsemsnticchair2013track
copydelete
- community post
- history of this post
1DGI-Konferenz 2012
Die Deutsche Gesellschaft für Informationswissenschaft und Informationspraxis e.V. (DGI) fördert die Entwicklungen der Informationswissenschaft und Informationspraxis durch die Beobachtung und Vermittlung von Grundlagen, Arbeitsmethoden und technischen Hilfsmitteln.
11 years ago by @hotho
show all tags
science
pc
web
conference
social
dgi
2012
member
sciencepcwebconferencesocialdgi2012member
copydelete
- community post
- history of this post
2Web Observatory Wiki
http://wow.west.webobservatory.org/index.php/Main_Page
11 years ago by @jaeschke
show all tags
science
semantic
web
dataset
observatory
wiki
sciencesemanticwebdatasetobservatorywiki
copydelete
- community post
- history of this post
1blekko donates search data to Common Crawl | blekko
Blekko Blog | get the Latest Updates On SEO, Search Engines, SEO Tools, SEO Tutorials, SEO techniques, SEO APIs and much more
12 years ago by @jaeschke
show all tags
web
dataset
search
crawl
blekko
webdatasetsearchcrawlblekko
copydelete
- community post
- history of this post
13Flare | Data Visualization for the Web
http://flare.prefuse.org/
12 years ago by @jaeschke
show all tags
web
plot
visualization
data
javascript
webplotvisualizationdatajavascript
copydelete
- community post
- history of this post
2JavaScript InfoVis Toolkit
JavaScript InfoVis Toolkit, Meaningful Visualizations
12 years ago by @jaeschke
show all tags
web
plot
visualization
graphics
javascript
webplotvisualizationgraphicsjavascript
copydelete
- community post
- history of this post
1GIScience Uni Heidelberg - WebGL
http://webgl.uni-hd.de/
12 years ago by @hotho
show all tags
3d
gis
web
everyaware
3dgiswebeveryaware
copydelete
- community post
- history of this post
1Collective Awareness
http://ec.europa.eu/information_society/activities/collectiveawareness/links/index_en.htm
12 years ago by @jaeschke
show all tags
web
social
awareness
collective
caps
intelligence
platform
websocialawarenesscollectivecapsintelligenceplatform
copydelete
- community post
- history of this post
4oEmbed
oEmbed is a format for allowing an embedded representation of a URL on third party sites. The simple API allows a website to display embedded content (such as photos or videos) when a user posts a link to that resource, without having to parse the resource directly.
12 years ago by @jaeschke
show all tags
web
oembed
api
open
weboembedapiopen
copydelete
- community post
- history of this post
4VIVO | connect - share - discover
Enabling collaboration and discovery among scientists across all disciplines. The network of scientists will facilitate scholarly discovery. Institutions will participate in the network by installing VIVO, or by providing semantic web-compliant data to the network.
12 years ago by @jaeschke
show all tags
science
semantic
web
collaboration
scientist
vivo
network
sciencesemanticwebcollaborationscientistvivonetwork
copydelete
- community post
- history of this post
1Welcome to Altmetric
Our mission is to track and analyse the online activity around scholarly literature.
12 years ago by @jaeschke
show all tags
literature
science
web
scientometrics
altmetric
literaturesciencewebscientometricsaltmetric
copydelete
- community post
- history of this post
1Emerald | Library and Information Science | UNIVERSITIES: INTERNATIONAL LINKS
http://www.emeraldinsight.com/books.htm?chapterid=1838910&show=pdf
12 years ago by @jaeschke
show all tags
science
web
toread
scientometrics
university
sciencewebtoreadscientometricsuniversity
copydelete
- community post
- history of this post
1Emerald | Library Hi Tech | Measuring the institution's footprint in the web
http://www.emeraldinsight.com/journals.htm?articleid=1812469&show=abstract
12 years ago by @jaeschke
show all tags
science
web
toread
scientometrics
university
sciencewebtoreadscientometricsuniversity
copydelete
- community post
- history of this post
6Netspeak – Wortsuchmaschine
Netspeak helps you to search for words you don't know, yet. It is a new kind of dictionary that contains everything that has ever been written on the web.
12 years ago by @jaeschke
show all tags
web
netspeak
language
search
word
webnetspeaklanguagesearchword
copydelete
- community post
- history of this post
1The ClueWeb09 Dataset
http://www.lemurproject.org/clueweb09.php/
12 years ago by @jaeschke
show all tags
web
dataset
data
big
webdatasetdatabig
copydelete
- community post
- history of this post
4| CommonCrawl
http://commoncrawl.org/
12 years ago by @jaeschke
show all tags
web
dataset
data
crawling
webdatasetdatacrawling
copydelete
- community post
- history of this post
2WebBase Project
http://dbpubs.stanford.edu:8091/~testbed/doc2/WebBase/
12 years ago by @jaeschke
show all tags
web
dataset
stanford
webbase
data
webdatasetstanfordwebbasedata
copydelete
- community post
- history of this post
2Public Data Sets : Amazon Web Services
https://aws.amazon.com/datasets
12 years ago by @jaeschke
show all tags
web
dataset
amazon
data
webdatasetamazondata
copydelete
- community post
- history of this post
4ICWSM Datasets
http://icwsm.cs.mcgill.ca/
12 years ago by @hotho
show all tags
web
dataset
social
webdatasetsocial
copydelete
- community post
- history of this post
4ICWSM Datasets
http://icwsm.cs.mcgill.ca/
12 years ago by @jaeschke
show all tags
icwsm
web
dataset
twitter
social
icwsmwebdatasettwittersocial
copydelete
- community post
- history of this post
8jekyll
http://jekyllrb.com/
12 years ago by @jaeschke
show all tags
offline
web
cms
markdown
jekyll
ruby
offlinewebcmsmarkdownjekyllruby
copydelete
- community post
- history of this post
1Institut für Informatik: BAföG-Bescheinigungen
http://www.informatik.uni-wuerzburg.de/studium/studienfachberatung_informatik/ansprechpartner/bafoeg_bescheinigungen/
12 years ago by @hotho
show all tags
web
informatik
bafög
institut
bescheinigungen
webinformatikbaföginstitutbescheinigungen
copydelete
- community post
- history of this post
3TileMill | Fast and beautiful maps
Design beautiful maps
12 years ago by @hotho
show all tags
beautiful
web
fast
maps
tools
beautifulwebfastmapstools
copydelete
- community post
- history of this post
1LUC 2012: International Workshop on Learning from User-generated Content
http://www.cp.jku.at/conferences/luc2012/
12 years ago by @hotho
show all tags
pc
web
social
workshop
2012
learning
pcwebsocialworkshop2012learning
copydelete
- community post
- history of this post
1Truthy
Truthy is a research project that helps you understand how memes spread online. We collect tweets from Twitter and analyze them. With our statistics, images, movies, and interactive data, you can explore these dynamic networks. Our first application was the study of astroturf campaigns in elections. Currently, we're extending our focus to several themes. Browse the collection on the Memes page. Check out the Movie tool to browse and create animations of meme networks.
12 years ago by @hotho
show all tags
truthy
web
twitter
analysis
trends
truthywebtwitteranalysistrends
copydelete
- community post
- history of this post
1Extension Factory
Convert your Chrome extension into a Firefox or Safari one! This service converts Chrome Apps and Extensions to a Firefox and Safari version. This is a beta test and we offer it with no guarantees. If you are interested in distributing a converted extension, have a problem with a converted extension, if you want to provide feedback or have any question please Contact us! You can either upload your own package, in crx or zip format; or use the url or ID of an extension on the Chrome WebStore (click to search an extension!)
12 years ago by @hotho
show all tags
web
extension
browser
tools
webextensionbrowsertools
copydelete
- community post
- history of this post
1‘Nobody wants to do council estates’ – digital divide, spatial justice and outliers – AAG 2012 « Po Ve Sham – Muki Haklay’s personal blog
http://povesham.wordpress.com/2012/03/05/nobody-wants-to-do-council-estates-digital-divide-spatial-justice-and-outliers-aag-2012/
12 years ago by @jaeschke
show all tags
politics
science
web
divide
society
digital
politicssciencewebdividesocietydigital
copydelete
- community post
- history of this post
2ACM Recommender Systems 2012
http://recsys.acm.org/2012/
12 years ago by @hotho
show all tags
pc
systems
web
conference
2012
acm
recommender
pcsystemswebconference2012acmrecommender
copydelete
- community post
- history of this post
1Read the Web :: Carnegie Mellon University
http://rtw.ml.cmu.edu/rtw/index.php
12 years ago by @stumme
show all tags
read
the
NELL
web
cmu
readtheNELLwebcmu
copydelete
- community post
- history of this post
1Machine Learning :: Text feature extraction (tf-idf) – Part I | Pyevolve
Short introduction to Vector Space Model (VSM) In information retrieval or text mining, the term frequency - inverse document frequency also called tf-idf, is
12 years ago by @jaeschke
show all tags
document
ir
frequency
learning
tfidf
space
vsm
retrieval
web
machine
model
vector
search
information
term
documentirfrequencylearningtfidfspacevsmretrievalwebmachinemodelvectorsearchinformationterm
copydelete
- community post
- history of this post
3Javascript Closures
http://jibbering.com/faq/notes/closures/
12 years ago by @jaeschke
show all tags
web
html
closures
javascript
programming
webhtmlclosuresjavascriptprogramming
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)549
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Click Models for Web Search
A. Chuklin, I. Markov, and M. de Rijke. Springer International Publishing, (2015)
15 days ago by @jaeschke
show all tags
web
click
book
model
search
webclickbookmodelsearch
copydeleteadd this publication to your clipboard
2Effects of European Union Funding and International Collaboration on Estonian Scientific Impact
T. Hirv. Journal of Scientometric Research, (January 2018)
a month ago by @tobias.koopmann
show all tags
Estonia,
Web
impact,
Collaboration,
Funding
sources,
of
Scientific
Research
diss
social-science-related-work
Science
Estonia,Webimpact,Collaboration,Fundingsources,ofScientificResearchdisssocial-science-related-workScience
copydeleteadd this publication to your clipboard
2A Comprehensive Study of Features and Algorithms for URL-Based Topic Classification
E. Baykan, M. Henzinger, L. Marian, and I. Weber. Transactions on the Web, 5 (3): 1--29 (July 2011)
6 months ago by @jaeschke
show all tags
web
link
classification
url
weblinkclassificationurl
copydeleteadd this publication to your clipboard
1Blocking and Filtering Techniques for Entity Resolution
G. Papadakis, D. Skoutas, E. Thanos, and T. Palpanas. ACM Computing Surveys, 53 (2): 1--42 (March 2020)
7 months ago by @jaeschke
show all tags
semantic
web
data
blocking
resolution
ner
knowledge
graph
open
filtering
linked
entity
semanticwebdatablockingresolutionnerknowledgegraphopenfilteringlinkedentity
copydeleteadd this publication to your clipboard
2Construction of Knowledge Graphs: State and Challenges
M. Hofer, D. Obraczka, A. Saeedi, H. Köpcke, and E. Rahm. (2023)cite arxiv:2302.11509Comment: 43 pages, 5 figures, 3 tables.
7 months ago by @jaeschke
show all tags
lod
semantic
web
data
survey
knowledge
graph
open
linked
lodsemanticwebdatasurveyknowledgegraphopenlinked
copydeleteadd this publication to your clipboard
3What's Really New on the Web?: Identifying New Pages from a Series of Unstable Web Snapshots
M. Toyoda, and M. Kitsuregawa. Proceedings of the 15th International Conference on World Wide Web, page 233--241. New York, NY, USA, ACM, (2006)
11 months ago by @tobias.koopmann
show all tags
web
web
copydeleteadd this publication to your clipboard
4Focused Crawl of Web Archives to Build Event Collections
M. Klein, L. Balakireva, and H. Van de Sompel. Proceedings of the 10th ACM Conference on Web Science, page 333--342. New York, NY, USA, ACM, (2018)
11 months ago by @tobias.koopmann
show all tags
web
web
copydeleteadd this publication to your clipboard
2CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl
M. Fröbe, J. Bevendorff, L. Gienapp, M. Völske, B. Stein, M. Potthast, and M. Hagen. Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, (July 2021)
a year ago by @jaeschke
show all tags
web
detection
common
copycat
duplicate
crawl
webdetectioncommoncopycatduplicatecrawl
copydeleteadd this publication to your clipboard
2DSDD: Domain-Specific Dataset Discovery on the Web
H. Zhang, A. Santos, and J. Freire. Proceedings of the 30th ACM International Conference on Information &amp$\mathsemicolon$ Knowledge Management, ACM, (October 2021)
2 years ago by @jaeschke
show all tags
unknowndata
web
dataset
data
discovery
crawling
unknowndatawebdatasetdatadiscoverycrawling
copydeleteadd this publication to your clipboard
2Analyzing the Web: Are Top Websites Lists a Good Choice for Research?
T. Alby, and R. Jäschke. Proceedings of the International Conference on Theory and Practice of Digital Libraries, page 11--25. Cham, Springer, (2022)
2 years ago by @jaeschke
show all tags
science
myown
web
tpdl
commoncrawl
archive
2022
alexa
crawl
research
sciencemyownwebtpdlcommoncrawlarchive2022alexacrawlresearch
copydeleteadd this publication to your clipboard
2Dataset or Not? A Study on the Veracity of Semantic Markup for Dataset Pages
T. Alrashed, D. Paparas, O. Benjelloun, Y. Sheng, and N. Noy. The Semantic Web -- ISWC 2021, page 338--356. Cham, Springer International Publishing, (2021)
2 years ago by @jaeschke
show all tags
unknowndata
web
dataset
semantics
markup
semanticweb
extraction
unknowndatawebdatasetsemanticsmarkupsemanticwebextraction
copydeleteadd this publication to your clipboard
3Where are the Datasets? A case study on the German Academic Web Archive
Y. Younes, S. Tiesler, R. Jäschke, and B. Mathiak. Proceedings of the Web Archiving and Digital Libraries Workshop at JCDL 2022, (2022)
2 years ago by @jaeschke
show all tags
myown
german
unknowndata
web
dataset
academic
2022
gaw
crawl
myowngermanunknowndatawebdatasetacademic2022gawcrawl
copydeleteadd this publication to your clipboard
3WebFormer: The Web-page Transformer for Structure Information Extraction
Q. Wang, Y. Fang, A. Ravula, F. Feng, X. Quan, and D. Liu. Proceedings of the ACM Web Conference 2022, ACM, (April 2022)
2 years ago by @jaeschke
show all tags
web
deeplearning
transformer
page
html
ie
webformer
information
extraction
plk
webdeeplearningtransformerpagehtmliewebformerinformationextractionplk
copydeleteadd this publication to your clipboard
1ArchiveSpark
H. Holzmann, V. Goel, and A. Anand. Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, ACM, (June 2016)
2 years ago by @jaeschke
show all tags
archivespark
web
spark
archive
warc
archivesparkwebsparkarchivewarc
copydeleteadd this publication to your clipboard
4Googleology is Bad Science
A. Kilgarriff. Computational Linguistics, 33 (1): 147--151 (March 2007)
2 years ago by @jaeschke
show all tags
science
web
google
sciencewebgoogle
copydeleteadd this publication to your clipboard
3The Semantic Web - ISWC 2021 - 20th International Semantic Web Conference, ISWC 2021, Virtual Event, October 24-28, 2021, Proceedings
A. Hotho, E. Blomqvist, S. Dietze, A. Fokoue, Y. Ding, P. Barnaghi, A. Haller, M. Dragoni, and H. Alani (Eds.) volume 12922 of Lecture Notes in Computer Science, Springer, (2021)
2 years ago by @hotho
show all tags
myown
semantic
web
conference
2021
myownsemanticwebconference2021
copydeleteadd this publication to your clipboard
2Improving Relevance Prediction for Focused Web Crawlers
M. Safran, A. Althagafi, and D. Che. 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, page 161-166. (May 2012)
3 years ago by @parismic
show all tags
web
crawler
unkowndata
relevance
webcrawlerunkowndatarelevance
copydeleteadd this publication to your clipboard
2Archiving information from geotagged tweets to promote reproducibility and comparability in social media research
K. Kinder-Kurlanda, K. Weller, W. Zenk-Möltgen, J. Pfeffer, and F. Morstatter. Big Data & Society, 4 (2): 205395171773633 (November 2017)
3 years ago by @jaeschke
show all tags
web
twitter
archive
tweets
webtwitterarchivetweets
copydeleteadd this publication to your clipboard
4AggregateRank: Bringing order to web sites
G. Feng, T. Liu, Y. Wang, Y. Bao, Z. Ma, X. Zhang, and W. Ma. Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR \textquotesingle06, ACM Press, (2006)
3 years ago by @jaeschke
show all tags
retrieval
web
ir
aggregaterank
ranking
search
information
pagerank
retrievalwebiraggregaterankrankingsearchinformationpagerank
copydeleteadd this publication to your clipboard
2IRLbot: : Scaling to 6 billion pages and beyond
H. Lee, D. Leonard, X. Wang, and D. Loguinov. Transactions on the Web, 3 (3): 1--34 (June 2009)
3 years ago by @jaeschke
show all tags
bigdata
web
crawer
irlbot
crawling
bigdatawebcrawerirlbotcrawling
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩