group :: kdtm | BibSonomy

bookmarks (hide)176
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

210 open source books worth downloading
http://www.tectonic.co.za/?p=4491
15 years ago by @hkorte
show all tags
book
opensource
bookopensource
copydelete
- community post
- history of this post
2ACE - Automatic Content Extraction
The objective of the ACE Program is to develop extraction technology to support automatic processing of source language data (in the form of natural text, and as text derived from ASR and OCR). This includes classification, filtering, and selection based on the language content of the source data, i.e., based on the meaning conveyed by the data. Thus the ACE program requires the development of technologies that automatically detect and characterize this meaning. The ACE research objectives are viewed as the detection and characterization of Entities, Relations, and Events.
16 years ago by @hkorte
show all tags
nlp
data_source
relation_extraction
nlpdata_sourcerelation_extraction
copydelete
- community post
- history of this post
1ACL Anthology
Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
16 years ago by @hkorte
show all tags
linguistics
link_list
linguisticslink_list
copydelete
- community post
- history of this post
3Acronym Creator - find a name for your company, project, algorithm
Trying to find a name for a company, project, algorithm, product? Acronym Creator helps you generate a name that is an acronym or abbreviation. With this acronym builder, abbreviation maker, name generator, label finder - whatever you call it - you can make your own acronyms and have fun!
13 years ago by @hkorte
show all tags
tools
tools
copydelete
- community post
- history of this post
1Adding Keyboard Navigation | jQuery for Designers - Tutorials and screencasts
A JQuery plugin to easily capture keyboard input.
13 years ago by @hkorte
show all tags
library
javascript
libraryjavascript
copydelete
- community post
- history of this post
1Agenda Setting Prozesse (pdf)
http://epub.ub.uni-muenchen.de/734/1/AgendaSettingProzesse.pdf
16 years ago by @hkorte
show all tags
mewi
mewi
copydelete
- community post
- history of this post
1AKBC - First Workshop on Automated Knowledge Base Construction
This workshop will gather researchers in a variety of fields that contribute to the automated construction of knowledge bases. It will be held at Xerox Research Centre Europe, near Grenoble (France), May 17-19, 2010.
14 years ago by @hkorte
show all tags
knowledge_base_population
conference
workshop
knowledge_base_populationconferenceworkshop
copydelete
- community post
- history of this post
10andLinux.org
andLinux runs Linux natively inside Windows. It is a complete Ubuntu Linux system running seamlessly in Windows 2000 based systems (2000, XP, 2003, Vista, 7; 32-bit versions only).
15 years ago by @hkorte
show all tags
opensource
virtual_machine
opensourcevirtual_machine
copydelete
- community post
- history of this post
3Apache POI - Java API To Access Microsoft Format Files
The POI project consists of APIs for manipulating various file formats based upon Microsoft's OLE 2 Compound Document format, and Office OpenXML format, using pure Java. In short, you can read and write MS Excel files using Java. In addition, you can read and write MS Word and MS PowerPoint files using Java.
15 years ago by @hkorte
show all tags
java
library
tools
javalibrarytools
copydelete
- community post
- history of this post
14Apache Solr
Solr is an open source enterprise search server based on the Lucene Java search library, with XML/HTTP and JSON APIs, hit highlighting, faceted search, caching, replication, a web administration interface and many more features. It runs in a Java servlet container such as Tomcat.
15 years ago by @hkorte
show all tags
java
lucene
tools
search
javalucenetoolssearch
copydelete
- community post
- history of this post
4Apache Wicket
With proper mark-up/logic separation, a POJO data model, and a refreshing lack of XML, Apache Wicket makes developing web-apps simple and enjoyable again.
14 years ago by @hkorte
show all tags
java
web
javaweb
copydelete
- community post
- history of this post
1Aptana Studio 3
The professional, open source development tool for the open web. Develop and test your entire web application using a single environment. With support for the latest browser technology specs such as HTML5, CSS3 and JavaScript; and Ruby, Rails, PHP & Python on the server side. We've got you covered!
13 years ago by @hkorte
show all tags
web
ide
tools
eclipse
javascript
webidetoolseclipsejavascript
copydelete
- community post
- history of this post
2ASV Toolbox
ASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction.
16 years ago by @hkorte
show all tags
java
nlp
linguistics
pos
text_mining
tools
javanlplinguisticspostext_miningtools
copydelete
- community post
- history of this post
1AWS Elastic Beanstalk
AWS Elastic Beanstalk is an even easier way for developers to quickly deploy and manage applications in the AWS cloud without having to worry about the physical infrastructure or the resource configuration that accompanies setting up that infrastructure. You simply upload your application and AWS Elastic Beanstalk automatically handles the deployment details of capacity provisioning, load balancing, auto-scaling, and application health monitoring, while allowing you to change configuration settings and deploy new versions.
13 years ago by @hkorte
show all tags
java
cloud
webapps
amazon
tools
programming
javacloudwebappsamazontoolsprogramming
copydelete
- community post
- history of this post
1Bayesian Support Vector Machine Hyperparameter Tuning
Software for parameter tuning for SVM classifiers
16 years ago by @hkorte
show all tags
svm_tuning
svm
tools
svm_tuningsvmtools
copydelete
- community post
- history of this post
1BigTable: Google’s Distributed Data Store
http://hnr.dnsalias.net/wordpress/2008/10/bigtable-googles-distributed-data-store/
15 years ago by @hkorte
show all tags
to_read
to_read
copydelete
- community post
- history of this post
1Bundesliga - Die offizielle Webseite
Offizielle Fußball-Statistiken
15 years ago by @hkorte
show all tags
data_source
sports_betting
data_sourcesports_betting
copydelete
- community post
- history of this post
1Burchardt_WTEP_2007_slides.pdf (application/pdf-Objekt)
http://www.coli.uni-saarland.de/%7Ealbu/papers/Burchardt_WTEP_2007_slides.pdf
16 years ago by @hkorte
show all tags
frame_semantics
nlp
salsa
frame_semanticsnlpsalsa
copydelete
- community post
- history of this post
1Cibyl
Cibyl is a programming environment and binary translator that allows compiled C programs to execute on J2ME-capable phones. Cibyl uses GCC to compile the C programs to MIPS binaries, and these are then recompiled into Java bytecode.
15 years ago by @hkorte
show all tags
c-to-java-translator
tools
c-to-java-translatortools
copydelete
- community post
- history of this post
1Cleaneval development dataset
CLEANEVAL is a shared task and competitive evaluation on the topic of cleaning arbitrary web pages, with the goal of preparing web data for use as a corpus, for linguistic and language technology research and development.
15 years ago by @hkorte
show all tags
data_source
corpus
eval_corpus
html2text
data_sourcecorpuseval_corpushtml2text
copydelete
- community post
- history of this post
1coffee-maven-plugin - Apache Maven Plugin for Coffeescript
https://github.com/talios/coffee-maven-plugin
13 years ago by @hkorte
show all tags
coffeescript
maven
javascript
coffeescriptmavenjavascript
copydelete
- community post
- history of this post
1Cohen's Kappa for more than two annotators and multiple classes
Online Calculator for Cohen's Kappa
15 years ago by @hkorte
show all tags
online
tools
statistics
onlinetoolsstatistics
copydelete
- community post
- history of this post
1Computer Aided Translation Tool
Caitra is an experimental translation tool developed by the Machine Translation Group at the University of Edinburgh.
15 years ago by @hkorte
show all tags
machine_translation
tools
machine_translationtools
copydelete
- community post
- history of this post
4ConceptNet
ConceptNet represents data in the form of a semantic network, and makes it available to be used in natural language processing and intelligent user interfaces.
15 years ago by @hkorte
show all tags
nlp
corpus
WordNet
ontology
nlpcorpusWordNetontology
copydelete
- community post
- history of this post
2CoNLL-2005 Shared Task: Semantic Role Labeling
http://www.lsi.upc.edu/~srlconll/
15 years ago by @hkorte
show all tags
data_source
semantic_role_labeling
data_sourcesemantic_role_labeling
copydelete
- community post
- history of this post
1Darmstadt Knowledge Processing Repository (DKPro Repository)
The DKPro Repository aims at providing the NLP research community with a collection of ready-to-use, robust NLP components for Apache UIMA.
15 years ago by @hkorte
show all tags
framework
uima
uima_components
frameworkuimauima_components
copydelete
- community post
- history of this post
1Das Fußball Studio
Das Fußball Studio ist eine Freeware, mit der Fussball-Ligen und -Turniere verwaltet und ausgewertet werden können. Dazu die Bundesliga-Datenbank mit vollständigen Daten der 1. und 2. Bundesliga.
15 years ago by @hkorte
show all tags
data_source
sports_betting
data_sourcesports_betting
copydelete
- community post
- history of this post
3Das LATEX2e-Sündenregister
Veraltete Befehle, Pakete und andere Fehler
16 years ago by @hkorte
show all tags
latex
latex
copydelete
- community post
- history of this post
1Data Extraction, Web Screen Scraping Tool, Mozenda Scraper
The Mozenda Scraper provides web data extraction software, Web Screen Scraping tools that makes it easy to capture nearly any content from the web. See how you can start getting data from the web in minutes.
12 years ago by @hkorte
show all tags
web
scraper
webscraper
copydelete
- community post
- history of this post
1Data Mining for Security and Crime Detection
In this paper, we present recent research on internet threats aiming at fraud or hampering critical information infrastructure. One approach concentrates on the rapid detection of phishing email, designed to make it next impossible for attackers to obtain financial resources or commit identity theft in this way. Then we address how another type of internet fraud, the violation of the rights of trademark owners by faked merchandise, can be semi-automatically solved with text mining methods. Thirdly, we report on two projects that are designed to prevent fraud in business processes in public administrations, namely in the healthcare sector and in customs administrations. Finally, we focus on the issue of critical infrastructures, and describe our approach towards protecting them using a specific middleware architecture.
16 years ago by @paass
show all tags
fraud
prevention
phishing
security
fraudpreventionphishingsecurity
copydelete
- community post
- history of this post
1Detecting Known and New Salting Tricks in Unwanted Emails
We have developed a systems that enables the detection of certain common salting tricks that are employed by criminals. Salting is the intentional addition or distortion of content. In this paper we describe a framework to identify email messages that might contain new, previously unseen tricks. To this end, we compare the simulated perceived email message text generated by our hidden salting simulation system to the OCRed text we obtain from the rendered email message. We present robust text comparison techniques and train a classifier based on the differences of these two texts. In simulations we show that we can detect suspicious emails with a high level of accuracy.
16 years ago by @paass
show all tags
phishing
email
security
filtering
phishingemailsecurityfiltering
copydelete
- community post
- history of this post
2DGAP - Deutsche Gesellschaft für Ad-hoc-Publizität
Financial News Directly From the Source: Die Deutsche Gesellschaft für Ad-hoc-Publizität ist eine Institution zur Erfüllung der Pflichtpublizität.
16 years ago by @hkorte
show all tags
data_source
finance
data_sourcefinance
copydelete
- community post
- history of this post
1Die Zukunft der Wikis: Semantic Web
Kurze Einführung ins Semantic Web.
17 years ago by @hkorte
show all tags
rdf
wiki
semanticweb
rdfwikisemanticweb
copydelete
- community post
- history of this post
1Dispatch
Dispatch is a library for asynchronous HTTP interaction. It provides a Scala vocabulary for Java’s async-http-client.
12 years ago by @hkorte
show all tags
scala
http_client
scalahttp_client
copydelete
- community post
- history of this post
19Django | The Web framework for perfectionists with deadlines
Django is a high-level Python Web framework that encourages rapid development and clean, pragmatic design.
13 years ago by @hkorte
show all tags
framework
web
python
frameworkwebpython
copydelete
- community post
- history of this post
1DNB - Normdaten-DVD-ROM
Diese DVD-ROM der Deutschen Nationalbibliothek enthält sowohl die Personennamendatei (PND) als auch die Schlagwortnormdatei (SWD) und die Gemeinsame Körperschaftsdatei (GKD) und ist direkt über die Deutsche Nationalbibliothek zu beziehen.
15 years ago by @hkorte
show all tags
data_source
database
data_sourcedatabase
copydelete
- community post
- history of this post
2enunciate
Enunciate is a Web service deployment framework. It is not another Web service stack implementation. Rather, Enunciate leverages existing Web service technologies to provide a mechanism to build, package, deploy, and to clearly, accurately deliver your Web service API on the Java platform.
17 years ago by @hkorte
show all tags
framework
java
webservices
amf
programming
frameworkjavawebservicesamfprogramming
copydelete
- community post
- history of this post
2Evaluating Text Extraction Algorithms | My tech blog.
Lately I’ve been working on evaluating and comparing algorithms, capable of extracting useful content from arbitrary html documents. I have made a feature wise comparison of related software and APIs.
13 years ago by @hkorte
show all tags
evaluation
text_extraction
html2text
evaluationtext_extractionhtml2text
copydelete
- community post
- history of this post
1Ext GWT - Java Component Library
Ext GWT: Rich Internet Application Framework for GWT.
15 years ago by @hkorte
show all tags
java
ria
google_web_toolkit
opensource
javascript
programming
javariagoogle_web_toolkitopensourcejavascriptprogramming
copydelete
- community post
- history of this post
1Extensible Dependency Grammar (XDG)
Extensible Dependency Grammar (XDG) is a general framework for dependency grammar, with multiple levels of linguistic representations called dimensions, e.g. grammatical function, word order, predicate-argument structure, scope structure, information structure and prosodic structure. It is articulated around a graph description language for multi-dimensional attributed labeled graphs. An XDG grammar is a constraint that describes the valid linguistic signs as n-dimensional attributed labeled graphs, i.e. n-tuples of graphs sharing the same set of attributed nodes, but having different sets of labeled edges. All aspects of these signs are stipulated explicitly by principles: the class of models for each dimension, additional properties that they must satisfy, how one dimension must relate to another, and even lexicalization.
15 years ago by @hkorte
show all tags
nlp
linguistics
parsing_german
trees
nlplinguisticsparsing_germantrees
copydelete
- community post
- history of this post
1Extract RSS feeds from Web pages
Approach to convert any Web data into RSS format.
15 years ago by @hkorte
show all tags
rss
web_article_extraction
www
information_extraction
crawling
tools
C#
rssweb_article_extractionwwwinformation_extractioncrawlingtoolsC#
copydelete
- community post
- history of this post
31Freebase
Freebase is a database that has all kinds of data in it and an API. Because it's an open database, anyone can enter new data in Freebase. An example page in the Freebase db looks pretty similar to a Wikipedia page. When you enter new data, the app can make suggestions about content. The topics in Freebase are organized by type, and you can connect pages with links, semantic tagging. So in summary, Freebase is all about shared data and what you can do with it.
17 years ago by @hkorte
show all tags
rdf
web2.0
wiki
semanticweb
rdfweb2.0wikisemanticweb
copydelete
- community post
- history of this post
4Freebase Wikipedia Extraction (WEX)
The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia.
16 years ago by @hkorte
show all tags
information_extraction
opensource
wikipedia
tools
information_extractionopensourcewikipediatools
copydelete
- community post
- history of this post
1FTD - Der Computer als Fondsmanager
Quantitative Fonds arbeiten mit ausgeklügelten Rechenmodellen und verzichten auf die subjektive Titelauswahl durch menschliche Manager. Doch die Produkte haben ihre Tücken, wie die jüngste Krise beweist.
16 years ago by @hkorte
show all tags
article
finance
articlefinance
copydelete
- community post
- history of this post
1FTD - Munzinger Archiv
Die Personen-Datenbank des Munzinger-Archivs umfasst mehr als 20.000 prominente Lebensläufe und wird kontinuierlich aktualisiert. Sie finden dort Porträts von Politikern, Wirtschaftsgrößen, aber auch von Künstlern und Wissenschaftlern.
16 years ago by @hkorte
show all tags
data_source
named_entity_recognition
finance
data_sourcenamed_entity_recognitionfinance
copydelete
- community post
- history of this post
1GNU Bash Reference Manual
This text is a brief description of the features that are present in the Bash shell.
15 years ago by @hkorte
show all tags
reference
bash
manual
referencebashmanual
copydelete
- community post
- history of this post
1GNU Emacs Manual
Emacs is the extensible, customizable, self-documenting real-time display editor. This Info file describes how to edit with Emacs and some of how to customize it; it corresponds to GNU Emacs version 23.1.
15 years ago by @hkorte
show all tags
reference
emacs
manual
referenceemacsmanual
copydelete
- community post
- history of this post
3Google Tech Talk Showcase
http://almaer.com/techtalkshowcase/
16 years ago by @hkorte
show all tags
video
talk
videotalk
copydelete
- community post
- history of this post
2Grails Framework Reference Documentation
http://grails.org/doc/1.0.x/
16 years ago by @hkorte
show all tags
groovy
docs
grails
groovydocsgrails
copydelete
- community post
- history of this post
1Groovy API - Overview (Groovy (1.6-beta-2))
http://groovy.codehaus.org/api/
16 years ago by @hkorte
show all tags
groovy
docs
api
groovydocsapi
copydelete
- community post
- history of this post
1GWT API's for SmartClient
SmartGWT is a GWT based framework that allows you to not only utilize its comprehensive widget library for your application UI, but also tie these widgets in with your server-side for data management. SmartGWT is based on the powerful and mature SmartClient library.
15 years ago by @hkorte
show all tags
java
google_web_toolkit
tools
license_lgpl
javascript
programming
javagoogle_web_toolkittoolslicense_lgpljavascriptprogramming
copydelete
- community post
- history of this post
1Handling Keyboard Shortcuts in JavaScript
Despite the many JavaScript libraries that are available today, I cannot find one that makes it easy to add keyboard shortcuts(or accelerators) to your javascript app. This is because keyboard shortcuts where only used in JavaScript games - no serious web application used keyboard shortcuts to navigate around its interface. But Google apps like Google Reader and Gmail changed that. So, I have created a function to make adding shortcuts to your application much easier.
13 years ago by @hkorte
show all tags
library
javascript
libraryjavascript
copydelete
- community post
- history of this post
3Headache relief for programmers - Regular Expression Generator
Helps to build a Regex based on an example string
15 years ago by @hkorte
show all tags
regex_generation
regex
tools
regex_generationregextools
copydelete
- community post
- history of this post
1Heart of Gold
The Heart of Gold is a middleware architecture for the integration of deep and shallow natural language processing components.
16 years ago by @hkorte
show all tags
nlp
nlp
copydelete
- community post
- history of this post
3How to Write a Spelling Corrector
An example of a toy spelling corrector that achieves 80 or 90% accuracy at a processing speed of at least 10 words per second in less than a page of python code.
12 years ago by @hkorte
show all tags
python
spelling_correction
pythonspelling_correction
copydelete
- community post
- history of this post
1HPPC: High Performance Primitive Collections for Java
Carrot Search Labs: High Performance Primitive Collections for Java, JUnit benchmarking, Suffix Arrays for Java, CSS sprites
11 years ago by @hkorte
show all tags
java
collections
performance
tools
javacollectionsperformancetools
copydelete
- community post
- history of this post
5HTML Parser
HTML Parser is a Java library used to parse HTML in either a linear or nested fashion. Primarily used for transformation or extraction, it features filters, visitors, custom tags and easy to use JavaBeans. It is a fast, robust and well tested package. It is a fast real-time parser for real-world HTML. What has attracted most developers to HTMLParser has been its simplicity in design, speed and ability to handle streaming real-world html.
15 years ago by @hkorte
show all tags
java
opensource
tools
programming
javaopensourcetoolsprogramming
copydelete
- community post
- history of this post
2ICEpdf
ICEpdf is an open source Java PDF library ideal for displaying and printing PDF documents within any Java application.
15 years ago by @hkorte
show all tags
java
pdf
pdfrenderer
api
javapdfpdfrendererapi
copydelete
- community post
- history of this post
4Iconfinder - Free icons
Iconfinder provides high quality icons for webdesigners and developers in an easy and efficient way. Many icons are free for commercial use.
14 years ago by @hkorte
show all tags
data_source
icons
programming
data_sourceiconsprogramming
copydelete
- community post
- history of this post
1Improved Phishing Detection using Model-Based Features
We investigate the statistical filtering of phishing emails, where a classifier is trained on characteristic features of existing emails and subsequently is able to identify new phishing emails with different contents. We propose advanced email features generated by adaptively trained Dynamic Markov Chains and by novel latent Class-Topic Models. On a publicly available test corpus classifiers using these features are able to reduce the number of misclassified emails by two thirds compared to previous work. Using a recently proposed more expressive evaluation method we show that these results are statistically significant. In addition we successfully tested our approach on a non-public email corpus with a real-life composition.
16 years ago by @paass
show all tags
mining
phishing
data
email
security
filtering
miningphishingdataemailsecurityfiltering
copydelete
- community post
- history of this post
1Install script for nice Euro signs in LaTeX
http://www.ctan.org/tex-archive/fonts/euro/
15 years ago by @hkorte
show all tags
latex
latex
copydelete
- community post
- history of this post
19Introduction to Information Retrieval
http://www-csli.stanford.edu/~hinrich/information-retrieval-book.html
16 years ago by @hkorte
show all tags
book
information_retrieval
bookinformation_retrieval
copydelete
- community post
- history of this post
1Introduction to Semantic MediaWiki
Semantic MediaWiki (SMW) is a free extension of MediaWiki that helps to search, organise, tag, browse, evaluate, and share the wiki's content. While traditional wikis contain only texts which computers can neither understand nor evaluate, SMW adds semantic annotations that bring the power of the Semantic Web to the wiki.
16 years ago by @hkorte
show all tags
rdf
relation_extraction
wiki
information_extraction
semantics
rdfrelation_extractionwikiinformation_extractionsemantics
copydelete
- community post
- history of this post
1Introduction to Syntactic Parsing
An overview on 91 slides of syntactic parsers using PCFGs
16 years ago by @hkorte
show all tags
nlp
linguistics
parser
nlplinguisticsparser
copydelete
- community post
- history of this post
11iText, a Free Java-PDF Library
iText is a library that allows you to generate PDF files on the fly.
15 years ago by @hkorte
show all tags
java
pdf
library
tools
javapdflibrarytools
copydelete
- community post
- history of this post
6Jade - Java Agent DEvelopment Framework
JADE (Java Agent DEvelopment Framework) is a software Framework fully implemented in Java language. It simplifies the implementation of multi-agent systems through a middle-ware that complies with the FIPA specifications and through a set of graphical tools that supports the debugging and deployment phases
13 years ago by @hkorte
show all tags
java
library
aose
agent
p2p
javalibraryaoseagentp2p
copydelete
- community post
- history of this post
3Java Open Source NLP and Text Mining tools
This is an overview of the open source NLP and machine learning tools for text mining, information extraction, text classification, clustering, approximate string matching, language parsing and tagging, and more.
15 years ago by @hkorte
show all tags
nlp
tools
textmining
nlptoolstextmining
copydelete
- community post
- history of this post
1JBoss Cache as a POJO Cache
Tutorial on JBossCache POJO.
15 years ago by @hkorte
show all tags
java
db_collections
jboss
tutorial
pojo_cache
tools
javadb_collectionsjbosstutorialpojo_cachetools
copydelete
- community post
- history of this post
1JBoss PojoCache Tutorial
PojoCache is an in-memory, transactional, and replicated POJO (plain old Java object) cache system that allows users to operate on a POJO transparently without active user management of either replication or persistency aspects. This tutorial focuses on the usage of the PojoCache API.
15 years ago by @hkorte
show all tags
java
db_collections
tutorials
jboss
pojo_cache
javadb_collectionstutorialsjbosspojo_cache
copydelete
- community post
- history of this post
1jcoffeescript
JCoffeeScript is a java library that compiles CoffeeScript 1.1.
13 years ago by @hkorte
show all tags
library
coffeescript
tools
javascript
librarycoffeescripttoolsjavascript
copydelete
- community post
- history of this post
2Jericho HTML Parser
Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
15 years ago by @hkorte
show all tags
java
parser
opensource
tools
javaparseropensourcetools
copydelete
- community post
- history of this post
5JNI Kernel Extension for SVMlight
This software is an extension of the SVMlight software. It provides an interface to kernel functions that are implemented in Java by means of the Java Native Interface (JNI) Invocation API.
16 years ago by @hkorte
show all tags
java
svmlight
svm
kernels
tools
programming
javasvmlightsvmkernelstoolsprogramming
copydelete
- community post
- history of this post
1JNI method and constructor signature cheat sheet
The JNI signatures that are required when getting methods and constructors are a bit hard to understand.
14 years ago by @hkorte
show all tags
java
c
jni
programming
javacjniprogramming
copydelete
- community post
- history of this post
7Joda Time - Java date and time API
Joda-Time provides a quality replacement for the Java date and time classes. The design allows for multiple calendar systems, while still providing a simple API. The 'default' calendar is the ISO8601 standard which is used by XML. The Gregorian, Julian, Buddhist, Coptic, Ethiopic and Islamic systems are also included, and we welcome further additions. Supporting classes include time zone, duration, format and parsing.
15 years ago by @hkorte
show all tags
java
library
tools
javalibrarytools
copydelete
- community post
- history of this post
2jqMath
jqMath is a JavaScript module that makes it easy to put formatted mathematical expressions in web pages.
13 years ago by @hkorte
show all tags
web
math
jquery
javascript
webmathjqueryjavascript
copydelete
- community post
- history of this post
1jquery-panel-magic
The aim of this jQuery plugin is to provide a unique way to "panelize" a website. It brings a new approach to website and web application navigation.
14 years ago by @hkorte
show all tags
library
web
javascript
librarywebjavascript
copydelete
- community post
- history of this post
1JWebPro: A Java-based Web Processing Toolkit
It contains a Web Crawler, HTML Parser and ("in the near future") NER and REX. Additionally, including JWikiDocs, a Java tool for crawling and downloading Wikipedia documents.
15 years ago by @hkorte
show all tags
java
opensource
tools
programming
javaopensourcetoolsprogramming
copydelete
- community post
- history of this post
1jWebSocket - The Open Source Java WebSocket Server
jWebSocket is a pure Java/JavaScript high speed bidirectional communication solution for the Web - secure, reliable and fast. Provides easy integration into existing Tomcat web applications.
12 years ago by @hkorte
show all tags
java
tomcat
web2.0
webapps
websocket
javatomcatweb2.0webappswebsocket
copydelete
- community post
- history of this post
1Kernel Methods for General Pattern Analysis (PDF)
Präsentation zum Buch (Cristianini)
15 years ago by @hkorte
show all tags
kernels
kernels
copydelete
- community post
- history of this post
1Kostenlose Fachbücher für das Studium
Kostenlose Fachbücher für das Studium als Ebooks
14 years ago by @hkorte
show all tags
ebooks
free
ebooksfree
copydelete
- community post
- history of this post
7LaTeX for Linguists
Qtree is a LaTeX package of tree-drawing macros.
16 years ago by @hkorte
show all tags
linguistics
trees
latex
linguisticstreeslatex
copydelete
- community post
- history of this post
1LaTeX: Packages in the ‘graphics’ bundle
User-manual for the packages color, graphics, and graphicx.
16 years ago by @hkorte
show all tags
latex
latex
copydelete
- community post
- history of this post
1LDC Catalog: ACE-2 Version 1.0
http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2003T11
16 years ago by @hkorte
show all tags
corpus
english
corpusenglish
copydelete
- community post
- history of this post
4Learning with Kernels - Support Vector Machines, Regularization, Optimization and Beyond
This web page provides information, errata, as well as about a third of the chapters of the book Learning with Kernels, written by Bernhard Schölkopf and Alex Smola (MIT Press, Cambridge, MA, 2002).
16 years ago by @hkorte
show all tags
book
svm
kernels
booksvmkernels
copydelete
- community post
- history of this post
1Lecture Slides: Experiments in Parsing German
Universität des Saarlandes, IAI & Computational Linguistics Department (Sisay Fissaha, Daniel Olejnik, Ralf Kornberger, Karin Müller, Detlef Prescher)
16 years ago by @hkorte
show all tags
nlp
linguistics
parser
semantic_role_labeling
parsing_german
nlplinguisticsparsersemantic_role_labelingparsing_german
copydelete
- community post
- history of this post
1Lecture Slides: Parsing German
Expressive Grammars for Natural Language Processing: Theory and applications - Today's lecture: Parsing German
16 years ago by @hkorte
show all tags
nlp
linguistics
parser
semantic_role_labeling
parsing_german
nlplinguisticsparsersemantic_role_labelingparsing_german
copydelete
- community post
- history of this post
12LIBSVM - A Library for Support Vector Machines
LIBSVM is an integrated software for support vector classification, (C-SVC, nu-SVC), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM ). It supports multi-class classification.
16 years ago by @hkorte
show all tags
java
machine_learning
svm
programming
javamachine_learningsvmprogramming
copydelete
- community post
- history of this post
6LingPipe
LingPipe is a suite of Java libraries for the linguistic analysis of human language.
15 years ago by @hkorte
show all tags
java
nlp
tools
javanlptools
copydelete
- community post
- history of this post
2Linksammlung: Freie NLP-Software für die deutsche Sprache
Alle Programme und Resourcen auf der Liste sind frei, d.h. kostenlos (für Forschungszwecke) verfügbar, auf deutschsprachige Texte anwendbar und sofort startklar, d.h. sie müssen nicht erst mit Hilfe von z.B. annotierten Korpora trainiert werden. Die Liste ist natürlich unvollständig (Stand 22.5.2007).
16 years ago by @hkorte
show all tags
nlp
linguistics
tools
nlplinguisticstools
copydelete
- community post
- history of this post
1List of resources: Article text extraction from HTML documents | My tech blog.
http://tomazkovacic.com/blog/56/list-of-resources-article-text-extraction-from-html-documents/
13 years ago by @hkorte
show all tags
web_article_extraction
tools
scraper
web_article_extractiontoolsscraper
copydelete
- community post
- history of this post
1log4javascript
A logging framework for JavaScript based on log4j.
15 years ago by @hkorte
show all tags
library
logging
javascript
libraryloggingjavascript
copydelete
- community post
- history of this post
1Making browsers faster: Resource Packages
A proposal to make downloading web page resources faster in all browsers (by the use of compressed jar files for imaged etc.)
15 years ago by @hkorte
show all tags
webapps
tools
programming
webappstoolsprogramming
copydelete
- community post
- history of this post
2MegaMap - A simple, unbounded hashtable for Java
MegaMap is a Java implementation of a map (or hashtable) that can store an unbounded amount of data, limited only by the amount of disk space available. Objects stored in the map are persisted to disk. Good performance is achieved by an in-memory cache. The MegaMap can, for all practical reasons, be thought of as a map implementation with unlimited storage space.
15 years ago by @hkorte
show all tags
java
java
copydelete
- community post
- history of this post
3MLComp
MLcomp is a free website for objectively comparing machine learning programs across various datasets for multiple problem domains.
14 years ago by @hkorte
show all tags
comparison
machine_learning
tools
comparisonmachine_learningtools
copydelete
- community post
- history of this post
8mloss
A forum for open source software in machine learning.
16 years ago by @hkorte
show all tags
machine_learning
opensource
programming
machine_learningopensourceprogramming
copydelete
- community post
- history of this post
3Movie Review Data
Movie review data: test set for sentiment analysis
16 years ago by @hkorte
show all tags
nlp
data_source
sentiment_analysis
corpus
opinion_mining
nlpdata_sourcesentiment_analysiscorpusopinion_mining
copydelete
- community post
- history of this post
1Mozilla Labs Jetpack
API for creating Firefox add-ons with HTML, CSS and JavaScript.
15 years ago by @hkorte
show all tags
API
javascript
programming
APIjavascriptprogramming
copydelete
- community post
- history of this post
1MSTParser
MSTParser is a non-projective dependency parser that searches for maximum spanning trees over directed graphs. Models of dependency structure are based on large-margin discriminative training methods. Projective parsing is also supported.
16 years ago by @hkorte
show all tags
java
parser
tools
javaparsertools
copydelete
- community post
- history of this post
1Multi-lingual Noun Phrase Extractor (MuNPEx)
MuNPEx is a multi-lingual noun phrase (NP) extraction component developed for the GATE architecture, implemented in JAPE. It currently supports English, German, French, and Spanish (in beta). MuNPEx requires a part-of-speech (POS) tagger to work and can additionally use detected named entities (NEs) to improve chunking performance. Please read the documentation (or source code) for more details.
16 years ago by @hkorte
show all tags
nlp
linguistics
text_mining
information_extraction
nlplinguisticstext_mininginformation_extraction
copydelete
- community post
- history of this post
1natural language processing blog: F-measure versus Accuracy
F-measure versus Accuracy
16 years ago by @hkorte
show all tags
evaluation
information_retrieval
statistics
blog
evaluationinformation_retrievalstatisticsblog
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
⟩
⟩⟩

bookmarks (hide)176 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide) displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

KD Text Mining

discussion

tags

bookmarks (hide)176
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...