- Pegasus An award-winning, open-source, graph-mining system with massive scalability. Analyze petabytes of graph data with ease.
- DMA 2012 : Workshop on Data Mining in Agriculture
- SUBDUE is a graph-based knowledge discovery system that finds structural, relational patterns in data representing entities and relationships. SUBDUE repre...SUBDUE is a graph-based knowledge discovery system that finds structural, relational patterns in data representing entities and relationships. SUBDUE represents data using a labeled, directed graph in which entities are represented by labeled vertices or subgraphs, and relationships are represented by labeled edges between the entities. SUBDUE uses the minimum description length (MDL) principle to identify patterns that minimize the number of bits needed to describe the input graph after being compressed by the pattern. SUBDUE can perform several learning tasks, including unsupervised learning, supervised learning, clustering and graph grammar learning.
- RNNLIB - A recurrent neural network library for sequence learning problems. As published by Marcus Liwicki
- Mloss is a community effort at producing reproducible research via open source software, open access to data and results, and open standards fo...Mloss is a community effort at producing reproducible research via open source software, open access to data and results, and open standards for interchange.
- Markov Logic Networks (MLNs) is a powerful framework that combines statistical and logical reasoning; they have been applied to many data intensive problem...Markov Logic Networks (MLNs) is a powerful framework that combines statistical and logical reasoning; they have been applied to many data intensive problems including information extraction, entity resolution, text mining, and natural language processing. Based on principled data management techniques, Tuffy is an MLN inference engine that achieves scalability and orders of magnitude speedup compared to prior art implementations. It is written in Java and relies on PostgreSQL. For a brief introduction to MLNs and the technical details of Tuffy, please see our technical report.
- Local Outlier Factor (LOF) is an anomaly detection algorithm presented as "LOF: Identifying Density-based Local Outliers" by Markus M. Breunig, Hans-Peter ...Local Outlier Factor (LOF) is an anomaly detection algorithm presented as "LOF: Identifying Density-based Local Outliers" by Markus M. Breunig, Hans-Peter Kriegel, Raymond T. Ng and Jörg Sander[1]. The key idea of LOF is comparing the local density of a point's neighborhood with the local density of its neighbors.
- In computer science, a kd-tree (short for k-dimensional tree) is a space-partitioning data structure for organizing points in a k-dimensional space. kd-tre...In computer science, a kd-tree (short for k-dimensional tree) is a space-partitioning data structure for organizing points in a k-dimensional space. kd-trees are a useful data structure for several applications, such as searches involving a multidimensional search key (e.g. range searches and nearest neighbour searches).
- A great deal of research has focused on algorithms for learning features from un- labeled data. Indeed, much progress has been made on benchmark datasets l...A great deal of research has focused on algorithms for learning features from un- labeled data. Indeed, much progress has been made on benchmark datasets like NORB and CIFAR by employing increasingly complex unsupervised learning al- gorithms and deep models. In this paper, however, we show that several very sim- ple factors, such as the number of hidden nodes in the model, may be as important to achieving high performance as the choice of learning algorithm or the depth of the model. Specifically, we will apply several off-the-shelf feature learning al- gorithms (sparse auto-encoders, sparse RBMs and K-means clustering, Gaussian mixtures) to NORB and CIFAR datasets using only single-layer networks. We then present a detailed analysis of the effect of changes in the model setup: the receptive field size, number of hidden nodes (features), the step-size (“stride”) be- tween extracted features, and the effect of whitening. Our results show that large numbers of hidden nodes and dense feature extraction are as critical to achieving high performance as the choice of algorithm itself—so critical, in fact, that when these parameters are pushed to their limits, we are able to achieve state-of-the- art performance on both CIFAR and NORB using only a single layer of features. More surprisingly, our best performance is based on K-means clustering, which is extremely fast, has no hyper-parameters to tune beyond the model structure it- self, and is very easy implement. Despite the simplicity of our system, we achieve performance beyond all previously published results on the CIFAR-10 and NORB datasets (79.6% and 97.0% accuracy respectively).
- The workshop aims to discuss key issues and practices of semantic mining. Thanks to the initiatives of the Linked Open Data and robust techniques for seman...The workshop aims to discuss key issues and practices of semantic mining. Thanks to the initiatives of the Linked Open Data and robust techniques for semantic annotation of Web, social, and sensor data, more semantic data is available. Many research efforts have been directed toward demonstrating semantic techniques to analyze and mine this growing resource. The workshop will provide a cross-disciplinary forum for researchers to showcase their innovation and efforts, and to further enhance existing bounds and create new connections among different communities. Here we solicit contributions on researches and practices of mining data semantics including theory, algorithms, and applications from computer science, life science, healthcare and other domains.
- Ninth Workshop on Mining and Learning with Graphs will be held in conjunction with the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining th...Ninth Workshop on Mining and Learning with Graphs will be held in conjunction with the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining that will take place August 21-24, 2011 in San Diego, CA.
- Elefant (Efficient Learning, Large-scale Inference, and Optimisation Toolkit) is an open source library for machine learning licensed under the Mozilla Pub...Elefant (Efficient Learning, Large-scale Inference, and Optimisation Toolkit) is an open source library for machine learning licensed under the Mozilla Public License (MPL). We develop an open source machine learning toolkit which provides algorithms for machine learning utilising the power of multi-core/multi-threaded processors/operating systems (Linux, WIndows, Mac OS X), a graphical user interface for users who want to quickly prototype machine learning experiments, tutorials to support learning about Statistical Machine Learning (Statistical Machine Learning at The Australian National University), and detailed and precise documentation for each of the above.
- RDF data can be analyzed with various query languages such as SPARQL or SeRQL. Due to their nature these query languages do not support fuzzy ...RDF data can be analyzed with various query languages such as SPARQL or SeRQL. Due to their nature these query languages do not support fuzzy queries. In this paper we present a new method that transforms the information presented by subject-relation-object relations within RDF data into Activation Patterns. These patterns represent a common model that is the basis for a number of sophisticated analysis methods such as semantic relation analysis, semantic search queries, unsuper- vised clustering, supervised learning or anomaly detection. In this paper, we explain the Activation Patterns concept and apply it to an RDF representation of the well known CIA World Factbook.
- C++ library for RL.
- PyBrain is a modular Machine Learning Library for Python. Its goal is to offer flexible, easy-to-use yet still powerful algorithms for Machine Learning Tas...PyBrain is a modular Machine Learning Library for Python. Its goal is to offer flexible, easy-to-use yet still powerful algorithms for Machine Learning Tasks and a variety of predefined environments to test and compare your algorithms. PyBrain is short for Python-Based Reinforcement Learning, Artificial Intelligence and Neural Network Library.
- CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge management, page 1221--1230. New York, NY, USA, ACM, (2008)
- Advances in neural information processing systems (2003)
- ACL, page 793-803. The Association for Computer Linguistics, (2011)
- Collaborative Web Tagging Workshop at WWW 2006, Edinburgh, Scotland, (May 2006)
- Proceedings of the 17th international conference on Computational linguistics, page 768--774. Morristown, NJ, USA, Association for Computational Linguistics, (1998)
- (March 2005)v2 .
- ACL '01: Proceedings of the 39th Annual Meeting on Association for Computational Linguistics, page 26--33. Morristown, NJ, USA, Association for Computational Linguistics, (2001)
- NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS OF THE 2002 IEEE WORKSHOP, page 747--756. IEEE, (2002)
- Proceedings of the ACL-2008 Workshop on Mobile Language Processing, Association for Computational Linguistics, (2008)
- Proceeding of the International Conference on Knowledge Discovery and Information Retrieval KDIR 2009, INSTICC, (Oct 6, 2009)
- Proceedings of the 9th International Semantic Web Conference ISWC2010, Berlin / Heidelberg, Springer, (2010)
- 10th International Conference on Knowledge Management and Knowledge Technologies 1–3 September 2010, Messe Congress Graz, Austria, page 18 - 18. (2010)
- ACM SIGKDD Explorations Newsletter 7(2):3--12 (2005)
- Machine Learning 73(1):3-23 (2008)
- Proceedings of the 18th international conference on Inductive Logic Programming, page 3--3. Berlin, Heidelberg, Springer-Verlag, (2008)
- (2008)
- KDD, page 623-631. ACM, (2008)
- University of Pennsylvania, Philadelphia, PA, (1998)
- AAAI'07: Proceedings of the 22nd national conference on Artificial intelligence, page 913--918. AAAI Press, (2007)
- page 92--100. Morgan Kaufmann Publishers, (1998)


user