< >
- Mahout currently has Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering...Mahout currently has Collaborative Filtering User and Item based recommenders K-Means, Fuzzy K-Means clustering Mean Shift clustering Dirichlet process clustering Latent Dirichlet Allocation Singular value decomposition Parallel Frequent Pattern mining Complementary Naive Bayes classifier Random forest decision tree based classifier High performance java collections (previously colt collections) A vibrant community and many more cool stuff to come by this summer thanks to Google summer of code
- The Knowledge Discovery Machine Learning (KDML) group focuses on the neighboring subfields of computer science known as knowledge discovery in databases (K...The Knowledge Discovery Machine Learning (KDML) group focuses on the neighboring subfields of computer science known as knowledge discovery in databases (KDD, sometimes referred to simply as data mining) and machine learning (ML). For us, these fields include on the one hand the automated analysis of large data sets using intelligent algorithms that are capable of extracting from the collected data hidden knowledge in order to produce models that can be used for prediction and decision making. On the other hand, they also include algorithms and systems that are capable of learning from experience and adapting to their environment or their users.
- Data Mining, Analytics, and Databases Databases are the workhorse of the enterprise today. Searching through databases and finding useful informat...Data Mining, Analytics, and Databases Databases are the workhorse of the enterprise today. Searching through databases and finding useful information has become a big computational challenge. Researchers from academia and Microsoft, Oracle, SAP, and many other corporations are looking to CUDA-enabled GPUs to find a scalable solution.
- LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM,...LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, and L1-loss linear SVM. Main features of LIBLINEAR include * Same data format as LIBSVM, our general-purpose SVM solver, and also similar usage * Multi-class classification: 1) one-vs-the rest, 2) Crammer & Singer * Cross validation for model selection * Probability estimates (logistic regression only) * Weights for unbalanced data * MATLAB/Octave, Java interfaces
- Social Spam Detection
< >
- Proceedings of the ACM WebSci'11, (June 2011)
- (2011)cite arxiv:1111.3689 .
- MLDM, volume 2123 of Lecture Notes in Computer Science, page 184-195. Springer, (2001)
- Science 327(5968):1018-1021 (2010)
- Data & Knowledge Engineering 68(11):1271 - 1288 (2009)Including Special Section: Conference on Privacy in Statistical Databases PSD 2008 - Six selected and extended papers on Database Privacy .
- (2010)
- (2002)
- ICDM, page 503-512. IEEE Computer Society, (2009)
- International Semantic Web Conference, volume 4273 of Lecture Notes in Computer Science, page 640-653. Springer, (2006)
- SIGIR, page 179-186. ACM, (2008)
- PAKDD, volume 5476 of Lecture Notes in Computer Science, page 628-636. Springer, (2009)
- Data Mining and Knowledge Discovery 2(2):169--194 (June 1998)
- 7th Conference on Computer Methods and Systems, Krakow, Poland, (November 2009)ISBN 83-916420-5-4 .
- (2010)cite arxiv:1004.3539 .
- International Journal of Intelligent Systems 23(6):654-679 (2008)
- Wiley & Sons, 1., Auflage edition, (1999)
- Proc. LeGo-09: From Local Patterns to Global Models, Workshop at the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, (2009)accepted .
- IEEE Transactions on Knowledge and Data Engineering (1999)
- AIRWeb '08: Proceedings of the 4th international workshop on Adversarial information retrieval on the web, page 61--68. New York, NY, USA, ACM, (2008)
- Advances in Knowledge Discovery and Data Mining, (1996)


concept