Other articles where Thesaurus is discussed: library: Thesauri: A new use of the term thesaurus, now widespread, dates from the early 1950s in the work of H.P. Luhn, at International Business Machines Corporation (IBM), who was searching for a computer process that could create a list of authorized terms for the indexing…
The aim of the International Journal of Advances in Internet of Things is to provide a forum for scientists and social workers to present and discuss issues in the impact of the Internet to the society and disseminate findings in scientific research on related subjects.
The Cataloger's Reference Shelf is based on 21 MARC manuals and other reference works published by The Library of Congress and frequently accessed by technical services staff. A must see for catalogers!
Database of animal natural history, distribution, classification, and conservation biology. Contains species accounts about individual animal species and descriptions of levels of organization above the species level, especially phyla, classes, and in some cases, orders and families.
Provides taxonomic, conservation status and distribution information on plants and animals that are extinct, at risk of extinction, or near threatened.
In this post you will see 5 recipes of supervised classification algorithms applied to small standard datasets that are provided with the scikit-learn library.
Advantages and drawbacks of data organisation in hierarchies, facets and with tags. Problems with finding the needed data without exact knowledge about it.
"by letting users tag (...), we're (building) systems that, like the Web itself, do a better job of letting individuals create value for one another, often without realizing it."
Ein englischer Text von Adam Mathes mit den Themen:The Creation of Metadata, Tagging Content in Del.icio.us and Flickr, From Tags to Folksonomy, Why Folksonomies Work and Areas For Further Research
While professionally created metadata are often considered of high quality, it is costly in terms of time and effort to produce. User created metadata is a third approach, and this paper focuses on grassroots community classification of digital assets.
This introductory course on machine learning will give an overview of many concepts, techniques, and algorithms in machine learning, beginning with topics such as classification and linear regression and ending up with more recent topics such as boosting,
Concept mining is a discipline at the nexus of data mining, text mining, and linguistics, drawing on artificial intelligence and statistics. It aims to extract concepts from documents.
My diploma thesis about a system to automatically build a multilingual thesaurus from wikipedia, "WikiWord", is finally done. I handed it in yesterday. My research will hopefully help to make Wikipedia more accessible for automatic processing
Scalable and Efficient Data Streaming Algorithms for Detecting Common Content in Internet Traffic. Minho Sung, Abhishek Kumar, Li Li, Jia Wang, Jun Xu. To appear in the Proc. of 2nd IEEE International Workshop on Networking Meets Databases (NetDB'06), April 2006. Sketch Guided Sampling -- Using On-Line Estimates of Flow Size for Adaptive Data Collection. Abhishek Kumar, Jun (Jim) Xu. To appear in the proceedings of IEEE Infocom'06, Barcelona, Spain, April 2006.
LIBLINEAR is a linear classifier for data with millions of instances and features. It supports L2-regularized logistic regression (LR), L2-loss linear SVM, and L1-loss linear SVM.
Main features of LIBLINEAR include
* Same data format as LIBSVM, our general-purpose SVM solver, and also similar usage
* Multi-class classification: 1) one-vs-the rest, 2) Crammer & Singer
* Cross validation for model selection
* Probability estimates (logistic regression only)
* Weights for unbalanced data
* MATLAB/Octave, Java interfaces
Dewey.info is an experimental space for linked DDC data. The initial data set available is a
linked data version of the DDC Summaries in nine languages. The intention of the dewey.info prototype
is to be a platform for Dewey data on the Web.
M. Sandler. KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, page 580--589. New York, NY, USA, ACM, (2007)
A. Sapkal, S. Nemade, N. Mohadikar, and P. Gosavi. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (4):
2171--2173(April 2015)
L. Satheesh, P. Prabhakar, and A. P. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (1):
394--400(January 2015)