<rdf:RDF xmlns:community="http://www.bibsonomy.org/ontologies/2008/05/community#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:cc="http://web.resource.org/cc/" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" xmlns:swrc="http://swrc.ontoware.org/ontology#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xml:base="http://www.bibsonomy.org/user/jil/naive"><owl:Ontology rdf:about=""><rdfs:comment>BibSonomy publications for /user/jil/naive</rdfs:comment><owl:imports rdf:resource="http://swrc.ontoware.org/ontology/portal"/></owl:Ontology><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2b8f819dc681e76ee9723c72a859dff3c/jil"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2b8f819dc681e76ee9723c72a859dff3c/jil"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Misc"/><owl:sameAs rdf:resource="http://citeseer.ist.psu.edu/kim02effective.html"/><swrc:date>Tue May 06 02:13:03 CEST 2008</swrc:date><swrc:title>Effective methods for improving Naive Bayes text classifiers</swrc:title><swrc:year>2002</swrc:year><swrc:keywords>naive learning bayes length multinomial machine normalization </swrc:keywords><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="S. Kim"/></rdf:_1><rdf:_2><swrc:Person swrc:name="H. Rim"/></rdf:_2><rdf:_3><swrc:Person swrc:name="D. Yook"/></rdf:_3><rdf:_4><swrc:Person swrc:name="H. Lim"/></rdf:_4></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/22896eb9538a6ee34f8e6c6757bdcf99e/jil"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/22896eb9538a6ee34f8e6c6757bdcf99e/jil"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Misc"/><owl:sameAs rdf:resource="http://people.csail.mit.edu/~jrennie/papers/sm-thesis.pdf"/><swrc:date>Mon May 05 19:34:57 CEST 2008</swrc:date><swrc:school><swrc:University swrc:name="Massachusetts Institute of Technology"/></swrc:school><swrc:title>Improving Multi-class Text Classification with Naive Bayes</swrc:title><swrc:year>2001</swrc:year><swrc:keywords>deduction thesis naive komplett estimation map prior bayes mle exhaustive likelihood multinomial herleitung maximum </swrc:keywords><swrc:abstract>There are numerous text documents available in electronic form. More and more
are becoming available every day. Such documents represent a massive amount of
information that is easily accessible. Seeking value in this huge collection requires
organization; much of the work of organizing documents can be automated through
text classification. The accuracy and our understanding of such systems greatly
influences their usefulness. In this paper, we seek 1) to advance the understanding
of commonly used text classification techniques, and 2) through that understanding,
improve the tools that are available for text classification. We begin by clarifying
the assumptions made in the derivation of Naive Bayes, noting basic properties and
proposing ways for its extension and improvement. Next, we investigate the quality
of Naive Bayes parameter estimates and their impact on classification. Our analysis
leads to a theorem which gives an explanation for the improvements that can be
found in multiclass classification with Naive Bayes using Error-Correcting Output
Codes. We use experimental evidence on two commonly-used data sets to exhibit an
application of the theorem. Finally, we show fundamental flaws in a commonly-used
feature selection algorithm and develop a statistics-based framework for text feature
selection. Greater understanding of Naive Bayes and the properties of text allows us
to make better use of it in text classification.</swrc:abstract><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Jason D. M. Rennie"/></rdf:_1></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2fa46d1cc0dd56ab40a7f722e569a1fd3/jil"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2fa46d1cc0dd56ab40a7f722e569a1fd3/jil"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://www.kamalnigam.com/papers/multinomial-aaaiws98.pdf"/><swrc:date>Mon May 05 19:02:36 CEST 2008</swrc:date><swrc:booktitle>Learning for Text Categorization: Papers from the 1998 {AAAI} Workshop </swrc:booktitle><swrc:pages>41--48</swrc:pages><swrc:title>A Comparison of Event Models for Naive {B}ayes Text Classification</swrc:title><swrc:year>1998</swrc:year><swrc:keywords>bernoulli naive model ereignis classification event text multinomial bayes vergleich </swrc:keywords><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Andrew McCallum"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Kamal Nigam"/></rdf:_2></rdf:Seq></swrc:author></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2e290abb350b7aa09a412c1dddac55cd6/jil"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2e290abb350b7aa09a412c1dddac55cd6/jil"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#InProceedings"/><owl:sameAs rdf:resource="http://citeseer.ist.psu.edu/lewis98naive.html"/><swrc:date>Mon May 05 18:53:49 CEST 2008</swrc:date><swrc:address>Chemnitz, DE</swrc:address><swrc:booktitle>Proceedings of {ECML}-98, 10th European Conference on Machine Learning</swrc:booktitle><swrc:number>1398</swrc:number><swrc:pages>4--15</swrc:pages><swrc:publisher><swrc:Organization swrc:name="Springer Verlag, Heidelberg, DE"/></swrc:publisher><swrc:title>Naive ({B}ayes) at forty: The independence assumption in information retrieval.</swrc:title><swrc:year>1998</swrc:year><swrc:keywords>representation overview forty text naive bayes ir </swrc:keywords><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="David D. Lewis"/></rdf:_1></rdf:Seq></swrc:author><swrc:editor><rdf:Seq><rdf:_1><swrc:Person swrc:name="Claire N{\&#039;{e}}dellec"/></rdf:_1><rdf:_2><swrc:Person swrc:name="C{\&#039;{e}}line Rouveirol"/></rdf:_2></rdf:Seq></swrc:editor></rdf:Description><rdf:Description rdf:about="http://www.bibsonomy.org/bibtex/2b4e1a9d4635a9fb1f11a947f1ab3618a/jil"><owl:sameAs rdf:resource="http://www.bibsonomy.org/uri/bibtex/2b4e1a9d4635a9fb1f11a947f1ab3618a/jil"/><rdf:type rdf:resource="http://swrc.ontoware.org/ontology#Misc"/><owl:sameAs rdf:resource="http://citeseer.ist.psu.edu/757874.html"/><swrc:date>Mon May 05 18:50:15 CEST 2008</swrc:date><swrc:title>Spam Filtering with Naive Bayes -- Which Naive Bayes?</swrc:title><swrc:year>2006</swrc:year><swrc:keywords>metsis multivariate naive multinomial spam bayes </swrc:keywords><swrc:author><rdf:Seq><rdf:_1><swrc:Person swrc:name="Vangelis Metsis"/></rdf:_1><rdf:_2><swrc:Person swrc:name="Ion Androutsopoulos"/></rdf:_2><rdf:_3><swrc:Person swrc:name="Georgios Paliouras"/></rdf:_3></rdf:Seq></swrc:author></rdf:Description></rdf:RDF>