Evolving Lucene search queries for text
classification
L. Hirsch, R. Hirsch, and M. Saeedi. GECCO '07: Proceedings of the 9th annual conference on
Genetic and evolutionary computation, 2, page 1604--1611. London, ACM Press, (7-11 July 2007)
Abstract
We describe a method for generating accurate, compact,
human understandable text classifiers. Text datasets
are indexed using Apache Lucene and Genetic Programs
are used to construct Lucene search queries. Genetic
programs acquire fitness by producing queries that are
effective binary classifiers for a particular category
when evaluated against a set of training documents. We
describe a set of functions and terminals and provide
results from classification tasks.
GECCO '07: Proceedings of the 9th annual conference on
Genetic and evolutionary computation
year
2007
month
7-11 July
pages
1604--1611
publisher
ACM Press
volume
2
organisation
ACM SIGEVO (formerly ISGEC)
publisher_address
New York, NY, USA
isbn13
978-1-59593-697-4
notes
GECCO-2007 A joint meeting of the sixteenth
international conference on genetic algorithms
(ICGA-2007) and the twelfth annual genetic programming
conference (GP-2007).
ACM Order Number 910071
%0 Conference Paper
%1 1277279
%A Hirsch, Laurence
%A Hirsch, Robin
%A Saeedi, Masoud
%B GECCO '07: Proceedings of the 9th annual conference on
Genetic and evolutionary computation
%C London
%D 2007
%E Thierens, Dirk
%E Beyer, Hans-Georg
%E Bongard, Josh
%E Branke, Jurgen
%E Clark, John Andrew
%E Cliff, Dave
%E Congdon, Clare Bates
%E Deb, Kalyanmoy
%E Doerr, Benjamin
%E Kovacs, Tim
%E Kumar, Sanjeev
%E Miller, Julian F.
%E Moore, Jason
%E Neumann, Frank
%E Pelikan, Martin
%E Poli, Riccardo
%E Sastry, Kumara
%E Stanley, Kenneth Owen
%E Stutzle, Thomas
%E Watson, Richard A
%E Wegener, Ingo
%I ACM Press
%K algorithms, apache classification genetic lucene, programming, text
%P 1604--1611
%T Evolving Lucene search queries for text
classification
%U http://doi.acm.org/10.1145/1276958.1277279
%V 2
%X We describe a method for generating accurate, compact,
human understandable text classifiers. Text datasets
are indexed using Apache Lucene and Genetic Programs
are used to construct Lucene search queries. Genetic
programs acquire fitness by producing queries that are
effective binary classifiers for a particular category
when evaluated against a set of training documents. We
describe a set of functions and terminals and provide
results from classification tasks.
@inproceedings{1277279,
abstract = {We describe a method for generating accurate, compact,
human understandable text classifiers. Text datasets
are indexed using Apache Lucene and Genetic Programs
are used to construct Lucene search queries. Genetic
programs acquire fitness by producing queries that are
effective binary classifiers for a particular category
when evaluated against a set of training documents. We
describe a set of functions and terminals and provide
results from classification tasks.},
added-at = {2008-06-19T17:35:00.000+0200},
address = {London},
author = {Hirsch, Laurence and Hirsch, Robin and Saeedi, Masoud},
biburl = {https://www.bibsonomy.org/bibtex/215321960377bb90f91096c5592989adb/brazovayeye},
booktitle = {GECCO '07: Proceedings of the 9th annual conference on
Genetic and evolutionary computation},
editor = {Thierens, Dirk and Beyer, Hans-Georg and Bongard, Josh and Branke, Jurgen and Clark, John Andrew and Cliff, Dave and Congdon, Clare Bates and Deb, Kalyanmoy and Doerr, Benjamin and Kovacs, Tim and Kumar, Sanjeev and Miller, Julian F. and Moore, Jason and Neumann, Frank and Pelikan, Martin and Poli, Riccardo and Sastry, Kumara and Stanley, Kenneth Owen and Stutzle, Thomas and Watson, Richard A and Wegener, Ingo},
interhash = {a261595e779acc4fa922c206449150f0},
intrahash = {15321960377bb90f91096c5592989adb},
isbn13 = {978-1-59593-697-4},
keywords = {algorithms, apache classification genetic lucene, programming, text},
month = {7-11 July},
notes = {GECCO-2007 A joint meeting of the sixteenth
international conference on genetic algorithms
(ICGA-2007) and the twelfth annual genetic programming
conference (GP-2007).
ACM Order Number 910071},
organisation = {ACM SIGEVO (formerly ISGEC)},
pages = {1604--1611},
publisher = {ACM Press},
publisher_address = {New York, NY, USA},
timestamp = {2008-06-19T17:41:31.000+0200},
title = {Evolving Lucene search queries for text
classification},
url = {http://doi.acm.org/10.1145/1276958.1277279},
volume = 2,
year = 2007
}