Simulation of the Evolution of Single Celled Organisms
with Genome, Metabolism and Time-Varying Phenotype
P. Kennedy. University of Technology, Sydney, Australia, (July 1999)
Abstract
A novel model of a biological cell is presented.
Primary features in the cell are a genome and
metabolism. Pairs of genome and metabolism coevolve
with a genetic algorithm (GA) to produce cells that can
survive in simple environments. Evolution of the genome
is Darwinian, whereas evolution of the metabolism has
Lamarckian features through acquired chemical
concentrations being inherited. Fitness is more closely
correlated with the mother cell than with the father. A
biologically inspired double-strand genome model is
presented. Double-stranded genomes admit a large
increase in the number of schemata represented by each
genome compared to single-strand encodings. This gives
GAs more information to use and allows faster search.
Simple implementation of a biologically inspired
algorithm for inversion also becomes possible, as well
as a compression of data on the genome. Increased rates
of inversion showed an increase in population
convergence. Double-stranded genomes impose constraints
between strands that decrease the overall rate of
population convergence. Four-bit bases from a parallel
genomic language are encoded on the genome. The
parallel genomic language, following the operon model
of Jacob and Monod, allows genes to be placed on the
genome at any loci and allows easy implementation of an
inversion operator. The genome and chemical metabolism
of a cell in our model have a close relationship.
Genomes specify allowable families of enzyme-catalysed
chemical reactions and families of chemicals that may
diffuse through the cell membrane at increased rate.
Chemicals produced from metabolic processes regulate
genes and allow expression of proteins from the genome.
We introduce the "bootstrapping" problem: evolution
of cells stable in simple environments from random
genomes and initial simple metabolic conditions.
Experiments show that solution of the
"bootstrapping" problem is much easier with
coevolution than when the initial metabolic conditions
remain fixed. A gallery of cellular survival strategies
is given. Genes in the population are diverse because
there is a variety of equally valid solutions to the
problem posed by the environment. Solution to the
"bootstrapping" problem is hindered because fitness
functions cannot differentiate between cells using
myopic solutions rather than long-term strategies.
Cells with myopic strategies attain high fitness but
produce offspring with high probability of cell death
(ie, when the myopic solution begins to fail). A novel
solution, where fitness of parents is retroactively
modified when the fitness of offspring becomes known,
reduces the number of cells exhibiting myopic
strategies.
Fri, 15 Jun 2001 12:16:19 +1000 To:
genetic-programming@cs.stanford.edu
I applied some more biological notions to GAs in my PhD
thesis. In that, I built a model of single-celled
organisms and bred populations of them to live in
simple environments. The cell models had a
double-stranded (DNA inspired) genome and a
chemical-kinetic metabolism. Operons on the genome
encoded enzymes to control reactions in the metabolism
and the metabolism itself instantiated the simulated
enzyme molecules from the genome template.
Some of the complexities I added from biology were:
- operons (for a model of gene regulation)
- a gene expression algorithm (transcription and
translation algorithms)
- a double stranded genome (not diploidy)
- the inversion genetic operator
- a language that allows genes to appear at any locus
on the genome
- a phenotype that interacts with the genome for its
lifetime rather than just at the start.
The simulation was interesting but big and slow. It
generated so much information that it was a bit
difficult to work out how it was solving a problem.
PhD thesis is here:
http://zahir.socs.uts.edu.au:9673/Paul/research_html
Since then I've looked at abstracting the biological
concepts out of the big simulation into simpler models
(with no differential equations!). This work has
focused on the double-stranded genome and inversion
operator. I have a paper at GECCO about this
work.
Currently I'm interested in the (overly simplified)
idea that biological cells exist as systems in
isolation to their genome. (That's not to say that a
cell can exist without its genome). Without the genome
the cell is a sort of "default" system. As you add
genes to the genome you add enzymes to the system which
kicks the metabolism into different areas of (chemical)
reaction space. I see this kind of phenotype as a
"tempered" phenotype - tempered by genes rather
than completely specified. I'm looking forward to
discussing some of these ideas at the gene expression
workshop at GECCO next month.
Cheers, Paul.
%0 Thesis
%1 kennedy:thesis
%A Kennedy, Paul Joseph
%C Australia
%D 1999
%K algorithms genetic
%T Simulation of the Evolution of Single Celled Organisms
with Genome, Metabolism and Time-Varying Phenotype
%U http://zahir.socs.uts.edu.au:9673/Paul/Papers/Thesis.zip
%X A novel model of a biological cell is presented.
Primary features in the cell are a genome and
metabolism. Pairs of genome and metabolism coevolve
with a genetic algorithm (GA) to produce cells that can
survive in simple environments. Evolution of the genome
is Darwinian, whereas evolution of the metabolism has
Lamarckian features through acquired chemical
concentrations being inherited. Fitness is more closely
correlated with the mother cell than with the father. A
biologically inspired double-strand genome model is
presented. Double-stranded genomes admit a large
increase in the number of schemata represented by each
genome compared to single-strand encodings. This gives
GAs more information to use and allows faster search.
Simple implementation of a biologically inspired
algorithm for inversion also becomes possible, as well
as a compression of data on the genome. Increased rates
of inversion showed an increase in population
convergence. Double-stranded genomes impose constraints
between strands that decrease the overall rate of
population convergence. Four-bit bases from a parallel
genomic language are encoded on the genome. The
parallel genomic language, following the operon model
of Jacob and Monod, allows genes to be placed on the
genome at any loci and allows easy implementation of an
inversion operator. The genome and chemical metabolism
of a cell in our model have a close relationship.
Genomes specify allowable families of enzyme-catalysed
chemical reactions and families of chemicals that may
diffuse through the cell membrane at increased rate.
Chemicals produced from metabolic processes regulate
genes and allow expression of proteins from the genome.
We introduce the "bootstrapping" problem: evolution
of cells stable in simple environments from random
genomes and initial simple metabolic conditions.
Experiments show that solution of the
"bootstrapping" problem is much easier with
coevolution than when the initial metabolic conditions
remain fixed. A gallery of cellular survival strategies
is given. Genes in the population are diverse because
there is a variety of equally valid solutions to the
problem posed by the environment. Solution to the
"bootstrapping" problem is hindered because fitness
functions cannot differentiate between cells using
myopic solutions rather than long-term strategies.
Cells with myopic strategies attain high fitness but
produce offspring with high probability of cell death
(ie, when the myopic solution begins to fail). A novel
solution, where fitness of parents is retroactively
modified when the fitness of offspring becomes known,
reduces the number of cells exhibiting myopic
strategies.
@phdthesis{kennedy:thesis,
abstract = {A novel model of a biological cell is presented.
Primary features in the cell are a genome and
metabolism. Pairs of genome and metabolism coevolve
with a genetic algorithm (GA) to produce cells that can
survive in simple environments. Evolution of the genome
is Darwinian, whereas evolution of the metabolism has
Lamarckian features through acquired chemical
concentrations being inherited. Fitness is more closely
correlated with the mother cell than with the father. A
biologically inspired double-strand genome model is
presented. Double-stranded genomes admit a large
increase in the number of schemata represented by each
genome compared to single-strand encodings. This gives
GAs more information to use and allows faster search.
Simple implementation of a biologically inspired
algorithm for inversion also becomes possible, as well
as a compression of data on the genome. Increased rates
of inversion showed an increase in population
convergence. Double-stranded genomes impose constraints
between strands that decrease the overall rate of
population convergence. Four-bit bases from a parallel
genomic language are encoded on the genome. The
parallel genomic language, following the operon model
of Jacob and Monod, allows genes to be placed on the
genome at any loci and allows easy implementation of an
inversion operator. The genome and chemical metabolism
of a cell in our model have a close relationship.
Genomes specify allowable families of enzyme-catalysed
chemical reactions and families of chemicals that may
diffuse through the cell membrane at increased rate.
Chemicals produced from metabolic processes regulate
genes and allow expression of proteins from the genome.
We introduce the {"}bootstrapping{"} problem: evolution
of cells stable in simple environments from random
genomes and initial simple metabolic conditions.
Experiments show that solution of the
{"}bootstrapping{"} problem is much easier with
coevolution than when the initial metabolic conditions
remain fixed. A gallery of cellular survival strategies
is given. Genes in the population are diverse because
there is a variety of equally valid solutions to the
problem posed by the environment. Solution to the
{"}bootstrapping{"} problem is hindered because fitness
functions cannot differentiate between cells using
myopic solutions rather than long-term strategies.
Cells with myopic strategies attain high fitness but
produce offspring with high probability of cell death
(ie, when the myopic solution begins to fail). A novel
solution, where fitness of parents is retroactively
modified when the fitness of offspring becomes known,
reduces the number of cells exhibiting myopic
strategies.},
added-at = {2008-06-19T17:35:00.000+0200},
address = {Australia},
author = {Kennedy, Paul Joseph},
biburl = {https://www.bibsonomy.org/bibtex/2ddbda1c217ec29e1b089d89433721f86/brazovayeye},
interhash = {2314afab123ab1b8b4bd480bd901edae},
intrahash = {ddbda1c217ec29e1b089d89433721f86},
keywords = {algorithms genetic},
month = {July},
notes = {Fri, 15 Jun 2001 12:16:19 +1000 To:
genetic-programming@cs.stanford.edu
I applied some more biological notions to GAs in my PhD
thesis. In that, I built a model of single-celled
organisms and bred populations of them to live in
simple environments. The cell models had a
double-stranded (DNA inspired) genome and a
chemical-kinetic metabolism. Operons on the genome
encoded enzymes to control reactions in the metabolism
and the metabolism itself instantiated the simulated
enzyme molecules from the genome template.
Some of the complexities I added from biology were:
- operons (for a model of gene regulation)
- a gene expression algorithm (transcription and
translation algorithms)
- a double stranded genome (not diploidy)
- the inversion genetic operator
- a language that allows genes to appear at any locus
on the genome
- a phenotype that interacts with the genome for its
lifetime rather than just at the start.
The simulation was interesting but big and slow. It
generated so much information that it was a bit
difficult to work out how it was solving a problem.
PhD thesis is here:
http://zahir.socs.uts.edu.au:9673/Paul/research_html
Since then I've looked at abstracting the biological
concepts out of the big simulation into simpler models
(with no differential equations!). This work has
focused on the double-stranded genome and inversion
operator. I have a paper at GECCO about this
work.
Currently I'm interested in the (overly simplified)
idea that biological cells exist as systems in
isolation to their genome. (That's not to say that a
cell can exist without its genome). Without the genome
the cell is a sort of {"}default{"} system. As you add
genes to the genome you add enzymes to the system which
kicks the metabolism into different areas of (chemical)
reaction space. I see this kind of phenotype as a
{"}tempered{"} phenotype - tempered by genes rather
than completely specified. I'm looking forward to
discussing some of these ideas at the gene expression
workshop at GECCO next month.
Cheers, Paul.},
school = {University of Technology, Sydney},
size = {278 250 pages},
timestamp = {2008-06-19T17:43:08.000+0200},
title = {Simulation of the Evolution of Single Celled Organisms
with Genome, Metabolism and Time-Varying Phenotype},
url = {http://zahir.socs.uts.edu.au:9673/Paul/Papers/Thesis.zip},
year = 1999
}