Evolutionary and Developmental Genetics

Penn State University

Research

Our lab is interested in the evolution of complex traits, evolutionary principles generally, and issues in the nature of our knowledge in the life sciences, including bioethical and societal aspects. We are also interested in human variation, how it got here, how much there is, and what it means in regard to complex traits. Our work is largely in genetics and involves studies of human polymorphisms and the amount of variation in genes related to human phenotypes, including disease-related traits, but with a concentration on the evolutionary processes that generated it, and issues involved in finding it.

A major thrust of my work is in developmental genetics and the evolution of developmental processes that control complex traits. We are currently working on the genetic basis of morphological traits that have been important in vertebrate evolution and constitute the fossil record, with particular attention on the patterning of the vertebrate dentition, and craniofacial morphology and its evolution. Our current model systems include collaborative work with a baboon genealogy in San Antonio (Southwest Foundation for Biomedical Research), and various experimental mouse models. This is a collaborative project with Drs Richtmeier, Jablonski, Walker, and Buchanan in our Department, and with Jeff Rogers at the Southwest Foundation and Jim Cheverud at Washington University, Fred Spoor in London, and other colleagues and consultants.

In addition, I am interested in bioinformatic approaches to understand the complex regulation of olfactory receptor genes and gene expression generally. This research involves searching DNA sequences for various species in and around the olfactory receptor gene clusters and searching for sequence 'signatures' that might be responsible for controling gene expression. At the moment, it isn't clear what bioinformatic tools or measures can be used in this quest.

Tying these areas together is an interest in an over-arching view of evolution as we know it from the science itself, as well as the way that the post-Darwin world has adopted Darwinian concepts. These concepts connect genes, history, origins, and the process of change as well as biological development. But the same general ideas of competitive natural selection are extended to many other areas of science and philosophy, including physics, the assumed nature of extra-terrestrial life, and human societies generally (see Bioethics, below). Often this is done uncritically or even based on misunderstandings about what we know about life and evolution. That can lead to misunderstandings both within biology itself and in the general public. Darwinian ideas have also often been upsetting to some religions, as we know, and we're in a time when that sensitivity is high in this country.

I also do collaborative work with Joe Terwilliger at Columbia University, whose slightly wacky web page (http://linkage.cpmc.columbia.edu/index2.html) shows why that is interesting to do. We co-teach a mini-course called Logical Reasoning in Human Genetics in which we discuss the amount and evolutionary origin of human variation in the context of human disease, and approaches to find and characterize that variation. Joe also has many tools for genetic inference in the disease context. Other co-instructors are also occasionally involved. See my Courses page or Joe's web page for more details and the next scheduled offering.

A major collaboration with Joe, and with a research scientist/programmer, Brian Lambert, here is that we're undertaking a broad-purpose, forward evolutionary simulation program package, to examine how evolution works and the issues we face in inferring the underlying genetic architecture of complex traits. In real life, we can never be sure we know the whole truth, but you can be when you have simulated data. Our program, called ForSim, is being developed for this purpose. More details are given below.

You can generally find our published papers by searching on PubMed or this lab web page.

::::Please note that while I try to keep this webpage reasonably current, I can't guarantee its precise accuracy. Things change as grants, interests, and personnel come and go. I do my best to keep it updated, however.::::::::::::::::::

FOR QUESTIONS ABOUT MY RESEARCH, PLEASE CONTACT ME: kenweiss@psu.edu

Development and patterning of the mammalian dentition

Human variation and disease

Olfactory Receptor Gene Regulation

Craniofacial development and evolution

ForSim: Forward Evolutionary Simulation

Bioethics and Biocosmology


Development and patterning of the mammalian dentition

Recumbinant Inbred Mice

Evolution of Complex Traits

We are searching for candidate genes involved in tooth development by comparing morphological differences in the teeth of inbred strains of mice. The photographs on the left show a comparison of the lower second molars of two strains of inbred mice (Balb/cJ (left) and SJL/J (right)). The SJL/J mice lack the fifth cusp on their lower second molars. If this trait segregates in CXJ mice (inbred strains made by crossing Balb/cJ and SJL/J) we may be able to locate segments of the genome that segregate with the trait and then, candidate genes.

 

 

 

 

Another major project here is being led mainly by Kazuhiko Kawasaki in my lab. See the Publications tab on my web page for publication references, of which there are several.
Gene duplication creates evolutionary novelties by using older tools in new ways. We have identified evidence that the genes for enamel matrix proteins (EMPs), milk caseins, and salivary proteins comprise a family called the SCPP genes Image(secretory calcium phospho-proteins) that are descended from a common ancestral gene called SPARCL1 by tandem gene duplication. These genes remain linked, except for one EMP gene, amelogenin. The SCPP genes show common structural features and are expressed in ontogenetically similar tissues. Many of these genes encode secretory Ca-binding phosphoproteins, which regulate the Ca-phosphate concentration of the extracellular environment. By exploiting this fundamental property, these genes have subsequently diversified to serve specialized adaptive functions. Casein makes milk supersaturated with Ca-phosphate, which was critical to the successive mammalian divergence. The innovation of enamel led to mineralized feeding apparatus, which enabled active predation of early vertebrates. The EMP genes comprise a subfamily not identified previously. A set of genes for dentine and bone extracellular matrix proteins constitutes an additional cluster distal to the EMP gene cluster, with similar structural features to EMP genes. The duplication and diversification of the primordial genes for enamel/dentine/bone extracellular matrix may have been important in core vertebrate feeding adaptations, the mineralized skeleton, the evolution of saliva, and, eventually, lactation. The order of duplication events may help delineate early events in mineralized skeletal formation, which is a major characteristic of vertebrates.

The SCPP family of genes provides and example of phenogenetic drift in which a trait (the mineralized nature of teeth, for example) is retained by natural selection, while its genetic basis changes. We are currently concentrating on early vertebrate evolution, including sharks, agnaths (lamprey), and amphibians. My Publications page lists other SCPP papers by Kazz and myself that are out or forthcoming.
[PDF]

Craniofacial growth and evolution

In collaboration with Joan Richtsmeier, Nina Jablonski, Anne Buchanan, and Alan Walker in our Department, Jim Cheverud in Washington University, and Jeffrey Rogers at the Southwest Foundation for Biomedical Research in San Antonio, Texas. I am involved in studies of the genetic basis and evolution of the shape of the head in primate evolution. This is part of the NSF Hominid, or Human Origins, major-project program (see their web page at nsf.org). Our project involves gene mapping in baboons and mice and various aspects of informatics and experimental mouse genetics, based on morphometric findings on CT-scanned mice and baboons. In addition, we are using the human, chimpanzee, macacaque, and mouse (and other) whole-genome sequences, and bioinformatic (comparative DNA sequence analysis) approaches, to identify genes that might have undergone natural selection or similar evolutionary processes during the evolution of the uniquely shaped human head and face. We have already identified some very interesting candidate regions in the baboon data, and mapping studies of a large sample of mice will occur sometime in 2008, along with scaling up and sharpening of the baboon reults. Candidate regions will be followed up in various ways, including gene expression and other experimental studies of the role of candidate genes in the craniofacial develoment in mouse embryos. This work has much overlap with our work on dental patterning and on mineralization, described elsewhere on this page. For more information about this project and its many facets, go here: http://www.getahead.psu.edu/hominid_index.html

Olfactory Gene Regulation

Chemodetection--smell and pheromones--is one of the most central of the senses in many animals and though less prominent in primates it is still important.   A mammal has about 1000 olfactory receptor genes (and as diploids, two copies of each).  This is the largest gene family in our genomes, and they are located in about 25 clusters spread across our genome.  Olfactory receptors sit in the cell membrane of our many millions of olfactory neurons, waiting to detect an odorant that they can bind to and send a 'received!' signal to the brain.  Each olfactory neuron expresses on one of these genes, so an odorant triggers a set of neurons that bear receptors that can bind to the odorant.  This cell/receptor specificity enables the brain to learn and remember patterns of signal that represent each odorant.  But how does this single-gene per cell expression that happen?  I am interested in applying bioinformatics (computational biological) methods to search between species, among gene clusters, and among genes for the DNA sequence patterns or signals that are used to bring about this remarkable expression pattern.  The methods will eventually involve experimental approaches to gene expression, but at present are restricted to computer-based pattern searching that I am trying in many different ways. I have ideas about how this might work....but until I can find good evidence and design experiments it remains in the dream and struggle stage. Students interested in programming, evolutionary or developmental modeling, or bioinformatics might find opportunities here.

Human Variation and Disease

We’re interested in the molecular genetic investigation of the amount of human variation, its geographic distribution, and the relationship of that variation to risk of common, complex, conditions like cardiovascular disease (CVD) and diabetes. The first main aim of our work has been to elucidate the full nature of standing variation as it occurs within and among human populations, both to relate observed variation to the effects of demographic factors and, where applicable the effects of natural selection, to define more clearly the ‘normal’ variation whose perturbations may be associated with disease. Currently my group is not directly working with subjects affected by disease. Rather, we're interested in the evolutionary and population processes that generate the variation in our species that includes disease-associated variation. From time to time we do become involved in studies of genetic varition more directly in relation to disease, but that isn't a main project at present. My lab is not currently suitable for students who wish to do research in disease as a major focus, unless it would be to work with bioinformatic techniques or with projects indirectily related to disease. For those interests see my work in forward evolutionary simulation (see next section), Olfactory gene regulation, dental patterning and evolution, and craniofacial development, or the genetic basis of vertebrate mineralization and its evolution, all on this page.

ForSim: A Forward Evolutionary Computer Simulation

Population genetics theory provides vital tools to understand many aspects of genetic change over the long and the short term.  There are many excellent backward (coalescent) simulation programs, that take a set of sequences sampled today and work backwards in time to reconstruct their common ancestral sequence.  Some of these programs can handle recombination, and natural selection in rather restricted ways. But life is really lived forward, and to be able to understand many aspects of evolution we need to be able to simulate the actual processes of genetic change as they happen forward in time.  Forward evolutionary simulations work the way nature does, screening on phenotypes and only indirectly on genotypes. Forward simulation is brute-force simulation, rather than resting on elegant theory, but for the same reason it is much more flexible. Weiss Lab Image

With Brian Lambert, we have developed a forward evolutionary simulation program called ForSim, that is phenogenetic rather than genetic in nature, an attempt at full-fledged evolutionary simulation. A phenogenetic simulation generates genetic variation, but then rather than having that evolve directly, translates that variation (as real organisms do) into phenotypes, and then those are subject to various modes of natural selection, in finite populations of various types, sizes, structures, and dynamics.  This kind of approach is important for understanding the genetic architecture that results from the evolution of real biological traits.  The figure above shows basic logical flow of the program's components. Below we present results that suggest just a taste of what the program package can do. ForSim is exceedingly flexible, and allows users to specify many different aspects of a simulation. Weiss Lab

 

The figure on the left shows results of a case-control comparison among simulated individuals, as reflected in the haplotypes at a simulaed gene. White (controls) and grey (cases) are given as haplotypes, one row to each. The haplotype's frequency is given in the bar graph at the left, and its net effect on a simulated quantitative phenotype on the right (red, negative effect, blue positive effect). Each contributing SNP is shown (green: no phenotype effect). These are arranged in haplotype-similarity order based on ClustalW clustering. You can see the multiallelic haplotypes and their relationships, as well as the small differences between what makes a case vs a control.

The next figure shows the haplotypes and linkage disequilibrium in two source populations (A and B) that evolved separately for 2,500 generations, after splitting from a single population that had evolved for 7,500 generations, and thenformed an admixed population (C), that then evolved for 10 generations to the 'present' (this is like many admixed human populations in the US today). The figure shows the LD pattern differences between the populations as resolved by the Haploview program (on which HapMap was based, but applied to our simulated data). . Of the 10 simulated genes shown here, 5 affected a trait under weak selection, identified by arrows, whle the other genes did not affect a trait and evolved neutrally. The admixture and selection affects are not great, as is usually the case, but can be seen: and these data are like real human data on most genome regions in admixed populations.

Weiss Lab Image

It is clear that human geneticists are having difficulty understanding the genetic architecture and specific contributing genes that underlie variation in important complex trait like those responsible for human diseases. This is a major impetus for the program, and ForSim is being written in collaboration with Joe Terwilliger at Columbia University, who is interested in how study design may be optimized in searches for disease genes by mapping and other means.  The examples shown here reflect that biomedical interest. We are funded by an NIH grant for this purpose. Similar problems face anthropologists interested in the genetic basis of normal traits, with focus on how they evolve, such as cranial shape, dental patterning, and stature, and their variation among primates including ourselves.  ForSim is designed to address these problems. At the end of a simulation pedigree data are saved, that can be tested in biostatistical inference packages. But the applications are not limited to humans, nor to very short-term population history, but can be applied to longer-term evolution as well.

ForSim accommodates multiple loci, chromosomes, and populations, and a range of mating, gene flow, phenogenetic, and selection models with parameters that can be changed at any point during a simulation, and copious output data during the run (as specified by the user) and at the end.

ForSim is intended for simulation of the generation of genetic variation within a fixed genetic etiology (number of genes and their general mode of action and interaction, etc.), on a scale of up to millions of years, so that we can better understand the amount and nature of causal variation for complex traits today. Other useful forward simulations have been written for various purposes, including by Jody Hey (see his Rutgers web page), SelSim by Spencer and Coop (Google it), and a technical report documenting Fregene by David Balding and colleagues. SimUPop, by Peng, Amos, and Kimmel (Google or PubMed search). Many population geneticists have written ad hoc forward simulations of one kind or another, and I am sorry if I have failed to mention, or even to be aware of them.

ForSim has been submitted to a journal and is in review. When it is accepted, we will release version 1.0 ( beta-test). The program (C++ source code and other wrapper scripts and resources) will be available at no cost (conditional on agreeing to an open-source non-commercialization license), including a detailed operating Manual describing the program and how to set up its input file that specifies the many different user-variable run conditions, aloong with samples of output text and graphics. We will want users to register with us so we can notify of bugs and changes, and agree how to credit the program and describe any changes you make to the code in any resulting publications, watermarking the modified source code appropriately. The current manual will be posted here (link to be added) so anyone using the program can have the clearest instructions we can provide.

Bioethics and biocosmology

With the completion of the Human Genome Project (HGP) and many other genomic resources rapidly expanding, leading to accelerating use of genetic information in diagnosis and risk assessment, it is important to take stock of the larger social and ethical implications of population genetics research. Several relevant areas are currently of interest to me and my group. We explore the empirical and epistemological limits of genetic information, both to highlight the current tendency toward what I think is excessive genetic determinism that goes beyond what we can actually say with data (or, often, directly against what we already know). It is important that people have a fuller understanding of the genetic and environmental contributions to complex traits and how difficult such traits, and their evolution, are to understand. Another major research interest is on the implications of population-specific (or as some would insist on calling it, 'race' specific) analysis, attempting to balance the likely benefits of increased knowledge regarding disease etiology and risk, against potential harms of stigmatization, discrimination, and categorical treatment of quantitative variation. I am also interested in the ways in which vested interests, ranging from private biotechnology firms to representatives of communities being investigated, interact to determine research priorities and hence, scientific outcomes and even our views of the science itself.

These subjects are connected through the over-arching theory of evolution, and genes as the underlying 'atoms' of this worldview. The powerful and perceptive ideas of genes and natural selection and historical evolution are extended, sometimes uncritically but with important implications for society and human well-being, to economics, politics, behavior, social relations and many other areas. We're interested in how this happens and what it means--especially because even biologists seem sometimes unaware of its limitations within biology itself.

We work closely with faculty and staff in Penn State's Rock Ethics Institute and department of Science, Technology, and Society (and others) in our bioethics work, and we have recently helped for an undergraduate minor in the subject....with future developments in the works. Post-docs or graduate students with interests in the history, philosophy, or societal asepcts of relevant areas of biology or biological anthropology might find our program interesting.

Bioethics Links

Rock Ethics Institute at Penn State:
http://rockethics.psu.edu/

Science, Medicine, and Technology in Culture program at Penn State:
http://rockethics.psu.edu/smtc/

ELSI Research Program of the National Human Genome Research Institute:
http://www.genome.gov/10001618

Bioethics Resources (from the NIH):
http://www.nih.gov/sigs/bioethics/index.html

Bioethics.net:
http://www.ajobonline.com/

New Mouse Under Development

"The Kawasaki"

"Easy to handle, but extremely difficult to breed" says Chief of Mouse Development Kazu Kawasaki of his latest mouse.

Consisting of only a head and tail, the Kawasaki Mouse, eliminates the fuss and muss of internal organs* for researchers interested in tooth, brain, or tail development. An ancillary advantage is the ability to study flagellar locomotion in mammals. That work is in progress.

Look for this handy research tool to be available soon!


Mouse

* NOTE: Requires special "pre-digested" feed (also under development)