Archive for July 7th, 2009

Am I getting too old?

Tuesday, July 7th, 2009

…for completely relating to the beginning of this song?

My second computer was a Vic 20.  I know that that puts me in the young category amongst most of my colleagues, but it certainly puts me in the “older than dirt” category amongst my students.

Oh well, enough time wasted on youtube tonight.  I have to finish two papers this week, so I have to get up early tomorrow, so off to bed with me…

187-195=-8

I don’t hate Canada

Tuesday, July 7th, 2009

Just re the last post: I don’t Canada…

No, seriously, I have yet to meet an unpleasant Canadian… but I love this song, so I couldn’t help myself.

Oh, yeah, and just to show that I am an equal opportunity bigot:

187-194=-7

Is free speech a human right?

Tuesday, July 7th, 2009

Maybe, and then maybe not…

This is a few days old, but I didn’t see it until now, so sorry for sharing it a bit late.

More here.

For more on Ezra Levant, follow the videos you find after this one:

or check out his blog.

Now, I try to keep my political belief out of my blog and focus on science, but any attack on free speech – especially based on religion as in the case of Ezra Levant – I just cannot ignore it.

Is free speech a human right? It may not be, but it certainly should be.

And remember, only unpopular speech needs to be protected!  That mean Nazis, Fascists and Comunists; Islamists and Pro Life extremeists should have the right to free speech!  Thought crimes and “hate crimes” laws are just plain wrong!  Punish actions – and only actions – and do not, ever, punish thoughs and beliefs.

Attack them when you disagree them, with dialogue and words, but do not try to silence them.

Here in Denmark, Nazies are free to publish and have their own radio station.  The result is that they are generally ridiculed.  We listen to them and disagree.  If there is no public debate about their views, how can we educate people on just how wrong they are?

(We do have a blasphemy law – that last time it was used in court was in 1938 – and I strongly think it should be removed from our laws… who get’s to choose which superstition should be valued over which other superstition?)

Do not silence.  Confront!

187-193=-6

Ancestral Population Genomics: The Coalescent Hidden Markov Model Approach

Tuesday, July 7th, 2009

We just got a new paper out – in “Advanced Access” at least – on coalescent hidden Markov models:

Ancestral population genomics: the coalescent hidden Markov approach

J. Dutheil et al.

Genetics

Abstract

With incomplete lineage sorting (ILS), the genealogy of closely related species differs along their genomes. The amount of ILS depends on population parameters such as the ancestral effective population sizes and the recombination rate, but also on the number of generations between speciation events. We use a hidden Markov model parametrized according to coalescent theory in order to infer the genealogy along a four-species genome alignment of closely related species, and estimate population parameters. We analyze a basic, panmictic demographic model and study its properties using an extensive set of coalescent simulations. We assess the effect of the model assumptions, and demonstrate that the Markov property provides a good approximation to the ancestral recombination graph. Using a too restricted set of possible genealogies, necessary to reduce the computational load, can bias parameter estimates. We propose a simple correction for this bias, and suggest directions for future extensions of the model. We show that the patterns of ILS along a sequence alignment can be recovered efficiently together with the ancestral recombination rate. Finally, we introduce an extension of the basic model that allows for mutation rate heterogeneity, and reanalyze Human-Chimpanzee-Gorilla-Orangutan alignments using the new models. We expect that this framework will prove useful for population genomics and provide exciting insights into genome evolution.

This paper has been a long time underway.  Pretty much Julien’s entire post doc, actually, but there are some upcoming application papers that still makes all this work worthwhile.

There are two main results in the paper.

First, a new parameterisation of the hidden Markov model that directly parameterises the HMM in terms of population genetic parameters such as effective population size and recombination rate.  This is mainly the work of Ganesh and Marcy, we collaborated with on this paper.  In our 2007 paper, we parameterised the model just like any hidden Markov model but then extracted population genetics parameters from estimated transition and emission probabillities; in this paper we can do maximum likelihood parameter estimation directy from the coalescent process.

Second, we have a much more detailed simulation validation of the model.  From extensive simulations we have validated the model and discovered its strengths and weaknesses.  Of the latter, especially of importance is various biases in parameter estimates.  We discovered some systematic biases in estimates of speciation time and, especially, recombination rate.  The latter we didn’t even consider in the first paper, but the bias in the former probably means that the speciation time estimate of human and chimp in our 2007 paper was biased and somewhat more recent that the real speciation time.

Julien came up with a simulation approach to alleviate the biases, and although this approach is somewhat time consuming it does seem to improve the estimates.

While the paper was in review, we identified some of the sources of the biases, and we now have a model that looks much less biased than the one in the paper.  It doesn’t completely remove the bias on the recombination rate, but is much better at estimating the other parameters. It is based on the continuous time Markov models I have described here and here, but results are still somewhat preliminary and the model can only deal with two genomes and not incomplete lineage sorting, so it is still a long way from handling data like that in this new Genetics paper.

We have a draft of a paper describing the new method, and some results for the orangutang genome project that will probably be out later this year or early next year, so I will not go into details about it here.  There are still a lot of details to work out on that model before we know exactly how well it performs compared to the old one.

Anyway, the work we did on this paper told us a lot about the coalescent hidden Markov model approach.  Mainly good stuff at that, the biases aside.  It is a very fast method – fast enough to analyze full genomes – and is pretty good at estimating speciation times.  The latter is somewhat problematic when the average genomic divergence time varies significantly from the speciation time due to large effective population sizes, so the new model should be much better at it than plain old “molecular clock” estimates.

  • Dutheil, J., Ganapathy, G., Hobolth, A., Mailund, T., Uyenoyama, M., & Schierup, M. (2009). Ancestral Population Genomics: The Coalescent Hidden Markov Model Approach Genetics DOI: 10.1534/genetics.109.103010
  • Hobolth, A., Christensen, O., Mailund, T., & Schierup, M. (2007). Genomic Relationships and Speciation Times of Human, Chimpanzee, and Gorilla Inferred from a Coalescent Hidden Markov Model PLoS Genetics, 3 (2) DOI: 10.1371/journal.pgen.0030007

187-192=-5

New web pages

Tuesday, July 7th, 2009

We spent today porting BiRC’s old web pages to our new university wide CMS and integrating the pages to the university personnel and publicaiton database.  The latter should make it easier to automatically keep publication lists and such updated – if that database was actually kept updated which it doesn’t really seem to be, but that is a rant for another day.

You can see the new pages here.

187-191=-4