Oh my, has it really been that long?
It is ten years we got the first complete piece of the human genome, chromosome 22.
We’ve got tons of genomes now, but of course the human genome was the first large one, and it is worth remembering, in these “slash-and-burn” genome days where we get a new large genome every other month, that most of those genomes are actually only at a (rough) draft level. Only the human and mouse genome are really considered “complete”, the rest are just drafts and in many cases there are still many issues to work out with over-collapsed assemblies and such and with all the troublesome regions missing.
Hopefully, we will go back to many of the genomes to fix them and complete them, when we are done creating drafts of all the interesting species.
Anyway, I have some more to say on this issue but I have to wait for a paper that I want to review, but a paper that is held back right now waiting for the orangutan genome paper to be out… so stay tuned.
–
337-335=+2
If you enjoyed this post, make sure you subscribe to my RSS feed!
December 3rd, 2009 at 9:23 pm
I do not know if you count them as large, but I would argue that the C Elgans and Arabidobsis genomes are also finished.
December 3rd, 2009 at 9:36 pm
They are ~100Mbp and ~250Mbp in size, as far as I remember off the top of my head, so perhaps semi-large ;-) An order of magnitude smaller than what I had in mind, though, thinking of the human genome.
They are considered completed, though, so you are right.
For a list of genome projects with status, see http://www.ncbi.nlm.nih.gov/genomes/static/gpstat.html There’s four animal genomes completed, C elegans, D melanogaster (short-ish animal genomes) in addition to man and mouse. There’s also four plan genomes completed, with rice being the other one in addition to A thaliana.
December 3rd, 2009 at 10:34 pm
Only the human and mouse genome are really considered “complete”, the rest are just drafts and in many cases there are still many issues to work out with over-collapsed assemblies and such and with all the troublesome regions missing.
Er…human, mouse, and over 1000 complete prokaryotic genomes. You know, the genomes that actually contain novel genes. Euks (especially the mammals) are all just basically minor variations of each other.
December 4th, 2009 at 8:13 am
Jonathan: Sorry, I thought it was clear from context that I wasn’t referring to prokaryotes where, you are right, we have quite a few more completed genomes. 979 completed genomes, according to NCBI.
I’m not saying that the prokaryotic genomes are not interesting or anything — they definitely are — but it is not the genomes that show up in the news, usually.
December 4th, 2009 at 8:14 am
I guess what matters with eukaryotes is that large genomes are mainly a consequence of many repeats, so one could say that there are only two really complex repeat-rich genomes that have been finished to a satisfying degree. That is really quite discouraging, but one could hope that new technologies, especially very long read sequencing and new technological alternatives to BAC cloning, will remedy this. Otherwise we will continue to see a flurry of poorly finished and poorly annotated genomes coming out.
It would be especially nice to see a polyploid genome finished to a satisfying degree.
December 4th, 2009 at 8:25 am
Yes, it is indeed the repetitive regions that are problematic, and with the current technology it is probably too expensive to complete the genomes…