The problem of genome assembly is basically that we can only read small bits of the genome, and the way we do it is we'll typically just read a random small bit. And so, to see the whole thing, we've got to stitch it together from these random little reads. And in principle this wouldn't be too hard except for the fact that the genome is not really like a book, it's not like prose, it's more like poetry. It has a lot of repeating elements. And when you get in a repeating elements you can actually get a bit lost when you're trying to stitch it together.
jim kent,assembly program,interviewee,small pieces,prose,sequences,principle,elements,poetry