How much of their DNA do humans share with humans?

I saw this cited in another SDMB thread and it made me curious. This site says,

Pardon my skepticisim, but I would have an easier time believing this if a chimpanzee said it. I have heard this figure cited before, and it has always made me wonder whether (a) it is perfectly clear but doesn’t make sense to me because genetics is just not my strong suit or (b) it actually has a basis in fact but it has been watered down so much for the hoi polloi that it has lost most of its meaning. I, for one, have never shared any of my DNA with a chimpanzee.

Does the 99.4% figure imply that humans share 100% of their DNA with other humans? I thought that the difference between identical and fraternal twins was that identical twins shared 100% of their DNA with each other whereas fraternal twins shared no more than average siblings. How much of their DNA would two random humans be expected to have in common? 99.9%?

I have a distinct and slightly Kafkaesque feeling that there are some words missing here. Do people mean that chimps and humans could share 99.4% of their DNA? Does the fact that we and the chimps have diffferent numbers of chromosomes factor in at all? Or do I have a completely whacked definition of DNA, thinking incorrectly that DNA is actual genetic information when actually it just contains genetic information?

If my questions aren’t clear enough, maybe you can complete these sentences:
A. Identical twins are two individuals that share 100% of their ____.
B. Two members of the same species share 100% of their ____.

If the answer to both of these is “DNA” I am going to be very sad.

I think you may be misunderstanding the perspective of how much is in the genome and how much potential for variety there Chimps may seem more different than the slim margin you quote but when you compare humans to sea urchins it looks a little different.

Why would the fact that chimps and humans share 99.4% of their DNA automatically imply that 2 humans share 100% of their DNA? There are a HUGE number of nucleotide gbase pairs in human DNA, so two people could share 99.9999999% of their DNA and still differ by several hundred genes.

[nitpick]Given that there are about 3 billion base pairs in the human genome, if two humans shared 99.9999999% of those pairs, they’d only differ by about…3 base pairs.[/nitpick]

Consider, though, that Trisomy 21 (aka Down Syndrome) represents an increase of 1.9% of the base pairs compared to a “normal” human genome. Yet, those with Down Syndrome are undeniably human.

Also, on average, “between any two unrelated individuals, geneticists estimate there are three million differences in DNA sequences”, which would yield a similarity of about 99.9% (3 million being only .1% of 3 billion).

There are an estimated 3 billion base pairs in the human genome, so using Q.E.D.'s percentage gives 3 base pairs different - got carried away with the 9s, I think. Still, the point is correct - two genomes can be very, very similar as a percentage, but still differ in many ways.

Of course, a certain amount of those base pairs don’t appear to code for any specific protiens. They may have more subtle purposes, or they may be `noise’ introduced by our evolutionary past. Which makes those base pairs that do determine protiens all the more important, making variations in them more significant.

But even sharing junk' is significant from an evolutionary standpoint. Perhaps even more significant than sharing non-junk,’ given that `junk’ wouldn’t feel much evolutionary pressure to change over time.

Check out this thread.

To summarize a couple of points I made in that thread:

  1. To say that humans are 99.9% genetically identical to one another indicates that they differ on average by one out of every 1000 base pairs (not genes) in the genome. (I’m not going to go into here the differences in percentages between coding and non-coding - so-called “junK” DNA. Suffice it to say that coding regions, being constrained by natural selection, show less variability than non-coding regions.)

  2. The average gene consists of 2000-3000 base pairs, so one may expect about 2-3 differences per gene. However, some differences in base pairs are “synonomous,” or redundant, and have no effect at all on the gene product produced. Others may result in a difference in the amino acid sequence in the enzyme produced, but have no effect on the actual enzyme function.

  3. The upshot of all this is, that a 0.1% difference in base pairs will result in (very approximately) a 10% difference in gene products; that is, one out of every 10 genes will be functionally different in some way.

  4. It should be evident from this, that a 0.6% difference between chimps and humans will produce a correspondingly greater difference in gene products.

The 99.whatever measurement is outdated. I forget what more recent measures are, but they’re significantly less – though still within the mid-90s.

Colibri: Can you clarify the math in #3? Are you makeing an estimate based on the fact that even though every gene is likely to have some base pair differences, only about 10% will have base pair difference that exhibit themselves in the phenotype? Is the 10% number your guess, or is it generally accepted by geneticists? Any actually data to confirm the number?

BTW, I have referenced your explanation (back in my “99.9% thread”) quite frequently in the last few weeks. It’s been very helpful in the GD forum even. Not sure if you visit there often.

I’ll refer you to this article that I cited in the other thread. The figure is based on both empirical observations and calculations based on observed substitution rates.

And:

Bolding mine.

I sometimes read the threads but rarely post. If I started to get involved in debates I’d never get anything done.

Actually, I just submitted a paper to Journal of Biological Chemistry that touches on this. His estimation is excessive. In general, looking at codon to amino acid correspondences, a broad cross section of coding sequences (a gene is really more than just the protein coding sequence) could come up to about 50% “silent mutation” tolerance. Thus, roughly 50% of the basepair mutants would have a different amino acid sequence.

HOWEVER, unless the amino acid sequence is the “phenotype”, it should also be noted that a “non-silent” mutation at the amino acid sequence level could still end up being functionally identical at the folded protein level.

Originally posted by **Q.E.D. **

I don’t know why it would, I was asking if it did.