I’ve got a whack of ignorance that needs fightin’… Genetics. My ignorance is preventing me from appreciating the reports of genetic research projects, esp. genome projects:
Here’s what I think I know - please tell me where I’m wrong…
DNA is where information is stored. This information is encoded into DNA using the four “letters of the genetic alphabet” G-T-A-C (these letters representing names I forget, ending in -ine I think). Pairs of these letters form rungs on a twisted ladder - that ladder being a DNA molecule.
The ladder is “understood” by grouping the rungs. Each sequence of rungs (base-pairs) forms a word? paragraph? called a gene. So one DNA molecule contains many genes. Each gene describes a particular aspect of the species, i.e. this gene describes hair colour, that gene describes liver size, etc.
The DNA molecules are housed in a structure called a chromosome, being an organized collection of DNA molecules along with associated proteins (whatever they’re for - maybe the glue that holds it all together). Each species has a set of chromosome pairs (duplicated for redundancy?). Humans have 23 pairs, dogs have 39 pairs, horses 32, cows 30, etc.
I’m not exactly sure what a genome is, but I believe it’s an encompassing term that applies to all of the above on a species per species basis, i.e. the human genome is such that 23 pairs of chromosomes are organized in this way, having this gene which does this at this location, and that gene which does that at that location, etc.
With the cite at this site in sight, what does it mean, exactly, that a chimpanzee’s DNA differs from human DNA by 1.2% - 2.7%? The article mentions base-pair comparisons vs. large sequence comparisons. I don’t grok it. Does base-pair analysis mean they lined up all of the rungs in the ladder of each species side by side like this, then counted the differences?
123456789ABCDEF
---------------
Human GGATTCAATGAGGCT
ATCGATTTAGCAGGC
| |
Chimp GGATTCGATGATGCT
ATCGATCTAGCCGGC
Assume for illustration that the entire sequence for each species is exactly 15 base-pairs. The two differences shown above (rung 7 and rung C) amount to a 2/15 = 13% difference. Is this, essentially, the type of analysis that gives us the 1.2%?
Lets say that base-pairs are grouped into the following genes (this makes sense only if what I think I know is correct):
For the human:
1-3 - height (3 pairs)
4-6 - ability to blush[sup]1[/sup] (3 pairs)
7-9 - eye colour (3 pairs)
A-F - everything else (6 pairs)
And for the chimp
1-2 - banana detection (2 pairs)
3-4 - height (2 pairs)
5-9 - eye colour (5 pairs)
A-F - everything else (6 pairs)
Here the genes governing common traints represent 12 and 13 base-pairs respectively. The 2 to 3 base-pairs left over map to unique traits, hence an approximate 20% difference (let’s say).
These examples do not parse because the second analysis such trait mapping disparities would necessarily result in higher base-pair disparity. Obviously there’s much I don’t understand.
I have other questions, but I’ll stop here so I can benefit from the corrections you will surely provide to my understanding…
[sup]1[/sup] “Man is the only animal that blushes - or needs to.” - Mark Twain