The cigarette studies were touted at proof long before they had sufficient real evidence that cigarette smoking did indeed cause cancer. Dozens of additional studies all showed the same correlation, even after removing other factors. But the mechanism of causation was not known, and still remains incompletely revealed. The evidence was sufficient to convince a lot of people, but it falls short of proof. The reason is that it is insufficient that the correlation exists, the mechanism of causation must be revealed. That takes a different level of examination than simple statistics.
Statistical analysis is a slippery devil. Cancer is caused by exposure to graduate students. You can prove it, statistically. Control populations in laboratory cancer studies have a higher incidence of cancer than do captured members of the same species. The lab animals are deliberately not exposed to carcinogens in the test. Yet they have a higher rate of cancers found in autopsy than are found in animals taken from the wild. One factor that laboratories have that wild places do not have is higher levels of population of graduate students.
The problem is that laboratories themselves are a factor. So too is the limited genetic diversity of lab animals. So, you don’t rely on the statistical data to prove anything, you use that data to indicate what you should study. If you really are worried about the graduate students, you higher illiterate migrants to care for your animals, and keep the grad students out of the building. If your results indicate a drop in cancer rates, you may be on to something. But probably, it won’t change. Not graduate students. Maybe fluorescent lights? Gary Larson cartoons?
If you base your entire proof on statistics, you have proven nothing but you own ability to assemble statistics. Post hoc ergo propter hoc is not true, even when the antecedent did cause the result. The contention is that ‘therefore’ the antecedent is causal. Simple correlation does not even examine the mechanism of causation.
Statistics are a good tool, for determining a direction for investigation. But there are other areas than science in which they are valuable. Insurance companies don’t care why male drivers between 16 and 19 are more likely to cause expensive damages with automobiles. They know it happens. They charge rates based on that knowledge. They are not wrong.
Red cars go faster than brown cars. It certainly is ridiculous to think that color affects the speed of an automobile. But the fact is that if you stand on the edge of the highway with a radar gun for a week you will find out that red cars go faster than brown cars. Why? So far, every thing I have read on the subject is speculation on psychology. But they do go faster, even so. Now, is it wrong to say that the car goes faster because it is red? Yes. Implying that I understand the mechanism of causation is incorrect. I know that they go faster. I don’t know why.
The error is in the implication of understanding cause.
Tris
“Criticism comes easier than craftsmanship.” ~ Zeuxis ~ (400 BC)



