The OP is conflating two issues: a rise in I.Q. scores, and an actual increase in average intelligence.
James R. Flynn, for whom the “Flynn Effect” is named, is quite outspoken in saying he does not believe that the population is becoming markedly smarter on average, only that the average scores rise over time. These can be two entirely separate issues, as one deals with the thing measured, and the other deals with the thing doing the measuring.
An I.Q. test is supposed to measure intelligence. Even if we grant that it does this in some fair and meaningful way, and that is subject to enormous argument, it should be acknowledged that it can only do so in general, indirect terms. Intelligence is an abstraction. It is not a concrete thing that can be measured absolutely, in the same way, say, that the length of a piece of metal or the weight of a piece of produce can be measured.
But, to simplify matters, suppose for a moment that I.Q. tests really were as precise as a weight scale or a calipers.
Anyone who has worked for long in a machine shop or similar facility knows that measuring devices have to be recalibrated from time to time to assure their accuracy.
Imagine you have a dial calipers and a block of steel which you know for a fact is an exact one-inch cube. The first day that you use your calipers you measure the block and it measures out at exactly one inch on all sides. The second day that you measure the block it measures at exactly one inch on all sides. And so on, for days on end.
Gradually, though, something seems to be gradually going out of whack. After months of using the calipers without ever testing it or servicing it, it now tells you that the cube measures 1.001 inch on each side. And then, months later, the measurement has crept up, so that now the cube measures 1.002 inches on each side.
If a gradual increase in the average I.Q. test score using the same test over a period of years means that people are, on average, getting smarter, then, by the same logic, a steel cube which gradually measures larger and larger using the same untested calipers must be growing.
Here’s another example: suppose you have a produce scale, the sort of device one sees in grocery stores all the time, where there is a basket dangling underneath in which people can stack carrots, potatoes, etc. Suppose that people throw their produce into the basket roughly all the time, so that the basket sags and bounces back, and the needle bounces wildly before settling down. Gradually things start weighing more and more on average when placed in the scale. In fact, even the surrounding air weighs more: at the start the scale showed a weight of zero whenever the basket was empty, but now it shows a half ounce when at rest.
Using the logic of the OP, we can only conclude that either the dangling basket, or the air that sets on it, is getting heavier, along with everything that is measured on the scale.
The inference here is that I.Q. tests, calipers and scales don’t simply attempt to measure things. Rather, they define them infallibly. The I.Q. test, the calipers and the scale each take their turn at being God.
If there is no distinction between the results of an attempt at measurement and the underlying reality that one is attempting to measure, then the universe is a whole lot cooler and more accomodating that I, for one, ever realized. Like millions of people, I’ve thought about losing weight in the new year. Instead of dieting or exercise, all I have to do is set the needle back on my bathroom scale so that it shows less than zero before I step on it.
Half of the questions on an I.Q. test measure general knowledge. Before an I.Q. test is put into general use, it is administered to a grat many test subjects. In this way, the testers are able to determine how many correct answers the ideal average person at the center of a bell curve is “supposed” to know. Neither the test nor the people who administer it, though, have any way of controlling whether the answers to some of those questions will become more generally well-known over time.
Suppose they do.
Does this mean that the population is becoming smarter on average? Only if knowing the answers to those particular questions, and only those questions, in and of themselves are an absolute definition of intelligence. Then, by definition, the opulation at large is getting smarter, and simply because the answers to those particular questions are gradually becoming common knowledge. If so, then a person who gets the answer key to the I.Q. test and memorizes it is, by definition, incredibly smart, even if he doesn’t know another thing in the world.
Alternatively, one could argue that if test scores are rising on average, it is only because the I.Q. test being used is losing its accuracy, a bit like a dial calipers or a weight scale with a bent needle.