Thanks to everyone for their replies.
Topologist is indeed correct: I had in mind bias as it relates to estimators, not samples. Still, WillGolfForFood is correct in pointing out that biased samples are in general A Bad Thing and at the very least call for some sort of reweighting.
Some points
-> Just to make it clearer, the Mean Square Error criterion reflects both the bias and variance of a given estimator: it reflects the average squared error (relative to the true value) of a given estimator.
-> I am not alone in having problems with bias as a criterion for what makes a good estimator. Kennedy (1998) quotes Savage (1954), “A serious reason to prefer unbiased estimates seems never to have been proposed”. Kennedy continues, “None the less (sic), unbiasedness has enjoyed remarkable popularity among practitioners. Part of the reason for this may be due to the emotive content of the terminology: who can stand up in public and state that they prefer biased estimators?”.
-> More Kennedy: “The main objection to the unbiasedness criterion is summarized nicely by the story of the 3 econometricians who go duck hunting. The first shoots about a foot in front of the duck, the second about a foot behind; the third yells, ‘We got him!’”
-> I don’t know whether Maximum Likelihood invariably has lower Mean Square Errors in general. (Really! I lack knowledge.)
Still, I’m not posing a GD; rather I’m asking what the justification for the unbiasedness criterion is, relative to MSE. After posting, I thought of a possible explanation which may or may not be confused. [1]
A given estimating approach will produce a range of mean square errors depending upon the sample size or particular model that is being estimated. In contrast, an unbiased estimator will always be unbiased.
Now, if a given estimator gave the minimum mean square error under all circumstances, that of course would be a compelling attribute. But that criteria seems a little stringent.
Alternatively, we could ask whether an estimator had on average a lower MSE, relative to another estimator. But I speculate that such a calculation may be difficult.
Tentative WAG: It’s easier to make categorical statements about bias (or consistency for that matter) than it is about MSE.
[1] [sub]Actually, I read this in Harvey (1990), but fear that I may have completely misunderstood the point the author was making. So this isn’t really a cite.[/sub]