Actually, there are a whole passel of assumptions here. Please indulge me as I toy with some of them. If you just want the “meat” [genuine objection] skip to the bolded “Final Analysis”.
ISSUE #1: you presume “Criminals” are always guilty, and “Honest men” never are - all criminals are born guilty, and all honest men are forever sainted. This is necessarily false, and weakens the derivation above. In real life, of course, all men are born innocent, and some become guilty at some point. Without knowing the rate of (convicted) first offenses - the rate at which honest men become criminals, we can only guess (or approximate) the actual probabilities
ISSUE #2: what do real cops think if they see two men running away from the scene of a crime? They think both are guilty! I’ll get back to this point
ISSUE #3: If all guilty men are “always guilty” then we get some truly funky situations. If a penny is stolen from the till, every criminal in the vicinity must be guilty - a neat trick, and a ludicrous assumption for the real world.
You may argue that the unstated principle (‘assumption’ is more like it) that only one is guilty is implicit in the formulation of this problem (and similar problems of its class) But that forces a revision of the math.
You may say “the crime was murder, and only one gunshot was heard” (we’ll ignore the crime of ‘fleeing the scene’- which only casts more doubt on the presumption that “only criminals commit crimes”) We still have to refine the derivation.
If both men were chosen at random, then a Real Cop’s initial assumption (both are criminals for fleeing the scene) is correct 1% of the time, and the “reader’s assumption” (there is only one criminal) is correct 23% of the time - but 76% of the time there’d be no victim at all!
The apparently relevant denominator ambiguous. it’s either
A) “all cases where at least one man is a criminal” (which I think is the only physically arguable case); or
b) “all cases where only one man is a criminal” (whose only merit is conforming to a common presumption)
Pa = 450/9000 + 200/1000 - [(450/9000) * (200/1000)] = 24%
(the subtracted term removes the ‘double-counted’ overlap)
Pb = 450/9000 + 200/1000 - 2*[(450/9000) * (200/1000)] = 23%
(the subtracted term removes both counts of the overlap)
In either case, we can only rely on the prevalence of “guilty” men in each racial subgroup 10% (B) vs 5% (W) - but the analysis is completely flawed, because in a random matching of candidates the actual murderer would be White roughly 450/650 of the time. Why were we wrong?
**
FINAL ANALYSIS
The flaw is: crimes are committed solely by criminals. Any number which includes innocent people merely obfuscates the issue. This includes statistics like “percentage of criminals” or “total population”, which are affected by the number of innocents. changing the number of innocents does not affect the probability that a man is guilty.
The statistically valid denominator is the number of potential murderers in town **. All the Chinese in China, or all the innocent Chinese in town are irrelevant.
Issue 4: the problem was set up to demand a black man and a white man at the scene.The universe of random pairings, howeverdoes not reflect the underlying events each black man, guilty or innocent, is “forced to flee” 9-10.6875 times as many hypothetical crime scenes as each white man. This dramatically overestimates the possibility of black guilt
EFFECT 1: In the universe of of cases where a black man is guilty, the analysis provided pairs him against 9000 white men [case A] or 8550 innocent white men [case B] while each white man is mathematically paired against only 1000 or 800. If you wrote a chart every "random pairing, each black man’s name would appear either 9000 or 8550 times, while each white man’s name would only appear on the chart 1000 or 800 times. Throw a dart at this chart, and the result isn’t “fair” - the black men have 9000/1000 or 8550/800 times as many slips in the hat.
**To illustrate this,make the numbers more extreme. Say that there are only 10 blacks in the city (and 1 black criminal) while there are 10 million whites (and 50,000 white criminals). By (improper) Bayesian analysis, that black man must commit virtually all the crime that occurs in his vicinity, while the 50,000 white criminals sit on their hands. With a year (before new stats can be issued, every black person would be shot many times, and no white man would be caught in this situation. **
EFFECT 2: Relying on prevalence in subpopulations will skew all future statistics, even if the population sizes are equal. In cases where men of both races are suspected, the black criminals will always be caught [and be counted], and the white criminals will always escape [and not be counted]. This effect is strongest as the population with the highest pre-existing
Effect 1 increases as the black fraction of the total population decreases. Effect 2 increases as the black fraction increases. Racial profiling is a Big Lose for blacks, guilty or innocent.