My second post in this thread quite directly took issue with your contention that:
You absolutely can. Most potential discriminating factors will of course prove to have no influence on a person’s trustworthiness, and I would be very surprised if ethnicity proved to, but your bald statement that statistics can not be applied to this problem is simply false. Statistics will mostly prove that these factors are irrelevant, but it is most certainly statistics proving this, not some arbitrary notion of irrelevance as dictated by an observer.
We are, therefore, at opposite ends on the “other factors” question. I believe that any and all factors are fair game for study, while you appear to believe none of them are. Hence your confusion about my apparent agreement, I s’pose. You and you with the face appear to me to be making categorical statements that can only be proven by examining the numbers. You’ll notice that I’ve stated that most factors will prove to be useless to examine; where we part ways is that you insist that this means they shouldn’t be.
But I think you’re effectively assuming the answer, in that case. In the post linked in the other thread, you stated as your extra information only the knowledge that the absolute rates of the respective crimes are very different. This gives you no information about the claim rates, and if you assume these to be equal, then you are effectively saying “if black people’s claims are less reliable, then their claims are less reliable”.
Basically, there is no predictive value in assuming the value for a node in a directly dependent chain of events. You’re positing a possible posterior outcome, assuming it is the case, then applying it back to make a predictive statement. You just can’t do this. We agree, there are lots of approximations involved, but the lack of data for the number of claims relative to the number of occurrences is an absolute dealbreaker when making predictions. You simply can’t assume your way around that. At heart, we are making the following calculations (I won’t duplicate them for both ethnicities):
P(black claim valid) = ( number of times victim actually raped ) / ( number of total claims )
We approximate this as:
P(black claim valid) ~= ( number of convictions ) / ( number of total claims )
Now, this is a nasty approximation, but there is clearly some dependency between the number of actual rapes and the number of convictions, so we proceed. But we still need to calculate that ratio, and to do that we need some indication of the number of claims. Not only do we not have that in this case, we have no approximating factor to substitute it with. We’ve therefore got two fractions without denominators. Sure, you can say that if both denominators are the same then the black claims are less valid, but that’s begging the question. So we might do the following substitution, and say
P(black claim valid) ~= ( number of convictions ) / ( total number of actual w-o-b rapes * black reporting rate + number of false accusations )
(where reporting rate is the likelihood of a genuine rape victim reporting the crime)
This is really where it breaks down. I think it’s a supportable assumption that, given a rape, whites and blacks are equally likely to report it. So let’s assume that’s equal. And again, we can approximate conviction rates to the total number of w-o-b rapes, but that still leaves us with the false accusation rate. At this point I think it becomes clear that further assumptions are truly begging the question, since you’re simply plugging in an arbitrary false accusation rate, which is the very thing you’re trying to assess in the first place.
Some gaps in knowledge can be filled with approximation. Some, you have to admit that this is a hole that cannot be filled with anything but data. Data on claim rates is most certainly the latter, IMO.