Is this mathematically/statistically possible?

So if all 4 groups have zero members, their agregate has any percentage of 1s that you like, including less than 50%.

Say a room has 103 people, 51 of them men, and 52 women. As a whole 49.5% are men. Number those people so that number 1 through 51 are the men, and number 52 to 103 are the women.

Group A has all people wearing yellow shoes and contains #1 and #100: 50% men.
Group B has everyone who speaks English and incudes everyone but #101, #102, and #103: 51% men
Group C has everyone with a pierced nose and consists of #2 and #99: 50% men
Group D has everyone who fluent in Russian and contains #3 and #98: 50% men

Manlob, I think there’s an assumption in the OP that the four groups are disjoint, i.e., no two of them have any members in common. Without that assumption, as you demonstrate, the average of the aggregate need not be between the top and bottom average.

Exactly. This is what’s known as Simpson’s paradox in statistics.

I’m waiting for someone to address this.

What’s to address? It’s just correct.

So the groups A, B, C, D aggregated can have less than 50% 1s then. This was contraindicated earlier in the thread.

Alright, good call. It always has to be the case that the aggregate percentage is at least 50%; it just may also happen (if, and only if, there are 0 people) that the aggregate percentage is also, separately, a value less than 0%. This is the problem with division by zero.

Depends on whether your percentage numbers are accurate or rounded.

If your real, perfectly accurate, percentages are 49.5, 50.5, 49.5 and 49.5 (rounded to 50, 51, 50 and 50), the aggregate percentage will be 49.75%. However, even if this is less than 50%, you should report the result with the same precision as your other numbers, i.e. as 50%.

That’s not what Simpson’s paradox describes. It describes relative percentages, yes, but not aggregate percentages. Note that in the Wiki examples, all the aggregate percentages are still between the two original values. In other words, when a and b are averaged to x, it’s still true that a<x<b.

At any rate, though, it is true that with overlapping groups, the aggregate percentage need not be a weighted average of the in-group percentages. For example, consider Barack Obama, Hilary Clinton, and Mitt Romney. Out of the two men, the percentage of Obamas is 1/2, and out of the two Democrats, the percentage of Obamas is 1/2. Overall, however, the percentage of Obamas is 1/3.

Are the groups disjoint or not?

Group A: Alex and Bill are 1, Cathy is 0 (67%)
Group B: Alex and Bill are 1, Damian is 0 (67%)
Group C: Alex and Bill are 1, Elizabeth is 0 (67%)
Group D: Alex and Bill are 1, Frank is 0 (67%)

But the percentage of 1’s in the aggregate is 33%.

Even more simply, and in keeping with the OP:

Group A: Obama is 1, Alex is 0 (50%)
Group B: Obama is 1, Bill is 0 (50%)
Group C: Obama is 1, Cathy is 0 (50%)
Group D: Obama is 1, Damien is 0 (50%)

Yet the percentage of 1s in the aggregate is 1/5 = 20%.

But as said before, if the groups are disjoint and exhaustive (so everyone is counted just once as we go through each group), and there’s at least one person around (so we don’t worry about division by zero), then the groups cannot all have percentage 50% or greater while the aggregate percentage is less than 50%. [And same for any other number in place of 50%]