What's wrong with AMD?

That’s the multithread score though. Look at how abysmally low the single thread score is.

348.25M per quarter

It wasn’t so much the Athlon being spectacular as it was the Pentium 4 being a huge dud. At a given clock speed the Athlon chips handily outperformed their Intel counterparts, and the P4 chips required a new kind of RAM. Hopefully, someone with a better technical background than me can tell you why, from a hardware perspective, the Athlon performed better.

Are there applications where it would make sense to use a CPU with multiple cores and threads rather than a GPGPU?

That’s surprisingly low.

Unless you’re making a “3 fiddy” joke.

Can anyone explain (to non-CS people) what made the Pentium 4 a dud?

What else can Intel do?

Could the ARM chipmakers enter markets like gaming GPGPUs? Could computing as a whole switch to the ARM instruction set?

The architecture had low IPC (instructions per clock), but they expected to make up for it with very high clock rates. At one point they were talking about 10 GHz chips. Obviously, this never happened–they never got much above 4 GHz. Silicon just doesn’t scale very well to these high clocks. Better off sacrificing a few hundred MHz and going for high IPC.

The current lineup of chips (Core series) has its architectural heritage from a line that was intended for low-power mobile chips (out of their Israeli office). The P4 line was essentially abandoned. As I’ve said earlier, power efficiency is really where its at, so it’s no coincidence that their current line is based on one where power efficiency was a goal from the very start.

I understand that a complete answer to the following may require a lot more background knowledge than I have:

Earlier, you talked about how the GTX 1060 does less work per clock but can be clocked 40%+ higher than the RX 480. Here, you say that more IPC may be preferable. I don’t see a contradiction but it does make me curious about the principles and trade-offs involved. What governs whether there should be more instructions or more clocks? Can you give an example?

Roughly speaking do power & heat increase linearly with instructions but quadratically with clocks?

Also, am I mistaken in thinking that coming up with middleware and specialized hardware combinations like the different GameWorks suites makes a lot of sense? Any major downsides? If it does make sense, any guesses as to why it took until 2016 for AMD to start offering it?

GPUs are a different animal, in that the job they’re designed for is “embarrassingly parallel.” GPUs basically carry out the same instructions/calculations over and over, so you can get more performance by either increasing the clock or throwing more execution units in it. More units means the chip can have more pixels in the pipeline.

For a gamer ideally you want both. Good games support multiple cores, but not all do it or do it well.

AMD market is different than Intel market.

AMD more for gamers and people building their own computer.

Well Intel more for businesses and servers.

Likewise a Intel Xeon E5-2697 12 core would not perform as well as gaming computer.

A a Intel Xeon E5-2697 12 core would be better for businesses, servers, CAD or video editing.

I don’t see market changing any time soon.