875
- 09 Feb 2005 (3 years ago)
Dodgy Intel: The Pentium M fiasco part 1
- by Lurks
He was talking about a range of delays and failures but I want to concentrate on one in particular. Potentially the biggest one Intel has ever made and one which is mired in internal politics and single monumental bad decision from the marketing arm which dictating the entire direction of product development in their largest business unit. We're talking about the Pentium 4 here.
You might recall that the Pentium III was topping out at around 1.2GHz or so. Some place inside Intel, a meeting went on where what was discussed was the need to deliver CPUs that were clocked faster. Not that performed faster, but were clocked faster. It turns out that the team working on a new processor, the Pentium 4, were going to crazy lengths to get the speed up. It would have a 20-stage pipeline rather than the 10-stages of the Pentium III. The reason for a boosted number of stages is that at faster speeds, it's progressively harder to coordinate clock signals. It's a little bit like taking smaller, faster, steps because you can be more sure of your footing.
Meanwhile in Israel, Intel's design team had been working on a notebook CPU for el-cheapo notebooks called Timna. They concluded that the Pentium 4 Netburst architecture (the crazy 20-stage long one) was hopeless for power conservation. They wanted to get more work done per clock, not less. Timna was also to have a built-in memory controller and graphics functionality. The project was cancelled however, and some people say this experience is why Intel was never again open to an on-board memory controller such as that found on AMD's Athlon 64 range. Another bad decision.
The Israeli team were then tasked to develop a new processor dedicated to mobiles. They knew the Pentium III architecture very well and so worked with that to reduce power and improve performance. The pipeline was stretched a little to around 12-stages (no one actually knows outside of Intel) and an improved branch prediction unit installed. This component minimizes the effect of a mispredicted branch - a problem that gets worse the longer a pipeline is. Another analogy, if you've guessed at your turn off and you've driven for 20 miles instead of 10, before you realised, your journey time is hurt more. You can minimize that by getting a smarter navigator.
What the Israeli team were working on came to be known as the Bania core, aka the Pentium M processor. The Centrino branded notebooks have Pentium M processors in them and it's clear how well they've done - they've all but kicked AMD out of the market in thin and light notebooks.
At the same time, Intel was ramping up the clock speed on the Pentium 4 yet further. Up to 3GHz and beyond. The AMD Athlon 64 series has a 12-stage pipeline and is clocked at much lower speeds to get the same work done and hence they introduced the numbering scheme again. Eg the Athlon 64 4000+ processor which actually runs at 2400MHz. Obviously consumers aren't savvy enough to know the difference between these clock speed apples and oranges so the marketing departments have rolled up a number that makes more sense.
The trouble is, about the time of Barrett's announcement to the company, that the Pentium 4 'Northwood' core was out of steam and the replacement core, the 'Prescott' was delayed. This would have an absolutely insane 31-stage pipeline offset by a branch prediction unit virtually self-aware in it's complexity and to compensate for reduced performance across the board, Intel spent the spare die space (smaller transistors) on a big whack of cache - 2MB of it. It was, just like all other Intel cores in this lunatic escapade, slower at the same clock speed than the previous unit. Also despite the fact it's fabricated on the bleeding-edge 90nm production process (very very small transistors, which normally make things go faster and use less power), Prescott in fact used lots more power and was slower. But hey, you can buy it in faster MHz so all is good right?
At this stage, people in Intel must really have been feeling the heat - literally in Prescott's case. AMD wasn't showing signs of slowing down and the firm had managed to get Microsoft onboard their 64-bit x86 instructions and the mighty AMD Athlon FX series CPUs were basically faster processors in everything except for media encoding type applications. The FX series was notably a lot faster for gaming than Intel CPUs and that's a bad thing because hard core gamers are generally the consumers of the highest end most expensive chips.
Desperation began to show. Back in the Northwood era Intel fabricated a ridiculous CPU, essentially a Pentium 4 Xeon with 2MB of cache, and called it the 'Extreme Edition' and charged more money for this CPU than you can go down PC World and pick up an entire computer system for. Just to be known as king of the hill once more. It didn't work, it still wasn't faster than AMD. Then things got so bad that Intel couldn't shore things up easily any more.
4GHz. AMD had Athlon 64 4000+ on the roadmap. In fact you can buy those now and the bigger brother, the FX-55 CPU. Intel was forced to admit that even Prescott couldn't do 4GHz without heroic measures. What heroic measures were those? Well, basically redesigning the entire ATX PC form factor to place a hoover over the CPU and blow air directly over the CPU from outside of the case. The industry largely greeted this development with howls of derision and eventually Intel cancelled the 4GHz Pentium 4.
My God, now we have Intel which has no answer to the AMD's finest and certainly no answer to AMD's next generation goodies. The 4200+ and the FX-57, due out shortly. What the hell were they going to do?
During this end-game, Intel went back on the whole marketing philosophy which had lead them down this road. Their branch prediction unit had failed in the most spectacular way and sent the bulk of their desktop CPU development down a dead end path. Like their outrageous Prescott CPU, they had to lumber back a long way to come up with an alternative.
Suddenly clock speeds disappeared from Pentium 4 chips when the move to the LGA socket 775 happened. On the market now; the Pentium 4 530 is a 3.0GHz unit and the 540 is a 3.2GHz CPU. You've really got to ask why they moved to another core designed to ramp MHz up and then promptly remove the MHz ratings? Crazy stuff indeed.
What are they going to do in the future? Firstly, it's widely known that the next desktop processor, Smithfield, is going to be dual core. Some people had speculated, mne included, that this was going to be two Pentium M cores on a single die. That's not correct. In fact Smithfield is a fairly typical Intel stopgap measure. It appears most likely to be two Pentium 4 Prescott cores in a single package (the physical thing with all the pins) wired together internally. Heh, that means it's going to be even hotter than Prescott is and the initial figures of 130W of power appears to confirm that - being about 15% more than the current Prescott. These will most likely be clocked slower than the single Prescott CPUs available today.
So ironically once again, Intel will launch a CPU which will be slower than the one it replaces. Only this time you'll have to re-write your software to be multi-threaded, or run multiple programs, to gain the performance boost. In fairness, that's not so bad because this change was coming anyway and pretty much everyone designing CPUs is looking at multi-cores for the future.
Intel will really only finally get back on track when they bring out Yonah, a good year away according to the current roadmap. This is a dual core processor based on Dothan-like (the latest Pentium M, more in part 2) Pentium M cores. This is the culmination in Intel cancelling an entire architecture, dumping the marketing reliance on clock rates and admitting that the Israeli team have produced a significantly superior architecture. One which you can actually buy today in laptops but no one thought to ask how it might run on the desktop.
Or did they? Intel would have thought about it but imagine letting those pesky Israeli's show up their entire North American engineering operations? Unthinkable. It's just as well that no one would get to compare the processors side by side. Just as well no one would benchmark the Pentium 4 Prescott versus the Pentium M Dothan right?
Then the unthinkable happened; some Taiwanese bastards went and made a desktop motherboard that took a Pentium M CPU.
Stay tuned for part 2.

Irrespective of performance the price delta between a 770 Pentium CPU and a Pentium 4 3.4GHz 550 is still 200-300$ AND you have to overclock it to hit the benchmark numbers. Smithfield will as you say be not too successful performance wise but what else are they supposed to do? From my point of view you are getting to Pentium 4 processors for the price of 1, as the L2 Cache is split 50/50 between the 2. All it takes is for some one to tweak Doom3/HL2 to support it and there'll be a significant performance boost, but without the price that goes with it. as for Dothan - it will only ramp so far...as far as I can see they are topping out at 780 (2.2GHz) before Yonah in Q1 next year, which will only have 1MB L2 cache per core instead of Dothan's 2MB.
Hmm I can counter a lot, well pretty much all, of what you've said. First of all, it's true there is a big price difference between those CPUs but that has very little to do with the actual cost in producing those cores - purely that high end Pentium Ms go in more expensive products. It's a price the market will bare. The Dothan core is 84mm squared versus the 112mm square core of Prescott. That means that Prescott is much more expensive to make.
What are Intel supposed to do instead of the ridiculous Smithfield core? Well how about manufacturing a Dothan-like core with a built-in memory controller and a modern chipset with PCI-Express etc to support it. This would be remarkably faster than anything Prescott could do.
'All it takes' is a tweak to convert games to being multi-threaded? I think you've got a perilously shady grasp of software engineering. Id Software has tried for years to make their games SMP compatibile and it's never worked. Valve have yet to even try. Furthermore, these games aren't really CPU limited anyway.
I'm not entirely sure I buy Dothan topping out at 2.2GHz. Why am I not entirely sure? Because my fucking lounge server is a Dothan running at 2.5GHz, that's why. The heatsink is barely warm to touch and it's absolutely annihilating my P4 3.6GHz Prescott in pretty much every test other than memory bandwidth. A limitation which we know has nothing to do with the core and everything to do with the architecture around it. 2.2GHz is before the shift to 65nm too.
It's just because the core is so small, and because with the move to 65 it will be even smaller, putting two on a single die is feasible. Then Intel will have something really kick arse on the table.
Finally, all of your comments so far seem to be defending the use of the Pentium 4 in practical terms today - on the desktop. That is not the thrust of my argument here. My argument is that Intel have had a higher performance, highly efficient core in production which makes the Prescott look deeply silly.
To say nothing of the fact that Intel have virtually created an industry in CPU cooling which did not, when you examine the engineering, actually need to exist. If you need further proof, consider this. The Prescott CPU I bought for around about £160 needs this to cool it silently while the Dothan CPU I bought for £143 only needs this to cool it silently (please note the parallel socket as a size comparison) and what's more; overclocking my Prescott as far as it will go ends up encoding a song in LAME in 49 seconds and on the Dothan it takes 42 seconds. That's on lousy slow single channel memory too.
It doesn't take a genius to realise we've been had.
The DFI Dothan board "from Taiwan.".
You don't say...