Quote:
Originally Posted by
BizzareRide 
there is massive overhead.
It's not
too high for Intel, they added the uop cache in order to help with that and pipeline length, I think the majority of instructions can be cached in the uop cache so it's only the rarer ones that have the full decoding time, others have a much reduced one.
Quote:
Originally Posted by
james8 
Quote:
Originally Posted by
hajile 
It can't be stressed enough that a 40W, 1GHz, 65nm, 128GFLOPS Loongson beats a 77W, 3.9GHz,
22nm, 50GFLOPS 3770K ivy bridge (having 2.5x the GFLOPS of float performance)
some unknown architecture being able to beat intel's lastest creation while running at 1/4 the frequency and consuming ~1/2 the power and being 4 nodes bigger?

sounds too good to be true
Just because you don't know about MIPS, doesn't mean it's unknown. MIPS powered the PS1 and PSP among other things and x86 is hardly the fastest or most efficient of architectures...Hence why AMD and Intel both moved to having an entirely different architecture internally with decoders to still "talk" x86, SSE, SSE2, etc all help as well.
Quote:
Originally Posted by
SCollins 
Quote:
Originally Posted by
crust_cheese 

Assembly is just mnemonics replacing strings of ones and zeroes. It
is programmed directly to hardware.
assembly is between binary and c, programming in binary is going direct to the hardware level, Assembly is as close as you can get and still have reasonable inteligibility.Also, with todays c/c++/c++99 and other language and modern compilers. The performance benefits of coding in assembler are largely gone. The only real savings might "might" be space in certain circumstances. Like thats a issue these days for code, its not.
It's not as high as it was, but there's still significant performance gains from bypassing programming languages, but I'd imagine it'd be horribly confusing and difficult to get those improvements on the average modern application too, simply due to the size.
Quote:
Originally Posted by
SCollins 
Quote:
Originally Posted by
james8 
some unknown architecture being able to beat intel's lastest creation while running at 1/4 the frequency and consuming ~1/2 the power and being 4 nodes bigger?

sounds too good to be true
Loongson mips cpu's are not this good, though MIPS used to stomp a mudhole in x86 performance, way back in the day. It likely still could. One of the interesting benefits of MIPS, is that it would fit far better with hsa, then x86
I'd hazard a guess ARM, PPC and MIPS would be taking x86 down if they were more popular/had someone with x86s level of R&D funding.
It all comes down to R&D and use, most people run x86 and x86 has Intel pouring a metric tonne of R&D money into it, hence why it'd have faster performance..Think of it like theoretical performance vs real world performance, x86 might have lower theoretical performance than MIPS, but it's real world performance is still higher because it's say, using 90% of its theoretical performance while MIPS is still at 50% or 75%. (Not real numbers)