The new 6800 series has seen hardware architecture changes, with a focus on strengthening tessellation and geometry throughput. Avid readers of KitGuru will have seen over recent months that nVidia Fermi hardware has been leading the way in titles such as UniGine Heaven Benchmark, a synthetic test which relies heavily on Tessellation performance.
The 6800 series has a reconfigured core design which offers up to 2.0 TeraFLOPS and 24 Gigapixels per second performance. The command processor is linked to the graphics engine as seen in the diagram above, with the new generation 7 Tessellator. The Dual rasterizers and 12-14 SIMD engines tied to a 256 bit GDDR5 memory interface help to improve performance beyond the levels of the last generation, while using 25% less silicon.
AMD HD5850
|
AMD HD6870
|
|
Die Size |
334 mm2
|
255mm2
|
Transistors |
2.15 billion
|
1.7 billion
|
Memory Bandwidth |
153.6GB/sec
|
134.4 GB/sec
|
Geometry Throughput |
725 million polygons/sec
|
900 million polygons/sec
|
SIMD Engines |
18
|
14
|
Stream Processors |
1440
|
1120
|
Texture Units |
72
|
56
|
Z/Stencil ROPs |
128
|
128
|
Colour ROPs |
32
|
32
|
Max Board Power |
151W
|
151W
|
Idle Power |
27W
|
19W
|
The table above shows a direct comparison against the HD5850 and HD6870. The HD6870 has less texture units, SIMD engines, Stream Processors with a lower memory bandwidth but the Geometry throughput has been significantly increases from 725 million polygons per second to 900 million polygons per second. The new design is more efficient and with the die shrink requires less power at idle.
Hardware Tessellator Progression
|
||
Generation 1 |
ATI Radeon 8500
|
Fixed Function PN Triangles (TRUFORM)
|
Generation 2 |
Microsoft Xbox 360
|
Displacement mapping, adaptive tessellation
|
Generation 3 |
AMD Radeon HD2000 family
|
Dirext X 10 compatibility
|
Generation 4 |
AMD Radeon HD 3000 family
|
Direct X 10.1 compatibility
|
Generation 5 |
AMD Radeon HD 4000 family
|
Performance Enhancements
|
Generation 6 |
AMD Radeon HD 5000 family
|
Direct X 11 compatibility
|
Generation 7 |
AMD Radeon HD 6800 Series
|
Improved Thread Management and buffering
|
The updated Tessellation unit, called ‘Generation 7' brings new thread management capabilities as well as buffering enhancements to the table. This means that performance is increased by up to two times when directly compared to the HD5000 series. This should mean that AMD are closely competitive with similarly priced nVidia Fermi hardware, we will look at performance later in this article. AMD's internal testing shows that the HD6870 achieves twice the tessellation performance of the HD5870.
Tessellation is a hotly discussed topic right now, and it is a bone of contention between nVidia and AMD. Our recent interview with Richard Huddy opened a few eyes, as he said “nVidia is pushing a single message and that’s tessellation. Tessellation is about enriching detail, and that’s a good thing, but nVidia is pushing to get as much tessellation as possible into everything. With artificial tests like Stone Giant, which was paid for by nVidia, tessellation can be done down to the single pixel level. Even though that pixel can’t be broken away from the 3 other pixels in its quad. Doing additional processing for each pixel in a group of 4 and then throwing 75% of that work away is just sad.”
AMD's stance on this is “Tessellating the Right Way” – which means a focus on the most efficient tessellation usage models with 16 pixels per polygon combining solid image quality with high levels of performance. Adaptive Tessellation is how they want to approach it, using high levels for objects close to the front of the screen (and in eye shot of the gamers) and then switch to lower levels for distant and simple objects to help improve performance while avoiding geometry aliasing problems.
The image above was captured from Alien V Predator and showcases the new Morphological Anti-Aliasing technique from AMD which is a post process filtering technique accelerated with DirectCompute. It delivers full scene anti Aliasing and it is not limited to polygon edges or alpha tested surfaces. The system is faster than super sampling with similar performance levels to edge detect CFAA, but it applies to all edges. This can be enabled from Catalyst Control Center and is compatible with any Direct X 9/10/11 supported application.
Anistrophic Filtering has also been fine tuned and enhanced with a newly refined algorithm in place. It addresses visible discontinuities in very noisy textures offering smoother transitions between filter levels. It also maintains full performance and angle independence.
Interesting times ahead over the next year. I seriously dont know how they are giong to get much more power out of the new range. I would guess they will need to do more with power management too
They need to work on tesselation. the GTX460 is basically delivering the same tesselation performance as a high end 5870 right now. so I can see a huge focus on this.
Yeah it has to be a focus on tesselation , its their real weakness right now compared to nvidia.
even though I know this article will get some stick from people. its interesting guess work really and it stirs debate. I would say this wont be that far out.
Well going on the past. 3dmark vantage scores are generally slightly below dual card high end configurations of the generation before, so id say this would be close to accurate.
These are mostly just bad assumptions with nothing to back them up. While ATIs next gen gpus will come Q4, there’s no performance data available. Intels’ tic-tacs has nothing to do with ATI gpus and you won’t be seeing 28nm gpus until next summer at the earliest. The next gen gpus were meant to be done on 32nm, but that was cancelled, and it obviously changed plans, so basically we get a new 40nm gen that’s somewhat late considering.
1, Whom do we imagine will be making a 28nm part? There are no fabs with a 28nm mass production node ready for high end GPU parts this year.
2. Northern Islands has been cancelled, Southern Islands is now the codename series for next generation of AMD ATI Radeon cards.
3. Work on the design began far longer ago than the tape-out of Evergreen and Cypress. The lead time on these designs is at least 18-24months before tape-out.
4. ATI’s GPU ‘tock’ team are working with the CPU guys on Fusion. That’s why Evergreen was an incremental improvement of the Terascale SP design in the RV770, and not a brand new architecture. Southern Islands is the same – incremental, with a boost in ‘uncore’ – which can mean boosts in Tessellation performance (not that it means anything outside of benchmarks).
Double performance of 5870? thats not likely, a next gen of gpus never gives double performance from previous gen. maybe 50% over 5870 (6870) in vantage or 20-30% over gtx 480 but i think more like 15-20% sounds reasonable. Don’t forget boys and girls gtx 485 is coming soon (q3 2010) because its a 40nm chip the performance can only come from better clock speeds, power efficiency, drivers, cooling, and improved engineering
Funny 🙂 Never ceases to amaze me how many people take these weekend discussion topics on KG ;’seriously’ and then proceed to slander the whole site. saw rage3d posted about it and some guy was saying how crap everything is here. He clearly never read the 44 page HQV 2.0 benchmark analysis or the power supply article posted a few days ago. (amongst other things). seems its really easy on the net to just take an opinion without any basis for it and then continue slandering something 🙂
Good read this btw, will be interesting to see atis new range of graphics cards, I think tesselation will become a big part of future games, as its such a strong aspect of DX11
Considering current gens are generally more or less twice as fast as the previous generation of cards as death dealer said, 5000 gives twice the performance as 4000, eg 5870 2x~ 4870 more or less. If this is set to be more then just a refresh then that will be true, if theyve just been lazy and not really done much more then a refresh in all this time then probably more like ~1.3x time current performance
Interesting conjecture, and I’m not going to discount it, let’s face it Nvidia and AMD are simply not going to tell us what their strategies are and I’m sure all of the noted factors are very well noted by both teams.
I can’t call it from a technology stand point, although I’m a self confessed Nvidia fan, but I won’t let that colour my view.
It would be interesting if you did a follow up article on the state of play in with 28nm fabrication. Do AMD have the edge here with global foundries? We’ve seen Nvidia struggle with the 40 nm process, would AMD be faced with the same challenges or do they have an edge in tape out technologies and testing. That I’d be interested to read.
It will, in any event, be very interesting, and I’m waiting until after the summer and those cold dark evenings for a PC rebuild and cheaper cards.
@James: Great questions ! Let us try and respond…
1) TSMC completed work on Fab 12 in January this year. They have 28nm capability and, when they opened the Fab, announced that they expected production runs to begin Q4 2010. KitGuru says that a high-end 6000 series part in only likely to move to 28nm IF one of the simpler 5000-series parts moves first. So if there’s an announcement at the next quarterly briefing, we’ll all know for sure!
(For reference, check Anton Shilov’s article on Xbit back on 20th Jan 2010)
2) Not sure where your info comes from. We believe that Northern Islands is this year’s respin and a full update will happen around 12-18 months later. Please reveal your sauces 😉
3) We said that a lot of work happens in parallel. However, you can make changes to the final performance characteristics IF your opponent has already launched all of its key SKUs. That’s a big advantage for AMD. They KNOW where the nVidia products are. Remember, the GTS450, 440, 430 and 420 parts have not launched yet, while the high end parts are already several versions of the silicon down the line. If Fermi had launched in the Sep-Dec time frame, then the GTX460 etc would have been the respins. They are not respins now. They have become v1 launch silicon. Which means the respin for nVidia could be a lot further away. If that makes sense.
4) Possible, but not what we meant. What’s we’re saying is that Intel uses a Tick-Tock as a concept and we feel it’s unlikely for AMD to move a new product (6870) to a new process at the same time. We might be wrong, of course, and we’ll take our lumps if we are. But we believe a smaller product will ‘Tick’ its way to 28nm ahead of 6870.
@Aberkae: What’s your guess ?
@PineappleExpress: Wise words from down under methinks 🙂
No not going to happen this way AMD would not take the risk.
Not now that Nvidia has essentially closed the gap with the GF104 chip and forthcoming GF106 and 108 chips. Nvidia has retaken the $200 mark and with the possibility of a higher power GF104 chip using all 8 SM’s not just the 7 used on the current gtx460 and dual GF104 cards Nvidia is set to sweep across the board.
Both Ati and Nvidia were stung by the 40nm yield issues, ATi will now be looking for a dead certainty to counter Nvidias latest releases and will not risk its now required performance boost on a 28nm process which isn’t even in being yet particularly when they would also be making a new architecture on it without trialing it with a mid range gpu first(just like the mistake Nvidia made with Fermi on the 40nm process).
No Southern Islands on 40nm will be released in Q4 and we will probably see a mid range Southern Island card released in H1 2011 using 28nm process very much like they did with the 4770 on the 40nm process.
How long the Southern Island range will last is another question, it was only a back up plan used becasue of the cancellation of 32nm by the fabs, I don’t think Ati will wait a year from Southern Islands release to release Northern Islands and as soon as they are happy with the 28nm process it will be out , they don’t want to give Nvidia any breathing space just to stretch out the life cycle of Southern Islands which after all was just a back up plan.
http://www.fudzilla.com/graphics/graphics/radeon-hd-6000-is-a-minor-improvement
Fudzilla just posted this, so im guessing were looking at more minor improvements. Such as the uncore part as stated on wiki which improves tessellation performance as someone said above. Which probably means something like 1.3 times evergreen. So im guessing its the phenom x6 to the phenom x4 (without extra cores), just little things like improved process and stuff like that to improve efficency
Also read a comment saying that northern islands has been cancelled, its hasnt been cancelled, only delayed due to glofo/TSMC cancelling its 32nm process and jumping straight to 28nm. So ATI is using southern islands as a filler until northern islands comes around next year. Dunno if anyone else said this didnt read all the comments…
Hopefully southern islands will be good, and create ALOT of competition with GF104 as im looking to get my new build some time around the holiday season
I think the majority of it will end when we have real life simulation of environments and all the in betweens on our screen.
But with the release of Kinect, Wii, Move etc and their successors… you have to wonder if we’ll ever reach the end of the on screen gaming potential as the battle between mouse, keyboard, controller and the hands free experience intensifies.
The transition from whats traditional and the alternatives offered by the consoles may prevent us from reaching the pinnacle of gaming on screen more than any hardware or software bottleneck. \
so let me get this straight they did a benchmark on the 6870 3-6 months before its release date? seems fabricated to me.
Thats why its called a ‘prediction’. You can hardly fabricate a prediction. its guesswork and the reasons why surrounding it.
Haha)) According to the leaked benchmarks of HD6870 (here: http://we.pcinlife.com/thread-1500103-1-1.html ) Your predictions were pretty damn close! 6870 GPU Score in Performance benchmark: 24056.
kitguru prediction ? WRONG ! bench has been leaked !
GTX480 score heaven 2.1 = 970
ATI6718 score heaven 2.1 = 912
LOL – the truth is out there !
I realize some may not like the former post so here are the links !
http://www.geeks3d.com/20100525/quick-test-unigine-heaven-2-1-gtx-480-vs-gtx-470-vs-hd-5870-in-opengl-4-0-and-direct3d-11-in-extreme-tessellation/
http://futuremark.yougamers.com/forum/showthread.php?t=133775
Read it and weep waiters !
SiliconDoc
The prediction of “performance” score of HD6870 in 3DMark Vantage by KitGuru is almost correct! Only of course – If leaked benchmarks are true.
And considering the leaked Heaven 2.1 score – the GTX480 (at stock clock) with the same settings actually scores less then leaked 6870 – somewhere around 750 points. Check this: http://tof.canardpc.com/view/186b5b87-44ec-4000-b8a2-31789867a501.jpg
Ah yes, the anonymous photo shopped pic job with the low 480 score, rofl. The 480 users already busted that ruse.
SiliconDoc
I do not own GTX480, so the score could be photoshopped. But after some searching done on internet – I’m sure that stock GTX480 (without overclock) at 1920×1200 4xAA and extreme tesselation scores LOWER THAN 900 points in Heaven 2.1 (a screenshot here in the end of the page (GTX480 Possibly OVERCLOCKED): http://we.pcinlife.com/thread-1500103-3-1.html). So It’s weaker than 6870 in tesselation according to the leaked tests.
Personally I still hardly believe leaked info, because the increase is so huge it would need some massive arhitecture improvements. Just can’t wait to see the official release to make things clear.
The thank you Daimler, more canned links that link the link I linked in th relinked links.
I already went to ORB and looked up GTX480 scores and saw X15,000+ not uncommon – beating the 11,000+ leaked for 6000 – in xtreme with 1 480 card….
So anyway…
People tell me with a straight internet face the 480 is only 10% faster than the 5870, the 470 is the same as 5850, the 5830 matches the 460, the 5770 beats the 260 and 275 and matches the 285, and on and on and on.
What I’ve been doing for a couple years now is immediately downgrading whatever gigantic full tier lie the ati fans spew as “their take”, a full tier.
To copy the greatest spew heard over and over again for years already (it seems) ” THEY HAVE HAD 8 MONTHS to release and that late it had better be a card that can beat the Nvidia 480 !!!!” ( imagine a raging face and screeching anger).
Can it do Physx and play Cuda ? (not Crysis)
SiliconDoc
I’m not an ATI fan. I’m technology fan. But you are obviously an nVidia fanboy and thats a shame. 🙁
As for xtreme benchmark score in Vantage – make sure you looked not at total score, but GPU Score ONLY! Because Total Score is affected by CPU and Use of PhysX in CPU Tests. Stock clocked GTX480 would never 15,000 mark in GPU Score.