![]() |
|
|
#126 |
![]() Join Date: Mar 2009
Location: Internet Heaven
Posts: 1,058 (4.15/day)
Thanks: 158
Thanked 60 Times in 54 Posts
|
Wow guys!
you go ahead and figure this out![]() I'll buy the revision with the better memoryThanks for the heads up...Btw is 2 5770's worth it or is 2 5750's good
__________________
If you are someone who plays a lot of multiplayer games, consider putting TPU somewhere in your name so other members can recgonize that you're a member of the forums... |
|
|
|
|
|
#127 | |
![]() |
Quote:
|
|
|
|
|
|
|
#128 |
![]() Join Date: Sep 2009
Location: Reaching your left retina.
Posts: 435 (5.94/day)
Thanks: 22
Thanked 76 Times in 53 Posts
|
I have said many times and still mantain that the problem is IMO in the thread dispatch processor/setup engine.
1- Both RV770 and RV870 tout the same peak execution of 32k (kilo) threads, so probably the TP/SE has not been changed. 2- It's been said that RV870 is the exact same architecture as RV770 + DX11 support on the shaders, so probably only the ISA on the shaders have changed, if at all. 3- I know comparing different architectures is kinda stupid, but it can be valid as a guideline. Nvidia's GT200 had 32k peak threads too, but they have already said (I think it was on Fermi white paper) that in reality it could only do 10-12k and that was part of the reason for the "lacking" performance of GT200, at least at launch. Fermi will have 24k peak only, but thanks to 16 kernels and 2 different dispatch processors they think they will be able to max it out. SO even if we can't compare architectures directly, we do know that one of the companies did a thorought study on their hardware to test usage and saw that their 32k thread processor (12k in ractice) would not cut it, so they decided to put two, a different/weaker ones, but two. We could speculate wether AMD's dispatch processor was more efficient or not, but given the performance similarity it most probably had a similar one + the advantage of higher clocks if at all. Now imagine it was indeed a little bit more efficient so that that thread dispatch processor was excessive for RV770, with a heavy overhead they could not really test, because it was the rest of the chip that was holding it down. Imagine that RV770 could only do 10-12k on the shader side of things, just like GT200 did as a whole* and that AMD thought that in theory the DP/SE could really do 24k. In order to realease Evergreen as fast as they did, they probably didn't touch the DP at all, being that in theory it could handle 32k and 24k according to their estimates, plenty. But what if the DP can't do 20k and it only does 16k, for example? Then you have a bottleneck where you didn't thought you would have one. It's not as if you could do anything without a complete redesing so you release that, because, in the end it still is a fast card (the fastest), because you will release much sooner and because you expect to improve the efficiency of usage with future drivers. My two cents. |
|
|
|
|
|
#129 | |
![]() |
Quote:
Game A might render using only 2 threads, and only use 1 or 2 of the shaders per cluster. next throw in the CPU performing the physics and some minor setup information. So in a crossfire setup card 1 is generating frame 1, it has been handed the setup and physics information from the CPU, the CPU is then unbound to start working on the next setup as that card is busy, card 2 receives data from the CPU and starts generating frame 2. Card 1 is now done and it is sent to the display, during that time the CPU has generated the next physics and other data for card 1.....so on and so forth.....each card is provided data regardless of what the other card is doing. In the 5870 until that frame is done no other information is dispatched to the GPU, so when it is done it must wait on information from the CPU, not alot of info, but the basics of movement from the mouse, physics, and other user and game thread input must be sent to determine WHAT to render. So we have alot of underutilized GPU power, and even if one shader is being used per cluster it will still report that as activity for the cluster. So long story short, until game devs learn to use shaders and move data processing to the GPU this card is stuck.
__________________
If I wanted your thanks I would have asked for it, asshole. “it would have been perfect....its got trains and the line"tech your kids not to do what iv done"(or similar) because i had obviously done something to warrent 2 e-thugs to come 4000miles out of their way and kill me.” -Solaris17 “yeah i failed. i noticed the "coming soon" part after i posted.” -Mussels
|
|
|
|
|
|
|
#130 | ||
![]() Join Date: Sep 2009
Location: Reaching your left retina.
Posts: 435 (5.94/day)
Thanks: 22
Thanked 76 Times in 53 Posts
|
Quote:
It's not that. Quote:
Last edited by Benetanegia; 11-03-2009 at 10:42 PM. |
||
|
|
|
|
|
#131 | |
![]() Join Date: Sep 2008
Location: PA, USA
Posts: 4,266 (9.81/day)
Thanks: 536
Thanked 1,023 Times in 861 Posts
|
Quote:
I never had that issue with my i7 at stock with several SLI configs and even 295s in single & SLI.
__________________
*Peripheral- noun - computer peripheral, peripheral device ((computer science) electronic equipment connected by cable to the CPU of a computer) "disk drives and printers are important peripherals" Read my latest peripheral review and please join/support the TPU Peripherals Club. |
|
|
|
|
| The Following User Says Thank You to Binge For This Useful Post: |
|
|
#132 |
![]() |
Then the geme knows your every move, and that turn you made to the right is preprogramed into the game? Probably not. That would mean the game has preprocessed every option, every possible physics situation, and every possible pixel from every possible angle, with every possible show or light.
The CPU still has to handle the game thread, and the game thread still has to generate positional (vertex) information to send to the GPU as fast as possible. games run differently based on their software threads and how they approach the handoff between the two. Thus the different performance in games as will as in architecture of systems. The GPU currently is not responsible for generating more than the pretties on top of the basic information handed to it, GPGPU or OPENCL is the beginning of the GPU doing more of the work for faster framerates, and better physics. No latency introduced by the CPU and communications layers. So again think about the step by step process a frame takes as you turn to the left, the CPU is responsible for generating the movement from the mouse/controller input, then hands that to the game thread, which runs on the CPU, that then translates that into character movement, then generates a new set of locations for the GPU to act upon. If the GPU thread generated by the game doesn't utilize all the shader hardware then it creates a artificial bottleneck. Either way the game threads are the holdup, not the GPU core.
__________________
If I wanted your thanks I would have asked for it, asshole. “it would have been perfect....its got trains and the line"tech your kids not to do what iv done"(or similar) because i had obviously done something to warrent 2 e-thugs to come 4000miles out of their way and kill me.” -Solaris17 “yeah i failed. i noticed the "coming soon" part after i posted.” -Mussels
|
|
|
|
|
|
#133 | |
![]() Join Date: Sep 2009
Location: Reaching your left retina.
Posts: 435 (5.94/day)
Thanks: 22
Thanked 76 Times in 53 Posts
|
Quote:
The rendering is the final step of the process, once when all the data for one frame is sent it starts with the next, whether the next set of data is sent to another GPU or to the same GPU that has already finished the work* is irrelevant. * A card with half the execution units will take twice the time to render the same frame, but since there is two cards one does the odd frames and the other one the even ones, with a 50% offset in the period. The result is the same. |
|
|
|
|
|
|
#134 |
![]() |
So there is no latenty introduced byt the time the GPU reports the frame done sending that back to the CPU, and the CPU sending the next instruction set? There is. Even if it is only rendering with 70% of the GPU hardware, there is still wait time. Wait time the drive in a crossfire diminishes by allowing the next frame to start rendering before the current one is finished, so there is your incremental speedup of over 100% scaling.
Why doesn't a old game get some absurd FPS that is linearly incremental to the hardware? Latentcy. We are at that point, the GPU needs to be handling these calculations on board, or the game DEV's/DX needs a override for frames being rendered in order by sending a new packet without the wait flushing the buffer and starting execution on the next relevant frame, perhaps they do and this is the issue, frames are being dumped by the wayside and not counting.
__________________
If I wanted your thanks I would have asked for it, asshole. “it would have been perfect....its got trains and the line"tech your kids not to do what iv done"(or similar) because i had obviously done something to warrent 2 e-thugs to come 4000miles out of their way and kill me.” -Solaris17 “yeah i failed. i noticed the "coming soon" part after i posted.” -Mussels
|
|
|
|
|
|
#135 | |
![]() Join Date: Nov 2007
Location: Miami
Posts: 2,812 (3.79/day)
Thanks: 927
Thanked 495 Times in 438 Posts
|
Quote:
I highly doubt that they accidentally didn't give it enough bandwidth or decided to go with a single operation (didnt know that) tesselator. More likely is that these are corners which were chosen to be cut for whatever reasons. Ones that are unknown to us. Perhaps by cutting these, they were able to get the cards out faster and thus made them more profitable. Maybe they took shortcuts that enabled them to make an X2 card almost simultaneously with the single GPU. Who knows. Point is, they have a card out and its double ready to be released at any given moment. Is it below expectations? Well if you read the specs and assumed a linear increase in performance then yes. If you expected a kickass card within +/_ 20% of the current dual GPU options then no.
__________________
“In this economy. Scott. It's almost feels like you are lapping your ass when using it.” -erocker
PC modding >>>http://www.mypimpedpc.co.uk/ |
|
|
|
|
| The Following User Says Thank You to phanbuey For This Useful Post: |
|
|
#136 | |
![]() Join Date: Sep 2009
Location: Reaching your left retina.
Posts: 435 (5.94/day)
Thanks: 22
Thanked 76 Times in 53 Posts
|
Quote:
Old games don't get absurdly high frames because they are CPU limited, limited by their ability to calculate physics, AI and geometry and the result is a bottleneck that affects every card configuration: every combination of GPUs give the exact same fps. That's not the case here, in fact is quite the opposite, because in a CPU bottlenecked scenario the multi-gpu setup would suffer lower fps, because a lot of data must be sent twice, occupying CPU clocks. http://www.techpowerup.com/reviews/HIS/HD_5770/18.html - 1024x768, that is a CPU bottleneck, in that situation yes latencies do matter a bit, although sincronization of different clock domains plays a much more important role. In fact here http://www.techpowerup.com/reviews/A...ssFire/15.html you can see better how crossfire works out to be slower than single HD5870. Last edited by Benetanegia; 11-04-2009 at 12:26 AM. |
|
|
|
|
| The Following User Says Thank You to Benetanegia For This Useful Post: |
|
|
#137 | |
![]() Join Date: Sep 2008
Location: PA, USA
Posts: 4,266 (9.81/day)
Thanks: 536
Thanked 1,023 Times in 861 Posts
|
Quote:
__________________
*Peripheral- noun - computer peripheral, peripheral device ((computer science) electronic equipment connected by cable to the CPU of a computer) "disk drives and printers are important peripherals" Read my latest peripheral review and please join/support the TPU Peripherals Club. |
|
|
|
|
|
|
#138 |
![]() Join Date: Sep 2009
Location: Reaching your left retina.
Posts: 435 (5.94/day)
Thanks: 22
Thanked 76 Times in 53 Posts
|
@ Binge and phanbuey
maybe I failed to make that point clear, but when I was talking about the DP and setup engine, I meant that. That they knew it would affect somehow, but they decided it would pay off not to redesign the whole architecture. Although I don't think they knew it would affect so much (whatever the problem is) or they would have put less SPs on the chip to make it cheaper. |
|
|
|
|
|
#139 | |
![]() Join Date: Sep 2008
Location: PA, USA
Posts: 4,266 (9.81/day)
Thanks: 536
Thanked 1,023 Times in 861 Posts
|
Quote:
__________________
*Peripheral- noun - computer peripheral, peripheral device ((computer science) electronic equipment connected by cable to the CPU of a computer) "disk drives and printers are important peripherals" Read my latest peripheral review and please join/support the TPU Peripherals Club. |
|
|
|
|
| The Following User Says Thank You to Binge For This Useful Post: |
|
|
#140 |
![]() |
5770 pixels per second. W X H X FPS
1024X768 174,587,904 1680X1050 313,639,904 2560X1600 386,252,800 There is only two reasons the ramp would have not stayed the same between the last two, memory bandwidth limit, and that is not plausable as others have already done tests to confirm memory clock has little to do with performance. PCIe bandwidth as that has little to do with performance. And the cards being underutilized by the software threads controlling it. Wether or not due to latentcy constraints, the hardware should have a linear rate of descent, minus a bit of overhead. The CPU can supply data at a given rate for the current frame to be rendered. i will run some numbers tonight when I get back and try a couple games on my system at different resolutions and GPU loads. I still believe the latentcy even at higher frame rates is what is causing the questions/issues for some.
__________________
If I wanted your thanks I would have asked for it, asshole. “it would have been perfect....its got trains and the line"tech your kids not to do what iv done"(or similar) because i had obviously done something to warrent 2 e-thugs to come 4000miles out of their way and kill me.” -Solaris17 “yeah i failed. i noticed the "coming soon" part after i posted.” -Mussels
|
|
|
|
| The Following User Says Thank You to Steevo For This Useful Post: |
|
|
#141 | |
![]() Join Date: May 2007
Location: Perth, Australia
Posts: 3,014 (3.23/day)
Thanks: 570
Thanked 372 Times in 325 Posts
|
Quote:
They did what they did because they were able to take the crown for single GPU, and beat Nvidia to the cake. I'm pretty sure it's that simple.
__________________
Wolf. Nothing is unmoddable, nothing remains stock. “my goal is speed, full utra, and extreme gaming
” -Ephraim |
|
|
|
|
|
|
#142 |
![]() Join Date: Nov 2007
Location: elkins, wv
Posts: 1,928 (2.63/day)
Thanks: 245
Thanked 187 Times in 173 Posts
|
gotta say you guys are bringing alot of my thoughts and wondering with your posts and its a great discussion
, Though it seems most of us agree there's more potential for the HD 5870. Another reason i think they may have cut corners is bc as said, it allowed them to get control of the market before nvidia released. By releasing this early compared to nvidia, they'll have a good headstart for their next architecture and by cutting these corners it is probably helping them determine what really is going to be a factor come DX11 titles so they'll have a better idea of how to design their next chip. IMO this gen's launch is a big win for AMD and i'd like to see them design a new architecture instead of building on the current, which has been used since RV6X0 days(or was it RV5XX?). One thing i'd like to see and idk much bout this so idk if it'd add too much complexity or not, but i'd like to see ATI unlock their shader clock from core. I mean think if that was the factor with the next gen. Even with only 1200 shaders but clocked at say 1500 with oc headroom that would boost ATI's performance tremendously...i think .keep it up guys, this discussion is very interesting.
__________________
VISIT Chenowith Creek AND HELP IT GROW MORE THAN CDAWALL's “Yeah crysis is what real life was supposed to look like but god messed up
” -DrPepper“MRCL:What if Jesus came by and Apocalypse actually would happen?
Weer:Then I'd be more afraid that other works of fiction would come true, such as Harry Potter.”
|
|
|
|
|
|
#143 | |
|
Outdated Meme Moderator
Join Date: Oct 2004
Location: Bendigo, Australia (NOT THE USA)
Posts: 17,944 (9.57/day)
Thanks: 899
Thanked 2,888 Times in 2,506 Posts
|
Quote:
Disable Vsync and watch the FPS soar.
__________________
![]() AMD tech support - see how bad it really is 5 port E-sata cage. 4x samsung 1TB + 2x Seagate 1.5TB = 7 TB external storage 32 Bit OS vs 64 bit OS information 48x0 card support Getting DXVA working in windows 7! |
|
|
|
|
|
|
#144 |
![]() Join Date: Oct 2009
Location: Milwaukee Wisconsin
Posts: 206 (4.19/day)
Thanks: 19
Thanked 12 Times in 12 Posts
|
It should on paper but in real life nothing is certain. Also A_ump you have to remember that the Drivers for the 5870 are still really young. As the card matures the performance will definitely increase.
Like it was stated ..... Last Gen cards were really powerful and that a Single GPU eve comes close to beating a Duel GPU from last Gen is impressive. I own a Diamond 5870. Before I bought it I worried about the same thing you just commented on. With it's performance. But you know after I saw how much a 4870 improved after the driver updates came out. I calmed all my my worries. I sold my EVGA GTX 285 FTW edition to get this card. That's how sure I am after all is said and done.... there will be nothing that comes close to this card from last GEN when all the updates BIOS flashes and tweaks are done. |
|
|
|
|
|
#145 |
![]() Join Date: Nov 2007
Location: elkins, wv
Posts: 1,928 (2.63/day)
Thanks: 245
Thanked 187 Times in 173 Posts
|
very true. and there's that lovely suspicion among some of us that ATI is intentially holding back the HD 5870's performance onpurpoase as it currently selling fine and is has the single gpu performance crown.
__________________
VISIT Chenowith Creek AND HELP IT GROW MORE THAN CDAWALL's “Yeah crysis is what real life was supposed to look like but god messed up
” -DrPepper“MRCL:What if Jesus came by and Apocalypse actually would happen?
Weer:Then I'd be more afraid that other works of fiction would come true, such as Harry Potter.”
|
|
|
|
|
|
#146 |
|
Moderator
Join Date: Jul 2006
Posts: 15,960 (13.05/day)
Thanks: 1,113
Thanked 3,360 Times in 2,681 Posts
|
|
|
|
|
|
|
#147 |
![]() Join Date: Sep 2009
Location: Reaching your left retina.
Posts: 435 (5.94/day)
Thanks: 22
Thanked 76 Times in 53 Posts
|
That's what I said. Look, that there's something "wrong" with the card is clear, that they released the best card this quarter is clear too. That they didn't care because they'd have to go back to the drawing board otherwise, is not so clear, but we are all saying that, and since it is an improvement over previous cards it doesn't matter anyway. They wanted the crown and they got it, but at the expense of doing a less efficient design. Who cares? Well when it comes to market reality, no one, I don't, but I am a tech yonkie and I like discussing architectures and how they affect performance, etc. So in that sense I care, it's not performing as it should, I just want to know why.
|
|
|
|
| The Following User Says Thank You to Benetanegia For This Useful Post: |
|
|
#148 |
![]() Join Date: May 2007
Location: Perth, Australia
Posts: 3,014 (3.23/day)
Thanks: 570
Thanked 372 Times in 325 Posts
|
I missed the bit where you said you liaise with their R&D, not to mention I'm allowed my opinion in not believing you
![]() You've also managed to restate the same point over and over and over, we do get it brah.
__________________
Wolf. Nothing is unmoddable, nothing remains stock. “my goal is speed, full utra, and extreme gaming
” -Ephraim |
|
|
|
|
|
#149 | |
![]() Join Date: Nov 2007
Location: elkins, wv
Posts: 1,928 (2.63/day)
Thanks: 245
Thanked 187 Times in 173 Posts
|
Quote:
__________________
VISIT Chenowith Creek AND HELP IT GROW MORE THAN CDAWALL's “Yeah crysis is what real life was supposed to look like but god messed up
” -DrPepper“MRCL:What if Jesus came by and Apocalypse actually would happen?
Weer:Then I'd be more afraid that other works of fiction would come true, such as Harry Potter.”
|
|
|
|
|
|
|
#150 |
![]() Join Date: May 2007
Location: Perth, Australia
Posts: 3,014 (3.23/day)
Thanks: 570
Thanked 372 Times in 325 Posts
|
I think that is true of most people who frequent the video cards section of TPU, GPU architecture is far more interesting than CPU architecture to me, especially how both camps continue to have such vastly different approaches, yet end up in roughly the same spot, its an amazing race to take part in.
__________________
Wolf. Nothing is unmoddable, nothing remains stock. “my goal is speed, full utra, and extreme gaming
” -Ephraim |
|
|
|
![]() |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| SLi expectations | SK-1 | NVIDIA | 6 | 07-06-2008 08:39 AM |
| ATI RV770 'On Par' With Expectations | zekrahminator | News | 57 | 02-20-2008 06:29 AM |
| New Computer doesn't meet expectations | wildmonkeys77 | General Hardware | 45 | 01-04-2007 07:26 AM |
| Microsoft DirectX 10 expectations | malware | News | 0 | 09-04-2005 10:54 AM |
| X850XT PE bios expectations | Synetic | Graphics Cards | 36 | 07-09-2005 10:27 PM |