News - NVIDIA details next-generation Fermi GPU architecture

**HEXUS** · 01-10-2009, 10:36 AM

NVIDIA spills the beans on Fermi. Good enough to take down the Radeon HD 5870? We take a first look at the architecture.

**kalniel** · 01-10-2009, 11:09 AM

They seem incredibly reluctant to even talk about gaming, let alone give realistic hints to performance. It's almost like they are counting on HPC sales to grow exponentially to make up for a poor forcast on gaming profit.

I think AMD will be satisfied.

**borandi** · 01-10-2009, 11:20 AM

Gaming drives the HPC market in terms of tech, so nVidia has to be within a smidgen of ATI for gaming to compete in both spaces.

**GheeTsar** · 01-10-2009, 11:36 AM

It's very hard to see how, from a gaming perspective, nVidia will be able to match ATI on a price per performance basis. I hope I'm wrong as the price of 5870s could stay high for quite some time if this does come to pass.

**kingpotnoodle** · 01-10-2009, 11:54 AM

They have more than doubled the SP count, 512 is more than a GTX295, so this should be much faster given the inevitable tweaks under the hood...

I reckon it'll compete with a 5870 fairly evenly, and for me the value proposition of NVidia with CUDA, PhysX, the potential for flash acceleration etc is better than ATI (guess it depends on your viewpoint though) so assuming the power/efficiency are good I'm looking forward to this, going to be hard to choose...

**shaithis** · 01-10-2009, 12:08 PM

Originally Posted by kingpotnoodle

They have more than doubled the SP count, 512 is more than a GTX295, so this should be much faster given the inevitable tweaks under the hood...

No mention of speeds though, so it's possible that the speeds have been greatly reduced to fit the SPs in.

I doubt it but it's a possibility.

**dangel** · 01-10-2009, 12:21 PM

Happily I can afford to wait - my current card is fast enough and directx 11 brings speed improvements for all cards this time round so..

**chrestomanci** · 01-10-2009, 01:44 PM

Originally Posted by HEXUS

NVIDIA says that the GPU will also run the likes of Python and Java, although just how much use that will be is debatable.

I think it will be a lot of use because there are loads of programmers out there who prefer to program in Python or Java, and don't like C. It is also a lot quicker to write useful programs in high level languages, than in C.

Suppose you have an existing program written in Java. It currently takes an hour to run, and because it gets run a great deal you have a bussness need for it to run faster.

You could re-write the time crital sections in C, which will make the program about 50% faster (40 minutes), but to do so you would need to learn C, and the resultant code would be more bug prone.

Alternatively you could ask your Boss for £1000 for an nVidai CUDA card that will run the code 100 times faster (4 seconds), with only minor tweaks to the code in a language you are already familiar with.

Even if the program is not yet written, it is often still better to write in a high level language than a low level one as development will be faster. If that last bit of performance is still needed then the critical sections can still be re-written in C, but most of the time the 100x speedup from using CUDA will be good enough.

**Tarinder** · 01-10-2009, 02:28 PM

Originally Posted by chrestomanci

I think it will be a lot of use because there are loads of programmers out there who prefer to program in Python or Java, and don't like C. It is also a lot quicker to write useful programs in high level languages, than in C.

Suppose you have an existing program written in Java. It currently takes an hour to run, and because it gets run a great deal you have a bussness need for it to run faster.

You could re-write the time crital sections in C, which will make the program about 50% faster (40 minutes), but to do so you would need to learn C, and the resultant code would be more bug prone.

Alternatively you could ask your Boss for £1000 for an nVidai CUDA card that will run the code 100 times faster (4 seconds), with only minor tweaks to the code in a language you are already familiar with.

Even if the program is not yet written, it is often still better to write in a high level language than a low level one as development will be faster. If that last bit of performance is still needed then the critical sections can still be re-written in C, but most of the time the 100x speedup from using CUDA will be good enough.

It's the implementation that I'm querying rather than the use, I suppose. Python won't run natively on a GPU, and the 'translation' would hinder performance.

**kalniel** · 01-10-2009, 02:33 PM

Exactly.

Originally Posted by RealWorldTechnologies

Nvidia's marketing is makinig ridiculous claims that they will eventually have Python and Java support, but the reality is that neither language can run natively on a GPU. An interpreted language, such as Python would kill performance, and so what is likely meant is that Python and Java can call libraries which are written to take advantage of CUDA.

**Tarinder** · 01-10-2009, 02:48 PM

For anyone interested in the architecture to a greater degree, NVIDIA's released a whitepaper to the press a few days ago. It's now on the site, so read away (PDF).

http://www.nvidia.com/content/PDF/fe...Whitepaper.pdf

**scaryjim** · 01-10-2009, 03:03 PM

Originally Posted by Tarinder

Python won't run natively on a GPU, and the 'translation' would hinder performance.

not only that, but surely to make effective use of a GPU with that many stream processors your code would already have to be written to be massively multithreaded. Having done an MSc which taught Java as its principal language, and therefore knowing the coding skills of many professional Java developers, the concept of them trying to develop a massively-multithreaded software architecture to take advantage of this leaves me shivering in terror...

**Steve** · 01-10-2009, 05:51 PM

Originally Posted by Tarinder

It's the implementation that I'm querying rather than the use, I suppose. Python won't run natively on a GPU, and the 'translation' would hinder performance.

I don't think the language you write in is that big a deal if it comes with a decent library for this sort of stuff, or a good compiler (the world needs more compiler writers).

Originally Posted by chrestomanci

I think it will be a lot of use because there are loads of programmers out there who prefer to program in Python or Java, and don't like C. It is also a lot quicker to write useful programs in high level languages, than in C.

The problem is, most workloads just aren't written to do SIMD. OK, so new CUDA can run multiple kernels, but I doubt you can run as many kernels as you have streams (I guess I should read the whitepaper!).

If you want to make a GPGPU run fast, you need to take a lot of data, chop it up, and apply the same operations to each chunk - which is why you can dunk it through something massively parallel.

As soon as the operations you need to perform vary between each chunk (e.g. you have branches) the whole thing breaks down. Now, assuming you've got data that lends itself to parallel processing, there are ways of dealing with conditionals that don't involve branching.

Indeed, the reason GPUs have turned into the parallelised beasts that they are, is that graphics shaders and the data they work on are perfect for such situations.

There are a lot of workloads that can have multiple things happening at once, but that's not the same as doing the same thing to lots of data elements at once, which is why we don't have 512-core CPUs (yet...).

**Hicks12** · 18-01-2010, 12:17 PM

Originally Posted by kingpotnoodle

They have more than doubled the SP count, 512 is more than a GTX295, so this should be much faster given the inevitable tweaks under the hood...

I reckon it'll compete with a 5870 fairly evenly, and for me the value proposition of NVidia with CUDA, PhysX, the potential for flash acceleration etc is better than ATI (guess it depends on your viewpoint though) so assuming the power/efficiency are good I'm looking forward to this, going to be hard to choose...

Yes they also increased the bus to 300bit and more onboard ram, this all adds to a huge expense and so it will probably offer better performance than the 5870 but will also cost a HUGE amount more

.
edit: Also the amount of R&D thats gone into this project, it wont be healthy for Nvidia to compete with AMD's 5000 series (a lot less R&D costs etc) on price, either they sell at a loss or sell at a huge price that people wont buy. Although its a start of what could be an amazing platform/design its just going to be a profitless technology unless some serious cost cutting and developments can be made.

Doesnt matter anyway, by the time fermi is out AMD will already have its 6000 series out or a few months away i reckon.

**scaryjim** · 18-01-2010, 01:13 PM

Originally Posted by Hicks12

Yes they also increased the bus to 300bit and more onboard ram...

A 384bit memory bus is actually smaller than the GTX285, which interfaced using a 512bit bus, so there will be a small cost saving in using less memory chips. It does mean it'll end up shipping with one of those odd-sounding memory buffers though - 1.5GB most likely (I can't see them bothering with a 768MB version of the top end card).

Of course, ATI only use a 256bit bus, so unless Nvidia run their DDR5 @ < 3200 effective they're going to have more memory bandwidth on tap. On the other hand, it's debatable whether ATIs top end cards are bandwidth limited anyway, and therefore whether that extra bandwidth will boost performance at all...

**Hicks12** · 18-01-2010, 01:20 PM

Sorry i havent looked much into nvidia top end cards, only focused on the gtx 260 :L.

Thanks for informing me of that, however the rest still is right isnt it? R&D needs to be recopurated from somewhere and its going to be in the price of the cards, amd just tweaked there design and added more which is great (it shows) and in the end costs a lot less than changing the whole design!.

We will only know if bandwidth helps more when fermi is released, im betting Q2 release now tbh.

Thread: News - NVIDIA details next-generation Fermi GPU architecture

LinkBack

Thread Tools

News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Re: News - NVIDIA details next-generation Fermi GPU architecture

Received thanks from:

Re: News - NVIDIA details next-generation Fermi GPU architecture

Thread Information

Users Browsing this Thread

Similar Threads

News - NVIDIA to immortalise G98 GPU inside a keychain

Catalyst 7.5 out

IDF 2006 :: Intel's next generation Core architecture

HEXUS.beans :: Fanny's pillow talk reveals next gen NVIDIA product details

Posting Permissions