DirectCompute Lecture Series 210: GPU Optimizations and Performance
- Posted: Aug 04, 2010 at 8:31 AM
- 25,368 Views
- 5 Comments
Download
How do I download the videos?
- To download, right click the file type you would like and pick “Save target as…” or “Save link as…”
Why should I download videos from Channel9?
- It's an easy way to save the videos you like locally.
- You can save the videos in order to watch them offline.
- If all you want is to hear the audio, you can download the MP3!
Which version should I choose?
- If you want to view the video on your PC, Xbox or Media Center, download the High Quality WMV file (this is the highest quality version we have available).
- If you'd like a lower bitrate version, to reduce the download time or cost, then choose the Medium Quality WMV file.
- If you have a Zune, WP7, iPhone, iPad, or iPod device, choose the low or medium MP4 file.
- If you just want to hear the audio of the video, choose the MP3 file.
Right click “Save as…”
- High Quality WMV (PC, Xbox, MCE)
- MP3 (Audio only)
- MP4 (iPod, Zune HD)
- Mid Quality WMV (Lo-band, Mobile)
Join James Fung from Developer Technology at NVIDIA as he covers the following advanced topics for DirectCompute:
Work distribution best practices
Compute shader code best practices
Algorithm selection best practices
Multi-GPU best practices
He also gives an example problem, scan/reduction, and then goes through various optimizations for that problem.
For more information on DirectCompute, download the PDC 2009 DirectCompute HOL, watch the DirectCompute Roundtable discussion, see the full DirectCompute lecture series, and download the slides for this lecture.
Comments Closed
Comments have been closed since this content was published more than 30 days ago, but if you'd like to continue the conversation,
please create a new thread in our Forums,
or
Contact Us and let us know.
Follow the Discussion
Where we can find the presentations?
The code is not visible in the movies
The relevant links should be in the description to the right of the video (http://code.msdn.microsoft.com/Project/Download/FileDownload.aspx?ProjectName=DirectComputeLecture&DownloadId=12929) is the link to the slide deck. The code that he uses is similar to that included in the DirectCompute Hands on Lab (HOL) which is also linked in the description. Hope that helps!
Here's an interesting question:
Wont we get the same problem in the GPU as in the CPU when all applications start using the GPU ? ( contention, GPU gets overloaded, which means delays, etc... )
So in other words we are just moving/postponing the problem.
Do you have any thoughts about this ?
I think overall its letting the system make use of all resources, GPU and CPU. Certainly its possible that down the line, applications could begin to contend for GPU resources. But, there will be tasks that remain on the CPU and many that can make new use of the GPU (and often need it intermittently) so overall it'll make better use of whats available.
Some slides (for example, #35 and 36) were apparently taken as-is from the older presentation by Mark Harris (http://developer.download.nvidia.com/compute/cuda/1_1/Website/projects/reduction/doc/reduction.pdf) and contain code from CUDA version of the reduction optimization experiment.
In my opinion, description of algorithm cascading along with Brent’s theorem is pretty important, but unfortunately it is missing in the current talk. Otherwise, great lecture, thank you James!
Remove this comment
Remove this thread
close