Please use this identifier to cite or link to this item:
http://hdl.handle.net/1942/10907
Title: | A High-level Kernel Transformation Rule Set for Efficient Caching on Graphics Hardware - Increasing Streaming Execution Performance with Minimal Design Effort | Authors: | ROGMANS, Sammy BEKAERT, Philippe LAFRUIT, Gauthier |
Issue Date: | 2009 | Publisher: | INSTICC Press | Source: | Proceedings of the International Conference on Signal Processing and Multimedia Applications. p. 38-43. | Abstract: | This paper proposes a high-level rule set that allows algorithmic designers to optimize their implementation on graphics hardware, with minimal design effort. The rules suggest possible kernel splits and merges to transform the kernels of the original design, resulting in an inter-kernel rather then low-level intra-kernel optimization. The rules consider both traditional texture caches and next-gen shared memory – which are used in the abstract stream-centric paradigms such as CUDA and Brook+ – and can therefore be implicitly applied in most generic streaming applications on graphics hardware. | Document URI: | http://hdl.handle.net/1942/10907 | ISBN: | 978-989-674-005-4 | ISI #: | 000282246300011 | Category: | C1 | Type: | Proceedings Paper | Validations: | ecoom 2011 |
Appears in Collections: | Research publications |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.