Please use this identifier to cite or link to this item: http://hdl.handle.net/1942/10907
Title: A High-level Kernel Transformation Rule Set for Efficient Caching on Graphics Hardware - Increasing Streaming Execution Performance with Minimal Design Effort
Authors: ROGMANS, Sammy 
BEKAERT, Philippe 
LAFRUIT, Gauthier 
Issue Date: 2009
Publisher: INSTICC Press
Source: Proceedings of the International Conference on Signal Processing and Multimedia Applications. p. 38-43.
Abstract: This paper proposes a high-level rule set that allows algorithmic designers to optimize their implementation on graphics hardware, with minimal design effort. The rules suggest possible kernel splits and merges to transform the kernels of the original design, resulting in an inter-kernel rather then low-level intra-kernel optimization. The rules consider both traditional texture caches and next-gen shared memory – which are used in the abstract stream-centric paradigms such as CUDA and Brook+ – and can therefore be implicitly applied in most generic streaming applications on graphics hardware.
Document URI: http://hdl.handle.net/1942/10907
ISBN: 978-989-674-005-4
ISI #: 000282246300011
Category: C1
Type: Proceedings Paper
Validations: ecoom 2011
Appears in Collections:Research publications

Show full item record

Page view(s)

82
checked on Aug 6, 2023

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.