Conferences >2013 IEEE International Confe...

Variable block size motion estimation implementation on compute unified device architecture (CUDA)

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper proposes a highly parallel variable block size full search motion estimation algorithm with concurrent parallel reduction (CPR) on graphics processing unit (GP...Show More

Metadata

Abstract:

This paper proposes a highly parallel variable block size full search motion estimation algorithm with concurrent parallel reduction (CPR) on graphics processing unit (GPU) using compute unified device architecture (CUDA). This approach minimizes memory access latency by using high-speed on-chip memory of GPU. By applying parallel reductions concurrently depending on the amount of data and the data dependency, the proposed approach increases thread utilization and decreases the number of synchronization points which cause latency. Experimental results show that the proposed approach achieves substantial improvement up to 92 times than the central processing unit (CPU) only counterpart.

Published in: 2013 IEEE International Conference on Consumer Electronics (ICCE)

Date of Conference: 11-14 January 2013

Date Added to IEEE Xplore: 28 March 2013

ISBN Information:

ISSN Information:

DOI: 10.1109/ICCE.2013.6487048

Conference Location: Las Vegas, NV, USA

Contents

References is not available for this document.

Variable block size motion estimation implementation on compute unified device architecture (CUDA)

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Variable block size motion estimation implementation on compute unified device architecture (CUDA)

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?