Skip to main content

Advertisement

Log in

A detailed study of ray tracing performance: render time and energy cost

  • Original Article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

Optimizations for ray tracing have typically focused on decreasing the time taken to render each frame. However, in modern computer systems it may actually be more important to minimize the energy used, or some combination of energy and render time. Understanding the time and energy costs per ray can enable the user to make conscious trade-offs between image quality and time/energy budget in a complete system. To facilitate this, in this paper we present a detailed study of per-ray time and energy costs for ray tracing. Specifically, we use path tracing, broken down into distinct kernels, to carry out an extensive study of the fine-grained contributions in time and energy for each ray over multiple bounces. As expected, we have observed that both the time and energy costs are highly correlated with data movement. Especially in large scenes that do not mostly fit in on-chip caches, accesses to DRAM not only account for the majority of the energy use, but also the corresponding stalls dominate the render time.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Aila, T., Karras, T.: Architecture considerations for tracing incoherent rays. In: Proceedings of HPG (2010)

  2. Aila, T., Laine, S.: Understanding the efficiency of ray traversal on GPUs. In: Proceedings of HPG (2009)

  3. Arnau, J.M., Parcerisa, J.M., Xekalakis, P.: Eliminating redundant fragment shader executions on a mobile GPU via hardware memoization. In: Proceedings of ISCA (2014)

  4. Bakhoda, A., Yuan, G.L., Fung, W.W.L., Wong, H., Aamodt, T.M.: Analyzing CUDA workloads using a detailed GPU simulator. In: ISPASS (2009)

  5. Barringer, R., Akenine-Möller, T.: Dynamic ray stream traversal. ACM TOG 33(4), 33 (2014)

    Article  MATH  Google Scholar 

  6. Binkert, N., Beckmann, B., Black, G., Reinhardt, S.K., Saidi, A., Basu, A., Hestness, J., Hower, D.R., Krishna, T., Sardashti, S., et al.: The gem5 simulator. ACM SIGARCH Comp Arch News 39(2), 1–7 (2011)

    Article  Google Scholar 

  7. Boulos, S., Edwards, D., Lacewell, J.D., Kniss, J., Kautz, J., Shirley, P., Wald, I.: Packet-based Whitted and distribution ray tracing. In: Proceedings of Graphics Interface (2007)

  8. Brunvand, E., Kopta, D., Chatterjee, N.: Why graphics programmers need to know about DRAM. In: ACM SIGGRAPH 2014 Courses (2014)

  9. Budge, B., Bernardin, T., Stuart, J.A., Sengupta, S., Joy, K.I., Owens, J.D.: Out-of-core Data Management for Path Tracing on Hybrid Resources. CGF (2009)

  10. Chatterjee, N., Balasubramonian, R., Shevgoor, M., Pugsley, S., Udipi, A., Shafiee, A., Sudan, K., Awasthi, M., Chishti, Z.: USIMM: the Utah SImulated Memory Module. Technical Report UUCS-12-02, U. of Utah (2012)

  11. Chatterjee, N., OConnor, M., Lee, D., Johnson, D.R., Keckler, S.W., Rhu, M., Dally, W.J.: Architecting an energy-efficient DRAM system for GPUs. In: HPCA (2017)

  12. Christensen, P.H., Laur, D.M., Fong, J., Wooten, W.L., Batali, D.: Ray differentials and multiresolution geometry caching for distribution ray tracing in complex scenes. In: Eurographics (2003)

  13. Dally, B.: The challenge of future high-performance computing. Uppsala University, Uppsala, Sweden, Celsius Lecture (2013)

  14. Gribble, C., Ramani, K.: Coherent ray tracing via stream filtering. In: IRT (2008)

  15. Hapala, M., Davidovic, T., Wald, I., Havran, V., Slusallek, P.: Efficient stack-less BVH traversal for ray tracing. In: SCCG (2011)

  16. HWRT: SimTRaX a cycle-accurate ray tracing architectural simulator and compiler. http://code.google.com/p/simtrax/ (2012). Utah Hardware Ray Tracing Group

  17. Johnsson, B., Akenine-Mller, T.: Measuring per-frame energy consumption of real-time graphics applications. JCGT 3, 60–73 (2014)

    Google Scholar 

  18. Johnsson, B., Ganestam, P., Doggett, M., Akenine-Möller, T.: Power efficiency for software algorithms running on graphics processors. In: HPG (2012)

  19. Kajiya, J.T.: The rendering equation. In: Proceedings of SIGGRAPH (1986)

  20. Karras, T., Aila, T.: Fast parallel construction of high-quality bounding, vol. hierarchies. In: Proceedings of HPG (2013)

  21. Kopta, D., Shkurko, K., Spjut, J., Brunvand, E., Davis, A.: An energy and bandwidth efficient ray tracing architecture. In: Proceedings of HPG (2013)

  22. Kopta, D., Shkurko, K., Spjut, J., Brunvand, E., Davis, A.: Memory considerations for low energy ray tracing. CGF 34(1), 47–59 (2015)

    Google Scholar 

  23. Lee, W.J., Shin, Y., Hwang, S.J., Kang, S., Yoo, J.J., Ryu, S.: Reorder buffer: an energy-efficient multithreading architecture for hardware MIMD ray traversal. In: Proceedings of HPG (2015)

  24. Liktor, G., Vaidyanathan, K.: Bandwidth-efficient BVH layout for incremental hardware traversal. In: Proceedings of HPG (2016)

  25. Mansson, E., Munkberg, J., Akenine-Moller, T.: Deep coherent ray tracing. In: IRT (2007)

  26. Moon, B., Byun, Y., Kim, T.J., Claudio, P., Kim, H.S., Ban, Y.J., Nam, S.W., Yoon, S.E.: Cache-oblivious ray reordering. ACM Trans. Graph. 29(3), 28:1–28:10 (2010)

  27. Muralimanohar, N., Balasubramonian, R., Jouppi, N.: Optimizing NUCA organizations and wiring alternatives for large caches with CACTI 6.0. In: MICRO (2007)

  28. Navrátil, P., Fussell, D., Lin, C., Mark, W.: Dynamic ray scheduling to improve ray coherence and bandwidth utilization. In: IRT (2007)

  29. Navrátil, P.A., Mark, W.R.: An analysis of ray tracing bandwidth consumption. University of Texas at Austin, Computer Science Department (2006)

  30. Overbeck, R., Ramamoorthi, R., Mark, W.R.: Large ray packets for real-time Whitted ray tracing. In: IRT (2008)

  31. Pool, J.: Energy-precision tradeoffs in the graphics pipeline. Ph.D. thesis, UNC, Chapel Hill (2012)

  32. Pool, J., Lastra, A., Singh, M.: An energy model for graphics processing units. In: ICCD (2010)

  33. Shkurko, K., Grant, T., Kopta, D., Mallett, I., Yuksel, C., Brunvand, E.: Dual streaming for hardware-accelerated ray tracing. In: Proceedings of HPG (2017)

  34. Smits, B.: Efficiency issues for ray tracing. In: SIGGRAPH Courses, SIGGRAPH ’05 (2005)

  35. Spjut, J., Kensler, A., Kopta, D., Brunvand, E.: TRaX: A multicore hardware architecture for real-time ray tracing. IEEE Trans. CAD 28(12), 1802–1815 (2009)

  36. Spjut, J., Kopta, D., Boulos, S., Kellis, S., Brunvand, E.: TRaX: A multi-threaded architecture for real-time ray tracing. In: SASP (2008)

  37. Tsakok, J.A.: Faster incoherent rays: Multi-BVH ray stream tracing. In: Proceedings of HPG (2009)

  38. Vogelsang, T.: Understanding the energy consumption of dynamic random access memories. In: MICRO ’43 (2010)

  39. Wang, R., Yu, B., Marco, J., Hu, T., Gutierrez, D., Bao, H.: Real-time rendering on a power budget. ACM TOG 35(4), 111:1–111:11 (2016)

  40. Whitted, T.: An improved illumination model for shaded display. Com. ACM 23(6), 343–349 (1980)

Download references

Acknowledgements

Crytek Sponza is from Frank Meinl at Crytek and Marko Dabrovic, Dragon is from the Stanford Computer Graphics Laboratory, Hairball is from Samuli Laine, and San Miguel is from Guillermo Leal Laguno.

Funding This material is supported in part by the National Science Foundation under Grant No. 1409129.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elena Vasiou.

Ethics declarations

Conflict of interest

The authors, Elena Vasiou, Konstantin Shkurko, Ian Mallett, Erik Brunvand, and Cem Yuksel, declare that they have no conflict of interest relating to this work and publication.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 1101 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vasiou, E., Shkurko, K., Mallett, I. et al. A detailed study of ray tracing performance: render time and energy cost. Vis Comput 34, 875–885 (2018). https://doi.org/10.1007/s00371-018-1532-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-018-1532-8

Keywords

Navigation