ABSTRACT
This paper presents a new graphics architecture enabling content-rich applications for the embedded space by extending hardware architecture in two main areas - geometry processing and configurable per-fragment shading. Our first contribution combines vertex cache and a programmable geometry engine that handles both fixed and variable size geometrical primitives completely on-chip. It enables subdivision surface tessellation, silhouette rendering and other geometry processing algorithms to be implemented in one pass and without external memory access. Our second contribution is in configurable per-fragment shading that is mainly a dot product + lookup table machine being versatile enough to realize Cook-Torrance shading, Schlick anisotropy model and others.
Memory storage and memory bandwidth are reduced in proposed architecture as both compact geometry and material descriptions are possible, enabling complex shapes and sophisticated shading models in embedded space.
The architecture has complete HDL and ASIC implementations and was demonstrated during the ESEC 2008 exhibition in Japan. Exposing all the features of our architecture via OpenGL ES 1.X and 2.0 API enabled extended OpenGL ES engines from Rightware Oy to run on our ASIC implementations.
- Andrews, J., and Baker, N. 2006. Xbox 360 system architecture. IEEE Micro 26, 2, 25--37. Google ScholarDigital Library
- Arvo, J., and Westerholm, J. 2004. Hardware accelerated soft shadows using penumbra quads. Journal of WSCG 12, 1, 11--18.Google Scholar
- Blythe, D. 2006. The direct3d 10 system. In SIGGRAPH '06: ACM SIGGRAPH 2006 Papers, ACM Press, New York, NY, USA, 724--734. Google ScholarDigital Library
- Bolz, J., and Schröder, P. 2002. Rapid evaluation of catmull-clark subdivision surfaces. In Web3D '02: Proceeding of the seventh international conference on 3D Web technology, ACM Press, New York, NY, USA, 11--17. Google ScholarDigital Library
- Cook, R. L., and Torrance, K. E. 1982. A reflectance model for computer graphics. ACM Transactions on Graphics 1, 1 (January), 7--24. Google ScholarDigital Library
- Donghyun Kim, e. a. 2005. An soc with 1.3gtexels/s 3d graphics full pipeline engine for consumer applications. In IEEE International Solid-State Circuits Conference, vol. XLVIII, 190--.Google Scholar
- Hast, A. 2005. Shading by quaternion interpolation. In WSCG (Short Papers), 53--56.Google Scholar
- Heidrich, W., and Seidel, H.-P. 1999. Realistic, hardware-accelerated shading and lighting. In Proceedings of SIGGRAPH 99, ACM SIGGRAPH/Addison Wesley Longman, Los Angeles, California, Computer Graphics Proceedings, Annual Conference Series, 171--178. ISBN 0-20148-560-5. Google ScholarDigital Library
- Imai, M., Nagasaki, T., Sakamoto, J., Takeuchi, H., Nagano, H., Iwasaki, S., Hatakenaka, M., Fujita, J., Keino, K., Motomura, T., Ueda, T., Niki, T., and Tomikawa, H. 2004. A 109.5mw 1.2v 600m texels/s 3-d graphics engine. In IEEE International Solid-State Circuits Conference, vol. XVII, 332--333.Google Scholar
- Kameyama, M., Kato, Y., Fujimoto, H., Negishi, H., Kodama, Y., Inoue, Y., and Kawai, H. 2003. 3d graphics lsi core for mobile phone "z3d". In HWWS '03: Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, Eurographics Association, Aire-la-Ville, Switzerland, Switzerland, 60--67. Google ScholarDigital Library
- Kautz, J., and McCool, M. D. 1999. Interactive rendering with arbitrary brdfs using separable approximations. In Eurographics Rendering Workshop 1999, Springer Wein/Eurographics, Granada, Spain. Google ScholarDigital Library
- Kazakov, M. 2007. Catmull-clark subdivision for geometry shaders. In Proceedings of the 5th international conference on Computer graphics, virtual reality, visualisation and interaction in Africa, ACM, New York, NY, USA, AFRIGRAPH '07, 77--84. Google ScholarDigital Library
- Lawrence, J., Rusinkiewicz, S., and Ramamoorthi, R. 2004. Efficient brdf importance sampling using a factored representation. In SIGGRAPH '04: ACM SIGGRAPH 2004 Papers, ACM Press, New York, NY, USA, 496--505. Google ScholarDigital Library
- Lawrence, J., Ben-Artzi, A., DeCoro, C., Matusik, W., Pfister, H., Ramamoorthi, R., and Rusinkiewicz, S. 2006. Inverse shade trees for non-parametric material representation and editing. ACM Trans. Graph. 25, 3, 735--745. Google ScholarDigital Library
- Loop, C., Schaefer, S., Ni, T., and Castaño, I. 2009. Approximating subdivision surfaces with gregory patches for hardware tessellation. In ACM SIGGRAPH Asia 2009 papers, ACM, New York, NY, USA, SIGGRAPH Asia '09, 151:1--151:9. Google ScholarDigital Library
- Mochocki, B., Lahiri, K., and Cadambi, S. 2006. Power analysis of mobile 3d graphics. In DATE '06: Proceedings of the conference on Design, automation and test in Europe, European Design and Automation Association, 3001 Leuven, Belgium, Belgium, 502--507. Google ScholarDigital Library
- Ohbuchi, E., and Unno, H. 2002. A real-time configurable shader based on lookup tables. Proceeding of Cyberworld 00, 0507. Google ScholarDigital Library
- Pedersen, M., 2004. Patent wo2003/081528: Method and system for generating subdivision surfaces in real-time.Google Scholar
- Rost, R., Bishop, L., Hall, C., Simpson, R., Rice, D., Murray, B., Barnes, M., and Suoranta, T. 2006. Openkode: an open mobile media development environment. In SIGGRAPH '06: ACM SIGGRAPH 2006 Courses, ACM Press, New York, NY, USA, 1. Google ScholarDigital Library
- Rusinkiewicz, S. 1998. A new change of variables for efficient BRDF representation. In Rendering Techniques '98 (Proceedings of Eurographics Rendering Workshop '98), Springer Wien, New York, NY, G. Drettakis and N. Max, Eds., 11--22.Google ScholarCross Ref
- Schröder, P. 1998. Subdivision for modeling and animation. Course notes of Siggraph 98. ACM SIGGRAPH..Google Scholar
- Shiue, L.-J., Jones, I., and Peters, J. 2005. A realtime gpu subdivision kernel. ACM Trans. Graph. 24, 3, 1010--1015. Google ScholarDigital Library
- Ström, J., and Akenine-Möller, T. 2005. ipackman: high-quality, low-complexity texture compression for mobile phones. In HWWS '05: Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware, ACM Press, New York, NY, USA, 63--70. Google ScholarDigital Library
- Szirmay-Kalos, L., and Kelemen, C. 2001. A microfacet based coupled specular-matte brdf model with importance sampling. In Eurographics Conference, Short presentations, 25--34.Google Scholar
- Uesaki, A., Kobayashi, T., Nishio, K., Nishimura, A., and Mochizuki, Y., 2004. Us patent 7,015,917: Curved surface subdivision apparatus.Google Scholar
- Watt, A., and Watt, M. 1991. Advanced animation and rendering techniques. ACM Press, New York, NY, USA. Google Scholar
- Woo, R., Choi, S., Sohn, J.-H., Song, S.-J., and Yoo, H.-J. 2004. A 210-mw graphics lsi implementation full 3-d pipeline with 264 mtexels/s texturing for mobile multimedia applications. 358--367.Google Scholar
Index Terms
- Primitive processing and advanced shading architecture for embedded space
Recommendations
A lazy object-space shading architecture with decoupled sampling
HPG '10: Proceedings of the Conference on High Performance GraphicsWe modify the Reyes object-space shading approach to address two inefficiencies that result from performing shading calculations at micropolygon grid vertices prior to rasterization. Our system samples shading of surface sub-patches uniformly in the ...
Poxels: polygonal voxel environment rendering
VRST '14: Proceedings of the 20th ACM Symposium on Virtual Reality Software and TechnologyWe present efficient rendering of opaque, sparse, voxel environments with data amplified in local graphics memory with stream-out from a geomery shader to a cached vertex buffer pool. We show that our Poxel rendering primitive aligns with optimized ...
Comments