Abstract
The technological developments in the areas of computer hardware and software resulted in a wide range of fast and cheap single- or multi-core processors, compilers, operating systems and programming languages, each with its own benefits and drawbacks, but with the ultimate goal to increase overall computer system performances. Although the number of transistors on a chip continues to double roughly every two years, there is still difficult to improve the performance of sequential processors, and even of the parallel multi-core and multi-processor shared-memory systems. The main reason for this resides in the ever-increasing gap between processor and memory speeds in the classical Von Neumann’s computer model. Therefore in this paper we propose a novel memory-centric approach of computing in a RISC-modified processor core that includes on-chip memory, which can be directly accessed, without the use of general-purpose registers (GPRs) and cache memory. Considering that the proposed RISC-modified core allows for a high on-chip memory bandwidth and low latency, we examine its performances in applications with different arithmetical intensity (dense matrix multiplication, Fast Fourier Transform - FFT, Partial Differential Equations - PDEs), according to the Roofline model. The results show that the proposed memory-centric RISC-modified core outperforms the initial RISC-based MIPS processor core for problems with medium or large arithmetical intensity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Patterson, D.A., Hennessy, J.L.: Computer Organization and Design: The Hardware/Software Interface. Elsevier, USA (2014)
Wulf, W.A., McKee, S.A.: Hitting the memory wall: implications of the obvious. ACM SIGARCH Comput. Archit. News 23(1), 20–24 (1995)
Hennessy, J.L., Patterson, D.A.: Computer Architecture: A Quantitative Approach. Morgan Kaufmann, USA (2012)
Borkar, S., Chien, A.A.: The future of microprocessors. Commun. ACM 54(5), 67–77 (2011)
Intel Corporation: New Microarchitecture for 4th Gen. Intel Core Processor Platforms. Product Brief (2013)
Carvalho, C.: The gap between processor and memory speeds. In: ICCA 2002, Portugal (2002)
Machanick, P.: Approaches to addressing the memory wall. Technical report. University of Queensland Brisbane, Australia (2002)
Smotherman, M.: Understanding EPIC architectures and implementations. In: ACM Southeast Conference, Atlanta (2002)
Jakimovska, D., et al.: Modern processor architectures overview. In: XVIII ICEST Conference, Bulgaria, pp. 239–242 (2012)
Eigenmann, R., et al.: Von Neumann computers. Wiley Encyclopedia Electr. Electron. Eng. 23, 387–400 (1998)
Saulsbury, A., Pong, F., Nowatzyk, A.: Missing the memory wall: the case for processor/memory integration. In: 23rd International Symposium on Computer Architecture, USA (1996)
Suresh, P.: PERL - a register-less processor. Ph.D. Thesis. Indian Institute of Technology, Kanpur (2004)
Wang, P.: Designing scratchpad memory architecture with emerging STT-RAM memory technologies. In: IEEE International Symposium on Circuits and Systems (2013)
Cojocaru, C.: Computational RAM: implementation and bit-parallel architecture. Master Thesis. Carletorn University, Ottawa (1995)
Tsubota, H., et al.: The M32R/D, a 32b RISC microprocessor with 16Mb embedded DRAM. Technical report (1996)
Draper, J., et al.: A prototype processing-in-memory (PIM) chip for the data-intensive architecture (DIVA) system. J. VLSI Sig. Process. Syst. 40(1), 73–84 (2005)
Gokhale, M., Holmes, B., Jobst, K.: Processing in memory: the Terasys massively parallel PIM array. IEEE Comput. J. 28(4), 23–31 (1995)
Gebis, J., et al.: VIRAM1: a mediaoriented vector processor with embedded DRAM. In: 41st Design Automation Student Design Contest, San Diego (2004)
Murakami, K., Shirakawa, S., Miyajima, H.: Parallel processing RAM chip with 256 Mb DRAM and quad processors. In: Solid-State Circuits Conference (1997)
Kaxiras, S., Burger, D., Goodman, R.: DataScalar: a memory-centric approach to computing. J. Syst. Archit. 45, 1001–1022 (1999)
Oskin, M., Chong, F.T., Sherwood, T.: Active pages a computation model for intelligent memory. In: 25th Annual International Symposium on Computer Architecture, pp. 192–203 (1998)
Azarkhish, E., Rossi, D., Loi, I., Benini, L.: Design and evaluation of a processing-in-memory architecture for the smart memory cube. In: 29th International Conference Architecture of Computing Systems, Germany (2016)
Moore’s Law is dead - long live Moore’s Law. IEEE Spectr. Mag. (2015)
Bakshi, A., et al.: Memory latency: to tolerate or to reduce? In: 12th Symposium on Computer Architecture and High Performance Computing (2000)
Li, S., et al.: Performance impacts of non-blocking caches in out-of-order processors. Technical paper (2011)
Hewlett Packard Labs: The Machine: The future of technology. Technical Paper (2016)
Vollmar, K., Sanderson, P.: MARS: an education-oriented MIPS assembly language simulator. In: 37th SIGCSE Technical Symposium on Computer Science Education, USA (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Efnusheva, D., Tentov, A. (2017). Design of Processor in Memory with RISC-modified Memory-Centric Architecture. In: Silhavy, R., Senkerik, R., Kominkova Oplatkova, Z., Prokopova, Z., Silhavy, P. (eds) Cybernetics and Mathematics Applications in Intelligent Systems. CSOC 2017. Advances in Intelligent Systems and Computing, vol 574. Springer, Cham. https://doi.org/10.1007/978-3-319-57264-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-57264-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-57263-5
Online ISBN: 978-3-319-57264-2
eBook Packages: EngineeringEngineering (R0)