skip to main content
research-article

Virtualizing a Post-Moore’s Law Analog Mesh Processor: The Case of a Photonic PDE Accelerator

Published: 24 January 2023 Publication History

Abstract

Innovative processor architectures aim to play a critical role in future sustainment of performance improvements under severe limitations imposed by the end of Moore’s Law. The Reconfigurable Optical Computer (ROC) is one such innovative, Post-Moore’s Law processor. ROC is designed to solve partial differential equations in one shot as opposed to existing solutions, which are based on costly iterative computations. This is achieved by leveraging physical properties of a mesh of optical components that behave analogously to lumped electrical components. However, virtualization is required to combat shortfalls of the accelerator hardware. Namely, (1) the infeasibility of building large photonic arrays to accommodate arbitrarily large problems and (2) underutilization brought about by mismatches in problem and accelerator mesh sizes due to future advances in manufacturing technology. In this work, we introduce an architecture and methodology for lightweight virtualization of ROC that exploits advantages borne from optical computing technology. Specifically, we apply temporal and spatial virtualization to ROC and then extend the accelerator scheduling tradespace with the introduction of spectral virtualization. Additionally, we investigate multiple resource scheduling strategies for a system-on-chip (SoC)-based PDE acceleration architecture and show that virtual configuration management offers a speedup of approximately 2×. Finally, we show that overhead from virtualization is minimal, and our experimental results show two orders of magnitude increased speed as compared to microprocessor execution while keeping errors due to virtualization under 10%.

References

[1]
Anant Agarwal and Jeffrey Lang. 2007. Course materials for 6.002 Circuits and Electronics. Retrieved from http://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-002-circuits-and-electronics-spring-2007/video-lectures/6002_l1.pdf.
[2]
Massimo Alioto, Vivek De, and Andrea Marongiu. 2018. Energy-quality scalable integrated circuits and systems: Continuing energy scaling in the twilight of moore’s law. IEEE J. Emerg. Select. Topics Circ. Syst. 8, 3 (2018).
[4]
Jeff Anderson, Engin Kayraklioglu, Hamid Reza Imani, Mario Miscuglio, Volker J. Sorger, and Tarek El-Ghazawi. 2020. Virtualizing analog mesh computers: The case of a photonic PDE solving accelerator. In Proceedings of the International Conference on Rebooting Computing (ICRC). 133–142. DOI:
[5]
Jeff Anderson, Engin Kayraklioglu, Shuai Sun, Joseph Crandall, Yousra Alkabani, Vikram Narayana, Volker Sorger, and Tarek El-Ghazawi. 2020. ROC: A reconfigurable optical computer for simulating physical processes. ACM Trans. Parallel Comput. 7, 1 (2020).
[6]
Sanghun Bang, Jeonghyun Kim, Gwanho Yoon, Takuo Tanaka, and Junsuk Rho. 2018. Recent advances in tunable and reconfigurable metamaterials. Micromachines 9, 11 (2018).
[7]
Henri Berestycki and Yves Pomeau. 2002. Nonlinear PDE’s in Condensed Matter and Reactive Flows. Springer.
[8]
Marsha J. Berger and Joseph Oliger. 1984. Adaptive mesh refinement for hyperbolic partial differential equations. J. Comput. Phys. 53, 3 (1984), 484–512.
[9]
Katherine Compton, Zhiyuan Li, James Cooley, Stephen Knol, and Scott Hauck. 2002. Configuration relocation and defragmentation for run-time reconfigurable computing. IEEE Trans. Very Large Scale Integr. Syst. 10, 3 (June2002), 209–220. DOI:
[10]
CPUID. 2020. HWMonitor: volate, temperatures and fan speed monitoring. Retrieved 2020 from https://www.cpuid.com/softwares/hwmonitor.html.
[11]
Jack Dongarra. 2017. Current Trends in High Performance Computing and Challenges for the Future. Retrieved from https://www.acm.org/binaries/content/assets/education/lc-monthly-bulletins/january2017.html.
[12]
Esam El-Araby, Vikram K. Narayana, and Tarek El-Ghazawi. 2010. Space and time sharing of reconfigurable hardware for accelerated parallel processing. In Reconfigurable Computing: Architectures, Tools and Applications. ARC 2010. Lecture Notes in Computer Science 5992 (2010). Springer. DOI:
[13]
Tarek El-Ghazawi. 2009. Virtual configuration management for efficient use of reconfigurable hardware. Patent No. US20090187733A1, Filed March 3rd, 2007, Issued July. 23rd, 2009.
[14]
Tarek El-Ghazawi, Prachya Chalermwat, and Jacqueline Le Moigne. 1997. Wavelet-based image registration on parallel computers. In Proceedings of the ACM/IEEE Conference on Supercomputing (SC’97).
[15]
Tarek El-Ghazawi, Volker J. Sorger, Shuai Sun, Abdel-Hameed A. Badawy, and Vikram K. Narayana. 2019. Reconfigurable optical computer. Patent No. US10318680B2, Filed December 5th, 2017, Issued June. 8th, 2019.
[16]
Nader Engheta. 2007. Circuits with light at nanoscales: Optical nanocircuits inspired by metamaterials. Science 317, 5845 (2007), 1698–1702. DOI:
[17]
Nader Engheta, Alessandro Salandrino, and Andrea Alù. 2005. Circuit elements at optical frequencies: Nanoinductors, nanocapacitors, and nanoresistors. Phys. Rev. Lett. 95, 9 (Aug.2005), 095504. DOI:
[18]
Rolf Enzler, Christian Plessl, and Marco Platzner. 2003. Virtualizing hardware with multi-context reconfigurable arrays. In Proceedings of the International Conference on Field Programmable Logic and Applications (FPL’03).
[19]
Suhaib A. Fahmy, Kizheppatt Vipin, and Shanker Shreejith. 2015. Virtualized FPGA accelerators for efficient cloud computing. In Proceedings of the IEEE 7th International Conference on Cloud Computing Technology and Science (ICCTS’15), Vol. 1. IEEE.
[20]
Rich Goldman. 2020. Lumerical 2020a Release Speeds Photonic Design through High Performance Computing. Retrieved 2020 from https://www.prweb.com/releases/prweb16705671.htm.
[21]
Yaliang Gui, Mario Miscuglio, Zhizhen Ma, Mohammad H. Tahersima, Shuai Sun, Rubab Amin, Hamed Dalir, and Volker J. Sorger. 2019. Towards integrated metatronics: A holistic approach on precise optical and electrical properties of indium tin oxide. Sci. Rep. 11279, 9 (Aug.2019).
[22]
Isom Herron and Michael R. Foster. 2008. Partial Differential Equations in Fluid Dynamics (1st ed.). Cambridge University Press.
[23]
Hui Kang, Michael Le, and Shu Tao. 2016. Container and microservice driven design for cloud infrastructure DevOps. In Proceedings of the IEEE International Conference on Cloud Engineering (IC2E’16). IEEE.
[24]
Engin Kayraklioglu, Jeff Anderson, Hamid Reza-Imani, Volker Sorger, and Tarek El-Ghazawi. 2020. Software stack for an analog mesh computer: The case of a nanophotonic PDE accelerator. In Proceedings of the International Conference on Computing Frontiers (CF’20). ACM.
[25]
Thor Labs. 2021. NIR Product Page. Retrieved from https://www.thorlabs.com/newgrouppage9.cfm?objectgroup_id=4737.
[26]
H. J. Lee and William Schiesser. 2003. Ordinary and Partial Differential Equation Routines in C, C++, Fortran, Java, Maple and MATLAB. CRC Press, Boca Raton, FL.
[27]
George Liebmann. 1950. Solution of partial differential equations with a resistance network analogue. Brit J. Appl. Phys. 1, 4 (1950).
[28]
Leibo Liu, Jianfeng Zhu, Zhaoshi Li, Yanan Lu, Yangdong Deng, Jie Han, Shouyi Yin, and Shaojun Wei. 2019. A survey of coarse-grained reconfigurable architecture and design: Taxonomy, challenges, and applications. Comput. Surv. 52, 6 (Oct.2019).
[29]
Zhizhen Ma, Zhuoran Li, Ke Liu, Chenran Ye, and Volker J. Sorger. 2015. Indium-tin-oxide for high-performance electro-optic modulation. Nanophotonics4 (Mar.2015).
[30]
Dimitrios Mbakoyiannis, Othon Tomoutzoglou, and George Kornaros. 2018. Energy-performance considerations for data offloading to FPGA-based accelerators over PCIe. ACM Trans. Archit. Code Optim. 15, 1 (Mar.2018). DOI:
[31]
Mario Miscuglio, Yaliang Gui, Xiaoxuan Ma, Zhizhen Ma, Shuai Sun, Tarek El Ghazawi, Tatsuo Itoh, Andrea Alu, and Volker J. Sorger. 2021. Approximate analog computing with metatronic circuits. Commun. Phys. 4, 196 (Aug.2021). Retrieved from https://www.nature.com/articles/s42005-021-00683-4.
[32]
Sanjay Moulik, Rishabh Chaudhary, Zinea Das, and Arnab Sarkar. 2020. EA-HRT: An energy-aware scheduler for heterogeneous real-time systems. In Proceedings of the 25th Asia and South Pacific Design Automation Conference (ASP-DAC). 500–505. DOI:
[33]
David Padua (Ed.). 2011. Latency Hiding. Springer US, Boston, MA, 1006–1006. DOI:
[34]
P. Palmer, A. R. Copson, and S. C. Redshaw. 1959. Investigations into the Use of an Electrical Resistance Analogue for the Solution of Certain Oscillatory-flow Problems. Reports and Memoranda 312. Aeronautical Research Council.
[35]
Robin Panda, Aaron Wood, Nathaniel McVicar, Carl Ebeling, and Scott Hauck. 2021. Extending Course-grained Reconfigurable Arrays with Multi-kernel Dataflow. Retrieved 2021 from https://people.ece.uw.edu/hauck/publications/CARLMosaic2.pdf.
[36]
Rudiger Paschotta. 2020. Coherent Beam Combining. Retrieved 2020 from https://www.rp-photonics.com/coherent_beam_combining.html.
[37]
Luis Pinuel, I. Martin, and Francisco Tirado. 1998. A special-purpose parallel computer for solving partial differential equations. In Proceedings of the 6th Euromicro Workshop on Parallel and Distributed Processing (PDP’98). IEEE.
[38]
Gerald J. Popek and Robert P. Goldberg. 1974. Formal requirements for virtualizable third generation architectures. Commun. ACM 17, 7 (July1974). DOI:
[39]
Alfio Quarteroni. 2017. A Brief Survey of Partial Differential Equations. Springer.
[40]
Jan Rabaey. 2020. The Spice Page. Retrieved 2020 from http://bwrcs.eecs.berkeley.edu/Classes/IcBook/SPICE/f.
[41]
Reza Ramezani. 2021. Dynamic scheduling of task graphs in multi-FPGA systems using critical path. J. Supercomput. 77 (2021), 597–618. DOI:
[42]
J. Ramirez-Angulo and Mark R. DeYong. 2000. Digitally-configurable analog VLSI chip and method for real-time solution of partial differential equations. Patent No. US6141676, Filed July 22, 1998, Issued October 31, 2000.
[43]
Carlos Reano and Federico Silla. 2017. A comparative performance analysis of remote GPU virtualization over three generations of GPUs. In Proceedings of the 46th International Conference on Parallel Processing Workshops (ICPPW’17). IEEE.
[44]
Isaac Richter, Kamil Pas, Xiaochen Guo, Ravi Patel, Ji Liu, Engin Ipek, and Eby G. Friedman. 2015. Memristive accelerator for extreme scale linear solvers. In Proceedings of the Government Microcircuit Applications and Critical Technology Conference (GOMACTECH’15).
[45]
Arun F. Rodrigues. 2010. Using Reconfigurable Functional Units in Conventional Microprocessors. Sandia Report SAND2010-8063. Sandia National Laboratories.
[46]
Wolfgang Rueckner, Rob Hart, Yelena Synkova, and Joe Peidle. 2019. Summary of Rules for Error Propagation. Retrieved from https://sites.fas.harvard.edu/scphys/nsta/error_propagation.pdf.
[47]
Stephen Scalara. 2001. Context Switching Reconfigurable Computing. Final Technical Report AFRL-IF-RS-TR-2001-161. Sanders.
[48]
I. J. Schoenberg. 1973. Cardinal Spline Interpolation. Society for Industrial and Applied Mathematics.
[49]
A. P. S. Selvadurai. 2000. Partial Differential Equations in Mechanics 1: Fundamentals, Laplace’s Equation, Diffusion Equation, Wave Equation. Springer.
[50]
Christian Siemers. 2000. Reconfigurable computing between classifications and metrics—The approach of space/time-scheduling. In Proceedings of the 10th International Workshop on Field-Programmable Logic and Applications: The Roadmap to Reconfigurable Computing (FPL’00), Vol. 1. ACM.
[51]
Volker J. Sorger, Norberto D. Lanzillotti-Kimura, Ren-Min Ma, and Xiang Zhang. 2012. Ultra-compact silicon nanophotonic modulator with broadband response. Nanophotonics1 (May2012).
[52]
Nandagopalan Auviur Srinivasa. 2006. Adaptive Mesh Refinement for a Finite Difference Scheme Using a Quadtree Decomposition Approach. Masters of Science Thesis. Texas Agricultural and Mechanical University.
[53]
Hongyang Sun, Redouane Elghazi, Ana Gainaru, Guillaume Aupy, and Padma Raghavan. 2018. Scheduling parallel tasks under multiple resources: List scheduling vs. pack scheduling. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS). 194–203. DOI:
[54]
Shuai Sun, Mario Miscuglio, Xiaoxuan Ma, Zhizhen Ma, Chen Shen, Engin Kayraklioglu, Jeffery Anderson, Tarek El Ghazawi, and Volker J. Sorger. 2021. Induced homomorphism: Kirchhoff’s law in photonics. Nanophotonics 10, 6 (2021).
[55]
Mohamed Taher and Tarek El-Ghazawi. 2009. Virtual configuration management: A technique for partial runtime reconfiguration. IEEE Trans. Comput. 58, 10 (Oct.2009).
[56]
Alexander N. Tait, John Chang, Bhavin J. Shastri, Mitchell A. Nahmias, and Paul R. Prucnal. 2015. Demonstration of WDM weighted addition for principal component analysis. Opt. Exp. 23, 10 (2015), 12758–12765. DOI:
[57]
Chen Tang, Qinghua Mi, Haiqing Yan, Jianquan Yang, and Shaowei Liu. 2013. PDE(ODE)-based image processing methods for optical interferometry fringe. Proc. SPIE - Int. Societ. Optic. Eng. 8769, 87692D. DOI:
[58]
Jeremiah P. Turpin, Jeremy A. Bossard, Kenneth L. Morgan, Douglas H. Werner, and Pingjuan L. Werner. 2014. Reconfigurable and tunable metamaterials: A review of the theory and applications. Int. J. Anten. Propag. (2014). DOI:
[59]
Peng-Chhay Ung. 2020. COMSOL Blog: Heat Transfer in Deformed Solids. Retrieved 2020 from https://www.comsol.com/blogs/heat-transfer-deformed-solids/.
[60]
Pranav Vaidya and Jaehwan John Lee. 2011. A novel multicontext coarse-grained reconfigurable architecture (CGRA) for accelerating column-oriented databases. ACM Trans. Reconfig. Technol. Syst. 4, 2 (May2011). DOI:
[61]
Anuj Vaishnav, Khoa Dang Pham, and Dirk Koch. 2018. A survey on FPGA virtualization. In Proceedings of the 28th International Conference on Field Programmable Logic and Applications (FPL’18). IEEE.
[62]
Pauli Virtanen, Ralf Gommers, Travis E. Oliphant, Matt Haberland, Tyler Reddy, David Cournapeau, Evgeni Burovski, Pearu Peterson, Warren Weckesser, Jonathan Bright, Stéfan J. van der Walt, Matthew Brett, Joshua Wilson, K. Jarrod Millman, Nikolay Mayorov, Andrew R. J. Nelson, Eric Jones, Robert Kern, Eric Larson, C. J. Carey, İlhan Polat, Yu Feng, Eric W. Moore, Jake Vand erPlas, Denis Laxalde, Josef Perktold, Robert Cimrman, Ian Henriksen, E. A. Quintero, Charles R. Harris, Anne M. Archibald, Antônio H. Ribeiro, Fabian Pedregosa, Paul van Mulbregt, and SciPy 1. 0 Contributors. 2019. SciPy 1.0–Fundamental algorithms for scientific computing in Python. arXiv e-prints, arXiv:1907.10121 (Jul2019).
[63]
Zhe Wang, Qi Tang, Biao Guo, Ji-Bo Wei, and Ling Wang. 2020. Resource partitioning and application scheduling with module merging on dynamically and partially reconfigurable FPGAs. MDPI Electron. 9, 1461 (2020). DOI:
[64]
Hezhu Yin. 2011. Application of Resistivity-Tool-Response Modeling For Formation Evaluation: AAPG Archie Series. Vol. 2. American Association of Petroleum Geologists.
[65]
Y. Zhao. 2008. Lattice Boltzmann based PDE solver on the GPU. Vis. Comput. 24, 5 (2008).
[66]
Michael Zhu and Suyog Gupta. 2017. To prune, or not to prune: Exploring the efficacy of pruning for model compression. arXiv:1710.01878 (112017).

Cited By

View all
  • (2024)Research on Electro-Optic Hybrid Multidigit Digital Multiplier Based on Surface Plasmon Polariton TechnologyPhotonics10.3390/photonics1109078511:9(785)Online publication date: 23-Aug-2024
  • (2024)Solving partial differential equations with waveguide-based metatronic networksAdvanced Photonics Nexus10.1117/1.APN.3.5.0560073:05Online publication date: 1-Sep-2024
  • (2024)Protecting the Intellectual Property of Binary Deep Neural Networks With Efficient Spintronic-Based Hardware ObfuscationIEEE Transactions on Circuits and Systems I: Regular Papers10.1109/TCSI.2024.339792571:7(3146-3156)Online publication date: Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Embedded Computing Systems
ACM Transactions on Embedded Computing Systems  Volume 22, Issue 2
March 2023
560 pages
ISSN:1539-9087
EISSN:1558-3465
DOI:10.1145/3572826
  • Editor:
  • Tulika Mitra
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

Publication History

Published: 24 January 2023
Online AM: 22 June 2022
Accepted: 05 June 2022
Revised: 13 April 2022
Received: 12 October 2021
Published in TECS Volume 22, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Virtualization
  2. domain-specific accelerators
  3. parallel processing
  4. scientific computing accelerators
  5. scheduling

Qualifiers

  • Research-article
  • Refereed

Funding Sources

  • NSF RAISE
  • NSF EPMD-ElectroPhotonic Mag Devices
  • CSR-Computer Systems Research, Networking Technology and Systems

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)189
  • Downloads (Last 6 weeks)6
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Research on Electro-Optic Hybrid Multidigit Digital Multiplier Based on Surface Plasmon Polariton TechnologyPhotonics10.3390/photonics1109078511:9(785)Online publication date: 23-Aug-2024
  • (2024)Solving partial differential equations with waveguide-based metatronic networksAdvanced Photonics Nexus10.1117/1.APN.3.5.0560073:05Online publication date: 1-Sep-2024
  • (2024)Protecting the Intellectual Property of Binary Deep Neural Networks With Efficient Spintronic-Based Hardware ObfuscationIEEE Transactions on Circuits and Systems I: Regular Papers10.1109/TCSI.2024.339792571:7(3146-3156)Online publication date: Jul-2024
  • (2024)Radiation-Immune Spintronic Binary Synapse and Neuron for Process-in-Memory ArchitectureIEEE Magnetics Letters10.1109/LMAG.2024.335681515(1-5)Online publication date: 2024
  • (2024)Multi-Retention STT-MRAM Architectures for IoT: Evaluating the Impact of Retention Levels and Memory Mapping SchemesIEEE Access10.1109/ACCESS.2024.336607412(26562-26580)Online publication date: 2024
  • (2024)Computer chipsMechanism Design, Behavioral Science and Artificial Intelligence in International Relations10.1016/B978-0-443-23982-3.00013-0(181-193)Online publication date: 2024
  • (2024)On the Layout-Oriented Investigation of Power Attack Hardness of Spintronic-Based Logic CircuitsCircuits, Systems, and Signal Processing10.1007/s00034-024-02603-743:5(3212-3237)Online publication date: 31-Jan-2024
  • (2023)On the Design of Power Attack Immune Spintronic Associative MemorySPIN10.1142/S201032472350032714:03Online publication date: 29-Dec-2023
  • (2023)Exciton strain mapping in 2D TMDCs for sensing and photodetectionLow-Dimensional Materials and Devices 202310.1117/12.2682123(38)Online publication date: 5-Oct-2023
  • (2023)Demultiplexing OAM beams via Fourier optical convolutional neural networkLaser Beam Shaping XXIII10.1117/12.2682108(8)Online publication date: 26-Sep-2023
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media