skip to main content
10.1145/3295500.3356145acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Bandwidth steering in HPC using silicon nanophotonics

Published: 17 November 2019 Publication History

Abstract

As bytes-per-FLOP ratios continue to decline, communication is becoming a bottleneck for performance scaling. This paper describes bandwidth steering in HPC using emerging reconfigurable silicon photonic switches. We demonstrate that placing photonics in the lower layers of a hierarchical topology efficiently changes the connectivity and consequently allows operators to recover from system fragmentation that is otherwise hard to mitigate using common task placement strategies. Bandwidth steering enables efficient utilization of the higher layers of the topology and reduces cost with no performance penalties. In our simulations with a few thousand network endpoints, bandwidth steering reduces static power consumption per unit throughput by 36% and dynamic power consumption by 14% compared to a reference fat tree topology. Such improvements magnify as we taper the bandwidth of the upper network layer. In our hardware testbed, bandwidth steering improves total application execution time by 69%, unaffected by bandwidth tapering.

References

[1]
[n.d.]. Characterization of the DOE Mini-apps. https://portal.nersc.gov/project/CAL/doe-miniapps.htm. Accessed: 2019-02-16.
[2]
[n.d.]. GTC. https://www.nersc.gov/users/computational-systems/cori/nersc-8-procurement/trinity-nersc-8-rfp/nersc-8-trinity-benchmarks/gtc/. (Accessed on 04/02/2019).
[3]
[n.d.]. MPICH | High-Performance Portable MPI. https://www.mpich.org/. (Accessed on 04/02/2019).
[4]
2015. Mellanox 1U EDR 100Gb/s InfiniBand Switch Systems Hardware User Manual Models: SB7700/SB7790. Technical Report. http://www.mellanox.com/related-docs/user_manuals/1U_HW_UM_SB77X0.pdf
[5]
2018. 100Gb/s QSFP28 MMF Active Optical Cables. Technical Report. https://www.mellanox.com/related-docs/prod_cables/PB_MFA1A00-Cxxx_100GbE_QSFP28_MMF_AOC.pdf
[6]
2018. The Top500 HPC list. https://www.top500.org/green500/lists/2018/11/
[7]
A. H. Abdel-Gawad, M. Thottethodi, and A. Bhatele. 2014. RAHTM: Routing Algorithm Aware Hierarchical Task Mapping. In SC '14: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 325--335.
[8]
Helgi Adalsteinsson, Scott Cranford, David A. Evensky, Joseph P. Kenny, Jackson Mayo, Ali Pinar, and Curtis L. Janssen. 2010. A Simulator for Large-Scale Parallel Computer Architectures. Int. J. Distrib. Syst. Technol. 1, 2 (April 2010), 57--73.
[9]
M. Adda and A. Peratikou. 2017. Routing and Fault Tolerance in Z-Fat Tree. IEEE Transactions on Parallel and Distributed Systems 28, 8 (Aug 2017), 2373--2386.
[10]
Jung Ho Ahn, Nathan Binkert, Al Davis, Moray McLaren, and Robert S. Schreiber. 2009. HyperX: Topology, Routing, and Packaging of Efficient Large-scale Networks. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (SC '09). Article 41, 11 pages.
[11]
Mohammad Al-Fares, Alexander Loukissas, and Amin Vahdat. 2008. A Scalable, Commodity Data Center Network Architecture. In Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication (SIGCOMM '08). ACM, 63--74.
[12]
George Almási, Charles Archer, José G. Castaños, C. Chris Erway, Philip Heidelberger, Xavier Martorell, José E. Moreira, Kurt Pinnow, Joe Ratterman, Nils Smeds, Burkhard Steinmacher-burow, William Gropp, and Brian Toonen. 2004. Implementing MPI on the BlueGene/L Supercomputer. In Euro-Par 2004 Parallel Processing, Marco Danelutto, Marco Vanneschi, and Domenico Laforenza (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 833--845.
[13]
Xiang Meng Yanir London Jigesh Patel Madeleine Glick Keren Bergman Anthony Rizzo, Liang Yuan Dai. 2019. Ultra-low power consumption silicon photonic link design analysis in the AIM PDK.
[14]
A. Azzouni and G. Pujolle. 2018. NeuTM: A neural network-based framework for traffic matrix prediction in SDN. In NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium. 1--5.
[15]
P. Balaji, S. Bhagvat, D. K. Panda, R. Thakur, and W. Gropp. 2007. Advanced Flow-control Mechanisms for the Sockets Direct Protocol over InfiniBand. In 2007 International Conference on Parallel Processing (ICPP 2007). 73--73.
[16]
K. J. Barker, A. Benner, R. Hoare, A. Hoisie, A. K. Jones, D. K. Kerbyson, D. Li, R. Melhem, R. Rajamony, E. Schenfeld, S. Shao, C. Stunkel, and P. Walker. 2005. On the Feasibility of Optical Circuit Switching for High Performance Computing Systems. In SC '05: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing. 16--16.
[17]
K. Bergman. 2018. Empowering Flexible and Scalable High Performance Architectures with Embedded Photonics. In 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Vol. 00. 378.
[18]
G. Bhanot, A. Gara, P. Heidelberger, E. Lawless, J.C. Sexton, and R. Walkup. 2005. Optimizing task layout on the Blue Gene/L supercomputer. IBM Journal on Research and Development 49 (March/May 2005).
[19]
Andrea Bianco, Paolo Giaccone, and Marco Ricca. 2016. Scheduling Traffic for Maximum Switch Lifetime in Optical Data Center Fabrics. Comput. Netw. 105, C (Aug. 2016), 75--88.
[20]
Wim Bogaerts, Peter De Heyn, Thomas Van Vaerenbergh, Katrien De Vos, Shankar Kumar Selvaraja, Tom Claes, Pieter Dumon, Peter Bienstman, Dries Van Thourhout, and Roel Baets. 2012. Silicon microring resonators. Laser & Photonics Reviews 6, 1 (2012), 47--73.
[21]
C. Camarero, C. Martinez, and R. Beivide. 2017. Random Folded Clos Topologies for Datacenter Networks. In 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA). 193--204.
[22]
C. Camarero, C. Martinez, and R. Beivide. 2018. On Random Wiring in Practicable Folded Clos Networks for Modern Datacenters. IEEE Transactions on Parallel and Distributed Systems 29, 8 (Aug 2018), 1780--1793.
[23]
Andromachi Chatzieleftheriou, Sergey Legtchenko, Hugh Williams, and Antony I. T. Rowstron. 2018. Larry: Practical Network Reconfigurability in the Data Center. In NSDI.
[24]
Qixiang Cheng, Meisam Bahadori, Madeleine Glick, Sébastien Rumley, and Keren Bergman. 2018. Recent advances in optical technologies for data centers: a review. Optica 5, 11 (2018), 1354--1370.
[25]
Qixiang Cheng, Meisam Bahadori, Madeleine Glick, Sébastien Rumley, and Keren Bergman. 2018. Recent advances in optical technologies for data centers: a review. Optica 5, 11 (Nov 2018), 1354--1370.
[26]
Qixiang Cheng, Liang Yuan Dai, Nathan C. Abrams, Yu-Han Hung, Padraic E. Morrissey, Madeleine Glick, Peter O'Brien, and Keren Bergman. 2019. Ultralowcrosstalk, strictly non-blocking microring-based optical switch. Photon. Res. 7, 2 (Feb 2019), 155--161.
[27]
C. Clos. 1953. A study of non-blocking switching networks. The Bell System Technical Journal 32, 2 (March 1953), 406--424.
[28]
Jeffrey Dean and Luiz André Barroso. 2013. The tail at scale. Commun. ACM 56, 2 (2013), 74--80.
[29]
T. DeFanti, M. Brown, J. Leigh, O. Yu, E. He, J. Mambretti, D. Lillethun, and J. Weinberger. 2003. Optical Switching Middleware for the OptIPuter. IEICE Trans. FUNDAMENTALS/COMMUN./ELECTRON./INF. & SYST (Feb. 2003).
[30]
Wolfgang Denzel, Jian Li, Peter Walker, and Yuho Jin. 2010. A Framework for End-to-End Simulation of High-performance Computing Systems. Simulation 86 (05 2010), 331--350.
[31]
Po Dong, Robert Gatdula, Kwangwoong Kim, Jeffrey H. Sinsky, Argishti Melikyan, Young-Kai Chen, Guilhem de Valicourt, and Jeffrey Lee. 2017. Simultaneous wavelength locking of microring modulator array with a single monitoring signal. Opt. Express 25, 14 (Jul 2017), 16040--16046.
[32]
Hans Eberle and Nils Gura. 2002. Separated High-bandwidth and Low-latency Communication in the Cluster Interconnect Clint. In Proceedings of the IEEE Conference on Supercomputing.
[33]
Jack Edmonds. 1965. Paths, Trees and Flowers. Canad. J. Math 17 (1965), 449--467.
[34]
Greg Faanes, Abdulla Bataineh, Duncan Roweth, Tom Court, Edwin Froese, Bob Alverson, Tim Johnson, Joe Kopnick, Mike Higgins, and James Reinhard. 2012. Cray Cascade: A Scalable HPC System Based on a Dragonfly Network. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (SC '12). IEEE Computer Society Press, Article 103, 9 pages.
[35]
P. Festa. 2014. A brief introduction to exact, approximation, and heuristic algorithms for solving hard combinatorial optimization problems. In 2014 16th International Conference on Transparent Optical Networks (ICTON). 1--20.
[36]
Klaus-Tycho Foerster, Manya Ghobadi, and Stefan Schmid. 2018. Characterizing the Algorithmic Complexity of Reconfigurable Data Center Architectures. In Proceedings of the 2018 Symposium on Architectures for Networking and Communications Systems (ANCS '18). ACM, New York, NY, USA, 89--96.
[37]
Message P Forum. 1994. MPI: A Message-Passing Interface Standard. Technical Report. Knoxville, TN, USA.
[38]
Yiannis Georgiou and Matthieu Hautreux. 2013. Evaluating Scalability and Efficiency of the Resource and Job Management System on Large HPC Clusters. In Job Scheduling Strategies for Parallel Processing, Walfredo Cirne, Narayan Desai, Eitan Frachtenberg, and Uwe Schwiegelshohn (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 134--156.
[39]
Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David A. Maltz, Parveen Patel, and Sudipta Sengupta. 2009. VL2: A Scalable and Flexible Data Center Network. SIGCOMM Comput. Commun. Rev. 39, 4 (Aug. 2009), 51--62.
[40]
Navid Hamedazimi, Zafar Qazi, Himanshu Gupta, Vyas Sekar, Samir R. Das, Jon P. Longtin, Himanshu Shah, and Ashish Tanwer. 2014. FireFly: A Reconfigurable Wireless Data Center Fabric Using Free-space Optics. In Proceedings of the 2014 ACM Conference on SIGCOMM (SIGCOMM '14). 319--330.
[41]
Vipul Harsh, Sangeetha Abdu Jyothi, Inderdeep Singh, and Philip Brighten Godfrey. 2018. Expander Datacenters: From Theory to Practice. CoRR abs/1811.00212 (2018). arXiv:1811.00212
[42]
Torsten Hoefler, Rolf Rabenseifner, Hubert Ritzdorf, Bronis R. de Supinski, Rajeev Thakur, and Jesper Larsson Träff. 2011. The Scalable Process Topology Interface of MPI 2.2. Concurr. Comput. : Pract. Exper. 23, 4 (March 2011), 293--310.
[43]
T. Hoefler, T. Schneider, and A. Lumsdaine. 2008. Multistage switches are not crossbars: Effects of static routing in high-performance networks. In 2008 IEEE International Conference on Cluster Computing. 116--125.
[44]
Chintan Jain and Deepak Garg. 2012. Improved Edmond Karps Algorithm for Network Flow Problem. International Journal of Computer Applications 37 (01 2012).
[45]
Nikhil Jain, Abhinav Bhatele, Louis H. Howell, David Böhme, Ian Karlin, Edgar A. León, Misbah Mubarak, Noah Wolfe, Todd Gamblin, and Matthew L. Leininger. 2017. Predicting the Performance Impact of Different Fat-tree Configurations. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '17). Article 50, 13 pages.
[46]
E. Jeannot, G. Mercier, and F. Tessier. 2014. Process Placement in Multicore Clusters:Algorithmic Issues and Practical Techniques. IEEE Transactions on Parallel and Distributed Systems 25, 4 (April 2014), 993--1002.
[47]
Nan Jiang, D. U. Becker, G. Michelogiannakis, J. Balfour, B. Towles, D. E. Shaw, J. Kim, and W. J. Dally. 2013. A detailed and flexible cycle-accurate Network-on-Chip simulator. In 2013 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). 86--96.
[48]
Nan Jiang, John Kim, and William J. Dally. 2009. Indirect Adaptive Routing on Large Scale Interconnection Networks. In Proceedings of the 36th Annual International Symposium on Computer Architecture (ISCA '09). ACM, New York, NY, USA, 220--231.
[49]
W. Jiang, J. Qi, J. X. Yu, J. Huang, and R. Zhang. 2018. HyperX: A Scalable Hypergraph Framework. IEEE Transactions on Knowledge and Data Engineering (2018), 1--1.
[50]
Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg, John Hu, Robert Hundt, Dan Hurt, Julian Ibarz, Aaron Jaffey, Alek Jaworski, Alexander Kaplan, Harshit Khaitan, Daniel Killebrew, Andy Koch, Naveen Kumar, Steve Lacy, James Laudon, James Law, Diemthu Le, Chris Leary, Zhuyuan Liu, Kyle Lucke, Alan Lundin, Gordon MacKean, Adriana Maggiore, Maire Mahony, Kieran Miller, Rahul Nagarajan, Ravi Narayanaswami, Ray Ni, Kathy Nix, Thomas Norrie, Mark Omernick, Narayana Penukonda, Andy Phelps, Jonathan Ross, Matt Ross, Amir Salek, Emad Samadiani, Chris Severn, Gregory Sizikov, Matthew Snelham, Jed Souter, Dan Steinberg, Andy Swing, Mercedes Tan, Gregory Thorson, Bo Tian, Horia Toma, Erick Tuttle, Vijay Vasudevan, Richard Walter, Walter Wang, Eric Wilcox, and Doe Hyun Yoon. 2017. In-Datacenter Performance Analysis of a Tensor Processing Unit. In Proceedings of the 44th Annual International Symposium on Computer Architecture (ISCA '17). ACM, New York, NY, USA, 1--12.
[51]
Shoaib Kamil, Leonid Oliker, Ali Pinar, and John Shalf. 2010. Communication Requirements and Interconnect Optimization for High-End Scientific Applications. IEEE Trans. Parallel Distrib. Syst. 21, 2 (2010), 188--202.
[52]
S. Kamil, A. Pinar, D. Gunter, M. Lijewski, L. Oliker, and J. Shalf. 2007. Reconfigurable Hybrid Interconnection for Static and Dynamic Scientific Applications. In Proceedings of the ACM International Conference on Computing Frontiers.
[53]
J. Kim, W. J. Dally, S. Scott, and D. Abts. 2008. Technology-Driven, Highly-Scalable Dragonfly Topology. In 2008 International Symposium on Computer Architecture. 77--88.
[54]
A. K. Kodi and A. Louri. 2011. Energy-Efficient and Bandwidth-Reconfigurable Photonic Networks for High-Performance Computing (HPC) Systems. IEEE Journal of Selected Topics in Quantum Electronics 17, 2 (March 2011), 384--395.
[55]
C. Lea. 2015. A Scalable AWGR-Based Optical Switch. Journal of Lightwave Technology 33, 22 (Nov 2015), 4612--4621.
[56]
Benjamin G. Lee. 2018. Photonic switching platform for datacenters enabling rapid network reconfiguration., 10560 - 10560 - 5 pages.
[57]
Jacob S. Levy, Alexander Gondarenko, Mark A. Foster, Amy C. Turner-Foster, Alexander L. Gaeta, and Michal Lipson. 2009. CMOS-compatible multiple-wavelength oscillator for on-chip optical interconnects. Nature Photonics 4 (20 Dec 2009), 37 EP -.
[58]
E. A. LeÃşn, I. Karlin, A. Bhatele, S. H. Langer, C. Chambreau, L. H. Howell, T. D'Hooge, and M. L. Leininger. 2016. Characterizing Parallel Scientific Applications on Commodity Clusters: An Empirical Study of a Tapered Fat-Tree. In SC '16: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 909--920.
[59]
Y. Li, H. Liu, W. Yang, D. Hu, and W. Xu. 2016. Inter-data-center network traffic prediction with elephant flows. In NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium. 206--213.
[60]
He Liu, Matthew K. Mukerjee, Conglong Li, Nicolas Feltman, George Papen, Stefan Savage, Srinivasan Seshan, Geoffrey M. Voelker, David G. Andersen, Michael Kaminsky, George Porter, and Alex C. Snoeren. 2015. Scheduling Techniques for Hybrid Circuit/Packet Networks. In Proceedings of the 11th ACM Conference on Emerging Networking Experiments and Technologies (CoNEXT '15). Article 41, 13 pages.
[61]
Robert Lucas, James Ang, Keren Bergman, Shekhar Borkar, William Carlson, Laura Carrington, George Chiu, Robert Colwell, William Dally, Jack Dongarra, Al Geist, Rud Haring, Jeffrey Hittinger, Adolfy Hoisie, Dean Micron Klein, Peter Kogge, Richard Lethin, Vivek Sarkar, Robert Schreiber, John Shalf, Thomas Sterling, Rick Stevens, Jon Bashor, Ron Brightwell, Paul Coteus, Erik Debenedictus, Jon Hiller, K. H. Kim, Harper Langston, Richard Micron Murphy, Clayton Webster, Stefan Wild, Gary Grider, Rob Ross, Sven Leyffer, and James Laros III. 2014. DOE Advanced Scientific Computing Advisory Subcommittee (ASCAC) Report: Top Ten Exascale Research Challenges. (2 2014).
[62]
Lailong Luo, Deke Guo, Wenxin Li, Tian Zhang, Junjie Xie, and Xiaolei Zhou. 2015. Compound graph based hybrid data center topologies. Frontiers of Computer Science 9, 6 (01 Dec 2015), 860--874.
[63]
William M. Mellette, Rob McGuinness, Arjun Roy, Alex Forencich, George Papen, Alex C. Snoeren, and George Porter. 2017. RotorNet: A Scalable, Low-complexity, Optical Datacenter Network. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM '17). 267--280.
[64]
G. Michelogiannakis, K. Z. Ibrahim, J. Shalf, J. J. Wilke, S. Knight, and J. P. Kenny. 2017. APHiD: Hierarchical Task Placement to Enable a Tapered Fat Tree Topology for Lower Power and Cost in HPC Networks. In 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID). 228--237.
[65]
S. H. Mirsadeghi, J. L. TrÃd'ff, P. Balaji, and A. Afsahi. 2017. Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives. In 2017 IEEE 24th International Conference on High Performance Computing (HiPC). 348--357.
[66]
M. A. Mollah, P. Faizian, M. S. Rahman, X. Yuan, S. Pakin, and M. Lang. 2018. A Comparative Study of Topology Design Approaches for HPC Interconnects. In 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID). 392--401.
[67]
Giovanni Neglia, Vincenzo Falletta, and Giuseppe Bianchi. 2004. Is TCP Packet Reordering Always Harmful?. In Proceedings of the The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems (MASCOTS '04). 87--94.
[68]
L. Nie, D. Jiang, L. Guo, S. Yu, and H. Song. 2016. Traffic Matrix Prediction and Estimation Based on Deep Learning for Data Center Networks. In 2016 IEEE Globecom Workshops (GC Wkshps). 1--6.
[69]
K. Padmaraju, D. F. Logan, T. Shiraishi, J. J. Ackert, A. P. Knights, and K. Bergman. 2014. Wavelength Locking and Thermally Stabilizing Microring Resonators Using Dithering Signals. Journal of Lightwave Technology 32, 3 (Feb 2014), 505--512.
[70]
I. Plander and M. Stepanovsky. 2017. MEMS technology in optical switching. In 2017 IEEE 14th International Scientific Conference on Informatics. 299--305.
[71]
Rastin Pries, Michael Jarschel, Daniel Schlosser, Michael Klopf, and Phuoc Tran-Gia. 2012. Power Consumption Analysis of Data Center Architectures. In Green Communications and Networking, Joel J. P. C. Rodrigues, Liang Zhou, Min Chen, and Aravind Kailas (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 114--124.
[72]
Francesco Redaelli, Marco D. Santambrogio, and Donatella Sciuto. 2008. Task Scheduling with Configuration Prefetching and Anti-fragmentation Techniques on Dynamically Reconfigurable Systems. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE '08). 519--522.
[73]
G. Rodriguez, C. Minkenberg, R. Beivide, R. P. Luijten, J. Labarta, and M. Valero. 2009. Oblivious routing schemes in extended generalized Fat Tree networks. In 2009 IEEE International Conference on Cluster Computing and Workshops. 1--8.
[74]
Arjun Roy, Hongyi Zeng, Jasmeet Bagga, George Porter, and Alex C. Snoeren. 2015. Inside the Social Network's (Datacenter) Network. In Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication (SIGCOMM '15). ACM, 123--137.
[75]
SAMTEC. 2019. PCIe Optical Half Cables Application Note. Technical Report. http://suddendocs.samtec.com/notesandwhitepapers/pcie_half_cable_app_note.pdf
[76]
V. Sasikala and K. Chitra. 2018. All optical switching and associated technologies: a review. Journal of Optics 47, 3 (01 Sep 2018), 307--317.
[77]
S. Scott, D. Abts, J. Kim, and W. J. Dally. 2006. The BlackWidow High-Radix Clos Network. In 33rd International Symposium on Computer Architecture (ISCA'06). 16--28.
[78]
John Shalf, Sudip Dosanjh, and John Morrison. 2011. Exascale Computing Technology Challenges. In Proceedings of the 9th International Conference on High Performance Computing for Computational Science (VECPAR'10). Springer-Verlag, Berlin, Heidelberg, 1--25.
[79]
J. Shalf, S. Kamil, L. Oliker, and D. Skinner. 2005. Analyzing Ultra-Scale Application Communication Requirements for a Reconfigurable Hybrid Interconnect. In Proc. SC2005: High performance computing, networking, and storage conference.
[80]
Y. Shen, A. Gazman, Z. Zhu, M. Y. The, M. Hattink, S. Rumley, P. Samadi, and K. Bergman. 2018. Autonomous Dynamic Bandwidth Steering with Silicon Photonic-Based Wavelength and Spatial Switching for Datacom Networks. In 2018 Optical Fiber Communications Conference and Exposition (OFC). 1--3.
[81]
Yiwen Shen, Storm Madeleine Glick, and Keren Bergman. 2019. Silicon photonic-enabled bandwidth steering for resource-efficient high performance computing. In Proceedings Volume 10946, Metro and Data Center Optical Networks and Short-Reach Links II (SPIE), Vol. 10946.
[82]
Yiwen Shen, Maarten H. N. Hattink, Payman Samadi, Qixiang Cheng, Ziyiz Hu, Alexander Gazman, and Keren Bergman. 2018. Software-defined networking control plane for seamless integration of multiple silicon photonic switches in Datacom networks. Opt. Express 26, 8 (Apr 2018), 10914--10929.
[83]
Arjun Singh, Joon Ong, Amit Agarwal, Glen Anderson, Ashby Armistead, Roy Bannon, Seb Boving, Gaurav Desai, Bob Felderman, Paulie Germano, Anand Kanagala, Jeff Provost, Jason Simmons, Eiichi Tanda, Jim Wanderer, Urs Holzle, Stephen Stuart, and Amin Vahdat. 2015. Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Googleś Datacenter Network. In Sigcomm '15.
[84]
Ankit Singla, P. Brighten Godfrey, and Alexandra Kolla. 2014. High Throughput Data Center Topology Design. In Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation (NSDI'14). USENIX Association, 29--41.
[85]
Ankit Singla, Chi-Yao Hong, Lucian Popa, and P. Brighten Godfrey. 2012. Jellyfish: Networking Data Centers Randomly. In Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation (NSDI'12). USENIX Association, 17--17.
[86]
C. Sun, M. Wade, M. Georgas, S. Lin, L. Alloatti, B. Moss, R. Kumar, A. H. Atabaki, F. Pavanello, J. M. Shainline, J. S. Orcutt, R. J. Ram, M. PopoviÄĞ, and V. StojanoviÄĞ. 2016. A 45 nm CMOS-SOI Monolithic Photonics Platform With Bit-Statistics-Based Resonant Microring Thermal Tuning. IEEE Journal of Solid-State Circuits 51, 4 (April 2016), 893--907.
[87]
Mohammad Mahdi Tajiki, Behzad Akbari, and Nader Mokari. 2017. Optimal Qos-aware Network Reconfiguration in Software Defined Cloud Data Centers. Comput. Netw. 120, C (June 2017), 71--86.
[88]
K. Tang, X. He, S. Gupta, S. S. Vazhkudai, and D. Tiwari. 2018. Exploring the Optimal Platform Configuration for Power-Constrained HPC Workflows. In 2018 27th International Conference on Computer Communication and Networks (ICCCN). 1--9.
[89]
Y. Tang, H. Guo, and J. Wu. 2018. OCBridge: An Efficient Topology Reconfiguration Strategy in Optical Data Center Network. In 2018 Optical Fiber Communications Conference and Exposition (OFC). 1--3.
[90]
Y. Tarutani, Y. Ohsita, and M. Murata. 2014. Virtual network reconfiguration for reducing energy consumption in optical data centers. IEEE/OSA Journal of Optical Communications and Networking 6, 10 (Oct 2014), 925--942.
[91]
E. Tasoulas, E. G. Gran, T. Skeie, and B. D. Johnsen. 2016. Fast hybrid network reconfiguration for large-scale lossless interconnection networks. In 2016 IEEE 15th International Symposium on Network Computing and Applications (NCA). 101--108.
[92]
Yutaka Urino, Tatsuya Usuki, Junichi Fujikata, Masashige Ishizaka, Koji Yamada, Tsuyoshi Horikawa, Takahiro Nakamura, and Yasuhiko Arakawa. 2014. Highdensity and wide-bandwidth optical interconnects with silicon optical interposers. Photon. Res. 2, 3 (Jun 2014), A1--A7.
[93]
Chao Wang, Frank Mueller, Christian Engelmann, and Stephen L. Scott. 2008. Proactive Process-level Live Migration in HPC Environments. In Proceedings of the 2008 ACM/IEEE Conference on Supercomputing (SC '08). IEEE Press, Piscataway, NJ, USA, Article 43, 12 pages. http://dl.acm.org/citation.cfm?id=1413370.1413414
[94]
Chang-Heng Wang, Tara Javidi, and George Porter. 2015. End-to-end scheduling for all-optical data centers. 406--414.
[95]
Guohui Wang, DavidG. Andersen, Michael Kaminsky, Michael Kozuch, T. S. Eugene Ng, Konstantina Papagiannaki, Madeleine Glick, and Lily B. Mummert. 2009. Your Data Center Is a Router: The Case for Reconfigurable Optical Circuit Switched Paths. In HotNets, Lakshminarayanan Subramanian, Will E. Leland, and Ratul Mahajan (Eds.). ACM SIGCOMM.
[96]
Ke Wen, Payman Samadi, Sébastien Rumley, Christine P. Chen, Yiwen Shen, Meisam Bahadroi, Keren Bergman, and Jeremiah Wilke. 2016. Flexfly: Enabling a Reconfigurable Dragonfly Through Silicon Photonics. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '16). IEEE Press, Piscataway, NJ, USA, 15:1--15:12.
[97]
Yiting Xia, Xiaoye Steven Sun, Simbarashe Dzinamarira, Dingming Wu, Xin Sunny Huang, and T. S. Eugene Ng. 2017. A Tale of Two Topologies: Exploring Convertible Data Center Network Architectures with Flat-tree. In Proceedings of the Conference of the ACM Special Interest Group on Data Communication (SIGCOMM '17). ACM, 295--308.
[98]
Xu Yang and Zhiling Lan. 2016. Cooperative Batch Scheduling for HPC Systems.
[99]
P. Yebenes, J. Escudero-Sahuquillo, P. J. Garcia, F. J. Quiles, and T. Hoefler. 2017. Improving Non-minimal and Adaptive Routing Algorithms in Slim Fly Networks. In 2017 IEEE 25th Annual Symposium on High-Performance Interconnects (HOTI). 1--8.
[100]
Keren Bergman Yiwen Shen, Madeleine Strom Glick. 2019. Silicon photonic-enabled bandwidth steering for resource-efficient high performance computing., 10946 - 10946 - 9 pages.
[101]
Ziyi Zhu, Yiwen Shen, Yishen Huang, Alexander Gazman, Maarten Hattink, and Keren Bergman. 2019. Flexible Resource Allocation Using Photonic Switched Interconnects for Disaggregated System Architectures, In Optical Fiber Communication Conference (OFC) 2019. Optical Fiber Communication Conference (OFC) 2019, M3F.3.

Cited By

View all
  • (2024)Meter-Scale Long Connectorized Paper-like Polymer Waveguide Film for 100 Gbps Board-Level Optical Interconnects ApplicationPolymers10.3390/polym1623335016:23(3350)Online publication date: 29-Nov-2024
  • (2024)Optical switching for data centers and advanced computing systems [Invited]Journal of Optical Communications and Networking10.1364/JOCN.53431717:1(A87)Online publication date: 9-Dec-2024
  • (2024)Fast and scalable all-optical network architecture for distributed deep learningJournal of Optical Communications and Networking10.1364/JOCN.51169616:3(342)Online publication date: 22-Feb-2024
  • Show More Cited By

Index Terms

  1. Bandwidth steering in HPC using silicon nanophotonics

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
    November 2019
    1921 pages
    ISBN:9781450362290
    DOI:10.1145/3295500
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    In-Cooperation

    • IEEE CS

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 November 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Funding Sources

    • Department of Energy

    Conference

    SC '19
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)51
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Meter-Scale Long Connectorized Paper-like Polymer Waveguide Film for 100 Gbps Board-Level Optical Interconnects ApplicationPolymers10.3390/polym1623335016:23(3350)Online publication date: 29-Nov-2024
    • (2024)Optical switching for data centers and advanced computing systems [Invited]Journal of Optical Communications and Networking10.1364/JOCN.53431717:1(A87)Online publication date: 9-Dec-2024
    • (2024)Fast and scalable all-optical network architecture for distributed deep learningJournal of Optical Communications and Networking10.1364/JOCN.51169616:3(342)Online publication date: 22-Feb-2024
    • (2024)COCSN: A Multi-Tiered Cascaded Optical Circuit Switching Network for Data CenterIEEE Transactions on Cloud Computing10.1109/TCC.2024.348827512:4(1463-1475)Online publication date: Oct-2024
    • (2024)MUSE: A Runtime Incrementally Reconfigurable Network Adapting to HPC Real-Time Traffic2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00073(765-779)Online publication date: 27-May-2024
    • (2024)Inter-Node Message Passing Through Optical Reconfigurable Memory ChannelIEEE Access10.1109/ACCESS.2024.341287812(83057-83071)Online publication date: 2024
    • (2024)The Intelligent Design of Silicon Photonic DevicesAdvanced Optical Materials10.1002/adom.20230133712:7Online publication date: 7-Feb-2024
    • (2023)GRAP: Group-level Resource Allocation Policy for Reconfigurable Dragonfly Network in HPCProceedings of the 37th International Conference on Supercomputing10.1145/3577193.3593732(437-449)Online publication date: 21-Jun-2023
    • (2023)Petabit-Scale Silicon Photonic Interconnects With Integrated Kerr Frequency CombsIEEE Journal of Selected Topics in Quantum Electronics10.1109/JSTQE.2022.319737529:1(1-20)Online publication date: Jan-2023
    • (2023)Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics2023 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER52292.2023.00021(158-172)Online publication date: 31-Oct-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media