Abstract
In this article, we propose novel and global Architecture-Aware Analytic MAPping (A3MAP) algorithms applied to Networks-on-Chip (NoCs) not only with homogeneous Processing Elements (PEs) on a regular mesh network as done by most previous application mapping algorithms but also with heterogeneous PEs on an irregular mesh or custom network. As the main contributions, we develop a simple yet efficient interconnection matrix that can easily model any core graph and network. Then, an application mapping problem is exactly formulated to Mixed Integer Quadratic Programming (MIQP). Since MIQP is NP-hard, we propose two effective heuristics, a successive relaxation algorithm achieving short runtime, called A3MAP-SR and a genetic algorithm achieving high mapping quality, called A3MAP-GA. We also propose a partition-based application mapping approach for large-scale NoCs, which provides better trade-off between performance and runtime. Experimental results show that A3MAP algorithms reduce total hop count, compared to the previous application mapping algorithms optimized for a regular mesh network, called NMAP [Murali and Micheli 2004] and for an irregular mesh and custom network, called CMAP [Tornero et al. 2008]. Furthermore, A3MAP algorithms make packets travel shorter distance than CMAP, which is related to energy consumption.
- AIMMS. 2012. Optimization software for operations research applications. http://www.aimms.comGoogle Scholar
- Borkar, S. 2007. Thousand core chips: A technology perspective. In Proceedings of the IEEE/ACM Design Automation Conference (DAC'07). Google ScholarDigital Library
- Benini, L. and Micheli, D. G. 2002. Network on chips: A new SoC paradigm. Comput. 35, 1, 70--78. Google ScholarDigital Library
- Bolotin, E., Cidon, I., Ginosar, R., and Kolodny, A. 2007. Routing table minimization for irregular mesh NoCs. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE'07). 1--6. Google ScholarDigital Library
- Carvalho, E., Calazans, N., and Moraes, F. 2007. Heuristics for dynamic task mapping in NoC-based heterogeneous MPSOCs. In Proceedings of the International Workshop on Rapid System Prototyping. 34--40. Google ScholarDigital Library
- Chan, J. and Parameswaran, S. 2008. NoCOUT: NoC topology generation with mixed packet-switched and point-to-point networks. In Proceedings of the Asia and South Pacific Design Automation Conference (ASP-DAC'08). Google ScholarDigital Library
- Chang, J. M. and Pedram, M. 2000. Codex-dp: Co-Design of communicating systems using dynamic programming. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 10, 7, 732--744. Google ScholarDigital Library
- Chang, P. C., Wu, I. W., Shann, J. J., and Chung, C. P. 2008. ETAHM: An energy-aware task allocation algorithm for heterogeneous multiprocessor. In Proceedings of the IEEE/ACM Design Automation Conference (DAC'08). 776--779. Google ScholarDigital Library
- Chatha, S. K., Srinivasan, K., and Konjevod, G. 2008. Automated techniques for synthesis of application specific network-on-chip architectures. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 27, 8. Google ScholarDigital Library
- Chen, G., Li, F., Son, W. S., and Kandemir, M. 2008. Application mapping for chip multiprocessor. In Proceedings of the IEEE/ACM Design Automation Conference (DAC'08). 620--625. Google ScholarDigital Library
- Chou, C. L., Ogras, Y. U., and Marculescu, R. 2008. Energy- and performance-aware incremental mapping for networks on chip with multiple voltage levels. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 27, 10, 1866--1879. Google ScholarDigital Library
- Dally, J. W. and Towles, B. 2001. Route packets, not wires: On-Chip interconnection networks. In Proceedings of the IEEE/ACM Design Automation Conference (DAC'01). 746--749. Google ScholarDigital Library
- Dick, P. R. 2012. Embedded system synthesis benchmarks suites (E3S). http://www.ece.northwestern.edu/∼dickrp/e3s/Google Scholar
- Dick, P. R., Rhodes, L. D., and Wolf, W. 1998. TGFF: Task graphs for free. In Proceedings of the International Workshop on Hardware/Software Codesign. 97--101. Google ScholarDigital Library
- Dutta, S., Jensen, R., and Rieckkmann, A. 2001. Viper: A multiprocessor SoC for advanced set-top box and digital tv systems. IEEE Des. Test Comput. 18, 5, 21--31. Google ScholarDigital Library
- Faruque, A. A. M., Krist, R., and Henkel, J. 2008. ADAM: Run-Time agent-based distributed application mapping for on-chip communication. In Proceedings of the IEEE/ACM Design Automation Conference (DAC'08). 760--765. Google ScholarDigital Library
- Ghosh, P., Sen, A., and Hall, A. 2009. Energy efficient application mapping to NoC processing elements operating at multiple voltage levels. In Proceedings of the International Symposium on Networks-on-Chip. 80--85. Google ScholarDigital Library
- Gopalakrishnan, P., Li, X., and Pileggi, L. 2006. Architecture-Aware fpga placement using metric embedding. In Proceedings of the IEEE/ACM Design Automation Conference (DAC'06). 460--465. Google ScholarDigital Library
- Grossmann, E. I. and Kravanja, Z. 1997. Mixed-Integer Nonlinear Programming: A Survey of Algorithms and Applications, Large-Scale Optimization with Applications, Part II: Optimal Design and Control. A. R. Conn, L. T. Biegler, T. F. Coleman, and F. N. Santosa, Eds. Springer.Google Scholar
- Hansson, A., Goossens, K., and Radulescu, A. 2005. A unified approach to constrained mapping and routing on network-on-chip architectures. In Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis (CODES + ISSS'05). 75--80. Google ScholarDigital Library
- He, O., Dong, S., Jang, W., Bian, J., and Pan, Z. D. 2011. UNISM: Unified scheduling and mapping for general networks on chip. IEEE Trans. VLSI Syst. 99, 1--14.Google Scholar
- hMETIS. 2012. Hypergraph and circuit partitioning. http://glaros.dtc.umn.edu/gkhome/views/metisGoogle Scholar
- Holsmark, R., Palesi, M., and Kumar, S. 2008. Deadlock free routing algorithms for irregular mesh topology NoC systems with rectangular regions. J. Syst. Archit. 54, 3--4, 384--396. Google ScholarDigital Library
- Hu, J. and Marculescu, R. 2003. Energy-Aware mapping for tile-based NoC architectures under performance constraints. In Proceedings of the Asia and South Pacific Design Automation Conference (ASP-DAC'03). 233--239. Google ScholarDigital Library
- Hu, J. and Marculescu, R. 2005. Communication and task scheduling of application-specific networks-on-chip. IEEE Proc. Comput. Digit. Tech. 152, 5, 643--651.Google ScholarCross Ref
- Jang, W. 2011. Architecture and physical design for advanced networks-on-chip. Ph.D. dissertation, University of Texas at Austin.Google Scholar
- Jang, W. and Pan, Z. D. 2010a. A3MAP: Architecture-Aware analytic mapping for networks-on-chip. In Proceedings of the Asia and South Pacific Design Automation Conference (ASP-DAC'10). 523--528. Google ScholarDigital Library
- Jang, W. and Pan, Z. D. 2010b. An SDRAM-aware router for networks-on-chip. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 20, 10, 1572--1585. Google ScholarDigital Library
- Jang, W., Ding, D., and Pan, Z. D. 2010. Voltage and frequency island optimizations for many-core/NoC designs. In Proceedings of the International Conference on Green Circuits and Systems. 217--220.Google Scholar
- Jang, W. and Pan, Z. D. 2011a. A voltage-frequency island aware energy optimization framework for networks-on-chip. IEEE J. Emerg. Select. Topics Circ. Syst. 1, 3, 420--432.Google ScholarCross Ref
- Jang, W. and Pan, Z. D. 2011b. Application-Aware NoC design for efficient sdram access. IEEE Trans. Comput. Aided Des. Integr. Circ. Syst. 30, 10, 1521--1533. Google ScholarDigital Library
- Jang, W., He, O., Yang, J. S., and Pan, Z. D. 2011. In Proceedings of the International Conference on Computer-Aided Design. 207--212. Google ScholarDigital Library
- Le Beux, S., Bois, G., Nicolescu, G., Langevin, M., and Paulin, P. 2010. Combining mapping and partitioning exploration for NoC-based embedded systems. J. Syst. Archit. 56, 7, 223--232. Google ScholarDigital Library
- Markovsky, Y., Patel, Y., and Wawrzynek, J. 2009. Using adaptive routing to compensate for performance heterogeneity. In Proceedings of the International Symposium on Networks on Chip.12--21. Google ScholarDigital Library
- Matousek, J. 2002. Lectures in Discrete Geometry. Springer. Google ScholarDigital Library
- Murali, S., Meloni, P., Angiolini, F., Atienza, D., Carta, S., Benini, L., Micheli, D. G., and Raffo, L. 2007. Designing application-aware networks on chips with floorplan information. In Proceedings of the International Conference on Computer-Aided Design. Google ScholarDigital Library
- Murali, S. and Micheli, D. G. 2004. Bandwidth-Constrained mapping of cores onto NoC architecture. In Proceedings of the Conference on Design, Automation and Test in Europe (DATE'04). 896--901. Google ScholarDigital Library
- Oliver, I., Smith, D., and Holland, J. 1987. A study of permutation crossover operators on the traveling salesman problem. In Proceedings of the Conference on Genetic Algorithms. 224--230. Google ScholarDigital Library
- Sahni, S. and Gonzalez, T. 1976. P-Complete approximation problems. J. ACM 23, 3, 555--565. Google ScholarDigital Library
- Schafer, F. F. M., Hollstein, T., Zimmer, H., and Glesner, M. 2005. Deadlock-Free routing and component placement for irregular mesh-based network-on-chip. In Proceedings of the International Conference on Computer-Aided Design. 238--245. Google ScholarDigital Library
- Shin, D. and Kim, J. 2004. Power-Aware communication optimization for networks-on-chip with voltage scalable links. In Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis. 170--175. Google ScholarDigital Library
- Singh, A. K., Jigang, W., Prakash, A., and Srikanthan, T. 2009. Efficient heuristics for minimizing communication overhead in NoC-based heterogeneous MPSoC platforms. In Proceedings of the International Symposium on Rapid System Prototyping. 55--60. Google ScholarDigital Library
- Singh, A. K., Srikanthan, T., Kumar, A., and Jigang, W. 2010. Communication-Aware heuristics for runtime task mapping on NoC-based MPSoC platforms. J. Syst. Archit. 56, 7, 242--255. Google ScholarDigital Library
- Smit, T. L., Smit, J. M. G., Hurink, L. J., Broersma, H., Paulusma, D., and Wolkotte, T. P. 2004. Run-Time assignment of tasks to multiple heterogeneous processors. In Proceedings of the 4th PROGRESS Workshop on Embedded Systems. 185--192.Google Scholar
- STMicroelectronics. 2012. Nomadik multimedia processors. http://www.st.comGoogle Scholar
- Texas Instruments. 2012. Wireless handset solutions: OMAP platform. http://www.ti.comGoogle Scholar
- Tornero, R., Orduna, M. J., Palesi, M., and Duato, J. 2008. A communication-aware topological mapping technique for NoCs. In Proceedings of the 14th International Conference on Parallel and Distributed Computing. 910--919. Google ScholarDigital Library
- Van Der Tol, B. E. and Jaspers, G. T. E. 2002. Mapping of the mpeg-4 decoding on flexible architecture platform. In Proce. SPIE 4674, 1, 1--13.Google Scholar
Index Terms
- A3MAP: Architecture-aware analytic mapping for networks-on-chip
Recommendations
Cluster-based application mapping method for Network-on-Chip
Network-on-Chip (NoC) is a newly introduced paradigm to overcome the communication problems of System-on-Chip architectures. Mapping applications onto mesh-based NoC architecture is an NP-hard problem and several heuristic methods have been presented to ...
A3MAP: architecture-aware analytic mapping for networks-on-chip
ASPDAC '10: Proceedings of the 2010 Asia and South Pacific Design Automation ConferenceIn this paper, we propose a novel and global A3MAP (Architecture-Aware Analytic Mapping) algorithm applied to NoC (Networks-on-Chip) based MPSoC (Multi-Processor System-on-Chip) not only with homogeneous cores on regular mesh architecture as done by ...
Numerical Experience with Lower Bounds for MIQP Branch-And-Bound
The solution of convex mixed-integer quadratic programming (MIQP) problems with a general branch-and-bound framework is considered. It is shown how lower bounds can be computed efficiently during the branch-and-bound process. Improved lower bounds such ...
Comments