Multiplier Architectures: Challenges and Opportunities with Plasmonic-based Logic : (Special Session Paper)

Emerging technologies such as plasmonics and pho-tonics are promising alternatives to CMOS for high throughput applications, thanks to their waveguide’s low power consumption and high speed of computation. Besides these qualities, these novel technologies also implement logic functionalities uncommon to traditional technologies that can be beneficial to existing CMOS architectures. In this work, we study how plasmonic-based devices can complement CMOS technology to achieve a more efficient implementation of multiplier architectures, which are the core of state-of-the-art data- and signal-processing circuits. A critical part of modern multipliers is the partial-product reduction step, used to reduce the partial product tree into a 2-input addition. In CMOS technology, this step is achieved by using compact and fast counters. On the other hand, the proposed plasmonic cells naturally implement counters of 3-, 9- and 27-inputs within a few logic levels at ultra-high speed. Thus, we present novel multiplier architectures, which take advantage of large plasmonic-based counters to reduce the number of cells and logic levels in the partial product reduction step of the multiplication. Our experimental results show that 3 levels and 30 counters are needed when 27-input cells are used. On the other side, 6 levels and 72 counters are employed with 9-input cells. Finally, we present various 16 × 16 multiplier implementations mixing 9- and 27-input cells, focusing on the trade-off in the number of counters, levels, and area of each architecture.

[1]  Christopher S. Wallace,et al.  A Suggestion for a Fast Multiplier , 1964, IEEE Trans. Electron. Comput..

[2]  Dmitri E. Nikonov,et al.  Overview of Beyond-CMOS Devices and a Uniform Methodology for Their Benchmarking , 2013, Proceedings of the IEEE.

[3]  Alessandro Aimar,et al.  NullHop: A Flexible Convolutional Neural Network Accelerator Based on Sparse Representations of Feature Maps , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Vojin G. Oklobdzija,et al.  A Method for Speed Optimized Partial Product Reduction and Generation of Fast Parallel Multipliers Using an Algorithmic Approach , 1996, IEEE Trans. Computers.

[5]  N. Sureka,et al.  An efficient high speed Wallace tree multiplier , 2013, 2013 International Conference on Information Communication and Embedded Systems (ICICES).

[6]  Francky Catthoor,et al.  Proposal for nanoscale cascaded plasmonic majority gates for non-Boolean computation , 2017, Scientific Reports.

[7]  Juerg Leuthold,et al.  100 GHz Plasmonic Photodetector , 2018, ACS Photonics.

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Jalil Fadavi-Ardekani,et al.  M*N Booth encoded multiplier generator using optimized Wallace trees , 1992, Proceedings 1992 IEEE International Conference on Computer Design: VLSI in Computers & Processors.

[10]  W. Barnes,et al.  Surface plasmon subwavelength optics , 2003, Nature.

[11]  William J. Kubitz,et al.  A Compact High-Speed Parallel Multiplication Scheme , 1977, IEEE Transactions on Computers.

[12]  H. John,et al.  Why future supercomputing requires optics , 2010 .

[13]  Mathias Soeken,et al.  Logic Synthesis for Established and Emerging Computing , 2019, Proceedings of the IEEE.

[14]  Michael J. Flynn,et al.  Introduction to Arithmetic for Digital Systems Designers , 1995 .