# N<sup>3</sup>ASICs: Designing Nanofabrics with Fine-Grained CMOS Integration

Pavan Panchapakeshan, Pritish Narayanan and Csaba Andras Moritz Electrical and Computer Engineering University of Massachusetts, Amherst Amherst, MA, USA {ppanchapakesh,andras}@ecs.umass.edu

Abstract—We propose a novel nanofabric approach that mixes unconventional nanomanufacturing with CMOS manufacturing flow and design rules in order to build a reliable nanowire-CMOS fabric called N<sup>3</sup>ASIC with no new manufacturing constraints added. Active devices are formed on a dense uniform semiconductor nanowire array and standard area distributed pins/vias; metal interconnects route the signals in 3D. CMOS design rules are followed. Novel nanowire based devices are envisioned and characterized based on 3D physics modeling. N<sup>3</sup>ASIC Overall fabric design, associated circuits, interconnection approach, and a layer-by-layer assembly sequence for the fabric are introduced. Key system level metrics such as power, performance, and density for a nanoprocessor design built using N<sup>3</sup>ASICs were evaluated and compared against a functionally equivalent CMOS design synthesized with state-of-the-art CAD tools. We show that the N<sup>3</sup>ASICs version of the processor is 3X denser and 5X more power efficient for a comparable performance than the 16-nm scaled CMOS version even without any new/unknown-manufacturing requirement added.

*Keywords-nanowires; 3-D integration; N<sup>3</sup>ASIC; NASIC; nano-CMOS hybrid system.* 

#### I. INTRODUCTION

Reliable manufacturing of integrated nanosystems incorporating novel nanodevices continues to be challenging. Specifically, assembly of nanostructures, achieving reconfigurable devices, interfacing and overlay considerations are key issues for nanoscale computing fabrics. While nanofabrics such as NASICs [1][2][3][4][5][6], CMOL [7] and FPNI [8] have been proposed minimizing certain manufacturing constraints, some or all of the aforementioned concerns still exist.

In this paper we propose a novel approach that mixes unconventional nanomanufacturing with conventional CMOS lithography and design rules to build a new class of 3-D integrated nanofabrics without any new manufacturing requirements. A new nanofabric, called N<sup>3</sup>ASICs (Nanoscale 3-D Application Specific Integrated Circuits) is presented. This fabric combines the advantages of high density obtained from unconventional manufacturing with the reliability and overlay precision of conventional CMOS manufacturing.

In N<sup>3</sup>ASICs, active devices are formed on uniform aligned semiconductor nanowire arrays, and area-distributed interfaces are used to connect to a 3-D CMOS metal stack for routing. To enable full integration with CMOS, lithographic design rules are also followed in shaping the fabric. Furthermore, a single unconventional manufacturing step to pattern/assemble sublithographic nanostructures is carried out at the beginning without any overlay requirement before any lithographic step. Thus, registration and overlay requirements exist only for subsequent photolithography steps, which is very precise (projected to be  $3\sigma = \pm 3.3$ nm for 16nm CMOS [9]). This is in direct contrast to proposals such as CMOL [7] and FPNI [8], where an unconventional manufacturing step (e.g., Nano-Imprint Lithography (NIL) [10]) with fairly poor overlay precision  $(3\sigma = \pm 105 \text{ nm } [11])$  is required after conventional lithography steps.

Core concepts of the N<sup>3</sup>ASIC fabric are introduced. A layer-by-layer assembly sequence is shown demonstrating how the fabric may be realized on a single Silicon-on-Insulator (SOI) wafer. Novel dual-channel Crossed Nanowire Field Effect Transistors (2C-xnwFETs), the active devices in N<sup>3</sup>ASICs, are extensively characterized using accurate 3-D physics-based simulations calibrated with experimental data. Associated circuit styles and interconnection approach are described and validated for functionality. A nanoprocessor design is implemented on N<sup>3</sup>ASICs, and key system-level metrics, including area, power and performance are evaluated.

The key contributions of this paper are: (i) N<sup>3</sup>ASIC, a new hybrid CMOS/nano computational fabric is described; (ii) Extensive device-level characterization of novel 2CxnwFETs for N<sup>3</sup>ASIC is shown; (iii) Key system-level metrics such as density, performance and power for N<sup>3</sup>ASIC are evaluated and compared against an equivalent 16nm CMOS design. We show that N<sup>3</sup>ASICs has 3X density and 5X power advantage over an end-of-the-line 16nm CMOS with comparable performance even if all CMOS design rules

This work was supported in part by the Center for Hierarchical Manufacturing (CHM) at UMass Amherst, Focus Center Research Program (FCRP) – Center on Functionally Engineering Nano Architectonics (FENA), and NSF awards CCR:0105516, NER:0508382, and CCR:051066.

are accommodated. Future N<sup>3</sup>ASIC fabrics could likely relax some of these rules, for example: by using new alignment techniques or by new via technologies.

The rest of the paper is organized as follows: Section II presents N<sup>3</sup>ASIC fabric and manufacturing aspects in detail, Section III presents evaluations of the fabric at device, circuit and architectural levels. Section IV discusses N<sup>3</sup>ASICs variants and directions for the future. Section V concludes the paper.

## II. N<sup>3</sup>ASIC FABRIC OVERVIEW

Fig. 1 shows the envisioned N<sup>3</sup>ASIC fabric built on a standard Silicon-on-Insulator (SOI) wafer. It consists of uniform parallel semiconductor nanowire arrays on which logic is implemented. Area-distributed standard pins or vias are used to connect inputs and outputs of these logic planes to the CMOS routing stack. Metal interconnections between vias achieve arbitrary routing. Support peripheral CMOS circuitry can be used for external control and dynamic clocking.

The underlying uniform nanowire array at the bottom layer can be direct patterned on an ultra-thin SOI substrate using approaches such as Nano-Imprint Lithography (NIL) [10] or Superlattice Nanowire Pattern Transfer (SNAP) [12][13]. For example, SNAP has shown uniform Silicon nanowire arrays at dimensions as low as 7nm width and 13nm pitch [14]. All subsequent steps, including the creation of vias, contacts and metal interconnect are achieved using conventional lithography and by obeying standard design rules.

To enable full and fine-grained integration with CMOS (e.g., not only IO signals but also inputs/outputs for each nanowire gate) without new manufacturing requirements, lithographic design rules need to be followed. Fig. 2 shows representative  $\lambda$  design rules applied to the N<sup>3</sup>ASIC fabric. All requirements for via overhang, metal-via and metal-metal spacing etc. are followed (e.g. [15] projects metal pitch = 40nm or 5 $\lambda$  for 16nm technology). Since lithographic requirements decide spacing, more sub-lithographically patterned nanowires may be bundled within the same dimension without loss of density. This allows for better contact, performance and inherent defect resilience, as will be shown in subsequent sections.



Figure 1 Nano-CMOS integrated N<sup>3</sup>ASIC fabric



Figure 2 CMOS Design rules applied to N<sup>3</sup>ASIC

Fig. 3 shows a layer-by-layer assembly sequence for N<sup>3</sup>ASICs. At the bottom of the fabric is a uniform semiconductor nanowire array (Fig. 3A). Metal gates (shown in green) are deposited at certain positions to define 2CxnwFETs (Fig. 3B) using conventional lithography. A selfaligning ion implantation is then used to create n+/p/n+structures for enhancement mode 2C-xnwFETs similar to conventional CMOS. All device channels are oriented along the same direction and lie on the substrate itself. Power and dynamic control rails are also established to define two separate logic planes. Metal lines and vias may then be laid down for interconnection. Inputs are received through an M1 array (light blue lines) and vias are dropped on to the nanowires to tap the outputs (blue dots) (Fig. 3C). In Fig. 3D, outputs from the left logic plane are cascaded to the inputs of the right plane using M2 (orange lines). This approach can be scaled to a large scale design with multiple cascaded logic planes.



Figure 3 Simplified assembly sequence for a N<sup>3</sup>ASIC tile: A) Shows a parallel array of nanowires; B) Shows the deposited metal gates to define transistor positions; C) Shows the M1/M2 metal interconnects and vias for output; D) 3D View of the proposed N<sup>3</sup>ASIC



Since a single unconventional patterning step such as SNAP or NIL is carried out *a priori* to any lithography, it will not have any registration or overlay requirement. Furthermore, registration of the first lithographic mask against the patterned nanowires can be achieved by transferring alignment markers to the substrate in the same step as logic nanowires (which ensures that the features are self-aligned). For an approach such as NIL, an arbitrary alignment marker could be created. For SNAP, where it may not be possible to create arbitrary markers as part of the superlattice, Moire patterns [16] could be used for registration. Furthermore, the underlying pattern of nanowires is uniform, which implies that the first lithographic mask can be offset with some tolerance and no loss of functionality.

This approach is in direct contrast to such as CMOL and FPNI, where the fabric organization requires nonconventional techniques such as imprint lithography to be employed *after* fabrication of CMOS layers. Overlay alignment for imprint lithography is  $3\sigma = 105$ nm [11], which implies significant challenges in alignment against previously formed CMOS features and would result in very low or zero yield.

### A. Device Structure

The use of standard design rules and lithography for manufacturing determines device structure and dimensions. Given that channel nanowires could have much smaller dimensions than metal vias, they are bundled into pairs to





make better contact, and provide for dual channel FETs.

In this paper the 2C-xnwFET along with an omega-like structured deposited metal gate shown in Fig. 4 was used. The gate width and the channel length of the device are defined by the technology node as they are lithographically defined. So for the purpose of study, devices with 16nm gate lengths were simulated. A high-k dielectric (HfO<sub>2</sub> [17]) was used as gate oxide material. A gate self-aligned process with etch back can be used for defining the oxide structure. Since this is an Omega-gated structure (somewhat similar to multi gate FETs [18]), one can expect good electrostatic control of the gate over the channel as it has a better gate to channel coupling as opposed to a top-gated structure. A better electrostatic control over the channel gives a higher on to off current ratio. The use of dual channels implies higher on-current, with potential benefits for system-level performance. Furthermore, the dualchannel structure implies inherent defect resilience against broken nanowires and some types of stuck-off defects, without a density impact. Stuck-off defects are very difficult to mask in general (vs. stuck-on defects that can be masked with redundancy fairly easily): this therefore is a good compromise.

#### B. Circuit Style

N<sup>3</sup>ASICs uses a dynamic circuit style similar to the circuit style employed by NASICs [3]. These dynamic circuit styles are amenable to implementation on regular nanowire arrays without the need for complementary devices, arbitrary sizing or placement, simplifying manufacturing requirements on N<sup>3</sup>ASICs. Logic customization is limited to defining the positions of the 2C-xnwFETs on the logic planes. Cascading and noise concerns for dynamic circuits arising from high output impedance are carefully managed through device design and intelligent fabric-level sequencing schemes similar to the approaches presented in [19][20][21].

One dynamic sequencing scheme for cascading is shown in Fig. 5. In this scheme, successive stages are clocked using different precharge and evaluate signals, with hold phases inserted for correct cascading. During a hold phase, the output node of a given stage is implicitly latched, and used for evaluation of the next stage, similar to [19][21]. Implicit latching implies that area expensive latches or flip-flops requiring complementary devices/local feedback paths are not needed.

Fig. 6 shows the top view of a 1-bit full adder circuit built using two N<sup>3</sup>ASIC logic planes. In this example, a 2-level NAND-NAND [3] logic style is used (other logic styles, such as based on H2L [1] logic are also possible). Stage 1 generates the minterms based on the inputs (marked stage 1 outputs). Minterms are fed to stage 2 using horizontal metal interconnects. Stage 2, using a combination of minterms generates different outputs. The outputs available on the right side of this stage can be routed to subsequent tiles using additional metal interconnects.

### III. EVALUATION AND RESULTS

The N<sup>3</sup>ASIC fabric was extensively evaluated at device, circuit and architectural levels. Device I-V and C-V characteristics were extracted, reflecting accurate 3-D physics. An integrated device-fabric methodology was used to create behavioral models of devices for a circuit simulator. Circuit level simulations were carried out to verify functionality. System-level metrics such as power and performance were evaluated for an N<sup>3</sup>ASIC processor design. The following subsections describe each phase in detail.

## A. Device Simulations

Enhancement mode Dual-Channel Crossed Nanowire FETs (2C-xnwFETs, Fig. 4) were extensively characterized using accurate physics-based 3D simulation of the electrostatics and operations using Synopsys Sentaurus<sup>TM</sup> [22]. The 2C-xnwFETs employ metal Omega gate structures for tighter electrostatic control. Gate material work function is 4.6 eV. 16nm channel devices were simulated given that it is the minimum feature size for lithographically defined gates. The notation N<sup>3</sup>ASIC-16 represents N<sup>3</sup>ASIC constructed with 16nm CMOS design rules, which implies  $\lambda$ the scale length, is equal to 8nm. The channels are doped ptype of the order of  $10^{18}$  cm<sup>-3</sup> and the source/drain regions were doped n-type of the order of 10<sup>20</sup>cm<sup>-3</sup>. A substrate bias of -3V was assumed to deplete the channel and adjust device parameters such as threshold voltage and on/off current ratios for correct cascading. A high-k HfO2 material is used for gate oxide. The gate oxide thickness was 3nm. Drift diffusion transport models [23]were used to simulate the 3D devices. Simulations were calibrated to account for interface scattering. surface roughness and interface trapped charges as explained



Figure 7  $I_{DS}$  Vs  $V_{DS}$  curves of N<sup>3</sup>ASIC-16 2C-xnwFET



in [20].

Drain current vs. drain voltage ( $I_{DS}-V_{DS}$ ), drain current vs. gate voltage ( $I_{DS}-V_{GS}$ ), and different parasitic capacitances vs. gate voltage (C vs V<sub>GS</sub>) were simulated. On-current ( $I_{ON}$ ) and on/off ( $I_{ON}/I_{OFF}$ ) current ratio were extracted. Fig. 7 shows the  $I_{DS}-V_{DS}$  curve for different V<sub>GS</sub> values. Fig. 8 shows the  $I_{DS}-V_{GS}$  curves for different V<sub>DS</sub> values. These simulations verify inversion mode behavior for 2C-xnwFETs with a positive threshold voltage. Table 1 shows key device parameters for N<sup>3</sup>ASIC-16 2C-xnwFET and for also the NASIC xnwFET described in [20]. Due to the dual channel the N<sup>3</sup>ASIC-16 2C-xnwFET have a higher ON current compared to the NASIC xnwFET that lowers intrinsic delay and can improve circuit performance. Also, VTH > 0.2, and ION/IOFF > 10<sup>4</sup> were obtained, implying that the devices meet circuit requirements for correct functionality and noise [20].

| TABLE I.        | DEVICE SIMULATION RESULTS           |                     |  |
|-----------------|-------------------------------------|---------------------|--|
| Parameter       | N <sup>3</sup> ASIC-16<br>2C-xnwFET | NASIC<br>xnwFET[20] |  |
| V <sub>TH</sub> | 0.27V                               | 0.21V               |  |
| I <sub>on</sub> | 39.6µA                              | 18.5µA              |  |
| Ion/IOFF        | 26218                               | 10782               |  |

#### B. Circuit-level Simulations

Novel nanoelectronic devices such as 2C-xnwFETs do not have built-in analytical models in traditional circuit simulators



Figure 9 C<sub>G</sub> vs V<sub>GS</sub> of N<sup>3</sup>ASIC-16 2C-xnwFET



N<sup>3</sup>ASIC are connected

such as HSPICE. Therefore, device simulation data were used to create behavioral models for 2C-xnwFETs compatible with Synopsys HSPICE [24] as explained in [20][19]. The behavioral models incorporate mathematical expressions for device current as a function of  $V_{GS}$  and  $V_{DS}$ , and piecewise-linear approximations of various parasitic capacitances vs.  $V_{GS}$ . For example, Fig. 9 shows  $C_G$  vs  $V_{GS}$  curves for N<sup>3</sup>ASIC-16.

Simulations were carried out in HSPICE to evaluate the performance and power of  $N^3ASICs$  design. Fig. 10 shows a circuit-level abstraction of cascaded NAND-NAND stages realized on the  $N^3ASIC$  fabric. Behavioral models are used for 2C-xnwFETs. Since vias and metal interconnects are used to route signals, CMOS interconnect models are necessary to evaluate the performance of  $N^3ASIC$ . The interconnects were modeled using the Predictive Technology Model (PTM) [25][26] models. The dimensions and parameters for scaled CMOS interconnect were chosen as projected by ITRS [9] and [15]. With the help of behavioral models, HSPICE simulations were carried out to verify functionality and measure the performance and power of  $N^3ASIC-16$ .

The full-adder in Fig. 6 was simulated in HSPICE to verify expected circuit level behavior. Fig. 11 shows the output waveforms of the one bit full adder simulated in HSPICE with the behavioral model. These simulations verify functionality of the circuits and adequate noise margins. It can be noted that the data on the output node is latched during the hold phases.

#### C. System-level Evaluation

For the purpose of system-level evaluation WISP-0 [5][27], a processor incorporating nanopipeling was chosen. Area of each tile in N<sup>3</sup>ASIC-16 WISP-0 was calculated based



Figure 11 Simulation wavefroms of N<sup>3</sup>ASIC One bit full adder

on the design rules and the number of metal tracks. A HSPICE circuit definition of the entire WISP-0 was created with proper interconnects to calculate the power and performance of  $N^3ASIC-16$  WISP-0. Key system level metrics such as area, performance and power were compared with a functionally equivalent 16nm static CMOS baseline.

The 16nm static CMOS baseline was created using the following methodology. A functional description of WISP-0 was written in Verilog. Using Synopsys Design Compiler, and standard cell library, gate level Verilog netlist was created. This was converted to a circuit-level netlist using the nettran utility. HSPICE definition of the standard cell library was used for this purpose. The MOSFET device dimensions were scaled to the 16nm technology node. The netlist and PTM 16nm MOSFET high performance models were used to run circuit level simulations in Synopsys HSPICE to measure the performance and power of the CMOS design. For area estimation the WISP-0 was synthesized using 45nm standard cell library and quadratically scaled down to 16nm.

Fig. 12 shows the density advantage of N<sup>3</sup>ASICs at various technology nodes. The proposed N<sup>3</sup>ASIC-16 is 3X denser compared to 16nm CMOS. The density improvement is due to the regular dense nanowire logic array at the bottom, use of single type of FET, smaller device footprint, and use of implicit latching without the need for area expensive flip flops. Since CMOS design rules are used for pitch and spacing, the scaling trend is almost constant across other technology nodes considered.

Power and performance comparisons are shown in Table 2. We notice that the performance of N<sup>3</sup>ASIC-16 is comparable to that of 16nm CMOS equivalent WISP-0. These simulations do not consider key optimizations for 2C-xnwFETs making comparisons pessimistic. For example, while the PTM models employ strained silicon, no straining was assumed for 2C-xnwFETs. It is expected that a better mobility and hence better performance could be obtained when straining techniques are employed in N<sup>3</sup>ASIC.

TABLE II. KEY SYSTEM LEVEL METRICS FOR WISP-0

|                         | Area<br>(µm²) | Performance<br>(GHz) | Power<br>(µw) |
|-------------------------|---------------|----------------------|---------------|
| CMOS<br>Baseline(16nm)  | 66.24         | 6.25                 | 77.90         |
| N <sup>3</sup> ASIC-16  | 22            | 6.32                 | 14.36         |
| Relative<br>Improvement | 3.01          | 1.01                 | 5.42          |

A significant reduction in average power of 5.4X was observed in case of N<sup>3</sup>ASIC-16. To clearly explain this, experiments were carried out with different circuits and varying number of inputs. With the voltage and the frequency of operation being the same, the capacitances were investigated. Since there is no arbitrary sizing in the case of N<sup>3</sup>ASICs and all 2C-xnwFETs are identical, the maximum input gate capacitance is always 20.42aF (Fig. 9). In case of the CMOS WISP-0 design, the transistors are sized, contributing to increased gate capacitance. The input gate capacitance in the case of minimum sized inverter in CMOS is 75.14aF which is more than 3.5X that of the N<sup>3</sup>ASICs. The largest NMOS device used has a gate capacitance of 135.4aF and the largest PMOS device has a gate capacitance of



372.38aF. A plot of the distribution of the transistor widths in the case of CMOS-WISP-0 is shown in Fig. 13. Since a dynamic logic style with only single type FET is used, N<sup>3</sup>ASIC-16 uses a fewer number of transistors to realize the logic. Implicit latching [21][27] of signals on the nanowires further reduces the number of transistors required. The transistor counts were 1306 and 3252 in case of N3ASICs and CMOS respectively. With the use of transistors of various widths, the gate capacitance further increases leading to increased dynamic power consumption for CMOS WISP-0.

#### IV. DISCUSSION AND SCOPE FOR FUTURE WORK

One variant of the N<sup>3</sup>ASICs was evaluated in detail. Other variants of N<sup>3</sup>ASICs to improve the density and performance of the fabric will be explored. Currently the density of the fabric is determined by metal 1 pitch and the via spacing. The density of the fabric would greatly improve if we just have the vias/pins to connect the inputs and outputs of a tile. This would be possible if nanowires are used to route signals within the tile instead of metal interconnects. By reducing the number of vias/pins we can take advantage of more densely packed nanowire arrays.

Another benefit of the 2C-xnwFET is that it provides fault tolerance against stuck-open devices in comparison to a single channel device. For example, dual channel structures are more resilient to broken nanowires. A single conducting nanowire can still achieve correct functionality. Given that nanowires can have much smaller dimensions than metal vias, multiple channel devices (greater than 2) could also be considered. From a fault-tolerance perspective, more channels would imply better resilience to stuck-off defects. However, due to





reduced channel cross-section scattering effects would increase causing deterioration in device performance. More detailed evaluations will be carried out as part of future work. Ultimately, defect distributions and performance targets will drive device design.

In order to improve the performance of the device, techniques like strain engineering [28][29] can be applied to increase the mobility of the charge carriers in the channel. Apart from the enhancement mode devices, junctionless devices similar to [29][30] can be used for N<sup>3</sup>ASICs. This would imply manufacturing and performance benefits. In a junctionless FET, a uniform doping profile is used on the channel without the need for n+p/n+ junctions, which implies simpler manufacturability. The performance of devices and circuits could be expected to increase owing to bulk conduction in junctionless devices unlike the enhancement mode devices which exhibit inversion layer conduction.

To reduce the manufacturing costs involved, a structured N<sup>3</sup>ASICs can be envisioned similar to the structured ASIC [31] approach. All nanowire logic planes could be identically sized and with pre-defined 2C-xnwFET positions. Arbitrary functionality and logic may be achieved purely with routing customizations using custom metal interconnects. This can potentially reduce design time and the manufacturing cost as it reduces the number of masks required.

#### V CONCLUSION

N<sup>3</sup>ASICs, a 3-D integrated nano-CMOS hybrid fabric was shown. Integration is fine grained: each input and output of a nanowire gate can be routed to any CMOS gate. The fabric uses unconventional manufacturing processes in conjunction with CMOS design rules for full 3-D integration without any special manufacturing requirements. A detailed layer by layer assembly sequence was presented. Detailed fabric evaluations were carried out at device, circuit, and system levels. A nanoprocessor implemented using the proposed N<sup>3</sup>ASIC fabric was shown to be 3X denser than an equivalent CMOS design even if all conservative CMOS design rules are obeyed. At a 5X lower power consumption the N<sup>3</sup>ASIC fabric is able to achieve the same performance as that of the CMOS processor even without device optimizations like straining that were supported in the 16nm CMOS device. With straining and by relaxing some of the design rule requirements much additional benefits may be possible.

#### REFERENCES

- [1] T. Wang, P. Narayanan, and C. Andras Moritz, "Heterogeneous Two-Level Logic and Its Density and Fault Tolerance Implications in Nanoscale Fabrics," IEEE Transactions on Nanotechnology, vol. 8, no. 1, pp. 22-30, Jan. 2009.
- C. A. Moritz et al., "Fault-Tolerant Nanoscale Processors on [2] Semiconductor Nanowire Grids," IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 54, no. 11, pp. 2422-2437, Nov. 2007.
- [3] P. Narayanan, M. Leuchtenburg, T. Wang, and C. A. Moritz, "CMOS Control Enabled Single-Type FET NASIC," in 2008 IEEE Computer Society Annual Symposium on VLSI, Montpellier, France, 2008, pp. 191-196.
- [4] C. A. Moritz, P. Narayanan, and C. O. Chui, Nanoscale Application Specific Integrated Circuits.
- [5] T. Wang, P. Narayanan, and C. A. Moritz, "Combining 2-level logic families in grid-based nanoscale fabrics," in 2007 IEEE International

Symposium on Nanoscale Architectures, San Jose, CA, USA, 2007, pp. 101-108.

- [6] P. Narayanan, K. W. Park, C. O. Chui, and C. A. Moritz, "Manufacturing pathway and associated challenges for nanoscale computational systems," in *Nanotechnology*, 2009. *IEEE-NANO* 2009. 9th IEEE Conference on, 2009, pp. 119-122.
- [7] K. K. Likharev, "CMOL: Second life for silicon?," *Microelectronics Journal*, vol. 39, p. 177–183, Feb. 2008.
- [8] G. S. Snider and R. S. Williams, "Nano/CMOS architectures using a field-programmable nanowire interconnect," *Nanotechnology*, vol. 18, no. 3, p. 035204, Jan. 2007.
- [9] "2009 ITRS." [Online]. Available: http://www.itrs.net/Links/2009ITRS/Home2009.htm.
- [10] T. Mårtensson, P. Carlberg, M. Borgström, L. Montelius, W. Seifert, and L. Samuelson, "Nanowire Arrays Defined by Nanoimprint Lithography," *Nano Letters*, vol. 4, no. 4, pp. 699-702, Apr. 2004.
- [11] C. Picciotto, J. Gao, Z. Yu, and W. Wu, "Alignment for imprint lithography using nDSE and shallow molds," *Nanotechnology*, vol. 20, no. 25, p. 255304, Jun. 2009.
- [12] D. Wang, Y. Bunimovich, A. Boukai, and J. R. Heath, "Twodimensional single-crystal nanowire arrays," Dec-2007. [Online]. Available: http://www.nanoarchive.org/1853/.
- [13] D. Wang, B. A. Sheriff, M. McAlpine, and J. R. Heath, "Development of ultra-high density silicon nanowire arrays for electronics applications," *Nano Research*, vol. 1, no. 1, pp. 9-21, Jul. 2008.
- [14] J. R. Heath, "Superlattice Nanowire Pattern Transfer (SNAP)," Accounts of Chemical Research, vol. 41, no. 12, pp. 1609-1617, Dec. 2008.
- [15] C. Bencher, H. Dai, and Y. Chen, "Gridded design rule scaling: taking the CPU toward the 16nm node," in *Proceedings of SPIE*, San Jose, CA, USA, 2009, p. 72740G-72740G-10.
- [16] S. H. Zaidi, "Moire interferometric alignment and overlay techniques," in *Proceedings of SPIE*, San Jose, CA, USA, 1994, pp. 371-382.
- [17] Supratik Guha and Vijay Narayanan, "High-κ/Metal Gate Science and Technology," 06-Jul-2009. [Online]. Available: http://www.annualreviews.org/doi/abs/10.1146/annurev-matsci-082908-145320.

- [18] C. Pacha et al., "Circuit design issues in multi-gate FET CMOS technologies," in 2006 IEEE International Solid State Circuits Conference - Digest of Technical Papers, San Francisco, CA, 2006, pp. 1656-1665.
- [19] P. Narayanan, C. A. Moritz, K. W. Park, and C. O. Chui, "Validating cascading of crossbar circuits with an integrated devicecircuit exploration," in 2009 IEEE/ACM International Symposium on Nanoscale Architectures, San Francisco, CA, USA, 2009, pp. 37-42.
- [20] P. Narayanan, J. Kina, P. Panchapakeshan, C. O. Chui, and C. A. Moritz, "Integrated Device-Fabric explorations and Noise Mitigation in Nanoscale Fabrics," *Submitted to TNANO under review*.
- [21] T. Wang, "Fault tolerant nanoscale microprocessor design on semiconductor nanowire grids," University of Massachusetts, Amherst.
- [22] "Synopsys- Sentaurus User Guide," 2009.
- [23] Streetman and Banerjee, Solid state electronic devices, 6th ed. Englewood Cliffs, NJ: Prentice-Hall, 2010.
- [24] "HSPICE Simulation and Analysis Guide," 2009.
- [25] W. Zhao and Y. Cao, "New Generation of Predictive Technology Model for Sub-45nm Design Exploration," in *Proceedings of the 7th International Symposium on Quality Electronic Design*, 2006, pp. 585-590.
- [26] "Predictive Technology Model (PTM)." [Online]. Available: http://ptm.asu.edu/.
- [27] T. Wang, M. Ben-naser, Y. Guo, and C. A. Moritz, "Wire-streaming processors on 2-D nanowire fabrics," *IN NANOTECH 2005. NANO* SCIENCE AND TECHNOLOGY INSTITUTE, 2005.
- [28] V. Chan et al., "Strain for CMOS performance improvement," in Proceedings of the IEEE 2005 Custom Integrated Circuits Conference, 2005., San Jose, CA, USA, pp. 662-669.
- [29] J.-P. Colinge et al., "Nanowire transistors without junctions," Nat Nano, vol. 5, no. 3, pp. 225-229, Mar. 2010.
- [30] C.-W. Lee, A. Afzalian, N. D. Akhavan, R. Yan, I. Ferain, and J.-P. Colinge, "Junctionless multigate field-effect transistor," *Applied Physics Letters*, vol. 94, no. 5, p. 053511, 2009.
- [31] B. Zahiri, "Structured ASICs: opportunities and challenges," in Proceedings 21st International Conference on Computer Design, San Jose, CA, USA, pp. 404-409.