### Digital IC & Sytems Design

### Iain McNally

### Digital IC & Sytems Design

### Iain McNally

 $\approx 10$  lectures

### Koushik Maharatna

 $\approx 12$  lectures

### Basel Halak

 $\approx 12$  lectures

1001

### Digital IC & Sytems Design

### Assessment

10% Coursework L-Edit Gate Design (BIM)

90% Examination

### Books

### **Integrated Circuit Design**

a.k.a. Principles of CMOS VLSI Design - A Circuits and Systems Perspective  $\,$ 

Neil Weste & David Harris

Pearson, 2011

### Digital System Design with SystemVerilog

Mark Zwolinski

Pearson Prentice-Hall, 2010

### Integrated Circuit Design

### Content

- Introduction
- Overview of Technologies
- Layout
- CMOS Processing
- Design Rules and Abstraction
- Cell Design and Euler Paths
- System Design using Standard Cells
- Wider View

### • Notes & Resources

http://users.ecs.soton.ac.uk/bim/notes/icd

1003

### History

### 1947 First Transistor

John Bardeen, Walter Brattain, and William Shockley (Bell Labs)

### 1952 Integrated Circuits Proposed

Geoffrey Dummer (Royal Radar Establishment) - prototype failed...

### 1958 First Integrated Circuit

Jack Kilby (Texas Instruments) - Co-inventor

### 1959 First Planar Integrated Circuit

Robert Noyce (Fairchild) - Co-inventor

### 1961 First Commercial ICs

Simple logic functions from TI and Fairchild

### 1965 Moore's Law

Gordon Moore (Fairchild) observes the trends in integration.

### History

### Moore's Law

Predicts exponential growth in the number of components per chip.

### 1965 - 1975 Doubling Every Year

In 1965 Gordon Moore observed that the number of components per chip had doubled every year since 1959 and predicted that the trend would continue through to 1975.

Moore describes his initial growth predictions as "ridiculously precise".

### 1975 - 201? Doubling Every Two Years

In 1975 Moore revised growth predictions to doubling every two years.

Growth would now depend only on process improvements rather than on more efficient packing of components.

In 2000 he predicted that the growth would continue at the same rate for another 10-15 years before slowing due to physical limits.

1005

### History

### Moore's Law at Intel<sup>1</sup>



<sup>&</sup>lt;sup>1</sup>Intel was founded by Gordon Moore and Robert Noyce from Fairchild

1006

### History

### Moore's Law; a Self-fulfilling Prophesy

The whole industry uses the Moore's Law curve to plan new fabrication facilities.

**Slower** - wasted investment

Must keep up with the Joneses<sup>2</sup>.

**Faster** - too costly

Cost of capital equipment to build ICs doubles approximately every 4 years.

Moore's law is not dead (at least not quite), although there are worries that below 20nm, clever processing required for smaller transistors means that cost per transistor is going up rather than down.

| 2               |     |        |  |
|-----------------|-----|--------|--|
| <sup>∠</sup> or | the | Intels |  |

1947 Point Contact transistor 1961 Fairchild Bipolar RTL RS Flip-Flop



Moore's Law (1965) Number of transistor has doubled every year and will continue to do so until 1975

Moore's Law (1975) Number of transistors will double every two years



### Overview of Technologies

### Components for Logic

Diode



**Bipolar Transistors** 





### **MOS Transistors**



### Overview of Technologies

### RTL Inverter and NOR gate



### Overview of Technologies

All functions can be realized using only the NOR gates  $^{\rm l}$  available in the RTL logic family.  $^{\rm 2}$ 



<sup>&</sup>lt;sup>1</sup>Note that an inverter is a special case of a NOR gate with only one input.

2003

### Overview of Technologies

### Other Bipolar Technologies





- TTL gives faster switching than RTL at the expense of greater complexity<sup>3</sup>. The characteristic multi-emitter transistor reduces the overall component count.
- ECL is a very high speed, high power, non-saturating technology.

<sup>&</sup>lt;sup>2</sup>NAND gates could be used instead for logic families which support only NAND gates.

<sup>&</sup>lt;sup>3</sup>Most TTL families are more complex than the basic version shown here

### Overview of Technologies

NMOS - a VLSI technology.



- Circuit function determined by series/parallel combination of devices.
- Depletion transistor acts as non-linear load resistor.
   Resistance increases as the enhancement device turns on, thus reducing power consumption.
- The low output voltage is determined by the size ratio of the devices.

2005

### Overview of Technologies

### Alternative transistors representations for NMOS circuits



Various shorthands are used for simplifying NMOS circuit diagrams.

- Substrate connections need not be drawn since all must connect to Gnd.
- Since source and drain are indistinguishable in the layout, there is no need to show the source on the circuit diagram.

Note that schematic tools not designed for IC design will usually include inappropriate 3-terminal symbols.

### Overview of Technologies

### **CMOS** logic

CMOS - state of the art VLSI.



- An active PMOS device complements the NMOS device giving:
  - rail to rail output swing.
  - negligible static power consumption.

2007

### **Digital CMOS Circuits**

### Alternative transistor representations for CMOS circuits



Digital CMOS circuits<sup>4</sup> tend to use simplified symbols like their NMOS counterparts.

- In general substrate connections are not drawn where they connect to Vdd (PMOS) and Gnd (NMOS).
- All CMOS devices are enhancement mode.
- Transistors act as simple digitally controlled switches.

<sup>&</sup>lt;sup>4</sup>in analog CMOS circuits we may have wells not connected to Vdd/GND

### Digital CMOS Circuits

### Static CMOS complementary gates



• For any set of inputs there will exist either a path to Vdd or a path to Gnd.

2009

### **Digital CMOS Circuits**

### **Compound Gates**



- All compound gates are inverting.
- $\bullet$  Realisable functions are arbitrary AND/OR expressions with inverted output.

Digital CMOS Circuits

### Compound Gate Example





• Bipolar Transitors with Resistors - MSI/LSI

RTL - NOR

TTL - NAND

ECS - OR/NOR

• MOS Transistors (no resistors) - VLSI

**NMOS** 

CMOS - No static power!

Both allow construction of NOR, NAND & Compound gate (always inverting)

### Components for IC Design

### Diodes and Bipolar Transistors

Diode



- Ideal structure 1D
- Real structure 3D
- Depth controlled implants.

3001

### Components for IC Design

### Diodes and Bipolar Transistors

NPN Transistor



• Two n-type implants.

### Components for IC Design

### **MOS Transistors**

Simple NMOS Transistor





3003

### Components for IC Design

### Simple NMOS Transistor

- Active Area mask defines extent of *Thick Oxide*.
- Polysilicon mask also controls extent of *Thin Oxide* (alias *Gate Oxide*).
- N-type implant has no extra mask.
- It is blocked by thick oxide and by polysilicon.
- The implant is *Self Aligned*.
- Substrate connection is to bottom of wafer.
- All substrates to ground.
- Gate connection not above transistor area.
  - Design Rule.

### Interconnect



- Crossing conductors on different masks do not interact<sup>1</sup>.
- Explicit contact/via is required for connection.
- Crossing conductors on the same mask are always connected.

<sup>1</sup>the exception to this rule is that polysilicon crossing diffusion gives us a transistor 3005

### Interconnect

### Resistance



$$R = \left(\frac{\rho}{t}\right) \left(\frac{l}{w}\right)$$

where  $\rho$  is the resistivity constant  $3.2\times 10^{-8}\Omega m \ \ {\rm for\ aluminium}$   $1.7\times 10^{-8}\Omega m \ \ {\rm for\ copper}$ 

Since t and  $\rho$  are fixed for a paricular mask layer, the value that is normally used is the sheet resistance:  $R_s = \binom{\rho}{t}$ .



$$R = R_s \left(\frac{l}{w}\right)$$

where  $R_s$  is sheet resistance  $0.1\Omega/\Box$  for 170nm thick copper

 $R_s=$  resistance of a square (i.e. w=l) so the units for  $R_s$  are  $\Omega/\square$  (ohms per square).

### Components for IC Design



- for larger resistances we need minimum width poly (often combined with a *serpantine* shape) to save on area
- corner squares count as half<sup>2</sup> squares
- for predicatability and matching we may need wider tracks without corners

3007

### Components for IC Design

### Capacitors



- Coupling capacitance to adjacent track  $C = C_c \, l/s$ where  $C_a$ ,  $C_f$ ,  $C_c$  are constants for a given layer and process in digital designs our only aim is to minimise parasitic capacitance

<sup>&</sup>lt;sup>2</sup>effective resistance  $\approx 0.56R_s$ 

## Diode





**NPN Transistor** 





NMOS Enhancement transistor NMOS Process





### Resistor

 1
 2
 3
 4
 5
 6
 7
 8
 9
 1

 3
 18
 17
 16
 15
 14
 13
 12
 11
 2

 19
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 2
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 3
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4
 4

Capacitors





### **CMOS**

### NMOS Transistor – with top substrate connection



4001

### **CMOS**

### NMOS Transistor – with top substrate connection

Where it is not suitable for substrate connections to be shared, a more complex process is used.

- Five masks must be used to define the transistor:
  - P Well
  - Active Area
  - Polysilicon
  - N+ implant
  - P+ implant
- P Well, for isolation.
- Top *substrate* connection.
- P+/N+ implants produce good *ohmic* contacts.

# CMOS Inverter N-well Active Area N implant P pimplant Polysilicon Contact Window Metal 4003

### **CMOS**

### **CMOS** Inverter

- The process described here is an *N Well process* since it has only an N Well. P Well and Twin Tub processes also exist.
- Note that the P-N junction between chip substrate and N Well will remain reverse biased.

Thus the transistors remain isolated.

- N implant defines NMOS source/drain and PMOS substrate contact.
- P implant defines PMOS source/drain and NMOS substrate contact.

### Processing – Photolithography



### Processing – Mask Making



• Optical reduction allows narrower line widths.





### Design Rules

To prevent chip failure, designs must conform to design rules:

• Single layer rules



• Multi-layer rules



### Derivation of Design Rules



### $0.5~\mu m$ CMOS inverter



5003

### Abstraction

### Levels of Abstraction



- Mask Level Design
- Laborious Technology/Process dependent.
- Design rules may change during a design!
- Transistor Level Design
- Process independent, Technology dependent.
- Gate Level Design
- $Process/Technology\ independent.$

### Abstraction - Stick Diagrams



Stick diagrams give us many of the benefits of abstraction:

- Much easier/faster than full mask specification.
- Process independent (valid for any CMOS process).
- Easy to change.

while avoiding some of the problems:

• Optimized layout may be generated much more easily from a stick diagram than from transistor or gate level designs.<sup>1</sup>

5005

### Digital CMOS Design



### Digital CMOS Design

### Stick Diagrams



5007

### Digital CMOS Design

### Stick Diagrams

- Explore your Design Space.
  - Implications of crossovers.
  - Number of contacts.
  - Arrangement of devices and connections.
- Process independent layout.
- Easy to expand to a full layout for a particular process.

<sup>&</sup>lt;sup>1</sup>note that all IC designs must end at the mask level.

### Sticks and CAD - Symbolic Capture



- Transistors are placed and explicitly sized.
- components are joined with zero width wires.
- contacts are automatically selected as required.
- $\bullet$  A semi-automatic compaction process will create DRC correct layout.

5009

### Sticks and CAD - Magic



- Log style design (sticks with width) DRC errors are flagged immediately.
- again contacts are automatically selected as required.
- On-line DRC leads to rapid generation of correct designs.
- symbolic capture style compaction is available if desired.



### Digital CMOS Design

### A logical approach to gate layout.

 All complementary gates may be designed using a single row of n-transistors above or below a single row of p-transistors, aligned at common gate connections.



### Digital CMOS Design

### **Euler Path**

- For the majority of these gates we can find an arrangement of transistors such that we can butt adjoining transistors.
  - Careful selection of transistor ordering.
  - Careful orientation of transistor source and drain.
- Referred to as *line of diffusion*.



Digital CMOS Design

### Finding an Euler Path

### Computer Algorithms

• It is relatively easy for a computer to consider all possible arrangements of transistors in search of a suitable Euler path.

This is not so easy for the human designer.

### One Human Algorithm

- Find a path which passes through all n-transistors exactly once.
- Express the path in terms of the gate connections.
- Is it possible to follow a similarly labelled path through the p-transistors?
  - Yes you've succeeded.
  - No try again (you may like to try a p path first this time).

6003

### Digital CMOS Design

### Finding an Euler Path



Here there are four possible Euler paths.

### Digital CMOS Design

### Finding an Euler Path





6005



Finding an Euler Path

No possible path through n-transistors! 6007

### Digital CMOS Design

### **Euler Path Example**





- 1. Find Euler path
- 3. Route power nodes 5. Route remaining nodes
- 2. Label poly columns 4. Route output node
  - 6. Add taps<sup>1</sup> for PMOS and NMOS

A combined contact and tap, •, may be used only where a power contact exists at the end of a line of diffusion. Where this is not the case a simple tap,  $\blacksquare$ , should be used.

### Digital CMOS Design

### Finding an Euler Path





<sup>&</sup>lt;sup>1</sup>1 tap is good for about 6 transistors – insufficient taps may leave a chip vulnerable to latch-up 6006

### Finding an Euler Path



No possible path through p-transistors. No re-arrangement will create a solution!

# Investigation of Euler paths leads to more efficient layout\*



### Digital CMOS Design

### Multiple gates



### Digital CMOS Design

### Multiple gates

- Gates should all be of same height.
  - Power and ground rails will then line up when butted.
- All gate inputs and outputs are available at top and bottom.
  - All routing is external to cells.
  - Preserves the benefits of hierarchy.
- Interconnect is via two conductor routing.
  - In this case Polysilicon vertically and Metal horizontally.

### Digital CMOS Design

### Two-layer Metal



Most modern VLSI processes support two or more metal layers.

The norm is to use only metal for inter-cell routing.

usually Metal1 for horizontal vertical inter-cell routing (and for power rails) (and for cell inputs and outputs).

7003

### Standard Cell Design

Many ICs are designed using the standard cell method.

### • Cell Library Creation

A cell library, containing commonly used logic gates<sup>1</sup> is created for a process. This is often carried out by or on behalf of the foundry.

### ASIC<sup>2</sup> Design

The ASIC designer must design a circuit using the logic gates available in the library.

The ASIC designer usually has no access to the full layout of the standard cells and doesn't create any new cells for the library.

Layout work performed by the ASIC designer is divided into two stages:

- Placement
- Routing

¹note that a standard cell may include transistors from more than one basic function (e.g. NAND + inverter to give AND) but will normally be designed *flat* i.e. without layout hierarchy.

<sup>&</sup>lt;sup>2</sup>Application Specific Integrated Circuit

### Standard Cell Design

### Layout and Abstract Views of a Standard Cell



The partial cell layout usually given to the ASIC designer is known as a black box or *abstract* view. The abstract:

- must include cell ports and a cell boundary
- may include some or all of the metal mask information

7005

### Placement & Routing

### Placement



Cells are placed in one or several equal length lines with inter-digitated power and ground rails.

### Routing



In the routing channels between the cells we route metal1 horizontally and metal2 vertically.

7007

### Placement & Routing

### Two conductor routing

- Conductor A for horizontal vertical inter-cell routing <sup>3</sup>
- This logical approach means that we should never have to worry about signals crossing.
- This makes life considerably easier for a computer (or even a human) to complete the routing.
- We must only ensure that two signals will not meet in the same horizontal or vertical channel.
- Computer algorithms can be used to ensure placement of cells such that wires are short.<sup>4</sup>
- Further computer algorithms can be used to optimize the routing itself.

<sup>&</sup>lt;sup>3</sup>In the two-metal example Conductor A is Metal1 and Conductor B is Metal2

<sup>&</sup>lt;sup>4</sup>In VLSI circuits we often find that inter-cell wiring occupies more area than the cells themselves.

### Standard Cell Design

 $\label{eq:more Metal Layers} More\ Metal\ Layers$  With three or more metal layers it is possible to take a different approach. The simplest example uses three metal layers.

- Standard Cells Use only metal1 except for I/O which is in metal2
- Two Conductor Routing Uses metal2 and metal3



### Standard Cell Design

# More Metal Layers

With this approach we can route safely over the cell to the specified pins leading to much smaller gaps between cell rows.

7010

### Standard Cell Design

### Alternative Placement Style

|   | Vdd! |      | Vdd!  |   |     |      | Vdd! |     |      | Vdd | !! |        |   |
|---|------|------|-------|---|-----|------|------|-----|------|-----|----|--------|---|
|   | GND! |      | GND!  |   |     |      | GND  | )!  |      | GN  | D! |        |   |
|   | GND! | GND! |       |   | GN  |      |      | GN  | D!   |     |    | GND!   |   |
|   | Vdd! | Vdd! | _Y_ * |   | Vdc |      |      | Vdc |      |     |    | Vdd!   | D |
|   | Vdd! | Vd   | d!    | Ι | Vd  | d!   |      |     | Vdd! |     |    | dd!    |   |
|   | GND! | GN   | ID!   |   | GN  | ID!  |      |     | GND! |     | G  | ND!    |   |
|   | GND! | GND! | • •   |   |     | GND! |      |     |      | GNI |    | • -• • |   |
| ĺ | Vdd! | Vdd! |       |   |     | Vdd! |      |     |      | Vdd |    |        |   |

By flipping every second row it may be possible to eliminate gaps between rows. N-wells are merged and power or ground rails are shared.

This approach is normally associated with sparse rows and non channel based routing algorithms.



### System Design Choices

- Programmable Logic
  - PLD
  - e.g. Lattice ispGAL22V10, Atmel ATF1502 CPLD
  - Field Programmable Gate Array (FPGA)
  - e.g. Altera Cyclone III, Xilinx Artix-7/Zync-7000
- Semi-Custom Design
  - Mask Programmable Gate Array
  - e.g. ECS CMOS Gate Array
    Altera HardCopy II structured ASICs
  - Standard Cell Design
  - e.g. Alcatel Mietec MTC45000  $0.35 \mu m$  cell library
- Full Custom Design

8001

8002

### System Design Choices

### • Programmable Logic

- Best possible design turnaround time
- Cheapest for prototyping
- Best time to market
- Minimum skill required
- Semi-Custom Design
- Full Custom Design
  - Cheapest for mass production
  - Fastest
  - Lowest Power
  - Highest Density<sup>1</sup>
  - Most skill required



ICT PEEL22CV10 Source: ICT

- One time use Fuse programmable.
- Reprogrammable UV/Electrically Erasable.

8003

### Field Programmable Gate Array - Xilinx XC4000



8004

- Configurable Logic Blocks (CLBs) & I/O Blocks<sup>2</sup>
- Programmable Interconnect

START HERE

optimization limited by speed/power/area trade off

 $<sup>^2</sup>$ Xilinx XC4013 has 576 (24 × 24) CLBs and up to 192 (4 × 48) user I/O pins.

### Field Programmable Gate Array - Xilinx XC4000 CLB



Source: Xilinx

8005



Artix-7 – SLICEM CLB Source: Xilinx



Source: Xilinx

- 4x 6-input Look-Up Tables (LUTs) for combinational logic
- Carry chain supporting fast carry lookahead
- 8x storage elements

LUTs can be alternatively configured as

- 256 bits RAM
- 32-bit shift register

8007

### FPGA - System On Chip

Modern FPGAs are big enough for:

- One or more soft-core processors
- Program memory
- Data memory
- + specialist hardware

The new trend is for FPGAs with hard processors built in:

- Xilinx Zync-7000 includes dual-core ARM A9
- Altera Arria V includes dual-core ARM A9
- Cypress PSoC 4 includes ARM Cortex-M0 and programmable digital<sup>4</sup> and analog blocks

<sup>&</sup>lt;sup>3</sup>Xilinx XC7A200T has 16,825 CLBs (each containing 2 slices) and up to 500 user I/O pins.

<sup>&</sup>lt;sup>4</sup>here the digital block is PLD rather than FPGA

### Mask Programmable Gate Array



### Mask Programmable Gate Array



• Customize Metal and Contact Window masks only.

### Standard Cell Design

• Logic Functions



- Auto Generated Macro Blocks
  - PLA
  - ROM
  - RAM
- System Level Blocks
  - Microprocessor core<sup>5</sup>

8011

### **Full Custom**

All design styles need full custom designers

- to design the base programmable logic chips
- to design building blocks for semi-custom

Where large ASICs use full custom techniques they are likely to be used alongside semi-custom techniques.

e.g. Hand-held computer game chip

- Full custom bitslice datapath hand crafted for optimum area efficiency and low power consumption
- Standard cell controller
- Macro block RAM, ROM

<sup>&</sup>lt;sup>5</sup>Will support System On Chip applications.





All design styles need full custom designers
A large ASIC (especially SoC) may mix Semi–Custom and Full Custom