Top-Down? Bottom Up? A Survey of Hierarchical Design Methodologies

> Trent McConaghy @trentmc0

# Pragmatic Ways to View Al, by Example

## **Classification**, in 2D



#### **Regression**, in 1D



## **Regression**, in 2D



How: Polynomials, splines, neural networks, support vector machines, Gaussian process models, boosted trees, ... [many refs]

#### **Symbolic Regression (SR)** (Like regression, but output a symbolic model too)



[e.g. McConaghy 2005; McConaghy 2011]

#### AI Has a Toolbox of Ways to Solve...

- •Classification Fraud detection, spam filtering ...
- Regression Stock prediction, sensitivity analysis ...
- •Whitebox regression Scientific discovery ...
- •Optimization Airfoil design, circuit simulation ...
- •Structural synthesis Analog synthesis, robotics ...
- Pattern recognition Face recognition, object recog ...
- •System identification Scientific discovery ...

•

- •Ranking Web search, ad serving, social discovery ...
- •Control Auto-driving autos, spacecraft trajectories ...

#### **AI Sub-fields**

- machine learning
- neural networks
- evolutionary computation
- fuzzy logic
- data mining
- artificial general intelligence
- pattern recognition
- .
- (nee) nonlinear programming
- (nee) databases
- .

#### AI Sub-fields of sub fields

- machine learning + neural networks
  - recurrent neural networks
  - sparse linear regression
  - deep learning
- evolutionary computation
  - evolutionary programming, evolution strategies
  - genetic algorithms
  - genetic programming
- ullet

**—** ...

## **Genetic Programming (GP):** A branch of a branch of Al But a super-cool one..

#### **Evolution**









#### **GP for SR**



Searches through the space of trees:

- 1. Initial random population; evaluate
- 2. Create children from parents via operators; evaluate
- 3. Select best; goto 2

#### **GP for SR: Crossover Operator**



#### SR with Vanilla GP – Koza 1991

```
(+ (- (% (RLOG (COS X)) (* (RLOG 0.48800004)
                         (* (+ (- X X) (COS - 0.8))
                           X)))
   (-(\cos -0.8)(\cos -0.8)))
(* (COS (- (COS (COS (+ (RLOG X)
                         (RLOG (COS X)))))
            (RLOG X)))
   (* (COS (- (COS -0.8) (RLOG X)))
      (* (- (% (RLOG (COS X))
               (* (RLOG 0.48800004)
                  (* (+ (- X X) (COS - 0.8)) X)))
            (SIN X))
         (RLOG (COS (RLOG X))))))).
```

#### SR With Fancier GP – McConaghy 2005



| Perf.           | Expression                                                                                   |
|-----------------|----------------------------------------------------------------------------------------------|
|                 |                                                                                              |
| ALF             | -10.3 + 7.08e-5 / id1                                                                        |
|                 | + 1.87 * ln( -1.95e+9 + 1.00e+10 / (vsg1*vsg3)+ 1.42e+9 *(vds2*vsd5) / (vsg1*vgs2*vsg5*id2)) |
| fu              | 10^( 5.68 - 0.03 * vsg1 / vds2 - 55.43 * id1+ 5.63e-6 / id1 )                                |
| PM              | 90.5 + 190.6 * id1 / vsg1 + 22.2 * id2 / vds2                                                |
| Voffset         | - 2.00e-3                                                                                    |
| SRp             | 2.36e+7 + 1.95e+4 * id2 / id1 - 104.69 / id2 + 2.15e+9 * id2 + 4.63e+8 * id1                 |
| SR <sub>n</sub> | - 5.72e+7 - 2.50e+11 * (id1*id2) / vgs2 + 5.53e+6 * vds2 / vgs2 + 109.72 / id1               |

#### **Circuit synthesis with GP – Koza 2001**



#### **Circuit synthesis with GP – McConaghy 2005**





#### Synthesis of jewelry – Hornby 2011







## Let's Get Ambitious! SCALE

#### Auto-design a piston?

Talafatatatatatatatatatatata

**Evolution**?

#### Auto-design a car?

### **Evolution?**



#### How do you evolve a 5K-part car?

# How do you evolve a 10G-part chip?



# How do you evolve a 37T cell animal?



### Approach the Design "Flat"?

- Simple
- What 99% of optimization does
- But, doesn't scale
- Yes, massive compute helps a lot. Eg deep learning
- But even for that: what about a 10x larger net? 1000x?

#### **Top-Down** Design? How, Specifically?



#### **Hierarchy in a Circuit**



#### **Bottom-up** Design? How, Specifically?



### **Hierarchy! Bringing Method to the Madness**

- Divide, conquer, stitch it back together
- Difficulty(design) ≈ Difficulty(hardest block)



### **Hierarchical Design Methodologies**

- Target behavior of each sub-block is well defined
- Design involves
  - A way to estimate performance of a candidate block
  - A way to search each sub-block
  - A way to stitch together the blocks
- Many possible ways!



# A key requirement: we need a way to estimate performance of each sub-block

- Input: design parameters (e.g. device sizes)
- Output: performance metrics (e.g. power, ...)
- Higher-level blocks are lower fidelity (to simulate faster)







• Step 1: map specifications down the hierarchy (via search)







 bottom-up verification to verify performance against specifications



- What happens when constraints cannot be met?
- General heuristic: backtrack







## Feasibility modeling bottom-up

- Model feasible performance space of a design
- For entire range of design parameters
- E.g. for a car: "can have 200mph top speed and 20 mpg separately but not together". Represent this continuously.



#### **Top-down constraint-driven design**

#### • Pros:

- Scale! (Compared to flat)
- Feasible runtime
- Can re-use feasibility models of building blocks
- "Natural choice" for advocates of top-down design

#### • Cons:

- Need a way to know what performance combinations of sub-blocks are feasible
- Once feasibility modeling is done, still need to do top-down steps (i.e. optimization at each node in hierarchy)



#### **Concurrent hierarchical methodology**



- **Pros:** an alternative to "flat"
- **Cons:** scales badly; more complex than "flat"



#### Multi-objective bottom-up design (MOBU)

- Pareto front of a lower level block is "dimensions" in the design space of each layer that uses it
- Effectively "compresses" lower level design spaces into the meaningful part (i.e. into the tradeoffs)



#### **MOBU Example #1**



#### **MOBU Example #2**



#### Multi-objective bottom-up design (MOBU)

#### • Pros:

- Feasible runtime
- Provides system level tradeoffs
- Can re-use multi-objective tradeoffs of building blocks
- "Model" of tradeoffs is simply the discrete designs themselves
- Once once top-level tradeoffs are known, the designs already exist too. No need for a subsequent top-down sizing
- Cons:
  - No full model of feasibility, just of tradeoffs
  - Cannot refine designs precisely, the way that top-down does

## How do you evolve a 37T cell *animal*?





















#### **Bottom-up Ossification**

- Two levels at a time are evolving
- Then lower level's mutation rate →0 (ossifies), and a new upper level emerges



#### **Bottom-up Ossification**

- Two levels at a time are evolving
- Then lower level's mutation rate →0 (ossifies), and a new upper level emerges



## Conclusion

- Hierarchical design methodologies enable ridiculously complicated designs
  - In machine learning
  - And in nature!
- They are *structured*, not ad-hoc
  - Top-down constraint-driven
  - Multi-objective bottom up
  - And more
- Are you ready for 1000x larger nets?

Trent McConaghy @trentmc0

## **Survey / further reading**

- Georges Gielen, Trent McConaghy, and Tom Eeckelaert, "Performance Space Modeling for Hierarchical Synthesis of Analog Integrated Circuits", Proc. Design Automation Conference, 2005. http://trent.st/content/2005\_DAC\_hierarchy.pdf
- For further references, see references of that work