Regression III

class: center, middle, inverse, title-slide

# Regression III
## More Flexible Fitting Methods
### Dave Armstrong

---

.shift150 { 
  position:relative; 
  top: -150px;
  }

.plot-callout {
  height: 225px;
  width: 450px;
  bottom: 5%;
  right: 5%;
  position: absolute;
  padding: 0px;
  z-index: 100;
}
.plot-callout img {
  width: 100%;
  border: 4px solid
  #  23373B;
}

.pull-right-shift {
  float: right;
  width: 47%;
  position: relative; 
  top: -100px;
}

.pull-right-shift2 {
  float: right;
  width: 47%;
  position:relative; 
  top: -50px;
}

.pull-right ~ * {
  clear: both;
}

.mycol {
  float: left;
  width: 30%;
  padding: 5px;
}

/* Clear floats after image containers */
.myrow::after {
  content: "";
  clear: both;
  display: table;
}

</style>

# Goals for Today

1. Discuss methods for more flexible fitting. 
    - Classification and Regression Trees (CART)
    - Random Forests Regression
    - Multivariate Adaptive Regression Splines (MARS)
    - Adaptive LASSO with Polynomial Expansion (Polywog)
2. Discuss MARS in an inferential context.

---
# Classification and Regression Trees (CART)

CART works in a decision-tree framework.

- Considering all independent variables, which dichotomization on one of them explains the most variance.

- Conditional on the previous *split*, which next dichotomization explains the most variance.

- Loss function is the well-known residual sum of squares.

- Continue until some stopping rule is met.

---
## Notes
.can-edit[Type notes here...]
---

# Notation

`$$f(X_i) = T(X_i, \Theta)\equiv \sum_{b=1}^{B}c_{b}I(X_i \in R_b)$$`

- `$T()$` is a regression tree, with rules `$\Theta$` regarding tree depth, stopping rules, etc...

- `$X_i$` is the data.

- `$c_b$` is the predicted value in each of the `$B$` regions.

- `$I()$` is an indicator function.

- `$R_b$` defines the different regions in the space.

---
## Notes
.can-edit[Type notes here...]
---

# Stopping Rules

- Candidate splits must increase `$R^{2}$` by a pre-specified amount (the `cp` parameter, default=.01).

- Each candidate split must have at least `minsplit` observations in it (default=20).

- Each terminal node must have at least `minbucket` observations in it.  Defaults to `round(minsplit/3)`.

- Tree depth - starting with the root node (0), what is the maximum depth of any node (defaults to 30).

---
## Notes
.can-edit[Type notes here...]
---

# Example

```r
library(rpart)
library(dplyr)
library(car)
data(SLID)
SLID <- SLID %>% 
  dplyr::select(wages, age, education) %>% 
  na.omit
mod <- rpart(log(wages) ~ age + education, data=SLID)
mod
```

```
## n= 4014 
## 
## node), split, n, deviance, yval
##       * denotes terminal node
## 
##  1) root 4014 1018.09900 2.619255  
##    2) age< 23.5 615   64.49511 2.064677 *
##    3) age>=23.5 3399  730.23310 2.719598  
##      6) education< 15.65 2536  482.80130 2.641571  
##       12) age< 31.5 554   85.64258 2.488905 *
##       13) age>=31.5 1982  380.63750 2.684244  
##         26) education< 13.95 1560  288.89010 2.646745 *
##         27) education>=13.95 422   81.44458 2.822866 *
##      7) education>=15.65 863  186.62170 2.948886  
##       14) age< 29.5 209   37.94680 2.617102 *
##       15) age>=29.5 654  118.31580 3.054915 *
```

---
## Notes
.can-edit[Type notes here...]
---

# Decision Tree

```r
plot(mod)
text(mod)
```

---
## Notes
.can-edit[Type notes here...]
---

# Surface Plot

.left-code[

```r
age.s <- seq(20, 95, length=25)
educ.s <- seq(9, 20, length=25)
cartpred <- function(x,y){
  predict(mod, 
          newdata=data.frame(
            age = x, 
            education=y))
}
p <- outer(age.s, 
           educ.s, 
           cartpred)
```

```r
library(plotly)
plot_ly() %>% 
  add_surface(x=~age.s, y=~educ.s, z=~exp(p)) %>% 
  layout(
    scene= list(
      xaxis=list(title="Age"), 
      yaxis=list(title="Education"),
      zaxis=list(title="Predicted Wage")))
```
]
.right-plot-shift[
<div id="htmlwidget-03855cb7c71b7447d88c" style="width:95%;height:504px;" class="plotly html-widget"></div>
<script type="application/json" data-for="htmlwidget-03855cb7c71b7447d88c">{"x":{"visdat":{"110c961b6dbc":["function () ","plotlyVisDat"]},"cur_data":"110c961b6dbc","attrs":{"110c961b6dbc":{"alpha_stroke":1,"sizes":[10,100],"spans":[1,20],"z":{},"type":"surface","x":{},"y":{},"inherit":true}},"layout":{"margin":{"b":40,"l":60,"t":25,"r":10},"scene":{"xaxis":{"title":"Age"},"yaxis":{"title":"Education"},"zaxis":{"title":"Predicted Wage"}},"hovermode":"closest","showlegend":false,"legend":{"yanchor":"top","y":0.5}},"source":"A","config":{"showSendToCloud":false},"data":[{"colorbar":{"title":"exp(p)","ticklen":2,"len":0.5,"lenmode":"fraction","y":1,"yanchor":"top"},"colorscale":[["0","rgba(68,1,84,1)"],["0.0416666666666666","rgba(70,19,97,1)"],["0.0833333333333334","rgba(72,32,111,1)"],["0.125","rgba(71,45,122,1)"],["0.166666666666667","rgba(68,58,128,1)"],["0.208333333333333","rgba(64,70,135,1)"],["0.25","rgba(60,82,138,1)"],["0.291666666666667","rgba(56,93,140,1)"],["0.333333333333333","rgba(49,104,142,1)"],["0.375","rgba(46,114,142,1)"],["0.416666666666667","rgba(42,123,142,1)"],["0.458333333333333","rgba(38,133,141,1)"],["0.5","rgba(37,144,140,1)"],["0.541666666666667","rgba(33,154,138,1)"],["0.583333333333333","rgba(39,164,133,1)"],["0.625","rgba(47,174,127,1)"],["0.666666666666667","rgba(53,183,121,1)"],["0.708333333333333","rgba(79,191,110,1)"],["0.75","rgba(98,199,98,1)"],["0.791666666666667","rgba(119,207,85,1)"],["0.833333333333333","rgba(147,214,70,1)"],["0.875","rgba(172,220,52,1)"],["0.916666666666667","rgba(199,225,42,1)"],["0.958333333333333","rgba(226,228,40,1)"],["1","rgba(253,231,37,1)"]],"showscale":true,"z":[[7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115],[7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115,7.88275048436115],[12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255],[12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,12.0480790472522,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255,13.6959794355255],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301],[14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,14.1080414501252,16.8250044255771,16.8250044255771,16.8250044255771,16.8250044255771,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301,21.2193828765301]],"type":"surface","x":[20,23.125,26.25,29.375,32.5,35.625,38.75,41.875,45,48.125,51.25,54.375,57.5,60.625,63.75,66.875,70,73.125,76.25,79.375,82.5,85.625,88.75,91.875,95],"y":[9,9.45833333333333,9.91666666666667,10.375,10.8333333333333,11.2916666666667,11.75,12.2083333333333,12.6666666666667,13.125,13.5833333333333,14.0416666666667,14.5,14.9583333333333,15.4166666666667,15.875,16.3333333333333,16.7916666666667,17.25,17.7083333333333,18.1666666666667,18.625,19.0833333333333,19.5416666666667,20],"frame":null}],"highlight":{"on":"plotly_click","persistent":false,"dynamic":false,"selectize":false,"opacityDim":0.2,"selected":{"opacity":1},"debounce":0},"shinyEvents":["plotly_hover","plotly_click","plotly_selected","plotly_relayout","plotly_brushed","plotly_brushing","plotly_clickannotation","plotly_doubleclick","plotly_deselect","plotly_afterplot","plotly_sunburstclick"],"base_url":"https://plot.ly"},"evals":[],"jsHooks":[]}</script>
]

---
## Notes
.can-edit[Type notes here...]
---

# Compare to Other Models

---
## Notes
.can-edit[Type notes here...]
---

# Smoothness of CART Models

Problems with CART

- Not particularly smooth - step functions aren't great approximations for smooth curves (though they can do OK).

- No real means for inference here.  Bootstrapping can be problematic because the function is "non-regular" (small data changes can result in wild changes in the model)

- In more complicated models, it's difficult to figure out what effects look like.

---
## Notes
.can-edit[Type notes here...]
---

# Visualizing Partial Effects: Partial Dependence Plot

The PDP plots the change in the average predicted value for a subset of features `$S$`, averaged over the subset of features `$C$`, where `$C$` is the complement of `$S$`.  Formally:

`$$f_S = \mathbb{E}_{x_{C}}\left[f(\mathbf{x}_{S}, \mathbf{x}_{C})\right] = \int f(\mathbf{x}_{S}, \mathbf{x}_{C})dP(\mathbf{x}_{C})$$`

In words: we are predicting `$f()$` with the variables in `$S$`  averaged over all of the variables in `$C$`.

---
## Notes
.can-edit[Type notes here...]
---

# ICE Plots

ICE disaggregates the PDP.

- The PDP is obtained by averaging over all of the ICE curves.

- Plots `$N$` different curves to enable evaluation of effect heterogeneity.

- Heterogeneity essentially means interactions with variables in `$C$`.

`$$f_{S_{i}} = \mathbb{E}_{x_{C_{i}}}\left[f(\mathbf{x}_{S}, \mathbf{x}_{C_{i}})\right]$$`

---
## Notes
.can-edit[Type notes here...]
---

# Ice Ice Baby

.left-code[

```r
library(ICEbox)
library(RColorBrewer)
ice1 <- ice(mod, SLID, y=mod$y, 
            predictor="age")
cice1 <- clusterICE(ice1, nClusters=3, 
                    plot_legend=TRUE, 
                    colorvec=brewer.pal(3, 
                                        "Set1"))
```
]
.right-plot-shift[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-3-1.png" width="90%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# PDPs in R

.left-code[

```r
library(pdp)
p1 <- partial(mod, pred.var="age", chull=TRUE)
plotPartial(p1)
```
]
.right-plot-shift[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-4-1.png" width="90%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# Ensemble Methods

Ensemble methods produce a bunch of `$(M)$` trees to better fit `$f(X)$` and to prevent overfitting, with general form:

`$$f(X_i) = \sum_{m=1}^{M}T_{m}(X_i, \Theta_m)$$`

**Tree Bagging (Bootstrap Aggregating)**

- Draw lots of random samples from the data
- Fit a _deep_ tree to each random sample.
- Average across the trees `$\hat{f}_{\text{bag}}(X_i) = \frac{1}{M}\sum_{m=1}^{M}T_m(X_i, \Theta_m)$`

Depends on the often dubious assumption of independence across trees to reduce bias and variance.

---
## Notes
.can-edit[Type notes here...]
---

# Random Forests
  
.pull-left[
A tree-bagging algorithm meant to increase independence across trees.

- In each random sample, only a small random subset `$(a)$` of the total `$j$` covariates is used in the splitting algorithm.
- Reduces bias and variance in the aggregate when `$a$` is small.

```r
library(randomForest)
rfmod <- randomForest(log(wages) ~ age+education, 
                      data=SLID, 
                      type="regression")
```
  ]
.pull-right-shift2[
<div id="htmlwidget-b7905c4cfb0d03b6addd" style="width:100%;height:504px;" class="plotly html-widget"></div>
<script type="application/json" data-for="htmlwidget-b7905c4cfb0d03b6addd">{"x":{"visdat":{"110c95b63d7ce":["function () ","plotlyVisDat"]},"cur_data":"110c95b63d7ce","attrs":{"110c95b63d7ce":{"alpha_stroke":1,"sizes":[10,100],"spans":[1,20],"z":{},"type":"surface","x":{},"y":{},"inherit":true}},"layout":{"margin":{"b":40,"l":60,"t":25,"r":10},"scene":{"xaxis":{"title":"Age"},"yaxis":{"title":"Education"},"zaxis":{"title":"Predicted Wage"}},"hovermode":"closest","showlegend":false,"legend":{"yanchor":"top","y":0.5}},"source":"A","config":{"showSendToCloud":false},"data":[{"colorbar":{"title":"exp(rfp)","ticklen":2,"len":0.5,"lenmode":"fraction","y":1,"yanchor":"top"},"colorscale":[["0","rgba(68,1,84,1)"],["0.0416666666666667","rgba(70,19,97,1)"],["0.0833333333333333","rgba(72,32,111,1)"],["0.125","rgba(71,45,122,1)"],["0.166666666666667","rgba(68,58,128,1)"],["0.208333333333333","rgba(64,70,135,1)"],["0.25","rgba(60,82,138,1)"],["0.291666666666667","rgba(56,93,140,1)"],["0.333333333333333","rgba(49,104,142,1)"],["0.375","rgba(46,114,142,1)"],["0.416666666666667","rgba(42,123,142,1)"],["0.458333333333333","rgba(38,133,141,1)"],["0.5","rgba(37,144,140,1)"],["0.541666666666667","rgba(33,154,138,1)"],["0.583333333333333","rgba(39,164,133,1)"],["0.625","rgba(47,174,127,1)"],["0.666666666666667","rgba(53,183,121,1)"],["0.708333333333333","rgba(79,191,110,1)"],["0.75","rgba(98,199,98,1)"],["0.791666666666667","rgba(119,207,85,1)"],["0.833333333333333","rgba(147,214,70,1)"],["0.875","rgba(172,220,52,1)"],["0.916666666666667","rgba(199,225,42,1)"],["0.958333333333333","rgba(226,228,40,1)"],["1","rgba(253,231,37,1)"]],"showscale":true,"z":[[10.1456750927576,9.29356939244954,8.47417835874685,7.93025741145171,8.283528668182,9.12026775014641,8.3577208329273,7.8209809129147,7.66501800636986,8.42087396476261,9.42733262770259,8.74569884964645,7.57106154715801,7.19118058485032,8.02433294035906,8.14968253529042,7.65335366246923,9.50289078100052,9.70417867403435,9.19907420319763,9.34090096935222,9.45011413832291,10.3878588438731,12.0799514898009,11.9387064840123],[11.9444610728903,9.28391399705114,8.95160705510891,8.47392838307543,9.13214234556267,10.435241218503,9.8982812784197,9.83253648926285,8.92815215797899,8.94500748661399,9.67569950899013,9.47867299658934,8.80433118034493,9.44328595643236,9.64468372627571,8.88132140688094,8.47807162105129,9.56343643478871,9.56938692047869,9.45051790079969,9.81886090021025,10.3997638631103,12.9864326712931,14.6640676593945,14.348970786381],[11.3226156589333,9.92450918705432,10.3936398394048,11.6025495829216,12.3865588230653,8.62154301075721,8.01103642018563,11.3751736533434,12.1072309433687,11.2768463654469,10.9451857239858,11.9389481366183,8.68005071317736,13.2343605291928,11.7783928466591,13.2875525261086,14.6505552919173,13.6400910622574,15.3649582605977,13.052880170825,15.0926521817586,10.1959574055199,11.1662448898465,15.4899674743901,16.0200236364866],[10.6396476862859,10.380525094403,10.0654306595058,9.55255843441039,10.4602674393382,11.322841913376,9.94422120288954,9.51677019016068,11.2025682788403,13.0200501309594,11.0517292243643,13.3947349738648,13.0584179952444,14.0533017326191,10.6910324037893,12.6439584721358,14.0892049191637,11.619280131052,17.0981587977588,16.0503338734039,15.7309563071973,15.9478673499282,16.2350081352224,17.477139877099,17.5455725268606],[12.7851014505081,11.5955909716835,10.0865014067393,9.7833779104722,11.5539349137117,13.3254138952689,14.8470988925494,11.8493464649447,13.8809274663721,14.888042626031,14.2743332124613,16.2819592848594,17.7018684854178,16.086578293905,14.1289385640541,14.9235804903521,13.9691756056625,19.1138100468754,15.4863578214762,19.3815319638484,20.9533431394445,20.8551853590084,20.30497526343,20.8032164752583,17.6665499334444],[12.8502677814751,15.4272146152854,16.5377922102334,11.5139777114916,14.993454482444,11.8675731922781,11.9978738052688,16.7556037054702,15.8922906049527,15.8804354906194,12.6238719072965,16.3313472711265,14.5058454487733,17.1432784365911,17.0415889715216,17.0647648218102,18.9481826258251,28.1622416042383,21.3535644064668,24.7140468762664,21.7455794787121,23.1257200073849,25.5241403589758,21.0816483232561,22.9591498485132],[11.5583456525531,13.3104476496059,13.3247277345806,11.6934218516105,16.2524148073954,14.0862406999699,15.1661686468651,18.5256917275824,15.6489409954616,15.7704450098004,17.3160509969093,16.78522194873,17.8461346794488,17.3476034738162,15.7615676933708,15.8946342066347,18.0207586145145,27.3987552007808,19.8998314586381,20.0430421855932,16.8271652685306,17.081131629475,20.5863132121892,18.8996727261061,21.2511468861854],[14.9409321871336,13.0895492488127,12.076956421536,9.49434204416729,14.8667956270021,13.1419474769931,12.8517282498517,18.0442964636808,17.2697565472444,12.5138523942047,16.1887884319925,15.7931719060947,15.578686502923,21.0179565217873,17.6059274897469,17.4751827465633,17.8093705882655,27.4165120711483,25.4130575216213,21.2146389218892,18.3819013581592,19.7889256902221,22.6100348963107,19.46143994939,20.7514296739651],[14.9573188164889,11.1806288154986,11.9872087644285,9.48515733309093,13.4760235799586,12.1471128514968,12.7131951678293,19.7139140473233,16.9441870227496,15.1592773571713,14.9915466880647,16.7362659133228,16.347122140911,18.1870622851479,16.5102525457895,17.1622475744479,18.9211714900136,24.3055044997681,19.6962835763704,21.2450462057397,17.3693423809482,19.1881436036529,22.6439308370251,16.4461917661647,16.9733960732772],[14.5278393682925,13.8363420120036,15.0348984291529,11.0899137995506,13.2110445806498,13.0595804417046,15.0423509982079,14.6224957344009,17.34838504533,15.7995366285003,16.2545905629061,15.2327355497861,15.1461040211152,18.4334068071796,19.1137025712353,18.2712890022788,18.5975865504768,25.9715609441341,22.600231717802,23.3109613789247,25.1777591076197,27.5771708792348,28.2022221631232,24.7585347607538,28.0737297809724],[14.1904868590605,14.8708502981559,14.3242966211652,10.500137440197,14.6130973872781,14.3625583055131,15.2443345758167,14.5884440680759,16.4244804749362,14.3420790925238,18.3053802648518,17.1730400415385,16.0366489865993,18.214036020245,19.8370785510914,17.7622664486434,21.5045625182305,23.899663685369,22.3969785862402,21.7332959441819,22.3031792783577,22.8572408646092,25.3722694266402,24.5957551757403,29.717619956386],[12.4189463885489,12.3242047428536,11.9381505794444,9.97255032600466,15.9839556252647,14.1325922339292,14.215895706794,13.8858107912073,15.1843086166186,15.0345903769618,17.3074049093987,17.8087299609485,16.6268258892946,16.2139190879845,19.1610186018465,16.4443884962641,20.9611058273397,24.7044610675207,23.4997438864509,22.2811278762013,22.5017020769554,22.8127871071399,24.2860516169858,22.3060953409711,24.9334108123097],[12.4461590538357,11.747586860898,13.5175894972414,10.3582996265186,17.3991324425719,12.3000173207324,11.8543076814293,12.3612003007172,14.7783233132132,14.1735050803789,16.127883410355,21.5128347107974,20.3684913306461,17.8728226037935,20.1604666128588,17.1683601943434,19.9565249165682,20.2432892366827,20.3588950097973,20.5105714549222,23.9221506864859,24.0381530197497,25.6745904319434,24.6197335509316,28.7132482986661],[13.5217968622041,11.5500750501087,12.0961236802876,10.9158500785018,12.9310400108202,10.7047465737067,11.2759248921176,13.5342626427064,16.4588092149816,15.4138071182314,13.5065556620874,19.1101227201452,19.3080249238815,21.484702448236,19.9792709025689,15.3723248615972,15.9257488569174,17.0114633959527,17.9270672685487,15.1661106293448,19.9831836502361,18.9078855646215,18.8911782178502,17.1910785249664,17.8049575118795],[12.7696574169423,12.3360931809359,12.536923912691,10.8662442795138,12.5238958076556,10.6368616086823,11.5373850924021,15.1032474591301,16.6390910597478,15.8632883567691,13.9643158368008,18.7907402065584,19.3602356369066,23.066197830681,20.5220954405029,16.522285288383,17.5035125545768,20.2716021916462,21.5846734522889,18.2018399048416,20.9501995096032,19.7181248865405,20.7012454115693,23.5163370678062,27.189395020155],[10.9105961440178,10.7270482439035,9.88951258073648,9.35531144010204,10.9518344464053,10.5160975083373,10.8359879049987,14.3307372680202,16.7557252476764,15.1062749807305,13.4968819454414,14.9126798666333,16.3836923467791,19.011428026766,18.2457358494483,14.5619491067808,15.1141640277127,18.1593498208291,18.6237212089617,15.4240987274205,15.5678293971138,13.9811350687959,13.7989149384482,16.1165573545831,18.6638640410362],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068],[10.6038722765766,10.4404728719243,9.67305866968491,9.14226905907829,10.4443545063325,9.85562296983877,9.96977138146949,11.7777615287958,12.9130853533827,12.5888097922652,12.1278328147652,13.9858287730206,15.5776800962408,19.1030972493532,18.6112492823185,16.5784840230114,17.4310783318975,19.9058999096065,20.1567253135515,16.7552896097533,16.8288476450811,15.1692851530536,14.9497295845772,17.0950269434095,19.7619148032068]],"type":"surface","x":[20,23.125,26.25,29.375,32.5,35.625,38.75,41.875,45,48.125,51.25,54.375,57.5,60.625,63.75,66.875,70,73.125,76.25,79.375,82.5,85.625,88.75,91.875,95],"y":[9,9.45833333333333,9.91666666666667,10.375,10.8333333333333,11.2916666666667,11.75,12.2083333333333,12.6666666666667,13.125,13.5833333333333,14.0416666666667,14.5,14.9583333333333,15.4166666666667,15.875,16.3333333333333,16.7916666666667,17.25,17.7083333333333,18.1666666666667,18.625,19.0833333333333,19.5416666666667,20],"frame":null}],"highlight":{"on":"plotly_click","persistent":false,"dynamic":false,"selectize":false,"opacityDim":0.2,"selected":{"opacity":1},"debounce":0},"shinyEvents":["plotly_hover","plotly_click","plotly_selected","plotly_relayout","plotly_brushed","plotly_brushing","plotly_clickannotation","plotly_doubleclick","plotly_deselect","plotly_afterplot","plotly_sunburstclick"],"base_url":"https://plot.ly"},"evals":[],"jsHooks":[]}</script>
  
]

---
## Notes
.can-edit[Type notes here...]
---

# Compare to Other Models
  
<img src="lecture12_2020_files/figure-html/comprfmods-1.png" width="504" height="80%" style="display: block; margin: auto;" />

---
## Notes
.can-edit[Type notes here...]
---

# Multivariate Adaptive Regression Splines (MARS)

.pull-left[
The main component of MARS is a pair of piecewise linear (hinge) splines.

`$$\begin{aligned}
(x-t)_{+} &= \left\{\begin{array}{ll}
  x-t & \text{ if } x > t\\
  0   &   \text{ otherwise.}
\end{array}\right.\\
(t-x)_{+} &= \left\{\begin{array}{ll}
  t-x & \text{ if } x < t\\
  0   &   \text{ otherwise.}
\end{array}\right.
\end{aligned}$$`
]
.pull-right-shift[
<img src="lecture12_2020_files/figure-html/hinge-1.png" width="\textwidth" />
]

---
## Notes
.can-edit[Type notes here...]
---

# MARS Notation

MARS takes the form:

`$$f(x) = \beta_0 + \sum_{m=1}^{M} \beta_mh_m(x)$$`

where `$h_m$` is the pair of hinge functions.

Computationally:

- Forward pass - add pairs of hinge functions by reduction in SSRes until all pairs are in.

- Backward pass - take individual functions out by min increase in SSRes until GCV criterion is satisfied.

---
## Notes
.can-edit[Type notes here...]
---

# Interactions

- The `degree` parameter in the R algorithm controls the degree of interaction you want to allow.

- This can make the model really complicated because it's expanding all possible interactions among hinge functions and then pulling them out on the backward pass step.

- This model is more easily constrained (particular w.r.t additivity) than the other models we talked about before.

- You can also identify variables that will enter the model linearly *if they enter the model at all *.

---
## Notes
.can-edit[Type notes here...]
---

# MARS Wages

.pull-left[

```r
library(earth)
emod <- earth(log(wages) ~ age + 
                education, data=SLID, 
              degree=3)
```
]
.pull-right-shift2[

```r
summary(emod)
```

```
## Call: earth(formula=log(wages)~age+education, data=SLID, degree=3)
## 
##                               coefficients
## (Intercept)                     2.67861807
## h(32-age)                      -0.04945215
## h(age-32)                      -0.03079281
## h(12.6-education)              -0.16237528
## h(education-12.6)               0.15308353
## h(age-32) * h(education-17.1)  -0.01150617
## h(age-32) * h(17.1-education)   0.00666845
## h(age-34) * h(education-12.6)   0.00552501
## h(51-age) * h(education-12.6)  -0.00489692
## h(58-age) * h(12.6-education)   0.00426955
## h(age-58) * h(12.6-education)  -0.01536656
## 
## Selected 11 of 12 terms, and 2 of 2 predictors
## Termination condition: RSq changed by less than 0.001 at 12 terms
## Importance: age, education
## Number of terms at each degree of interaction: 1 4 6
## GCV 0.166029    RSS 657.835    GRSq 0.3457333    RSq 0.3538597
```
]

---
## Notes
.can-edit[Type notes here...]
---

# Surface Plot

.left-code[

```r
marspred <- function(x,y){
  predict(emod, 
          newdata=data.frame(age = x, 
                             education=y))
}
p2 <- outer(age.s, 
            educ.s, 
            marspred)
```

```r
plot_ly() %>% 
  add_surface(x=~age.s, y=~educ.s, z=~exp(p2)) %>% 
  layout(
    scene= list(
      xaxis=list(title="Age"), 
      yaxis=list(title="Education"),
      zaxis=list(title="Predicted Wage")
    ))
```
]
.right-plot-shift[
<div id="htmlwidget-0124f8d9317b6deca156" style="width:95%;height:504px;" class="plotly html-widget"></div>
<script type="application/json" data-for="htmlwidget-0124f8d9317b6deca156">{"x":{"visdat":{"110c9a5f0333":["function () ","plotlyVisDat"]},"cur_data":"110c9a5f0333","attrs":{"110c9a5f0333":{"alpha_stroke":1,"sizes":[10,100],"spans":[1,20],"z":{},"type":"surface","x":{},"y":{},"inherit":true}},"layout":{"margin":{"b":40,"l":60,"t":25,"r":10},"scene":{"xaxis":{"title":"Age"},"yaxis":{"title":"Education"},"zaxis":{"title":"Predicted Wage"}},"hovermode":"closest","showlegend":false,"legend":{"yanchor":"top","y":0.5}},"source":"A","config":{"showSendToCloud":false},"data":[{"colorbar":{"title":"exp(p2)","ticklen":2,"len":0.5,"lenmode":"fraction","y":1,"yanchor":"top"},"colorscale":[["0","rgba(68,1,84,1)"],["0.0416666666666667","rgba(70,19,97,1)"],["0.0833333333333333","rgba(72,32,111,1)"],["0.125","rgba(71,45,122,1)"],["0.166666666666667","rgba(68,58,128,1)"],["0.208333333333333","rgba(64,70,135,1)"],["0.25","rgba(60,82,138,1)"],["0.291666666666667","rgba(56,93,140,1)"],["0.333333333333333","rgba(49,104,142,1)"],["0.375","rgba(46,114,142,1)"],["0.416666666666667","rgba(42,123,142,1)"],["0.458333333333333","rgba(38,133,141,1)"],["0.5","rgba(37,144,140,1)"],["0.541666666666667","rgba(33,154,138,1)"],["0.583333333333333","rgba(39,164,133,1)"],["0.625","rgba(47,174,127,1)"],["0.666666666666667","rgba(53,183,121,1)"],["0.708333333333333","rgba(79,191,110,1)"],["0.75","rgba(98,199,98,1)"],["0.791666666666667","rgba(119,207,85,1)"],["0.833333333333333","rgba(147,214,70,1)"],["0.875","rgba(172,220,52,1)"],["0.916666666666667","rgba(199,225,42,1)"],["0.958333333333333","rgba(226,228,40,1)"],["1","rgba(253,231,37,1)"]],"showscale":true,"z":[[8.04230138053782,8.04278980236315,8.04327825385112,8.04376673500353,8.04425524582218,8.04474378630888,8.04523235646542,8.04572095629361,8.04682457659191,8.05154286433535,8.05626391866568,8.06098774120507,8.06571433357668,8.07044369740462,8.07517583431393,8.07991074593063,8.08464843388167,8.08938889979498,8.09413214529941,8.0988781720248,8.10362698160193,8.10837857566252,8.11313295583927,8.11789012376585,8.12265008107684],[8.94612830137059,9.00155025101335,9.05731554387662,9.11342630699795,9.16988468059204,9.22669281813236,9.28385288643334,9.34136706573298,9.40118464027241,9.47290586560762,9.5451742490256,9.61799396477137,9.69136921893509,9.76530424969497,9.83980332756213,9.91487075562721,9.99051086980894,10.0667280391046,10.1435266658423,10.2209111859355,10.2988860691388,10.3774558193064,10.4566249746525,10.5363981080127,10.6167798271089],[9.9515310105466,10.0746020861705,10.1991951878664,10.3253291386061,10.4530229941457,10.582296045905,10.7131678238816,10.8456580996017,10.9834968811917,11.1451863389006,11.3092560477278,11.4757410476535,11.6446768944857,11.8160996674542,11.9900459769157,12.1665529721726,12.3456583494069,12.5274003597309,12.7118178173564,12.8989501078843,13.0888371967157,13.2815196375876,13.4770385812338,13.6754357841732,13.8767536176278],[11.0699250131141,11.2755696923699,11.4850346083531,11.6983907291469,11.9157103411997,12.1370670738168,12.3625359241066,12.5921932823893,12.8321279025164,13.112679497829,13.3993648535149,13.6923180733072,13.9916761928784,14.2975792439414,14.6101703197528,14.9295956420481,15.2560046294404,15.5895499673144,15.9303876792482,16.2786771999976,16.6345814500747,16.9982669119584,17.3699037079698,17.7496656798515,18.1377304700857],[12.153561608808,12.436251956953,12.7255176478252,13.0215116230877,13.3243903818078,13.634314063202,13.9514465313051,14.2759554616093,14.6167687908915,15.018498823477,15.4312700801107,15.8553860198785,16.2911584421933,16.7389077160216,17.1989630154112,17.6716625614917,18.1573538711259,18.6563940123956,19.1621959929327,19.6670375289205,20.1851794703812,20.7169722258553,21.2627754356426,21.8229582150199,22.3978994038654],[12.4554387276326,12.701434478832,12.9522886626351,13.2080972334122,13.4689580406384,13.7349708663231,14.0062374631767,14.2828615935325,14.5843112897962,15.0088091127691,15.4456625690063,15.8952312873801,16.3578853642898,16.8340056683347,17.3239841538544,17.8282241835961,18.3471408607727,18.8811613707864,19.3796796878675,19.7840946636349,20.1969489673613,20.618418710864,21.0486836810605,21.4879274166595,21.9363372864542],[12.7648140431014,12.9722715797718,13.183100777743,13.3973564342527,13.6150942371193,13.8363707792162,14.0612435731811,14.2897710663639,14.5599680376277,15.064528207258,15.5865733716437,16.1267094546369,16.685563377735,17.263783787731,17.8620418095815,18.4810318253628,19.1214722802215,19.7841065162553,20.3696840873097,20.7624902647605,21.1628712623401,21.5709731518349,21.9869448218591,22.4109380321748,22.8431074690582],[13.0818737997139,13.2488838343265,13.4180260062835,13.5893275355638,13.7628159896511,13.9385192879698,14.1164657063785,14.2966838817199,14.5356654177465,15.1204541547668,15.7287697037435,16.3615585789404,17.0198053741219,17.7045342945407,18.4168107505601,19.1577430153887,19.9284839495085,20.7302327944794,21.410262527536,21.7892710949591,22.1749889446245,22.5675348455314,22.9670296691505,23.3735964266423,23.7873603067345],[13.406808868017,13.5313943881035,13.6571376446781,13.7840493962444,13.9121405012818,14.0414219191741,14.1719047111477,14.3036000412173,14.5114033623317,15.1765877232221,15.872263293193,16.5998277506715,17.3607428418944,18.1565373176957,18.9888100049125,19.8592330185801,20.7695551213705,21.7216052370229,22.5039985663593,22.8668299801895,23.2355113159558,23.6101368918051,23.9908025465739,24.3776056643055,24.7706451991638],[13.7398148595097,13.8199290125866,13.900510295504,13.9815614320039,14.0630851617099,14.1450842402199,14.2275614391992,14.310519546474,14.4871818036754,15.2329296834013,16.017065974873,16.8415667751018,17.7085099034477,18.6200801379187,19.5785747210167,20.5864091490038,21.6461232591797,22.7603876305099,23.6536077417723,23.9976780803771,24.3467533382822,24.7009063180933,25.0602108814181,25.4247419642707,25.7945755927001],[14.0810922444013,14.1146161611288,14.1482198907696,14.1819036233401,14.2156675493094,14.2495118596,14.2834367455888,14.3174423991085,14.4618203192326,15.2796570955205,16.1437437198831,17.056695687868,18.0212764050642,19.0404055516288,20.1171679198384,21.2548227514174,22.456813602906,23.7267787689282,24.7208155034802,25.0274442423059,25.3378763015949,25.6521588564639,25.9703396671741,26.2924670863889,26.618590066522],[14.4308464722947,14.4155870261385,14.4003437156053,14.3851165236329,14.3699054331772,14.3547104272122,14.3395314887298,14.3243686007401,14.4229598237735,15.2136626302294,16.0477137463095,16.927489634993,17.8554970429164,18.834380142849,19.8669280677389,20.9560828577928,22.1049478432354,23.3167964866323,24.1989650560487,24.2901389639418,24.3816563857603,24.4735186157543,24.5657269530498,24.6582827016678,24.7511871705422],[14.7892880958678,14.7229755974854,14.6569604323766,14.5912412673525,14.5258167752021,14.4606856346656,14.3958465304071,14.3312981529889,14.3842037507229,15.1479532020581,15.952255000574,16.7992623299603,17.691242700212,18.6305840179437,19.6198009790178,20.6615418005964,21.7585953106424,22.9138984138464,23.6881307455827,23.5745546039519,23.461523019428,23.3490333810538,23.2370830903909,23.1256695614591,23.0147902206774],[13.6477065368131,13.7218953102601,13.7964873730137,13.8714849173498,13.9468901474616,14.0227052795243,14.0989325417599,14.1755741745034,14.3455518194861,15.0825275799009,15.8573640847662,16.6720063586641,17.5284993481584,18.4289930550991,19.3757479336401,20.3711405645184,21.4176696208396,22.5179621403432,23.1880800240903,22.8800512668834,22.5761143411333,22.2762148912506,21.9802992837016,21.6883145974162,21.4002086143243],[12.3451739514987,12.5678845401041,12.7946128935845,13.0254314936061,13.2604141294262,13.4996359214819,13.7431733454051,13.991104256471,14.3070037502228,15.0173845379694,15.7630376211881,16.545714363161,17.367253087015,18.2295833935095,19.1347306932138,20.0848209637187,21.0820857430568,22.1288673710591,22.6985852526114,22.2060079085208,21.7241198843686,21.2526892134145,20.7914889627812,20.3402971242151,19.8988965072182],[11.1669546441134,11.5109260231191,11.8654926192941,12.2309807930095,12.6077269573884,12.9960778879571,13.396391041834,13.8090348867508,14.2685592638445,14.9525228557695,15.6692722522336,16.4203790412452,17.2074901449064,18.0323314305537,18.8967114949989,19.8025256301681,20.7517599788326,21.7464958895459,22.2194235975896,21.5518217805313,20.9042786380008,20.2761914897574,19.66697576361,19.0760644513448,18.502907581001],[10.1011841966429,10.5428576692368,11.00384327916,11.4849854480753,11.9871655199155,12.5113033753067,13.0583591165815,13.6293348264706,14.2302180820126,14.8879413180786,15.5760646402689,16.2959931460253,17.0491968766456,17.8372138190007,18.6616530459907,19.5241980021544,20.4266099411395,21.3707315220537,21.750376929431,20.9169078914699,20.115377179888,19.3445609259563,18.6032821592919,17.890409010724,17.2048529840272],[9.13713052719302,9.65620381979204,10.2047652632156,10.7844900564221,11.3971485651222,12.0446117281285,12.7288567708353,13.4519732432765,14.1919799271365,14.8236387149227,15.4834114675142,16.1725494855059,16.8923597625692,17.6442074642469,18.4295185170764,19.249782312949,20.1065545338353,21.0014601022342,21.2912317232047,20.3006984836644,19.3562478809292,18.4557360097467,17.5971187058962,16.7784469059303,15.9978622227965],[8.26508582020603,8.84411751867232,9.46371476178294,10.1267194723836,10.8361726714842,11.5953284265889,12.4076687772127,13.2769197060431,14.1538445223714,14.759613841554,15.3913094359251,16.0500409221719,16.7369654073822,17.4532895215822,18.2002715372652,18.9792235796336,19.7915139314371,20.6385694364547,20.8417789614397,19.7026425255101,18.6257671768928,17.6077499492082,16.6453738699402,15.7355977946829,14.8755467970103],[7.47626877082125,8.1003276384623,8.77647792793683,9.50906781274152,10.302808416972,11.1628041115238,12.0945853391764,13.104144179652,14.1158115916166,14.6958654984278,15.2997552670754,15.9284603725752,16.5830005390143,17.2644373934861,17.9738761879891,18.7124675920835,19.4814095592115,20.2819492697104,20.4018140389733,19.1222052187225,17.9228539054628,16.7987263206468,15.7451044060555,14.7575660217103,13.8319663857789],[6.76273615900141,7.41909045327638,8.1391469162423,8.9290881330233,9.79569673674189,10.7464136459056,11.789401953997,12.9336170198384,14.0778808595131,14.6323924911807,15.2087457020406,15.8078008069256,16.4304520074861,17.0776287269531,17.7502969974758,18.4494609021059,19.1761640735778,19.9314912521267,19.9711366698083,18.5588675201055,17.2464677060433,16.0268749164445,14.8935262550807,13.8403229242889,12.861597406004],[6.11730286299503,6.79514528431416,7.54809766151256,8.38448273346694,9.31354546010446,10.3455552113189,11.4919192791751,12.7653089681051,14.0400520514421,14.5691936306076,15.1182775012814,15.6880552486843,16.2793067837862,16.8928414108467,17.5294989351904,18.1901508127306,18.875701342817,19.5870889060397,19.5495507959367,18.0121256774085,16.5956074799525,15.2904877836874,14.088005934433,12.9800902375633,11.9593037765188],[5.53346950668749,6.22367387562255,6.99996927123108,7.87309405624632,8.85512601794604,9.95964943813845,11.2019430022318,12.5991911467021,14.0023248935224,14.5062677326399,15.0283474445289,15.5692167741604,16.1295519587579,16.7100535732827,17.311447406347,17.9344853676519,18.5799464280839,19.2486375936455,19.1368644980875,17.4814907788573,15.9693099087269,14.5879354448073,13.3260523941343,12.1733245312946,11.1203097332445],[5.00535701226489,5.70026318635454,6.49164491445655,7.39289613789572,8.41927031221314,9.58813856815383,10.919283713787,12.4352350536723,13.9646991126089,14.4436136183231,14.9389523306697,15.4512785121103,15.9811747419975,16.52924357904,17.0961082464884,17.6824133408181,18.2888255647167,18.916034485209,18.7328899083594,16.9664883159555,15.3666480283428,13.9176632918734,12.6053093133056,11.4167026139286,10.3401745514727],[4.52764739914998,5.22087131219718,6.02023409853887,6.94198658307042,8.00486773948308,9.23048564842881,10.6437567838382,12.2734125579606,13.9271744362906,14.3812301137947,14.8500889776318,15.3342336433412,15.834162460761,16.3503900269998,16.8834477161331,17.4338842261676,18.0022661438388,18.5891785278217,18.3374431246971,16.4666577591657,14.7867298572451,13.2781881465557,11.9235478133081,10.7071078438606,9.61476904148511]],"type":"surface","x":[20,23.125,26.25,29.375,32.5,35.625,38.75,41.875,45,48.125,51.25,54.375,57.5,60.625,63.75,66.875,70,73.125,76.25,79.375,82.5,85.625,88.75,91.875,95],"y":[9,9.45833333333333,9.91666666666667,10.375,10.8333333333333,11.2916666666667,11.75,12.2083333333333,12.6666666666667,13.125,13.5833333333333,14.0416666666667,14.5,14.9583333333333,15.4166666666667,15.875,16.3333333333333,16.7916666666667,17.25,17.7083333333333,18.1666666666667,18.625,19.0833333333333,19.5416666666667,20],"frame":null}],"highlight":{"on":"plotly_click","persistent":false,"dynamic":false,"selectize":false,"opacityDim":0.2,"selected":{"opacity":1},"debounce":0},"shinyEvents":["plotly_hover","plotly_click","plotly_selected","plotly_relayout","plotly_brushed","plotly_brushing","plotly_clickannotation","plotly_doubleclick","plotly_deselect","plotly_afterplot","plotly_sunburstclick"],"base_url":"https://plot.ly"},"evals":[],"jsHooks":[]}</script>
]

---
## Notes
.can-edit[Type notes here...]
---

# Plots

.pull-left[
<img src="lecture12_2020_files/figure-html/ice2-1.png" width="100%" style="display: block; margin: auto;" />
]
.pull-right[
<img src="lecture12_2020_files/figure-html/pdp2a-1.png" width="100%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# Compare to Other Models

---
## Notes
.can-edit[Type notes here...]
---

# Variance Models

- You can't get confidence intervals from these models because they don't take into account the selection mechanism.

- MARS picks values essentially because they are good predictors, so the items in the model will necessarily have small p-values.

- You can get prediction intervals for the - essentially the variability in future observations predicted by the model.

- The `varmod.method` allows you to model the residual variance by modeling the absolute value of the residuals as a function of the fitted values.

- Prediction variance is:

`$$\varepsilon_{i,future}^{2} = \frac{(y_i-\hat{y}_{i})^{2}}{(1-h_{ii})} + \text{modvar}_{i}$$`

---
## Notes
.can-edit[Type notes here...]
---

# Prediction Variances in earth

```r
library(mgcv)
e2 <- earth(log(wages) ~ age + education, 
  data=SLID,
  nfold=10, ncross=10, pmethod="cv",
  degree=2, varmod.meth="gam")
plotmo(e2, pt.col=1, level=.95)
```

---
## Notes
.can-edit[Type notes here...]
---

# Plots
.shift[
<img src="lecture12_2020_files/figure-html/fitvals-1.png" width="50%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# Polywog

Polywog is a method developed by Kenkel and Signorino which puts two pieces we've already considered together:

- Polynomial expansion: If the degree = 3 and we have variablex `$\{x_1, x_2\}$` in our model, then the following terms would be included in the expansion: `$x_1, x_2, x_1^2, x_2^2, x_1^3, x_2^3, x_1x_2, x_1^2x_2, x_2^2x_1$`.

- Adaptive Lasso: We use the adaptive LASSO to figure out which of the polynomial expansion terms to keep in the model.

---
## Notes
.can-edit[Type notes here...]
---

# Polywog Example

.pull-left[

```r
library(polywog)
p1 <- polywog(log(wages) ~ age + 
      education, data=SLID, degree=4)
```
]
.pull-right[

```
## 
## Call:
## polywog(formula = log(wages) ~ age + education, data = SLID, 
##     degree = 4)
## 
## Coefficients:
##                     Estimate Std. Error
## (Intercept)        1.438e+00         NA
## age                1.830e-02         NA
## education          0.000e+00         NA
## age^2              0.000e+00         NA
## age.education      4.063e-03         NA
## education^2       -4.891e-03         NA
## age^3              0.000e+00         NA
## age^2.education   -7.690e-05         NA
## age.education^2    1.129e-04         NA
## education^3        5.197e-05         NA
## age^4              1.172e-08         NA
## age^3.education    0.000e+00         NA
## age^2.education^2  0.000e+00         NA
## age.education^3    0.000e+00         NA
## education^4        0.000e+00         NA
## 
## Regularization method: Adaptive LASSO
## Adaptive weights: inverse linear model coefficients
## Number of observations: 4014
## Polynomial expansion degree: 4
## Model family: gaussian
## Bootstrap iterations: 0
## Penalization parameter (lambda): 123.5
```
]

---
## Notes
.can-edit[Type notes here...]
---

# Plots

.pull-left[
<img src="lecture12_2020_files/figure-html/ice3-1.png" width="100%" style="display: block; margin: auto;" />
]
.pull-right[
<img src="lecture12_2020_files/figure-html/pdp3a-1.png" width="100%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# Surface Plot

.left-code[

```r
pwogpred <- function(x,y){
  predict(p1, 
          newdata=data.frame(age = x, 
                             education=y))
}
p3 <- outer(age.s, 
            educ.s, 
            pwogpred)
```

```r
plot_ly() %>% 
  add_surface(x=~age.s, y=~educ.s, z=~exp(p3)) %>% 
  layout(
    scene= list(
      xaxis=list(title="Age"), 
      yaxis=list(title="Education"),
      zaxis=list(title="Predicted Wage")
    ))
```
]
.right-plot-shift[
<div id="htmlwidget-28965760cd97712c0d57" style="width:95%;height:504px;" class="plotly html-widget"></div>
<script type="application/json" data-for="htmlwidget-28965760cd97712c0d57">{"x":{"visdat":{"110c92a548f13":["function () ","plotlyVisDat"]},"cur_data":"110c92a548f13","attrs":{"110c92a548f13":{"alpha_stroke":1,"sizes":[10,100],"spans":[1,20],"z":{},"type":"surface","x":{},"y":{},"inherit":true}},"layout":{"margin":{"b":40,"l":60,"t":25,"r":10},"scene":{"xaxis":{"title":"Age"},"yaxis":{"title":"Education"},"zaxis":{"title":"Predicted Wage"}},"hovermode":"closest","showlegend":false,"legend":{"yanchor":"top","y":0.5}},"source":"A","config":{"showSendToCloud":false},"data":[{"colorbar":{"title":"exp(p3)","ticklen":2,"len":0.5,"lenmode":"fraction","y":1,"yanchor":"top"},"colorscale":[["0","rgba(68,1,84,1)"],["0.0416666666666667","rgba(70,19,97,1)"],["0.0833333333333333","rgba(72,32,111,1)"],["0.125","rgba(71,45,122,1)"],["0.166666666666667","rgba(68,58,128,1)"],["0.208333333333333","rgba(64,70,135,1)"],["0.25","rgba(60,82,138,1)"],["0.291666666666667","rgba(56,93,140,1)"],["0.333333333333333","rgba(49,104,142,1)"],["0.375","rgba(46,114,142,1)"],["0.416666666666667","rgba(42,123,142,1)"],["0.458333333333333","rgba(38,133,141,1)"],["0.5","rgba(37,144,140,1)"],["0.541666666666667","rgba(33,154,138,1)"],["0.583333333333333","rgba(39,164,133,1)"],["0.625","rgba(47,174,127,1)"],["0.666666666666667","rgba(53,183,121,1)"],["0.708333333333333","rgba(79,191,110,1)"],["0.75","rgba(98,199,98,1)"],["0.791666666666667","rgba(119,207,85,1)"],["0.833333333333333","rgba(147,214,70,1)"],["0.875","rgba(172,220,52,1)"],["0.916666666666667","rgba(199,225,42,1)"],["0.958333333333333","rgba(226,228,40,1)"],["1","rgba(253,231,37,1)"]],"showscale":true,"z":[[8.04158914546521,8.09781087212251,8.15046190793599,8.19971383887052,8.24574859973594,8.28875761784267,8.32894098750552,8.36650668171199,8.40166980669401,8.43465190457693,8.46568030873508,8.49498755596783,8.5228108591275,8.54939164338888,8.57497514895236,8.59981010262421,8.62414846042124,8.64824522310613,8.67235832637738,8.69674860731529,8.72167984862591,8.74741890222942,8.77423589381034,8.80240451008635,8.83220237076167],[8.96082733268057,9.06014299034283,9.1574643920516,9.25298177405165,9.34689779531796,9.4394271209113,9.53079604719574,9.62124217528945,9.71101413887897,9.80037139231501,9.88958406473223,9.97893288580434,10.0687091886637,10.1592149954911,10.2507631913205,10.3436777917126,10.4382943101379,10.5349602311802,10.6340355960317,10.7358937072123,10.8409219600126,10.9495228088446,11.0621148774964,11.1791342232377,11.3010357658293],[9.85834071557574,10.0012205059164,10.1442472302053,10.2876771968823,10.4317831619901,10.5768545015488,10.7231974604793,10.8711354865764,11.0210096582779,11.1731792152939,11.3280222015614,11.4859362304889,11.6473393830551,11.8126712500539,11.9823941306282,12.156994400245,12.336984062433,12.5229024999554,12.7153184426578,12.9148321710117,13.1220779764288,13.3377269017437,13.562489787926,13.797120656089,14.0424204572899],[10.7102857695112,10.8946558291882,11.0817414867213,11.2719121533191,11.4655606200124,11.663104061975,11.8649851850769,12.0716735286972,12.283666939902,12.5014932353319,12.7257120685727,12.9569170224182,13.1957379473099,13.4428435693716,13.6989443938966,13.9647959329147,14.2412022886141,14.5290201279691,14.8291630879815,15.1426066555422,15.4703935711385,15.8136398115473,16.1735412133659,16.5513808068475,16.9485369381504],[11.4932670287971,11.714406017123,11.9410767857281,12.1738007810127,12.4131329906231,12.659664091026,12.9140228410388,13.1768787455915,13.4489450164697,13.7309818595922,14.0238001215635,14.3282653318526,14.6453021810541,14.9758994803281,15.3211156523922,15.6820848104084,16.0600234878889,16.4562380904321,16.8721331488291,17.309220462996,17.7691292374544,18.2536173219013,18.7645836849978,19.3040822661359,19.8743373688987],[12.1855874745849,12.4362260576016,12.6952596248455,12.9633858124382,13.24134876306,13.5299427370409,13.8300161138634,14.1424758242394,14.4682922575343,14.8085046945465,15.164227321579,15.5366558884722,15.9270750808962,16.3368666858721,16.7675186393335,17.2206350557301,17.6979473524078,18.2013265959901,18.7327972145063,19.2945522378496,19.8889702506616,20.5186342663276,21.1863527589013,21.8951831220077,22.6484578607297],[12.7684142871998,13.0390365857526,13.3207686312824,13.6144890423708,13.921137609048,14.2417206195114,14.5773167573497,14.9290836309602,15.2982650044318,15.6861988077869,16.0943260142641,16.524200483462,16.9774998818329,17.4560378064535,17.9617772544557,18.4968455992848,19.0635512564103,19.6644022456572,20.3021268854257,20.9796968862739,21.7003531483037,22.4676346092549,23.2854105390667,24.1579167329317,25.0897961197529],[13.2267641977504,13.506088573005,13.7989238635962,14.1063132445795,14.4293758744119,14.769314048125,15.1274211225548,15.5050903012644,15.9038243780835,16.3252465510538,16.7711124332256,17.2433234034864,17.7439414597093,18.2752057583728,18.8395510498241,19.4396282470468,20.0783273987121,20.7588033751175,21.4845046191331,22.2592053643941,23.0870417807883,23.9725525740369,24.9207246433574,25.9370444905647,27.0275561775747],[13.5502307560796,13.8258282950431,14.1169107569299,14.4246491789639,14.7503038772605,15.0952333464947,15.4609041329902,15.8489017967867,16.2609430936659,16.6988895257375,17.1647624293464,17.6607597921554,18.1892750177426,18.7529178864605,19.3545379962685,19.9972510075004,20.6844680619181,21.4199287999504,22.2077384618948,23.0524096304701,23.9589092550842,24.9327116944688,25.9798586262042,27.1070268018332,28.3216047779393],[13.7334021569098,13.9923974514068,14.2683768050166,14.5625854539262,14.8763682843347,15.211180193605,15.5685976010882,15.9503312507963,16.3582404676033,16.7943490510127,17.2608630162028,17.7601904215793,18.2949635560423,18.8680637983471,19.4826495061499,20.1421873445932,20.8504875247724,21.6117434925345,22.4305766894279,23.3120871021912,24.2619104272411,25.2862828049022,26.3921142278658,27.5870719033914,28.8796750536596],[13.7759492102587,14.0057411830945,14.2535650560048,14.5206744115377,14.8084290966153,15.1183065763893,15.4519145663405,15.8110051066606,16.1974902659323,16.6134596875665,17.0612002229193,17.5432179301609,18.0622627585809,18.6213562849972,19.2238229233801,19.873325091959,20.5739028954705,21.3300189655885,22.1466092020704,23.0291402732486,23.9836748701647,25.0169458674134,26.136440729861,27.3504977228438,28.6684157402558],[13.6823945149257,13.8713357125044,14.0789964167131,14.3065657017916,14.5553417239776,14.8267434617831,15.122323800784,15.4437841411327,15.7929907313679,16.1719929613758,16.5830438812062,17.0286232516364,17.5114634778038,18.0345788299723,18.6012984168155,19.2153034479882,19.8806694059937,20.6019138445482,21.3840506442862,22.2326516897569,23.1539170877603,24.1547552304489,25.2428742223544,26.4268864447088,27.7164283304393],[13.4616020057827,13.5995823867304,13.7567561740159,13.934172629612,14.1329899599434,14.3544867520083,14.600074755581,14.871313194288,15.1699248146774,15.4978139128885,15.8570866138995,16.2500737193986,16.6793564880924,17.1477957678904,17.6585649642788,18.2151874049616,18.8215787494622,19.4820951961778,20.2015883611455,20.9854678458405,21.8397726796752,22.7712530212672,23.7874637367626,24.8968717504117,26.1089793905555],[13.1260476911329,13.2049419787376,13.3034737850365,13.4224779166912,13.5628967250788,13.7257905029943,13.9123491765831,14.123905469462,14.361949741797,14.6281467371014,14.9243545044026,15.2526458040579,15.6153323528864,16.0149923196148,16.4545015463436,16.9370690475156,17.4662774267333,18.0461289561542,18.6810981859788,19.376192096213,20.1370189736049,20.9698673984444,21.8817969648131,22.8807426411964,23.9756350149229],[12.6909456481768,12.7048996386977,12.7391055075223,12.7941108513397,12.8705705789123,12.9692554962631,13.091062111619,13.2370238206214,13.4083236564066,13.6063088172073,13.832507216813,14.0886463413687,14.3766747405527,14.6987865333141,15.0574493694407,15.4554363599074,15.8958625732067,16.3822267940177,16.918459357483,17.5089770104149,18.1587459140508,18.8733540964606,19.6590948923481,20.5230631810168,21.473266558454],[12.1733076085284,12.118856073025,12.0856357271367,12.0738354053029,12.0837552012397,12.1158123963093,12.1705485510742,12.2486378929597,12.3508971547305,12.4782970437883,12.6319755517212,12.8132533478383,13.0236515404697,13.264912136662,13.539021585802,13.8482378571651,14.195121577205,14.5825718417439,15.0138674236943,15.4927142216732,16.0232999426498,16.6103571871729,17.2592363143057,17.9759897118785,18.7674693942437],[11.5910103982691,11.4670351851932,11.3658059130433,11.2870800357792,11.2307378453412,11.1967847384908,11.1853546693041,11.1967148792413,11.2312720166222,11.2895797801009,11.3723482468832,11.4804550766109,11.6149588168216,11.7771145765725,11.9683923823059,12.1904985856122,12.4454007577965,12.7353565829436,13.0629473517801,13.4311167657736,13.8432158878973,14.3030552273271,14.8149651248774,15.3838658201383,16.0153488372283],[10.9619347329763,10.7694846495201,10.6019626968647,10.4586179921789,10.3388454888814,10.2421832821543,10.1683112812612,10.1170512803407,10.0883684793803,10.0823745289915,10.0993321969112,10.1396617813726,10.2039494273024,10.2929575364598,10.4076375030655,10.5491450532556,10.7188585211407,10.918400457918,11.1496630452389,11.4148378721521,11.7164507391461,12.0574022764212,12.441015310556,12.8710900890889,13.3519686821887],[10.3032236355711,10.0452265910621,9.81509232747213,9.61147673318196,9.4332200306564,9.27933734472717,9.1490110825893,9.04158506605279,8.95656038111332,8.89359293484572,8.85249273546784,8.83322493870191,8.83591273287194,8.86084216719869,8.90846906326356,8.97942818952129,9.07454492411091,9.19484968330619,9.3415954532658,9.51627883308374,9.72066507967322,9.95681774235045,10.2271335902947,10.5343836731971,10.8817615190775],[9.6306914528176,9.31159382204424,9.02408168988629,8.76612159758444,8.5359228019135,8.33191863315221,8.15275051355847,7.99725443464994,7.86444972869586,7.75353000485148,7.66385615396595,7.59495135892095,7.54649808007919,7.51833701873372,7.51046809609959,7.52305352219646,7.55642306885188,7.61108170505273,7.6877198021972,7.78722617286347,7.91070427119333,8.05949195788352,8.23518532049375,8.4396671432184,8.67514074296056],[8.95839732053737,8.58376579042454,8.24521981821059,7.93992674085032,7.66537961909433,7.41936595791586,7.1999405288756,7.00540186695955,6.83427207808287,6.68527965054584,6.55734501542037,6.44956864819757,6.36122154802956,6.29173797248169,6.24071034578818,6.20788629807355,6.1931678327939,6.19661266074203,6.21843778240356,6.25902544742409,6.31893167177841,6.39889755146021,6.49986367793643,6.62298803737604,6.76966786533136],[8.29838174494195,7.8744997752939,7.49193119571833,7.14691905094896,6.83614492675134,6.55668036005779,6.30594464017431,6.08166821498064,5.88186102949934,5.7047852229316,5.548931696567,5.41300014089148,5.29588217750957,5.1966473317307,5.11453160624818,5.04892847654698,4.99938217570627,4.96558318125681,4.94736586083297,4.94470827766335,4.95773420265879,4.98671742826718,5.03208853179005,5.09444429411023,5.17456004563311],[7.66055291266144,7.19403837370215,6.77471489392502,6.39776043153726,6.05893982239835,5.75453263382176,5.48127088953984,5.23628531573174,5.01705895899966,4.82138719869458,4.64734332272938,4.49324896194999,4.35764878663865,4.23928896267923,4.13709894679626,4.05017627222797,3.97777404008133,3.91929088910392,3.87426326919224,3.84235989302196,3.82337828704375,3.81724340902088,3.82400834558804,3.84385715134095,3.87710994218935],[7.05270098240202,6.55016460878167,6.10125247204972,5.69992125999008,5.34090472479302,5.01960979111077,4.73202766429347,4.474657715045,4.24444226526066,4.03871069280931,3.85513151976549,3.69167135679077,3.54655975243264,3.41825914542348,3.30543924511813,3.20695527378915,3.12182959779758,3.04923635539264,2.98848875939229,2.93902881527977,2.90041925109484,2.87233750649864,2.85457167601346,2.84701834707313,2.84968231852751],[6.48061396604406,5.94837098469074,5.47664234449588,5.05799260931052,4.68600249513886,4.35512267700998,4.06054997867461,3.79812239094097,3.56422995418522,3.3557390297796,3.16992789160332,3.00443190697785,2.85719685824057,2.72643919155995,2.61061217659302,2.50837712587414,2.4185789619322,2.34022553765234,2.27247021515439,2.21459729365992,2.16600995016713,2.12622042055103,2.09484220494201,2.07158413165108,2.05624616006502]],"type":"surface","x":[20,23.125,26.25,29.375,32.5,35.625,38.75,41.875,45,48.125,51.25,54.375,57.5,60.625,63.75,66.875,70,73.125,76.25,79.375,82.5,85.625,88.75,91.875,95],"y":[9,9.45833333333333,9.91666666666667,10.375,10.8333333333333,11.2916666666667,11.75,12.2083333333333,12.6666666666667,13.125,13.5833333333333,14.0416666666667,14.5,14.9583333333333,15.4166666666667,15.875,16.3333333333333,16.7916666666667,17.25,17.7083333333333,18.1666666666667,18.625,19.0833333333333,19.5416666666667,20],"frame":null}],"highlight":{"on":"plotly_click","persistent":false,"dynamic":false,"selectize":false,"opacityDim":0.2,"selected":{"opacity":1},"debounce":0},"shinyEvents":["plotly_hover","plotly_click","plotly_selected","plotly_relayout","plotly_brushed","plotly_brushing","plotly_clickannotation","plotly_doubleclick","plotly_deselect","plotly_afterplot","plotly_sunburstclick"],"base_url":"https://plot.ly"},"evals":[],"jsHooks":[]}</script>
]

---
## Notes
.can-edit[Type notes here...]
---

# Compare to Other Models

---
## Notes
.can-edit[Type notes here...]
---

# Barry and Kleinberg Data

```r
load(file("https://quantoid.net/files/reg3/bk.rda"))
bk <-as.data.frame(bk)
model2 <- gamlss(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, data = bk)
```

```
## GAMLSS-RS iteration 1: Global Deviance = 16189.29 
## GAMLSS-RS iteration 2: Global Deviance = 16189.29
```

```r
model3 <- gamlss(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare + L_hse_sanction +
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, data = bk)
```

```
## GAMLSS-RS iteration 1: Global Deviance = 16182.26 
## GAMLSS-RS iteration 2: Global Deviance = 16182.26
```

---
## Notes
.can-edit[Type notes here...]
---

# Cart Models

```r
bk.cart1 <- rpart(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, data = bk)
bk.cart2 <- rpart(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare + L_hse_sanction +
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, data = bk)
```

---
## Notes
.can-edit[Type notes here...]
---

# CART Model Results

.pull-left[

```r
bk.cart1
```

```
## n= 2863 
## 
## node), split, n, deviance, yval
##       * denotes terminal node
## 
##  1) root 2863 61629.460 2.976535  
##    2) L_usstock2000_adj< 8.908139 2184 36891.230 1.973858  
##      4) L_usstock2000_adj< 6.56151 1169 12158.840 1.038018 *
##      5) L_usstock2000_adj>=6.56151 1015 22529.440 3.051688  
##       10) L_growth< 2.675 330  8621.694 1.847877 *
##       11) L_growth>=2.675 685 13199.140 3.631627 *
##    3) L_usstock2000_adj>=8.908139 679 15480.040 6.201639  
##      6) L_usstock2000_adj< 10.03128 327  8397.456 4.996460 *
##      7) L_usstock2000_adj>=10.03128 352  6166.405 7.321224 *
```
]
.pull-right[

```r
bk.cart2
```

---
## Notes
.can-edit[Type notes here...]
---

# Random Forests

---
## Notes
.can-edit[Type notes here...]
---

# Random Forests

```r
bk.rf1 <- randomForest(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk, mtry=3)
bk.rf2 <- randomForest(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare + L_hse_sanction +
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk, mtry=3)
```

---
## Notes
.can-edit[Type notes here...]
---

# Variable Importance

.pull-left[

```r
imp1 <- importance(bk.rf1)
imp1/max(imp1)
```

```
##                   IncNodePurity
## L_ab_sum             0.13914331
## L_hse_usnum          0.27336546
## L_hse_sanction       0.02201843
## L_growth             0.43484693
## L_lnpercap           0.57810416
## L_lnpop              0.47848065
## L_polity2            0.23685643
## L_durable            0.39365502
## L_civintensity       0.06230792
## L_spending           0.37971906
## L_s_us               0.37797556
## L_lnustrade          0.70098276
## L_us_distance        0.24532186
## L_usstock2000_adj    1.00000000
## allfdi2000           0.33512307
```
]
.pull-rigjt[

```r
imp2 <- importance(bk.rf2)
imp2/max(imp2)
```

```
##                   IncNodePurity
## L_sancmeanshare      0.55040707
## L_tradeshare         0.39862913
## L_hse_sanction       0.01993749
## L_growth             0.42616856
## L_lnpercap           0.56690562
## L_lnpop              0.44173764
## L_polity2            0.21967758
## L_durable            0.38148594
## L_civintensity       0.06104718
## L_spending           0.38414386
## L_s_us               0.39166370
## L_lnustrade          0.69009928
## L_us_distance        0.22720748
## L_usstock2000_adj    1.00000000
## allfdi2000           0.35339199
```
]

---
## Notes
.can-edit[Type notes here...]
---

# ICE Plot
.left-code[

```r
library(RColorBrewer)
bk.i <- ice(bk.rf1, bk, 
            predictor = "L_hse_usnum", 
            frac_to_build = 1)
crv <- bk.i$ice_curves
crv <- t(apply(crv, 1, function(x)x-mean(x)))
sapply(2:10, function(i){
  k <- kmeans(crv, centers=i)
  k$betweenss/(k$tot.withinss+k$betweenss)
  })
bk.c <- clusterICE(bk.i, nClusters = 6, 
        colorvec=brewer.pal(6, "Set1"),
        plot_legend = TRUE)
```
]
.right-plot-shift[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-20-1.png" width="90%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# ICE Plot
.left-code[

```r
bk.i2 <- ice(bk.rf2, bk, 
            predictor = "L_sancmeanshare", 
            frac_to_build = 1, 
            num_grid_pts = 25)
crv <- bk.i2$ice_curves
crv <- t(apply(crv, 1, function(x)x-mean(x)))
sapply(2:10, function(i){
  k <- kmeans(crv, centers=i)
  k$betweenss/(k$tot.withinss+k$betweenss)
  })
bk.c <- clusterICE(bk.i2, nClusters = 5, 
        colorvec=brewer.pal(5, "Set1"),
        plot_legend = TRUE)
```
]
.right-plot-shift[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-21-1.png" width="90%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# Random Forests

```r
bk.e1 <- earth(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk, degree=3, pmethod="backward")
bk.e2 <- earth(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare + L_hse_sanction +
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk, degree=3, pmethod="backward")
```

---
## Notes
.can-edit[Type notes here...]
---

# Results 1

```r
summary(bk.e1)
```

```
## Call: earth(formula=usfdi2000_adj~L_ab_sum+L_hse_usnum+L_hse_sanction+...),
##             data=bk, pmethod="backward", degree=3)
## 
##                                                                       coefficients
## (Intercept)                                                             0.63392319
## h(17-L_hse_usnum)                                                       0.24534151
## h(L_hse_usnum-17)                                                       0.14932381
## h(5.84698-L_usstock2000_adj)                                           -0.11501938
## h(L_usstock2000_adj-5.84698)                                            1.39884572
## h(17-L_hse_usnum) * h(L_s_us-0.452007)                                 -2.07349368
## h(17-L_hse_usnum) * h(L_lnpercap-8.72583) * h(L_s_us-0.452007)          1.32325615
## h(17-L_hse_usnum) * h(8.72583-L_lnpercap) * h(L_s_us-0.452007)          0.60472031
## h(17-L_hse_usnum) * h(L_lnpop-16.1764) * h(L_s_us-0.452007)             0.71042312
## h(17-L_hse_usnum) * h(16.1764-L_lnpop) * h(L_s_us-0.452007)             0.63388703
## h(17-L_hse_usnum) * h(0.452007-L_s_us) * h(6.39192-L_lnustrade)        -0.26949929
## h(L_hse_usnum-17) * h(0.593291-L_s_us) * h(L_lnustrade-9.24532)        -0.51248303
## h(L_hse_usnum-17) * h(0.593291-L_s_us) * h(9.24532-L_lnustrade)        -0.08992691
## h(13-L_growth) * h(L_lnpercap-9.49552) * h(L_usstock2000_adj-5.84698)  -0.05865711
## h(13-L_growth) * h(9.49552-L_lnpercap) * h(L_usstock2000_adj-5.84698)  -0.04877511
## h(13-L_growth) * h(L_lnpop-18.6438) * h(L_usstock2000_adj-5.84698)      0.10606737
## h(13-L_growth) * h(18.6438-L_lnpop) * h(L_usstock2000_adj-5.84698)     -0.00737439
## h(L_growth- -7.84) * h(-9-L_polity2) * h(L_usstock2000_adj-5.84698)    -0.10431613
## h(13-L_growth) * h(21.5-L_spending) * h(L_usstock2000_adj-5.84698)      0.00391730
## h(13-L_growth) * h(L_usstock2000_adj-5.84698) * h(59834.9-allfdi2000)  -0.00000080
## 
## Selected 20 of 31 terms, and 10 of 15 predictors
## Termination condition: Reached nk 31
## Importance: L_usstock2000_adj, L_growth, allfdi2000, L_hse_usnum, ...
## Number of terms at each degree of interaction: 1 4 1 14
## GCV 15.89249    RSS 43971.69    GRSq 0.262229    RSq 0.2865151
```

---
## Notes
.can-edit[Type notes here...]
---

# Results 2

```r
summary(bk.e2)
```

```
## Call: earth(formula=usfdi2000_adj~L_sancmeanshare+L_tradeshare+L_hse_s...),
##             data=bk, pmethod="backward", degree=3)
## 
##                                                                               coefficients
## (Intercept)                                                                      1.2590580
## h(L_s_us-0.759834)                                                              23.4287388
## h(5.84698-L_usstock2000_adj)                                                    -0.1385368
## h(L_usstock2000_adj-5.84698)                                                     1.4170450
## h(13-L_growth) * h(L_usstock2000_adj-5.84698)                                    0.0553836
## h(L_lnpercap-10.0476) * h(0.759834-L_s_us)                                       8.1246653
## h(8.17926-L_lnustrade) * h(L_usstock2000_adj-5.84698)                            0.5108149
## h(L_lnustrade-8.17926) * h(L_usstock2000_adj-5.84698)                           -0.2692848
## h(13-L_growth) * h(9.4572-L_lnpercap) * h(L_usstock2000_adj-5.84698)            -0.0572491
## h(13-L_growth) * h(L_lnpop-18.6438) * h(L_usstock2000_adj-5.84698)               0.1023436
## h(13-L_growth) * h(18.6438-L_lnpop) * h(L_usstock2000_adj-5.84698)              -0.0263415
## h(13-L_growth) * h(16.9-L_spending) * h(L_usstock2000_adj-5.84698)               0.0032823
## h(1.39-L_growth) * h(L_lnustrade-8.17926) * h(L_usstock2000_adj-5.84698)         0.0617883
## h(L_growth-1.39) * h(L_lnustrade-8.17926) * h(L_usstock2000_adj-5.84698)         0.0421413
## h(13-L_growth) * h(L_usstock2000_adj-5.84698) * h(59834.9-allfdi2000)           -0.0000012
## h(L_lnpercap-10.3123) * h(L_lnustrade-8.17926) * h(L_usstock2000_adj-5.84698)   -1.8565817
## h(-9-L_polity2) * h(8.17926-L_lnustrade) * h(L_usstock2000_adj-5.84698)         -0.9600373
## h(0.333333-L_s_us) * h(8.17926-L_lnustrade) * h(L_usstock2000_adj-5.84698)      -2.9656317
## h(L_s_us-0.333333) * h(8.17926-L_lnustrade) * h(L_usstock2000_adj-5.84698)      -1.6569208
## h(8.17926-L_lnustrade) * h(5425-L_us_distance) * h(L_usstock2000_adj-5.84698)    0.0001429
## 
## Selected 20 of 31 terms, and 10 of 15 predictors
## Termination condition: Reached nk 31
## Importance: L_usstock2000_adj, L_growth, allfdi2000, L_lnpercap, ...
## Number of terms at each degree of interaction: 1 3 4 12
## GCV 16.07655    RSS 44480.94    GRSq 0.2536847    RSq 0.278252
```

---
## Notes
.can-edit[Type notes here...]
---

# ICE Plot
.left-code[

```r
bk.i <- ice(bk.e1, bk, 
            predictor = "L_hse_usnum", 
            frac_to_build = 1)
crv <- bk.i$ice_curves
crv <- t(apply(crv, 1, function(x)x-mean(x)))
sapply(2:10, function(i){
  k <- kmeans(crv, centers=i)
  k$betweenss/(k$tot.withinss+k$betweenss)
  })
bk.c <- clusterICE(bk.i, nClusters = 4, 
        colorvec=brewer.pal(4, "Set1"), 
        plot_legend = TRUE)
```
]
.right-plot-shift[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-25-1.png" width="90%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

## Investigating clusters

.pull-left[

```r
library(nnet)
bk$cluster <- bk.c$cluster
clust.mod <- multinom(cluster ~ L_ab_sum + 
    L_hse_usnum + L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + 
    L_polity2 + L_durable + L_civintensity + 
    L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + 
    allfdi2000, data=bk)
```

```
## # weights:  68 (48 variable)
## initial  value 3968.960756 
## iter  10 value 3516.858960
## iter  20 value 3460.161139
## iter  30 value 3448.755375
## iter  40 value 3361.707671
## iter  50 value 3283.336646
## final  value 3281.728662 
## converged
```
]
.pull-right[

```r
DAMisc::mnlChange(clust.mod, bk)
```

```
##                   [,1]    [,2]    [,3]    [,4]   
## L_ab_sum          -0.007  -0.105* -0.058*  0.170*
## L_hse_usnum       -0.049* -0.047*  0.043*  0.054*
## L_hse_sanction     0.013*  0.030* -0.039* -0.003*
## L_growth           0.119* -0.149* -0.011*  0.041*
## L_lnpercap        -0.288*  0.141*  0.245* -0.098*
## L_lnpop           -0.298*  0.038*  0.276* -0.017*
## L_polity2          0.099* -0.016* -0.096*  0.012*
## L_durable          0.250* -0.115*  0.006  -0.141*
## L_civintensity     0.114* -0.086* -0.022* -0.007*
## L_spending        -0.221* -0.018   0.181*  0.058*
## L_s_us            -0.152* -0.002*  0.071*  0.084*
## L_lnustrade        0.294*  0.205* -0.146* -0.354*
## L_us_distance      0.228* -0.177* -0.006  -0.045 
## L_usstock2000_adj  0.191* -0.091* -0.395*  0.296*
## allfdi2000         0.060   0.042  -0.016  -0.086*
```
]

---
## Notes
.can-edit[Type notes here...]
---

# Venus

Venus is a project that I am working on with Duncan Murdoch.

- Some variables are subject to a MARS fit. 
  - Good way to control for variables. 
- Other variables are included in their assumed parametric form.

---
## Notes
.can-edit[Type notes here...]
---

# Details

`$$\mathbf{y} = \alpha + \mathbf{X\beta} + \mathbf{Z\Gamma} + \varepsilon$$`

We create `$e^{(y)}$` with MARS:

`$$\mathbf{y} = \mathbf{\lambda}H(\mathbf{Z}) + e^{(y)}$$`

and `$\mathbf{e}^{(x)}$` with

`$$\mathbf{X} = \mathbf{\theta}H(\mathbf{Z}) + \mathbf{e}^{(X)}$$`

Then we regress `$e^{(y)}$` on `$\mathbf{e}^{(X)}$` to obtain estimates of `$\beta$` controlling for `$\mathbf{Z}$` in a flexible way.

---
## Notes
.can-edit[Type notes here...]
---

# In R

```r
remotes::install_github("dmurdoch/venus")
```

```r
library(venus)
bk.v1 <- venus(usfdi2000_adj ~ L_ab_sum + L_hse_usnum, 
               usfdi2000_adj~ L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk)
bk.v2 <- venus(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare, 
               usfdi2000_adj ~ L_hse_sanction +
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk)
```

---
## Notes
.can-edit[Type notes here...]
---

# Summary

.pull-left[

```r
summary(bk.v1$mainFit)
```

```
## 
## Call:
## lm(formula = yResids ~ mainModelmatrix)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -16.664  -1.225   1.323   2.558   9.208 
## 
## Coefficients:
##                              Estimate Std. Error t value Pr(>|t|)    
## (Intercept)                -1.170e-16  7.493e-02   0.000 1.000000    
## mainModelmatrixL_ab_sum     2.330e-02  3.994e-02   0.583 0.559609    
## mainModelmatrixL_hse_usnum -5.278e-02  1.433e-02  -3.684 0.000234 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 4.009 on 2860 degrees of freedom
## Multiple R-squared:  0.004733,	Adjusted R-squared:  0.004037 
## F-statistic: 6.801 on 2 and 2860 DF,  p-value: 0.001131
```
]
.pull-right[

```r
summary(bk.v2$mainFit)
```

```
## 
## Call:
## lm(formula = yResids ~ mainModelmatrix)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -16.558  -1.157   1.368   2.544   9.225 
## 
## Coefficients:
##                                  Estimate Std. Error t value Pr(>|t|)
## (Intercept)                    -9.826e-17  7.508e-02   0.000    1.000
## mainModelmatrixL_sancmeanshare  5.799e-02  5.111e-02   1.135    0.257
## mainModelmatrixL_tradeshare    -9.313e-03  8.569e-03  -1.087    0.277
## 
## Residual standard error: 4.017 on 2860 degrees of freedom
## Multiple R-squared:  0.0008612,	Adjusted R-squared:  0.0001625 
## F-statistic: 1.233 on 2 and 2860 DF,  p-value: 0.2917
```
]

---
## Notes
.can-edit[Type notes here...]
---

# In GAMLSS

```r
remotes::install_url("https://quantoid.net/files/gamlss.add2_1.0-0.tar.gz")
```

```r
library(gamlss)
library(gamlss.add)
bk.g1 <- gamlss(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + 
            tr(~L_hse_sanction + L_growth + L_lnpercap + 
                L_lnpop + L_polity2 + L_durable + L_civintensity + 
                L_spending + L_s_us + L_lnustrade + 
                L_us_distance + L_usstock2000_adj + allfdi2000), 
            data = bk, trace=FALSE, control=gamlss.control(n.cyc=100))
```

```
## GAMLSS-RS iteration 1: Global Deviance = 16188.54 
## GAMLSS-RS iteration 2: Global Deviance = 16188.54
```

```r
bk.g2 <- gamlss(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare + 
            tr(~L_hse_sanction + L_growth + L_lnpercap + 
                L_lnpop + L_polity2 + L_durable + L_civintensity + 
                L_spending + L_s_us + L_lnustrade + 
                L_us_distance + L_usstock2000_adj + allfdi2000), 
            data = bk, trace=FALSE)
```

---
## Notes
.can-edit[Type notes here...]
---

# Summary

.pull-left[

```r
summary(bk.g1)
```

```
## ******************************************************************
## Family:  c("NO", "Normal") 
## 
## Call:  gamlss(formula = usfdi2000_adj ~ L_ab_sum + L_hse_usnum +  
##     tr(~L_hse_sanction + L_growth + L_lnpercap + L_lnpop +  
##         L_polity2 + L_durable + L_civintensity + L_spending +  
##         L_s_us + L_lnustrade + L_us_distance + L_usstock2000_adj +  
##         allfdi2000), data = bk, control = gamlss.control(n.cyc = 100),  
##     trace = FALSE) 
## 
## Fitting method: RS() 
## 
## ------------------------------------------------------------------
## Mu link function:  identity
## Mu Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  3.25029    0.19402   16.75   <2e-16 ***
## L_ab_sum     0.05211    0.04071    1.28   0.2006    
## L_hse_usnum -0.01590    0.00935   -1.70   0.0892 .  
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## Sigma link function:  log
## Sigma Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  1.40826    0.01322   106.6   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## NOTE: Additive smoothing terms exist in the formulas: 
##  i) Std. Error for smoothers are for the linear effect only. 
## ii) Std. Error for the linear terms maybe are not accurate. 
## ------------------------------------------------------------------
## No. of observations in the fit:  2863 
## Degrees of Freedom for the fit:  14
##       Residual Deg. of Freedom:  2849 
##                       at cycle:  2 
##  
## Global Deviance:     16188.54 
##             AIC:     16216.54 
##             SBC:     16299.98 
## ******************************************************************
```
]
.pull-right[

```r
summary(bk.g2)
```

```
## ******************************************************************
## Family:  c("NO", "Normal") 
## 
## Call:  gamlss(formula = usfdi2000_adj ~ L_sancmeanshare +  
##     L_tradeshare + tr(~L_hse_sanction + L_growth +  
##     L_lnpercap + L_lnpop + L_polity2 + L_durable +  
##     L_civintensity + L_spending + L_s_us + L_lnustrade +  
##     L_us_distance + L_usstock2000_adj + allfdi2000),  
##     data = bk, trace = FALSE) 
## 
## Fitting method: RS() 
## 
## ------------------------------------------------------------------
## Mu link function:  identity
## Mu Coefficients:
##                  Estimate Std. Error t value Pr(>|t|)    
## (Intercept)      2.786760   0.103522  26.920  < 2e-16 ***
## L_sancmeanshare  0.364642   0.052158   6.991 3.38e-12 ***
## L_tradeshare    -0.013476   0.008437  -1.597     0.11    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## Sigma link function:  log
## Sigma Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  1.41930    0.01322   107.4   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## NOTE: Additive smoothing terms exist in the formulas: 
##  i) Std. Error for smoothers are for the linear effect only. 
## ii) Std. Error for the linear terms maybe are not accurate. 
## ------------------------------------------------------------------
## No. of observations in the fit:  2863 
## Degrees of Freedom for the fit:  10
##       Residual Deg. of Freedom:  2853 
##                       at cycle:  2 
##  
## Global Deviance:     16251.75 
##             AIC:     16271.75 
##             SBC:     16331.35 
## ******************************************************************
```
]

---
## Notes
.can-edit[Type notes here...]
---

# MARS

```r
library(gamlss.add2)
bk.g3 <- gamlss(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + 
            ma(~L_hse_sanction + L_growth + L_lnpercap + 
                L_lnpop + L_polity2 + L_durable + L_civintensity + 
                L_spending + L_s_us + L_lnustrade + 
                L_us_distance + L_usstock2000_adj + allfdi2000), 
            data = bk, trace=FALSE, control=gamlss.control(n.cyc=100))
```

```
## GAMLSS-RS iteration 1: Global Deviance = 16051.26 
## GAMLSS-RS iteration 2: Global Deviance = 16057.73 
## GAMLSS-RS iteration 3: Global Deviance = 16057.73
```

```r
bk.g4 <- gamlss(usfdi2000_adj ~ L_sancmeanshare + L_tradeshare + 
            ma(~L_hse_sanction + L_growth + L_lnpercap + 
                L_lnpop + L_polity2 + L_durable + L_civintensity + 
                L_spending + L_s_us + L_lnustrade + 
                L_us_distance + L_usstock2000_adj + allfdi2000), 
            data = bk, trace=FALSE)
```

---
## Notes
.can-edit[Type notes here...]
---

# Summary

.pull-left[

```r
summary(bk.g3)
```

```
## ******************************************************************
## Family:  c("NO", "Normal") 
## 
## Call:  gamlss(formula = usfdi2000_adj ~ L_ab_sum + L_hse_usnum +  
##     ma(~L_hse_sanction + L_growth + L_lnpercap + L_lnpop +  
##         L_polity2 + L_durable + L_civintensity + L_spending +  
##         L_s_us + L_lnustrade + L_us_distance + L_usstock2000_adj +  
##         allfdi2000), data = bk, control = gamlss.control(n.cyc = 100),  
##     trace = FALSE) 
## 
## Fitting method: RS() 
## 
## ------------------------------------------------------------------
## Mu link function:  identity
## Mu Coefficients:
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  4.564214   0.189639  24.068   <2e-16 ***
## L_ab_sum     0.010584   0.039788   0.266     0.79    
## L_hse_usnum -0.082545   0.009139  -9.032   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## Sigma link function:  log
## Sigma Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  1.38542    0.01322   104.8   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## NOTE: Additive smoothing terms exist in the formulas: 
##  i) Std. Error for smoothers are for the linear effect only. 
## ii) Std. Error for the linear terms maybe are not accurate. 
## ------------------------------------------------------------------
## No. of observations in the fit:  2863 
## Degrees of Freedom for the fit:  19
##       Residual Deg. of Freedom:  2844 
##                       at cycle:  3 
##  
## Global Deviance:     16057.73 
##             AIC:     16095.73 
##             SBC:     16208.97 
## ******************************************************************
```
]
.pull-right[

```r
summary(bk.g4)
```

```
## ******************************************************************
## Family:  c("NO", "Normal") 
## 
## Call:  gamlss(formula = usfdi2000_adj ~ L_sancmeanshare +  
##     L_tradeshare + ma(~L_hse_sanction + L_growth +  
##     L_lnpercap + L_lnpop + L_polity2 + L_durable +  
##     L_civintensity + L_spending + L_s_us + L_lnustrade +  
##     L_us_distance + L_usstock2000_adj + allfdi2000),  
##     data = bk, trace = FALSE) 
## 
## Fitting method: RS() 
## 
## ------------------------------------------------------------------
## Mu link function:  identity
## Mu Coefficients:
##                  Estimate Std. Error t value Pr(>|t|)    
## (Intercept)      2.955280   0.100640  29.365  < 2e-16 ***
## L_sancmeanshare  0.149396   0.050706   2.946  0.00324 ** 
## L_tradeshare    -0.013936   0.008202  -1.699  0.08944 .  
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## Sigma link function:  log
## Sigma Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  1.39107    0.01322   105.3   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## ------------------------------------------------------------------
## NOTE: Additive smoothing terms exist in the formulas: 
##  i) Std. Error for smoothers are for the linear effect only. 
## ii) Std. Error for the linear terms maybe are not accurate. 
## ------------------------------------------------------------------
## No. of observations in the fit:  2863 
## Degrees of Freedom for the fit:  18
##       Residual Deg. of Freedom:  2845 
##                       at cycle:  2 
##  
## Global Deviance:     16090.13 
##             AIC:     16126.13 
##             SBC:     16233.4 
## ******************************************************************
```
]

---
## Notes
.can-edit[Type notes here...]
---

# Tests

```r
VC.test(model2, bk.g1)
```

```
##  Vuong's test: -0.733 it is not possible to discriminate between models: model2 and bk.g1 
## Clarke's test: 1381 p-value= 0.0616 it is not possible to discriminate between models: model2 and bk.g1
```

```r
VC.test(bk.g1, bk.g3)
```

```
##  Vuong's test: -2.898 model bk.g3 is preferred over bk.g1 
## Clarke's test: 1232 p-value= 0 bk.g3 is preferred over bk.g1
```

```r
VC.test(model3, bk.g2)
```

```
##  Vuong's test: 0.44 it is not possible to discriminate between models: model3 and bk.g2 
## Clarke's test: 1474 p-value= 0.1164 it is not possible to discriminate between models: model3 and bk.g2
```

```r
VC.test(bk.g2, bk.g4)
```

```
##  Vuong's test: -3.37 model bk.g4 is preferred over bk.g2 
## Clarke's test: 1214 p-value= 0 bk.g4 is preferred over bk.g2
```

---
## Notes
.can-edit[Type notes here...]
---

# Inference

Venus has good inferential properties. 
  - Bias, MSE and CI coverage errors go to 0 as `$N$` increases. 
  - Dominates a naive linear model in all but the perfect additive linear case. 
  
Other models: 
  - The venus result suggests that the GAMLSS model should have decent inferential properties on the parametric terms. 
  - Trees, MARS and Polywog would need data splitting or something similar to make appropriate inferences.  
    - Model building could be done on a training sample and then the appropriate model could be estimated on the other half of the data.

---
## Notes
.can-edit[Type notes here...]
---

# Mars Example with Inference

```r
set.seed(734)
train.samp <- sample(1:nrow(bk), 
                     floor(nrow(bk)*.6), 
                     replace=FALSE)
test.samp <- setdiff(1:nrow(bk), train.samp)
bk.e1 <- earth(usfdi2000_adj ~ L_ab_sum + L_hse_usnum + L_hse_sanction + 
    L_growth + L_lnpercap + L_lnpop + L_polity2 + L_durable + 
    L_civintensity + L_spending + L_s_us + L_lnustrade + 
    L_us_distance + L_usstock2000_adj + allfdi2000, 
    data = bk[train.samp, ], degree=3, pmethod = "backward")
```

Generating predictions:

```r
h <- function(x){
  out <- eval(x)
  out <- out*(out > 0)
  out
}
X <- model.matrix(bk.e1)
hinges <- colnames(X)[-1]
hinges <- gsub("*", ":", hinges, fixed=TRUE)
form <- reformulate(hinges, response="usfdi2000_adj")
lmod1 <- lm(form, data=bk[test.samp, ])
lmod2 <- lm(form, data=bk[train.samp, ])
```

---
## Notes
.can-edit[Type notes here...]
---

# Summary

```r
summary(lmod1)
```

```
## 
## Call:
## lm(formula = form, data = bk[test.samp, ])
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -15.754  -1.439   1.442   2.656  10.274 
## 
## Coefficients:
##                                                                                   Estimate
## (Intercept)                                                                      3.269e-01
## h(L_usstock2000_adj - 6.53771)                                                   6.859e-01
## h(6.53771 - L_usstock2000_adj)                                                  -1.476e-01
## h(L_hse_usnum - 17)                                                              3.786e-01
## h(17 - L_hse_usnum)                                                              1.763e-01
## h(L_lnpop - 16.6189)                                                             3.585e-01
## h(L_hse_usnum - 20)                                                             -2.383e-01
## h(L_usstock2000_adj - 6.53771):h(L_growth - -3.5)                                6.052e-02
## h(L_usstock2000_adj - 6.53771):h(-3.5 - L_growth)                                7.856e-02
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256)                          -9.912e+00
## h(L_hse_usnum - 17):h(0.814617 - L_s_us)                                        -1.885e-01
## h(L_usstock2000_adj - 6.53771):h(8.30968 - L_lnustrade)                          5.019e-01
## h(L_hse_usnum - 20):h(-9 - L_polity2)                                           -3.030e-01
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(9.40286 - L_lnustrade) -1.324e-01
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256):L_lnustrade               9.339e-01
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(21612.3 - allfdi2000)  2.400e-06
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(1.89 - L_growth)       3.078e-02
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(0.399586 - L_s_us)     4.860e+00
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(L_durable - 104)        7.385e-02
##                                                                                 Std. Error
## (Intercept)                                                                      4.251e-01
## h(L_usstock2000_adj - 6.53771)                                                   1.964e-01
## h(6.53771 - L_usstock2000_adj)                                                   6.393e-02
## h(L_hse_usnum - 17)                                                              1.690e-01
## h(17 - L_hse_usnum)                                                              4.769e-02
## h(L_lnpop - 16.6189)                                                             1.797e-01
## h(L_hse_usnum - 20)                                                              1.856e-01
## h(L_usstock2000_adj - 6.53771):h(L_growth - -3.5)                                2.363e-02
## h(L_usstock2000_adj - 6.53771):h(-3.5 - L_growth)                                1.346e-01
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256)                           1.009e+01
## h(L_hse_usnum - 17):h(0.814617 - L_s_us)                                         9.619e-02
## h(L_usstock2000_adj - 6.53771):h(8.30968 - L_lnustrade)                          2.011e-01
## h(L_hse_usnum - 20):h(-9 - L_polity2)                                            2.387e-01
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(9.40286 - L_lnustrade)  4.741e-02
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256):L_lnustrade               9.446e-01
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(21612.3 - allfdi2000)  1.209e-05
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(1.89 - L_growth)       3.254e-02
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(0.399586 - L_s_us)     3.169e+00
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(L_durable - 104)        9.278e-02
##                                                                                 t value
## (Intercept)                                                                       0.769
## h(L_usstock2000_adj - 6.53771)                                                    3.492
## h(6.53771 - L_usstock2000_adj)                                                   -2.309
## h(L_hse_usnum - 17)                                                               2.240
## h(17 - L_hse_usnum)                                                               3.696
## h(L_lnpop - 16.6189)                                                              1.995
## h(L_hse_usnum - 20)                                                              -1.284
## h(L_usstock2000_adj - 6.53771):h(L_growth - -3.5)                                 2.561
## h(L_usstock2000_adj - 6.53771):h(-3.5 - L_growth)                                 0.584
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256)                           -0.983
## h(L_hse_usnum - 17):h(0.814617 - L_s_us)                                         -1.960
## h(L_usstock2000_adj - 6.53771):h(8.30968 - L_lnustrade)                           2.496
## h(L_hse_usnum - 20):h(-9 - L_polity2)                                            -1.269
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(9.40286 - L_lnustrade)  -2.793
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256):L_lnustrade                0.989
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(21612.3 - allfdi2000)   0.198
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(1.89 - L_growth)        0.946
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(0.399586 - L_s_us)      1.533
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(L_durable - 104)         0.796
##                                                                                 Pr(>|t|)
## (Intercept)                                                                     0.441962
## h(L_usstock2000_adj - 6.53771)                                                  0.000498
## h(6.53771 - L_usstock2000_adj)                                                  0.021120
## h(L_hse_usnum - 17)                                                             0.025316
## h(17 - L_hse_usnum)                                                             0.000230
## h(L_lnpop - 16.6189)                                                            0.046238
## h(L_hse_usnum - 20)                                                             0.199530
## h(L_usstock2000_adj - 6.53771):h(L_growth - -3.5)                               0.010565
## h(L_usstock2000_adj - 6.53771):h(-3.5 - L_growth)                               0.559554
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256)                          0.325893
## h(L_hse_usnum - 17):h(0.814617 - L_s_us)                                        0.050286
## h(L_usstock2000_adj - 6.53771):h(8.30968 - L_lnustrade)                         0.012687
## h(L_hse_usnum - 20):h(-9 - L_polity2)                                           0.204575
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(9.40286 - L_lnustrade) 0.005315
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256):L_lnustrade              0.323053
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(21612.3 - allfdi2000) 0.842735
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(1.89 - L_growth)      0.344383
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(0.399586 - L_s_us)    0.125458
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(L_durable - 104)       0.426210
##                                                                                    
## (Intercept)                                                                        
## h(L_usstock2000_adj - 6.53771)                                                  ***
## h(6.53771 - L_usstock2000_adj)                                                  *  
## h(L_hse_usnum - 17)                                                             *  
## h(17 - L_hse_usnum)                                                             ***
## h(L_lnpop - 16.6189)                                                            *  
## h(L_hse_usnum - 20)                                                                
## h(L_usstock2000_adj - 6.53771):h(L_growth - -3.5)                               *  
## h(L_usstock2000_adj - 6.53771):h(-3.5 - L_growth)                                  
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256)                             
## h(L_hse_usnum - 17):h(0.814617 - L_s_us)                                        .  
## h(L_usstock2000_adj - 6.53771):h(8.30968 - L_lnustrade)                         *  
## h(L_hse_usnum - 20):h(-9 - L_polity2)                                              
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(9.40286 - L_lnustrade) ** 
## h(L_usstock2000_adj - 6.53771):h(L_lnpercap - 10.2256):L_lnustrade                 
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(21612.3 - allfdi2000)    
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(1.89 - L_growth)         
## h(L_usstock2000_adj - 6.53771):h(L_lnustrade - 8.30968):h(0.399586 - L_s_us)       
## h(L_usstock2000_adj - 6.53771):h(10.2256 - L_lnpercap):h(L_durable - 104)          
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 4.102 on 1127 degrees of freedom
## Multiple R-squared:  0.2374,	Adjusted R-squared:  0.2252 
## F-statistic: 19.49 on 18 and 1127 DF,  p-value: < 2.2e-16
```

---
## Notes
.can-edit[Type notes here...]
---

# Make PDP

.pull-left[

```r
seq_range <- function(x, n=25){
  x <- na.omit(x); 
  seq(min(x), max(x), length=n)
}
usnum.s <- seq_range(bk$L_hse_usnum)

predfun <- function(x,y){
  z <- bk[x, ]
  z$L_hse_usnum <- y
  predict(lmod1, newdata=z)
}
res <- outer(test.samp, usnum.s, predfun)

res <- t(apply(res, 1, function(x)x-mean(x)))
k <- lapply(2:10, function(k)kmeans(res, centers=k))
sapply(k, function(x)x$betweenss/(x$tot.withinss+x$betweenss))
```

```
## [1] 0.4453257 0.9078453 0.9475397 0.9785455 0.9440876 0.9890874 0.9893255
## [8] 0.9899874 0.9941667
```

```r
res2 <- rbind(k[[4]]$centers, res)
d2 <- dist(res2)
d2 <- as.matrix(d2)
diag(d2) <- max(c(d2))
closest <- apply(d2[1:5, ], 1, which.min)
```
]
.pull-right[

```r
library(purrr)
tmp <- bk[test.samp[closest], ]
tmp$cluster <- 1:5
dats <- map(1:5, ~as.list(tmp[.x, ])) %>% 
  map(., ~modify_at(.x, "L_hse_usnum", ~usnum.s)) %>% 
  map(., ~do.call(data.frame, .x)) %>% 
  bind_rows()
  
fits <- predict(lmod1, newdata=dats, se.fit=TRUE)
dats <- dats %>% 
  mutate(fit = fits$fit) %>% 
  group_by(cluster) %>% 
  mutate(fit = fit-mean(fit)) %>% 
  ungroup %>% 
  mutate(lwr = fit - 1.96*fits$se.fit, 
         upr = fit + 1.96*fits$se.fit)
```
]

---
## Notes
.can-edit[Type notes here...]
---

# PDP

.shift[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-46-1.png" width="50%" style="display: block; margin: auto;" />
]

---
## Notes
.can-edit[Type notes here...]
---

# Significant Differences

For how many observations are there significant differences moving across the values of `L_hse_usnum`?

.pull-left[

```r
sigdiffs <- NULL
combs <- combn(25, 2)
D <- matrix(0, ncol=ncol(combs), nrow=25)
D[cbind(combs[1,], 1:ncol(combs))] <- -1
D[cbind(combs[2,], 1:ncol(combs))] <- 1

for(i in 1:length(test.samp)){
  dk <- as.list(bk[test.samp[i], ])
  dk$L_hse_usnum <- usnum.s
  A <- model.matrix(formula(lmod1), 
                    data=do.call(data.frame, dk))
  preds <- A %*% coef(lmod1)
  vpreds <- A %*% vcov(lmod1) %*% t(A)
  diffs <- c(t(D) %*% preds)
  vdiffs <- t(D) %*% vpreds %*% D
  tdiffs <- abs(diffs)/sqrt(diag(vdiffs))
  tdiffs <- ifelse(is.finite(tdiffs), tdiffs, 0)
  sigdiffs <- c(sigdiffs, sum(tdiffs > 1.96))
}
```
]

.pull-right[
<img src="lecture12_2020_files/figure-html/unnamed-chunk-48-1.png" width="80%" style="display: block; margin: auto;" />
  
]

```