|
3884 | 3884 | "\n",
|
3885 | 3885 | "We will be doing this graphically as well, which may seem like an even less objective method. The alternative is to use *Bayesian p-values*. These are still subjective, as the proper cutoff between good and bad is arbitrary. Gelman emphasises that the graphical tests are more illuminating [3] than p-value tests. We agree.\n",
|
3886 | 3886 | "\n",
|
3887 | | - "The following graphical test is a novel data-viz approach to logistic regression. The plots are called *separation plots*[4]. For a suite of models we wish to compare, each model is plotted on an individual separation plot. I leave most of the technical details about separation plots to the very accessible [original paper](http://mdwardlab.com/sites/default/files/GreenhillWardSacks.pdf), but I'll summarize their use here.\n", |
| 3887 | + "The following graphical test is a novel data-viz approach to logistic regression. The plots are called *separation plots*[4]. For a suite of models we wish to compare, each model is plotted on an individual separation plot. I leave most of the technical details about separation plots to the very accessible [original paper](https://onlinelibrary.wiley.com/doi/10.1111/j.1540-5907.2011.00525.x), but I'll summarize their use here.\n", |
3888 | 3888 | "\n",
|
3889 | 3889 | "For each model, we calculate the proportion of times the posterior simulation proposed a value of 1 for a particular temperature, i.e. compute $P( \\;\\text{Defect} = 1 | t, \\alpha, \\beta )$ by averaging. This gives us the posterior probability of a defect at each data point in our dataset. For example, for the model we used above:"
|
3890 | 3890 | ]
|
|
4028 | 4028 | "def separation_plot( p, y, **kwargs ):\n",
|
4029 | 4029 | " \"\"\"\n",
|
4030 | 4030 | " This function creates a separation plot for logistic and probit classification. \n",
|
4031 | | - " See http://mdwardlab.com/sites/default/files/GreenhillWardSacks.pdf\n", |
| 4031 | + " See https://onlinelibrary.wiley.com/doi/10.1111/j.1540-5907.2011.00525.x\n", |
4032 | 4032 | " \n",
|
4033 | 4033 | " p: The proportions/probabilities, can be a nxM matrix which represents M models.\n",
|
4034 | 4034 | " y: the 0-1 response variables.\n",
|
|
0 commit comments