Commit f467667

committed

Merge branch 'master' of https://github.com/jphall663/interpretable_machine_learning_with_python

2 parents a916cc0 + a955a57 commit f467667Copy full SHA for f467667

File tree

2 files changed

-0

lines changed

README.md
readme_pics
- hist_pd_ice_lo.png

2 files changed

-0

lines changed

`‎README.md‎`

Lines changed: 8 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -14,6 +14,7 @@ The notebooks highlight techniques such as:`
`14`	`14`	`* [Sensitivity and residual analysis](https://github.com/jphall663/interpretable_machine_learning_with_python#testing-machine-learning-models-for-accuracy-trustworthiness-and-stability-with-python-and-h2o---notebook)`
`15`	`15`	`* [Advanced sensitivity analysis for model debugging](https://github.com/jphall663/interpretable_machine_learning_with_python#part-1-sensitivity-analysis---notebook)`
`16`	`16`	`* [Advanced residual analysis for model debugging](https://github.com/jphall663/interpretable_machine_learning_with_python#part-2-residual-analysis---notebook)`
	`17`	`+* [Detailed model comparison and model selection by cross-validated ranking](https://github.com/jphall663/interpretable_machine_learning_with_python#from-glm-to-gbm-building-the-case-for-complexity---notebook)`
`17`	`18`
`18`	`19`	`The notebooks can be accessed through:`
`19`	`20`	`* [H2O Aquarium (Recommended)](https://github.com/jphall663/interpretable_machine_learning_with_python#h2o-aquarium-recommended)`
`@@ -22,6 +23,7 @@ The notebooks can be accessed through:`
`22`	`23`	`* [Manual installation (Advanced)](https://github.com/jphall663/interpretable_machine_learning_with_python#manual-installation)`
`23`	`24`
`24`	`25`	`#### Further reading:`
	`26`	`+* [Machine Learning: Considerations for fairly and transparently expanding access to credit](http://info.h2o.ai/rs/644-PKX-778/images/Machine%20Learning%20-%20Considerations%20for%20Fairly%20and%20Transparently%20Expanding%20Access%20to%20Credit.pdf)`
`25`	`27`	`* [A Responsible Machine Learning Workflow with Focus on Interpretable Models, Post-hoc Explanation, and Discrimination Testing](https://www.mdpi.com/2078-2489/11/3/137)`
`26`	`28`	`* [An Introduction to Machine Learning Interpretability, 2nd Edition](https://www.h2o.ai/wp-content/uploads/2019/08/An-Introduction-to-Machine-Learning-Interpretability-Second-Edition.pdf)`
`27`	`29`	`* [On the Art and Science of Explainable Machine Learning](https://arxiv.org/pdf/1810.02909.pdf)`
`@@ -90,6 +92,12 @@ These model debugging exercises uncover accuracy, drift, and security problems s`
`90`	`92`
`91`	`93`	In general, residual analysis can be characterized as a careful study of when and how models make mistakes. A better understanding of mistakes will hopefully lead to fewer of them. This notebook uses variants of residual analysis to find error mechanisms and security vulnerabilities and to assess stability and fairness in a trained XGBoost model. It begins by loading the UCI credit card default data and then training an interpretable, monotonically constrained XGBoost gradient boosting machine (GBM) model. (Pearson correlation with the prediction target is used to determine the direction of the monotonicity constraints for each input variable.) After the model is trained, its logloss residuals are analyzed and explained thoroughly and the constrained GBM is compared to a benchmark linear model. These model debugging exercises uncover accuracy, drift, and security problems such as over-emphasis of important variables and strong signal in model residuals. Several remediation mechanisms are proposed including missing value injection during training, additional data collection, and use of assertions to correct known problems during scoring.
`92`	`94`
	`95`	`+### From GLM to GBM: Building the Case For Complexity - [Notebook](https://nbviewer.jupyter.org/github/jphall663/interpretable_machine_learning_with_python/blob/master/glm_mgbm_gbm.ipynb)`
	`96`	`+`
	`97`	`+![](readme_pics/hist_pd_ice_lo.png)`
	`98`	`+`
	`99`	+This notebook uses the same credit card default scenario to show how monotonicity constraints, Shapley values and other post-hoc explanations, and discrimination testing can enable practitioners to create direct comparisons between GLM and GBM models. Several candidate probability of default models are selected for comparison using feature selection methods, like LASSO, and by cross-validated ranking. Comparisons then enable building from GLM to more complex GBM models in a step-by-step manner, while retaining model transparency and the ability to test for discrimination. This notebook shows that GBMs can yield better accuracy, more revenue, and that GBMs are also likely to fulfill many model documentation, adverse action notice, and discrimination testing requirements.
	`100`	`+`
`93`	`101`	`## Using the Examples`
`94`	`102`
`95`	`103`	`### H2O Aquarium (recommended)`

`‎readme_pics/hist_pd_ice_lo.png‎`

1.17 MB

Loading[フレーム]

0 commit comments

Comments

(0)

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit f467667

File tree

2 files changed

2 files changed

`‎README.md‎`

`‎readme_pics/hist_pd_ice_lo.png‎`

0 commit comments