@@ -263,3 +263,37 @@ d. Logistic Regression Cross Validation
263
263
- Slightly below 70% target
264
264
- Better performance on real data;
265
265
266
+ -----------------------------------------------------------------------------------------------------------------------
267
+ Key Points
268
+ -------------
269
+ -> Machine Learning is Data driven;
270
+ -> Follow the Machine Learning WorkFlow;
271
+
272
+ * Machine Learning Workflow
273
+ ----------------------------
274
+ i. Asking the right question
275
+ - Started with question;
276
+ - Used requiredments and knowledge to transform;
277
+ - Resulted in solution statement;
278
+
279
+ ii. Preparing the data
280
+ - Retrieved diabetes data;
281
+ - Cleaned data;
282
+ - Molded data;
283
+
284
+ iii. Selecting the algorithm
285
+ - Learning type
286
+ - Result type
287
+ - Complexity
288
+ -Basic vs Enhanced
289
+
290
+ iv. Training the model
291
+ - Split data - 70% / 30%;
292
+ - Trained with training data;
293
+
294
+ v. Testing the model
295
+ - Evaluated prediction;
296
+ - Selected Logistic Regression
297
+ - Achieved success
298
+ - Used Cross Validation version for better general performance;
299
+ -------------------------------------------------------------------------------------------------------------------------------
0 commit comments