Machine Learning Program

Question 1

I've written a program that finds the difference between data and gives output. Here's the code:

import json
import numpy as np
print("This is a basic machine learning thing.")
baseData = {"collecting":True,"x":[],"y":[]}
while baseData["collecting"]:
 baseData["x"].append(float(input("X:")))
 baseData["y"].append(float(input("Y:")))
 if input("Do you want to keep feeding data? Press enter for yes, or type anything for no.") != "":
 baseData["collecting"] = False
if len(baseData["x"]) == len(baseData["y"]):
 xdata = baseData["x"]
 ydata = baseData["y"]
 nums = []
 for i in range(len(xdata)):
 nums.append(xdata[i] - ydata[i])
 median = np.median(nums)
else:
 print("malformed data")
def getY(x):
 pass
while True:
 data = input("X/Data:")
 print(int(data)-median)

To work the program, give it X and Y data, then give it X data and it will predict Y data.

Question 2

Do you have a particular question or concern with your code?

Question 3

@KaPy3141 I want to know some ways I can improve or minify this code.

Question 4

Maybe you should check validity of the input and ask again if input is wrong?

while baseData["collecting"]:
 baseData["x"].append(float(input("X:")))

This is always True, so just discard this part:

if len(baseData["x"]) == len(baseData["y"]):

Maybe you should give an option to exit?

while True:
 data = input("X/Data:")
 print(int(data)-median)

And in general, calling this a "machiene-learning thing" is quite a stretch of imagination, no? Maybe you should have a look at how to fit data. I.e. a basic linear model fit.

Question 5

baseData should be split into 3 separate variables(collecting, xdata, ydata); there is no reason for it to be a dict.

nums = []
for i in range(len(xdata)):
 nums.append(xdata[i] - ydata[i])

can be written more Pythonically as:

nums = []
for x, y in zip(xdata, ydata):
 nums.append(x - y)

or even just:

nums = [x - y for x, y in zip(xdata, ydata)]

You don't need to import numpy only for the media; stdlib statistics.median should work just fine.

KaPy3141 KaPy3141 2631 silver badge10 bronze badges · Answer 1 · 2021-03-13 23:48:03Z

Maybe you should check validity of the input and ask again if input is wrong?

while baseData["collecting"]:
 baseData["x"].append(float(input("X:")))

This is always True, so just discard this part:

if len(baseData["x"]) == len(baseData["y"]):

Maybe you should give an option to exit?

while True:
 data = input("X/Data:")
 print(int(data)-median)

And in general, calling this a "machiene-learning thing" is quite a stretch of imagination, no? Maybe you should have a look at how to fit data. I.e. a basic linear model fit.

SuperStormer SuperStormer 5653 silver badges13 bronze badges · Answer 2 · 2021-03-14 00:30:00Z

baseData should be split into 3 separate variables(collecting, xdata, ydata); there is no reason for it to be a dict.

nums = []
for i in range(len(xdata)):
 nums.append(xdata[i] - ydata[i])

can be written more Pythonically as:

nums = []
for x, y in zip(xdata, ydata):
 nums.append(x - y)

or even just:

nums = [x - y for x, y in zip(xdata, ydata)]

You don't need to import numpy only for the media; stdlib statistics.median should work just fine.

Stack Exchange Network

Machine Learning Program

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

Machine Learning Program

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions