Common Steps to Use a Machine Learning Model
1) Load the data & Split data into X & y
pandas.read_csv("./data/1.csv")
X = df.drop("target", axis=1) # using all columns besides target
y = df["target"] # predicting y using X
m= RandomForestClassifier(n_estimators=50)
m.fit(X_train,y_train);
5) Make prediction
ypreds=m.predict(X_test)
7) To Evaluate Model use score() function on test and train data
m.score(X_train,y_train)
8) To improve the model by changing its hyperParameters
# Use different numbers of n_estimators as hyperparameter
np.random.seed(40)for i in range(10, 100, 5):
print(f"Trying model with {i} estimators...")
m= RandomForestClassifier(n_estimators=i).fit(X_train, y_train)
print(f"Model accruacy on test data set: {m.score(X_test, y_test)}")
pickle.dump(m, open("My_Random_forest_model.pkl", "wb"))
10) Load a saved model and make a prediction on a single examplesaved_model = pickle.load(open("My_Random_forest_model.pkl", "rb"))
saved_model.score(X_train,y_train)