當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

【sklearn学习】模型网格化调参

發布時間：2023/12/15 编程问答 34 豆豆

生活随笔收集整理的這篇文章主要介紹了【sklearn学习】模型网格化调参小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

導入包?

from sklearn.datasets import load_breast_cancer from sklearn.esemble import RandomForestClassifier from sklearn.model_selection import GridSearchCV from sklearn.model_selection import cross_val_score import matplotlib.pyplot as plt import pandas as pd import numpy as np

?確定大致的學習曲線

score1 = [] for i in range(0,200,10):rfc = RandomForestClassifier(n_estimators=i+1,n_jobs=-1,random_state=90)score = cross_val_score(rfc,data.data,data.target,cv=10).mean()score1.append(score)print(max(score1),(score1.index(max(score1))*10)+1)plt.figure(figsize=[20,5]) plt.plot(range(1,201,10),score1) plt.show()

細化學習曲線

score1 = [] for i in range(35,45):rfc = RandomForestClassifier(n_estimators=i+1,n_jobs=-1,random_state=90)score = cross_val_score(rfc,data.data,data.target,cv=10).mean()score1.append(score)print(max(score1),([*range(35,45)][score1.index(max(score1))]))plt.figure(figsize=[20,5]) plt.plot(range(35,45),score1) plt.show()

調整max_depth，往左推，模型復雜度降低，泛化誤差增加，準確率降低

param_grid= {‘max_depth':np.arange(1, 20, 1)}rfc = RandomForestClassfier(n_estimators=39,random_state=90) GS = GridSearchCV(rfc, param_grid, cv=10) GS.fir(data.data, data.target)# 顯示調整出來的最佳參數 GS.best_params_ # 顯示調整出來的最佳參數的準確率 GS.best_score_

?調整max_features，往右推，模型復雜度增加，泛化誤差增加，準確率降低

param_grid= {‘max_features':np.arange(5, 30, 1)}rfc = RandomForestClassfier(n_estimators=39,random_state=90) GS = GridSearchCV(rfc, param_grid, cv=10) GS.fir(data.data, data.target)# 顯示調整出來的最佳參數 GS.best_params_ # 顯示調整出來的最佳參數的準確率 GS.best_score_

整個調參過程首先調整n_estimators，然后調整max_depth，通過max_depth產生的結果，判斷模型位于復雜度-泛化誤差的哪一邊，從而選擇應該調整的參數。

總結

以上是生活随笔為你收集整理的【sklearn学习】模型网格化调参的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：深情鬼攻略有哪些
下一篇：征信花了度小满好申请吗征信花了度小满容