Machine learning 评估autoML结果的输出

Machine learning 评估autoML结果的输出,machine-learning,deep-learning,Machine Learning,Deep Learning,我如何解释以下结果?基于Autoglion summary的最佳训练算法是什么 *** Summary of fit() *** Estimated performance of each model: model score_val fit_time pred_time_val stack_level 19 weighted_ensemble_k0_l2 -0.035874

我如何解释以下结果?基于Autoglion summary的最佳训练算法是什么

*** Summary of fit() ***
Estimated performance of each model:
                                    model  score_val     fit_time  pred_time_val  stack_level
19                weighted_ensemble_k0_l2  -0.035874     1.848907       0.002517            2
18                weighted_ensemble_k0_l1  -0.040987     1.837416       0.002259            1
16          CatboostClassifier_STACKER_l1  -0.042901  1559.653612       0.083949            1
11    ExtraTreesClassifierGini_STACKER_l1  -0.047882     7.307266       1.057873            1
...
...
0   RandomForestClassifierGini_STACKER_l0  -0.291987     9.871649       1.054538            0
生成上述结果的代码:

import pandas as pd
from autogluon import TabularPrediction as task
from sklearn.datasets import load_digits

digits = load_digits()
savedir = "otto_models/"  # where to save trained models

train_data = pd.DataFrame(digits.data)
train_target = pd.DataFrame(digits.target)
train_data = pd.merge(train_data, train_target, left_index=True, right_index=True)
label_column = "0_y"

predictor = task.fit(
    train_data=train_data,
    label=label_column,
    output_directory=savedir,
    eval_metric="log_loss",
    auto_stack=True,
    verbosity=2,
    visualizer="tensorboard",
)

results = predictor.fit_summary()  # display detailed summary of fit() process

在这种情况下,哪种算法似乎有效?

就验证分数(score\val)而言,加权集成(weighted_emblem)k0\u l2是最好的结果,因为它具有最高的值。您可能希望执行
predictor.leadboard(测试数据)
以获得每个模型的测试分数


请注意,结果显示为负值,因为Autoglion始终认为越高越好。如果某个特定度量(如logloss)希望值越低越好,则AutoGlion会翻转该度量的符号。我想在你的情况下,val_分数为0是一个完美的分数。

这是否意味着-0.03分数就相当于97%的准确率?不,在这种情况下,它将是0.97-0.03给定您提供的代码将意味着0.03 logloss。