paperKB
coga / coga-kb
Help
Sign in

Chunk #47 — Results — Explaining differences: datasets’ meta-features — Subgroup analyses: meta-features

Source
Random forest versus logistic regression: a large-scale benchmark experiment.
Embedded
yes

Text

Figure 5 displays the boxplots of the differences in accuracy for different subgroups based on the four selected meta-features p, n, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$\frac {p}{n}$\end{document}pn and Cmax. For each of the four meta-features, subgroups are defined based on different cut-off values, denoted as t, successively. The histograms of the four meta-features for the 243 datasets are depicted in the bottom row of the figure, where the considered cutoff values are materialized as vertical lines. Similar pictures are obtained for the two alternative performance measures auc and brier; See Additional file 1.