paperKB
coga / coga-kb
Help
Sign in

Chunk #26 — Methods — Inclusion criteria and subgroup analyses

Source
Random forest versus logistic regression: a large-scale benchmark experiment.
Embedded
yes

Text

When using a huge database of datasets, it becomes obvious that one has to define criteria for inclusion in the benchmarking experiment. Inclusion criteria in this context do not have any long tradition in computational science. The criteria used by researchers—including ourselves before the present study—to select datasets are most often completely non-transparent. It is often the fact that they select a number of datasets which were found to somehow fit the scope of the investigated methods, but without clear definition of this scope.