Skip to main content
Overview

Ensemble

August 31, 2021
1 min read

In production, the effort that would go into ensemble is usually spent on optimizing the model and training pipeline. But in competitions, where the fight is over fractions of a decimal point, using ensemble to push up the score matters.

Ensemble

When training most models, overfitting occurs frequently. Of course, underfitting can happen if the data is too small and biased, but that is less common.

The figure below might help understand this. ref: https://bywords.tistory.com/entry/%EB%B2%88%EC%97%AD-%EC%9C%A0%EC%B9%98%EC%9B%90%EC%83%9D%EB%8F%84-%EC%9D%B4%ED%95%B4%ED%95%A0-%EC%88%98-%EC%9E%88%EB%8A%94-biasvariance-tradeoff

Voting

ref: https://devkor.tistory.com/entry/Soft-Voting-%EA%B3%BC-Hard-Voting

  • Hard voting: selects the majority class
  • Soft voting: outputs the average across classes
  • Weight voting: multiplies each model’s output by its weight and divides by the sum of weights
Loading comments...