adaboost算法具体代码

以下是adaboost算法的Python代码：

import numpy as np

class AdaBoost:
    def __init__(self, n_estimators=50):
        self.n_estimators = n_estimators
        self.alpha = []
        self.estimators = []

    def fit(self, X, y):
        n_samples = X.shape[0]
        w = np.ones(n_samples) / n_samples
        for i in range(self.n_estimators):
            clf = DecisionTreeClassifier(max_depth=1)
            clf.fit(X, y, sample_weight=w)
            y_pred = clf.predict(X)
            error = w.dot(y_pred != y)
            alpha = 0.5 * np.log((1 - error) / error)
            w *= np.exp(-alpha * y * y_pred)
            w /= np.sum(w)
            self.alpha.append(alpha)
            self.estimators.append(clf)

    def predict(self, X):
        y_pred = np.zeros(X.shape[0])
        for alpha, clf in zip(self.alpha, self.estimators):
            y_pred += alpha * clf.predict(X)
        return np.sign(y_pred)

该代码中，我们使用了决策树作为基分类器，因为它是adaboost算法最常用的分类器之一。我们使用样本权重$w$来调整模型的训练，每个基分类器都会计算出一个错误率$error$和一个系数$\alpha$，系数$\alpha$表示基分类器的重要程度。在每次迭代中，我们都会更新样本权重$w$，让分类错误的样本权重增加，分类正确的样本权重减少，以便让基分类器更关注于分类错误的样本。

最后，我们将所有基分类器的预测结果加权求和，得到最终的预测结果