LinearBoost
diff --git a/‎README.md‎
Lines changed: 30 additions & 3 deletions b/‎README.md‎
Lines changed: 30 additions & 3 deletions
diff --git a/‎notebooks/optuna_sefrboost_demo.ipynb‎
Lines changed: 186 additions & 0 deletions b/‎notebooks/optuna_sefrboost_demo.ipynb‎
Lines changed: 186 additions & 0 deletions
diff --git a/‎src/linearboost/__init__.py‎
Lines changed: 4 additions & 1 deletion b/‎src/linearboost/__init__.py‎
Lines changed: 4 additions & 1 deletion
@@ -1,11 +1,15 @@
 # LinearBoost Classifier
 
-![Latest Release](https://img.shields.io/badge/release-v0.1.9-green)
+![Latest Release](https://img.shields.io/badge/release-v0.2.0-green)
 [![PyPI Version](https://img.shields.io/pypi/v/linearboost)](https://pypi.org/project/linearboost/)
 ![Python Versions](https://img.shields.io/badge/python-3.8%20%7C%203.9%20%7C%203.10%20%7C%203.11%20%7C%203.12%20%7C%203.13-blue)
 [![PyPI Downloads](https://static.pepy.tech/badge/linearboost)](https://pepy.tech/projects/linearboost)
 
-## 🧪 Quickstart Demo
+**Current release: v0.2.0.** This version adds **SEFRBoost**—gradient boosting with linear SEFR splits at tree nodes—via `SEFRBoostClassifier` and `SEFRBoostRegressor` (`from linearboost import …` or `linearboost.sefr_boost`). **LinearBoost security issues have been updated** in this release; upgrade from earlier versions to stay patched.
+
+## 🧪 Quickstart demos
+
+### LinearBoost
 
 Want to see how LinearBoost works in practice?
 
@@ -16,6 +20,10 @@ This Jupyter notebook shows how to:
 - Train `LinearBoostClassifier`
 - Evaluate using F1 score and cross-validation
 
+### SEFRBoost
+
+- **[`optuna_sefrboost_demo.ipynb`](notebooks/optuna_sefrboost_demo.ipynb)** — Hyperparameter search for **`SEFRBoostClassifier`** with **[Optuna](https://optuna.org/)** on sklearn’s Breast Cancer Wisconsin data: default baseline, then **5-fold stratified CV** optimizing F1 (install `optuna` in addition to `linearboost` and `scikit-learn`).
+
 LinearBoost is a fast and accurate classification algorithm built to enhance the performance of the linear classifier SEFR. It combines efficiency and accuracy, delivering state-of-the-art F1 scores and classification performance.
 
 In benchmarks across seven well-known datasets, LinearBoost:
@@ -32,11 +40,30 @@ Key Features:
 
 ---
 
+## 🚀 New in Version 0.2.0
+
+### SEFRBoost
+
+SEFRBoost provides binary classification and regression with shallow trees whose internal nodes use **SEFR** hyperplane splits on pseudo-residuals—an oblique-split GBDT-style alternative that lives alongside `LinearBoostClassifier`.
+
+```python
+from linearboost import SEFRBoostClassifier, SEFRBoostRegressor
+# or: from linearboost.sefr_boost import SEFRBoostClassifier, SEFRBoostRegressor
+```
+
+Hands-on examples: [`notebooks/optuna_sefrboost_demo.ipynb`](notebooks/optuna_sefrboost_demo.ipynb) (Optuna tuning).
+
+### Security updates
+
+LinearBoost-related security issues are addressed in v0.2.0. Upgrade from older releases.
+
+---
+
 ## 🚀 New in Version 0.1.9
 
 ### Security Updates
 
-Version 0.1.9 includes security updates. We recommend upgrading from earlier versions to stay current.
+Version 0.1.9 included security updates. For the latest fixes, use **v0.2.0** or newer.
 
 ---
 
 
@@ -0,0 +1,186 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "7fb27b941602401d91542211134fc71a",
+   "metadata": {},
+   "source": [
+    "# Optuna + `SEFRBoostClassifier`\n",
+    "\n",
+    "Tune [`SEFRBoostClassifier`](https://github.com/LinearBoost/linearboost-classifier) (gradient boosting with SEFR oblique splits) using [Optuna](https://optuna.org/) on sklearn’s **Breast Cancer Wisconsin** dataset (binary).\n",
+    "\n",
+    "**Install (if needed):** `pip install linearboost optuna scikit-learn` — or install this repo editable: `pip install -e .` from the repository root."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "acae54e37e7d407bbb7b55eff062a284",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import warnings\n",
+    "\n",
+    "import numpy as np\n",
+    "import optuna\n",
+    "from sklearn.datasets import load_breast_cancer\n",
+    "from sklearn.metrics import f1_score, roc_auc_score\n",
+    "from sklearn.model_selection import StratifiedKFold, train_test_split\n",
+    "from sklearn.pipeline import Pipeline\n",
+    "from sklearn.preprocessing import StandardScaler\n",
+    "\n",
+    "from linearboost import SEFRBoostClassifier\n",
+    "\n",
+    "warnings.filterwarnings(\"ignore\")\n",
+    "optuna.logging.set_verbosity(optuna.logging.WARNING)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "9a63283cbaf04dbcab1f6479b197f3a8",
+   "metadata": {},
+   "source": [
+    "## 1. Load data and train / test split\n",
+    "\n",
+    "`SEFRBoostClassifier` expects **dense numeric** input; we use `StandardScaler` in a pipeline."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "8dd0d8092fe74a7c96281538738b07e2",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "X, y = load_breast_cancer(return_X_y=True)\n",
+    "X_train, X_test, y_train, y_test = train_test_split(\n",
+    "    X, y, test_size=0.25, stratify=y, random_state=42\n",
+    ")\n",
+    "print(\"Train:\", X_train.shape, \"Test:\", X_test.shape, \"Classes:\", np.unique(y))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "72eea5119410473aa328ad9291626812",
+   "metadata": {},
+   "source": [
+    "## 2. Quick baseline (default hyperparameters)\n",
+    "\n",
+    "`Pipeline(StandardScaler → SEFRBoostClassifier)`."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "8edb47106e1a46a883d545849b8ab81b",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "baseline = Pipeline(\n",
+    "    [\n",
+    "        (\"scale\", StandardScaler()),\n",
+    "        (\"clf\", SEFRBoostClassifier(n_estimators=50, random_state=42)),\n",
+    "    ]\n",
+    ")\n",
+    "baseline.fit(X_train, y_train)\n",
+    "y_pred = baseline.predict(X_test)\n",
+    "y_proba = baseline.predict_proba(X_test)[:, 1]\n",
+    "print(\"Baseline F1 (weighted):\", f1_score(y_test, y_pred, average=\"weighted\"))\n",
+    "print(\"Baseline ROC-AUC:\", roc_auc_score(y_test, y_proba))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "10185d26023b46108eb7d9f57d49d2b3",
+   "metadata": {},
+   "source": [
+    "## 3. Optuna: maximize cross-validated F1\n",
+    "\n",
+    "Objective: suggest tree size, learning rate, depth, leaf constraints, and subsample; evaluate with **5-fold stratified CV** on the training set only (fast enough for local runs)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "8763a12b2bbd4a93a75aff182afb95dc",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "cv = StratifiedKFold(n_splits=5, shuffle=True, random_state=42)\n",
+    "\n",
+    "\n",
+    "def objective(trial: optuna.Trial) -> float:\n",
+    "    params = {\n",
+    "        \"n_estimators\": trial.suggest_int(\"n_estimators\", 20, 150),\n",
+    "        \"learning_rate\": trial.suggest_float(\"learning_rate\", 0.02, 0.3, log=True),\n",
+    "        \"max_depth\": trial.suggest_int(\"max_depth\", 2, 6),\n",
+    "        \"min_samples_leaf\": trial.suggest_int(\"min_samples_leaf\", 5, 40),\n",
+    "        \"min_samples_split\": trial.suggest_int(\"min_samples_split\", 10, 80),\n",
+    "        \"subsample\": trial.suggest_float(\"subsample\", 0.6, 1.0),\n",
+    "        \"random_state\": 42,\n",
+    "    }\n",
+    "    pipe = Pipeline(\n",
+    "        [\n",
+    "            (\"scale\", StandardScaler()),\n",
+    "            (\"clf\", SEFRBoostClassifier(**params)),\n",
+    "        ]\n",
+    "    )\n",
+    "    scores = []\n",
+    "    for train_idx, val_idx in cv.split(X_train, y_train):\n",
+    "        pipe.fit(X_train[train_idx], y_train[train_idx])\n",
+    "        pred = pipe.predict(X_train[val_idx])\n",
+    "        scores.append(f1_score(y_train[val_idx], pred, average=\"weighted\"))\n",
+    "    return float(np.mean(scores))\n",
+    "\n",
+    "\n",
+    "study = optuna.create_study(direction=\"maximize\")\n",
+    "study.optimize(objective, n_trials=30, show_progress_bar=True)\n",
+    "print(\"Best trial:\", study.best_trial.number, \"F1 (CV mean):\", study.best_value)\n",
+    "print(\"Best params:\", study.best_params)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "7623eae2785240b9bd12b16a66d81610",
+   "metadata": {},
+   "source": [
+    "## 4. Fit tuned model on full training set and evaluate on held-out test"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7cdc8c89c7104fffa095e18ddfef8986",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "best = study.best_params.copy()\n",
+    "best[\"random_state\"] = 42\n",
+    "tuned = Pipeline(\n",
+    "    [\n",
+    "        (\"scale\", StandardScaler()),\n",
+    "        (\"clf\", SEFRBoostClassifier(**best)),\n",
+    "    ]\n",
+    ")\n",
+    "tuned.fit(X_train, y_train)\n",
+    "y_pred_t = tuned.predict(X_test)\n",
+    "y_proba_t = tuned.predict_proba(X_test)[:, 1]\n",
+    "print(\"Tuned F1 (weighted):\", f1_score(y_test, y_pred_t, average=\"weighted\"))\n",
+    "print(\"Tuned ROC-AUC:\", roc_auc_score(y_test, y_proba_t))"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "name": "python",
+   "version": "3.11.0"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
@@ -1,9 +1,12 @@
-__version__ = "0.1.9"
+__version__ = "0.2.0"
 
 from .linear_boost import LinearBoostClassifier
 from .sefr import SEFR
+from .sefr_boost import SEFRBoostClassifier, SEFRBoostRegressor
 
 __all__ = [
     "LinearBoostClassifier",
     "SEFR",
+    "SEFRBoostClassifier",
+    "SEFRBoostRegressor",
 ]