refactored decision tree into reusable components by DanielLacina · Pull Request #316 · smartcorelib/smartcore

DanielLacina · 2025-07-09T18:58:04Z

No description provided.

Mec-iS · 2025-07-11T09:25:52Z

+
+#[cfg_attr(feature = "serde", derive(Serialize, Deserialize))]
+#[derive(Debug, Clone)]
+struct Node {


this should problably be Node<TX>, why it should be limited to f64? in general everytime you write a fixed numerical type you should ask yourself if it is generalisable.

All this code is copied and pasted from the DecisionTree file with the difference being that I use the name BaseTree instead of DecisionTree. This implementation is already in your codebase: https://github.com/smartcorelib/smartcore/blob/development/src/tree/decision_tree_regressor.rs#L129.

Plus Node is an implementation detail (no pub keyword) so it doesn't matter what numeric type it contains.

Mec-iS · 2025-07-11T09:26:15Z

+
+impl PartialEq for Node {
+    fn eq(&self, other: &Self) -> bool {
+        (self.output - other.output).abs() < f64::EPSILON


again, this will work only for f64?

Node is an implementation detail

Mec-iS · 2025-07-11T09:30:07Z

+
+#[cfg_attr(feature = "serde", derive(Serialize, Deserialize))]
+#[derive(Debug, Clone, Default)]
+pub enum Splitter {


missing docstring

Mec-iS · 2025-07-11T09:30:29Z

+
+#[cfg_attr(feature = "serde", derive(Serialize, Deserialize))]
+#[derive(Debug, Clone)]
+/// Parameters of Regression base_tree


this should explain a little more like:

/// Parameters for configuring the behavior of a decision tree regressor. /// /// This struct controls various aspects of tree construction including /// depth limits, sample requirements, and splitting strategies.

Mec-iS · 2025-07-11T09:33:10Z

+        let mut order: Vec<Vec<usize>> = Vec::new();
+
+        for i in 0..num_attributes {
+            let mut col_i: Vec<TX> = x.get_col(i).iterator(0).copied().collect();


this one could be just maybe?

let mut col_view: Vec<f64> = x.get_col(i).iterator(0).copied().collect(); let indices = col_view.argsort_mut();

I copied and pasted this code from the decision tree file. That would probably be more readable. Ima be honest, it was pretty hard reading the code of the decision tree implementation, but it is indeed optimal.

Mec-iS · 2025-07-11T09:34:08Z

+    fn parameters(&self) -> &BaseTreeRegressorParameters {
+        self.parameters.as_ref().unwrap()
+    }
+    /// Get estimate of intercept, return value


I don't think this comment is correct, please double-check

That's already written in the codebase.

https://github.com/smartcorelib/smartcore/blob/development/src/tree/decision_tree_regressor.rs#L121

Mec-iS · 2025-07-11T09:37:19Z

Thank you!

This is a nice starting point so to make possible to implement:

Classification Trees
Ensemble Methods: random forests
Feature Selection: mtry can be used for feature sampling

Mec-iS · 2025-07-11T09:40:52Z

This opens up to implementations like:

trait TreeRegressor<TX, TY, X, Y> {
    fn fit_tree(&self, x: &X, y: &Y, samples: Vec<usize>) -> Result<Self, Failed>;
    fn predict_tree(&self, x: &X) -> Result<Y, Failed>;
}

trait EnsembleRegressor<TX, TY, X, Y> {
    fn fit_ensemble(&self, x: &X, y: &Y, n_estimators: usize) -> Result<Self, Failed>;
    fn predict_ensemble(&self, x: &X) -> Result<Y, Failed>;
}

is that correct?

Recommended Implementation Priority

Random Forest Regressor: Highest priority due to existing foundation
Gradient Boosting Regressor: High impact for predictive performance
Extra Trees Regressor: Low implementation cost with good diversity benefits
Quantile Regression Trees: Valuable for uncertainty quantification
Pruned Regression Trees: Important for model interpretability

DanielLacina · 2025-07-12T03:11:39Z

I think we should build Base models that other more complex models can build on top of.

DanielLacina · 2025-07-12T03:13:41Z

Idk if traits are necessary. I think all we need is for example a complex model that stores the base model as an attribute. And then we can add extra logic on top of the base model.

refactored decision tree into reusable components

11b39b5

DanielLacina requested a review from Mec-iS as a code owner July 9, 2025 18:58

DanielLacina added 3 commits July 9, 2025 14:00

got rid of api code from base tree because its an implementation detail

ca07f08

got rid of api code from base tree because its an implementation detail

7ce47a5

changed name

cb71980

Mec-iS reviewed Jul 11, 2025

View reviewed changes

Mec-iS self-requested a review July 12, 2025 10:24

Mec-iS approved these changes Jul 12, 2025

View reviewed changes

Mec-iS merged commit c5816b0 into smartcorelib:development Jul 12, 2025
11 checks passed

Conversation

DanielLacina commented Jul 9, 2025

Uh oh!

Mec-iS Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Mec-iS commented Jul 11, 2025

Uh oh!

Mec-iS commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DanielLacina commented Jul 12, 2025

Uh oh!

DanielLacina commented Jul 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Mec-iS Jul 11, 2025 •

edited

Loading

Mec-iS commented Jul 11, 2025 •

edited

Loading