You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Explicitly mention that TableVectorizer makes the column selection automatically by using its dtype
Introduce concept of "low/high cardinality" and demonstrate effect of cardinality_threshold on the "native-country" column in the Adult Census dataset.
Update visualizing scikit-learn pipelines video to use TableVectorizer (with scikit-learn version >= 1.8)
Modify wrap-up quizzes that use the Ames Housing dataset i.e. M1, M4 and M5 to select subset of numerical columns with pandas
ColumnTransformerwith skrub.TableVectorizer starting from the Using numerical and categorical variables together notebookOrdinalEncoderbyskrub.ToCategorical.TableVectorizermakes the column selection automatically by using itsdtypecardinality_thresholdon the "native-country" column in the Adult Census dataset.TableVectorizer(with scikit-learn version >= 1.8)TableReport