fix some typos

glemaitre · glemaitre · commit b204b82d04a9 · 2025-08-17T10:50:29.000+02:00
diff --git a/content/python_files/feature_engineering.py b/content/python_files/feature_engineering.py
@@ -62,7 +62,7 @@
 # We wrap the resulting polars dataframe in a `skrub` DataOp to benefit
 # from the built-in `skrub.TableReport` display in the notebook. Using the
 # `skrub` DataOps will also be useful for other reasons: all
-# operations in this notebook chain operations chained together in a directed
+# operations in this notebook are chained together in a directed
 # acyclic graph that is automatically tracked by `skrub`. This allows us to
 # extract the resulting pipeline and apply it to new data later on, exactly
 # like a trained scikit-learn pipeline. The main difference is that we do so
diff --git a/content/python_files/single_horizon_prediction.py b/content/python_files/single_horizon_prediction.py
@@ -159,7 +159,7 @@
 #
 # In the example below, we define that the training data should be at most 2 years
 # worth of data and the test data should be 24 weeks long. We also define a gap of
-# 1 week between the training.
+# 1 week between the training and the testing sets.
 #
 # Let's check those statistics by iterating over the different folds provided by the
 # splitter.
@@ -286,7 +286,7 @@
 # A true model is navigating between the diagonal and the oracle model. The area between
 # the diagonal and the Lorenz curve of a model is called the Gini index.
 #
-# For our model, we observe that each oracle model is not far from the diagonal. It
+# For our use case, we observe that each oracle model is not far from the diagonal. It
 # means that the observed values do not contain a couple of large values with high
 # variability. Therefore, it informs us that the complexity of our problem at hand is
 # not too high. Looking at the Lorenz curve of each model, we observe that it is quite

Original file line number	Diff line number	Diff line change
`@@ -159,7 +159,7 @@`
`159`	`159`	`#`
`160`	`160`	`# In the example below, we define that the training data should be at most 2 years`
`161`	`161`	`# worth of data and the test data should be 24 weeks long. We also define a gap of`
`162`		`-# 1 week between the training.`
	`162`	`+# 1 week between the training and the testing sets.`
`163`	`163`	`#`
`164`	`164`	`# Let's check those statistics by iterating over the different folds provided by the`
`165`	`165`	`# splitter.`
`@@ -286,7 +286,7 @@`
`286`	`286`	`# A true model is navigating between the diagonal and the oracle model. The area between`
`287`	`287`	`# the diagonal and the Lorenz curve of a model is called the Gini index.`
`288`	`288`	`#`
`289`		`-# For our model, we observe that each oracle model is not far from the diagonal. It`
	`289`	`+# For our use case, we observe that each oracle model is not far from the diagonal. It`
`290`	`290`	`# means that the observed values do not contain a couple of large values with high`
`291`	`291`	`# variability. Therefore, it informs us that the complexity of our problem at hand is`
`292`	`292`	`# not too high. Looking at the Lorenz curve of each model, we observe that it is quite`