@@ -738,7 +738,7 @@ <h3>Data loading<a class="headerlink" href="#data-loading" title="Link to this h
738738< span class ="n "> penguins</ span > < span class ="o "> =</ span > < span class ="n "> pd</ span > < span class ="o "> .</ span > < span class ="n "> read_csv</ span > < span class ="p "> (</ span > < span class ="s2 "> "../datasets/penguins_regression.csv"</ span > < span class ="p "> )</ span >
739739< span class ="n "> feature_name</ span > < span class ="o "> =</ span > < span class ="s2 "> "Flipper Length (mm)"</ span >
740740< span class ="n "> target_name</ span > < span class ="o "> =</ span > < span class ="s2 "> "Body Mass (g)"</ span >
741- < span class ="n "> data</ span > < span class ="p "> ,</ span > < span class ="n "> target</ span > < span class ="o "> =</ span > < span class ="n "> penguins</ span > < span class ="p "> [[</ span > < span class ="n "> feature_name</ span > < span class ="p "> ]],</ span > < span class ="n "> penguins</ span > < span class ="p "> [</ span > < span class ="n "> target_name</ span > < span class ="p "> ]</ span >
741+ < span class ="n "> data</ span > < span class ="p "> ,</ span > < span class ="n "> target</ span > < span class ="o "> =</ span > < span class ="n "> penguins</ span > < span class ="p "> [[</ span > < span class ="n "> feature_name</ span > < span class ="p "> ]],</ span > < span class ="n "> penguins</ span > < span class ="p "> [[ </ span > < span class ="n "> target_name</ span > < span class ="p "> ] ]</ span >
742742</ pre > </ div >
743743</ div >
744744</ div >
@@ -823,7 +823,7 @@ <h2>Main exercise<a class="headerlink" href="#main-exercise" title="Link to this
823823< span class ="k "> def</ span > < span class ="w "> </ span > < span class ="nf "> goodness_fit_measure</ span > < span class ="p "> (</ span > < span class ="n "> true_values</ span > < span class ="p "> ,</ span > < span class ="n "> predictions</ span > < span class ="p "> ):</ span >
824824 < span class ="c1 "> # we compute the error between the true values and the predictions of our</ span >
825825 < span class ="c1 "> # model</ span >
826- < span class ="n "> errors</ span > < span class ="o "> =</ span > < span class ="n "> np </ span > < span class =" o " > . </ span > < span class =" n " > ravel </ span > < span class =" p " > ( </ span > < span class =" n " > true_values</ span > < span class =" p " > ) </ span > < span class ="o "> -</ span > < span class ="n "> np </ span > < span class =" o " > . </ span > < span class =" n " > ravel </ span > < span class =" p " > ( </ span > < span class =" n " > predictions</ span > < span class =" p " > ) </ span >
826+ < span class ="n "> errors</ span > < span class ="o "> =</ span > < span class ="n "> true_values</ span > < span class ="o "> -</ span > < span class ="n "> predictions</ span >
827827 < span class ="c1 "> # We have several possible strategies to reduce all errors to a single value.</ span >
828828 < span class ="c1 "> # Computing the mean error (sum divided by the number of element) might seem</ span >
829829 < span class ="c1 "> # like a good solution. However, we have negative errors that will misleadingly</ span >
@@ -859,15 +859,15 @@ <h2>Main exercise<a class="headerlink" href="#main-exercise" title="Link to this
859859< div class ="cell_output docutils container ">
860860< div class ="output stream highlight-myst-ansi notranslate "> < div class ="highlight "> < pre > < span > </ span > Model #0:
861861-40.00 (g / mm) * flipper length + 15000.00 (g)
862- Error: 2764.854
862+ Error: nan
863863
864864Model #1:
86586545.00 (g / mm) * flipper length + -5000.00 (g)
866- Error: 338.523
866+ Error: nan
867867
868868Model #2:
86986990.00 (g / mm) * flipper length + -14000.00 (g)
870- Error: 573.041
870+ Error: nan
871871</ pre > </ div >
872872</ div >
873873</ div >
0 commit comments