File tree Expand file tree Collapse file tree 7 files changed +59
-61
lines changed
_freeze/chapters/10_data_ops/execute-results Expand file tree Collapse file tree 7 files changed +59
-61
lines changed Original file line number Diff line number Diff line change 22 "cells" : [
33 {
44 "cell_type" : " markdown" ,
5- "id" : " b5dcbf2f " ,
5+ "id" : " 43539b7a " ,
66 "metadata" : {},
77 "source" : [
88 " # Exercise: exploring a new table\n " ,
1515 {
1616 "cell_type" : " code" ,
1717 "execution_count" : null ,
18- "id" : " a39de6a3 " ,
18+ "id" : " ffca3764 " ,
1919 "metadata" : {},
2020 "outputs" : [],
2121 "source" : [
2626 },
2727 {
2828 "cell_type" : " markdown" ,
29- "id" : " 6bbb0310 " ,
29+ "id" : " 71340102 " ,
3030 "metadata" : {},
3131 "source" : [
3232 " Now use the skrub `TableReport` and answer the following questions:"
3535 {
3636 "cell_type" : " code" ,
3737 "execution_count" : null ,
38- "id" : " 04295734 " ,
38+ "id" : " 85261d65 " ,
3939 "metadata" : {},
4040 "outputs" : [],
4141 "source" : [
4747 },
4848 {
4949 "cell_type" : " markdown" ,
50- "id" : " 61f7edb6 " ,
50+ "id" : " 7fdd5f56 " ,
5151 "metadata" : {
5252 "lines_to_next_cell" : 0
5353 },
7676 },
7777 {
7878 "cell_type" : " markdown" ,
79- "id" : " 7ba6e8d2 " ,
79+ "id" : " f06677b4 " ,
8080 "metadata" : {},
8181 "source" : [
8282 " ## Answers\n " ,
104104 },
105105 {
106106 "cell_type" : " markdown" ,
107- "id" : " b5d2a481 " ,
107+ "id" : " dbbeba4e " ,
108108 "metadata" : {},
109109 "source" : [
110110 " # Exercise: clean a dataframe using the `Cleaner`\n " ,
114114 {
115115 "cell_type" : " code" ,
116116 "execution_count" : null ,
117- "id" : " 6e6b0d18 " ,
117+ "id" : " 543d7760 " ,
118118 "metadata" : {},
119119 "outputs" : [],
120120 "source" : [
125125 },
126126 {
127127 "cell_type" : " markdown" ,
128- "id" : " cf0b8964 " ,
128+ "id" : " d28bd083 " ,
129129 "metadata" : {},
130130 "source" : [
131131 " Use the `TableReport` to answer the following questions:\n " ,
138138 {
139139 "cell_type" : " code" ,
140140 "execution_count" : null ,
141- "id" : " e8d0ec23 " ,
141+ "id" : " 26cd518d " ,
142142 "metadata" : {},
143143 "outputs" : [],
144144 "source" : [
149149 },
150150 {
151151 "cell_type" : " markdown" ,
152- "id" : " c8f51c8b " ,
152+ "id" : " 5fe48fa7 " ,
153153 "metadata" : {},
154154 "source" : [
155155 " Then, use the `Cleaner` to sanitize the data so that:\n " ,
162162 {
163163 "cell_type" : " code" ,
164164 "execution_count" : null ,
165- "id" : " 4fb2a1ba " ,
165+ "id" : " 3aa9513d " ,
166166 "metadata" : {},
167167 "outputs" : [],
168168 "source" : [
182182 {
183183 "cell_type" : " code" ,
184184 "execution_count" : null ,
185- "id" : " 0c3c78dc " ,
185+ "id" : " 28416486 " ,
186186 "metadata" : {},
187187 "outputs" : [],
188188 "source" : [
205205 },
206206 {
207207 "cell_type" : " markdown" ,
208- "id" : " 8a1c1797 " ,
208+ "id" : " 47ccd712 " ,
209209 "metadata" : {},
210210 "source" : [
211211 " We can inspect which columns were dropped and what transformations were applied:"
214214 {
215215 "cell_type" : " code" ,
216216 "execution_count" : null ,
217- "id" : " d8d13e9a " ,
217+ "id" : " 8fea1c0a " ,
218218 "metadata" : {},
219219 "outputs" : [],
220220 "source" : [
Original file line number Diff line number Diff line change 22 "cells" : [
33 {
44 "cell_type" : " markdown" ,
5- "id" : " 0a1656ab " ,
5+ "id" : " 386e67d8 " ,
66 "metadata" : {},
77 "source" : [
88 " # Exercise: using selectors together with `ApplyToCols`\n " ,
1212 {
1313 "cell_type" : " code" ,
1414 "execution_count" : null ,
15- "id" : " 94bc1caa " ,
15+ "id" : " b6e85b11 " ,
1616 "metadata" : {
1717 "lines_to_next_cell" : 0
1818 },
2424 {
2525 "cell_type" : " code" ,
2626 "execution_count" : null ,
27- "id" : " 743b063a " ,
27+ "id" : " 0b0b1069 " ,
2828 "metadata" : {},
2929 "outputs" : [],
3030 "source" : [
4646 },
4747 {
4848 "cell_type" : " markdown" ,
49- "id" : " 4da47e99 " ,
49+ "id" : " 78be5c02 " ,
5050 "metadata" : {},
5151 "source" : [
5252 " Using the skrub selectors and `ApplyToCols`:\n " ,
5959 {
6060 "cell_type" : " code" ,
6161 "execution_count" : null ,
62- "id" : " 190aa32e " ,
62+ "id" : " 5f1b9b9f " ,
6363 "metadata" : {},
6464 "outputs" : [],
6565 "source" : [
8484 {
8585 "cell_type" : " code" ,
8686 "execution_count" : null ,
87- "id" : " b9c6b4ee " ,
87+ "id" : " 2a5ed58a " ,
8888 "metadata" : {},
8989 "outputs" : [],
9090 "source" : [
103103 },
104104 {
105105 "cell_type" : " markdown" ,
106- "id" : " 6e321108 " ,
106+ "id" : " 10d0a234 " ,
107107 "metadata" : {},
108108 "source" : [
109109 " Given the same dataframe and using selectors, drop only string columns that contain\n " ,
113113 {
114114 "cell_type" : " code" ,
115115 "execution_count" : null ,
116- "id" : " 81750237 " ,
116+ "id" : " 0173a43b " ,
117117 "metadata" : {},
118118 "outputs" : [],
119119 "source" : [
132132 {
133133 "cell_type" : " code" ,
134134 "execution_count" : null ,
135- "id" : " f0721941 " ,
135+ "id" : " b3498f67 " ,
136136 "metadata" : {},
137137 "outputs" : [],
138138 "source" : [
143143 },
144144 {
145145 "cell_type" : " markdown" ,
146- "id" : " d3f434c8 " ,
146+ "id" : " ba37c85e " ,
147147 "metadata" : {},
148148 "source" : [
149149 " Now write a custom function that selects columns where all values are lower than\n " ,
153153 {
154154 "cell_type" : " code" ,
155155 "execution_count" : null ,
156- "id" : " de74b276 " ,
156+ "id" : " 6954ae31 " ,
157157 "metadata" : {},
158158 "outputs" : [],
159159 "source" : [
172172 {
173173 "cell_type" : " code" ,
174174 "execution_count" : null ,
175- "id" : " 333aaa5f " ,
175+ "id" : " 643b3c89 " ,
176176 "metadata" : {},
177177 "outputs" : [],
178178 "source" : [
189189 {
190190 "cell_type" : " code" ,
191191 "execution_count" : null ,
192- "id" : " d5751f7a " ,
192+ "id" : " d06d56eb " ,
193193 "metadata" : {},
194194 "outputs" : [],
195195 "source" : []
Original file line number Diff line number Diff line change 22 "cells" : [
33 {
44 "cell_type" : " markdown" ,
5- "id" : " 564b4538 " ,
5+ "id" : " 49231767 " ,
66 "metadata" : {},
77 "source" : [
88 " # Exercise\n " ,
1919 {
2020 "cell_type" : " code" ,
2121 "execution_count" : null ,
22- "id" : " 0bf5f727 " ,
22+ "id" : " cd9fec26 " ,
2323 "metadata" : {},
2424 "outputs" : [],
2525 "source" : [
2929 {
3030 "cell_type" : " code" ,
3131 "execution_count" : null ,
32- "id" : " 0fa39f1b " ,
32+ "id" : " 5524d644 " ,
3333 "metadata" : {},
3434 "outputs" : [],
3535 "source" : [
5353 {
5454 "cell_type" : " code" ,
5555 "execution_count" : null ,
56- "id" : " ee0af0a3 " ,
56+ "id" : " 49cbe625 " ,
5757 "metadata" : {},
5858 "outputs" : [],
5959 "source" : [
7777 {
7878 "cell_type" : " code" ,
7979 "execution_count" : null ,
80- "id" : " 17c31bf7 " ,
80+ "id" : " 10ef6861 " ,
8181 "metadata" : {},
8282 "outputs" : [],
8383 "source" : [
100100 {
101101 "cell_type" : " code" ,
102102 "execution_count" : null ,
103- "id" : " e27be910 " ,
103+ "id" : " 96f3c600 " ,
104104 "metadata" : {},
105105 "outputs" : [],
106106 "source" : [
120120 },
121121 {
122122 "cell_type" : " markdown" ,
123- "id" : " 145332dd " ,
123+ "id" : " 68fa9b12 " ,
124124 "metadata" : {},
125125 "source" : [
126126 " Modify the script so that the `DatetimeEncoder` adds periodic encoding with sine\n " ,
130130 {
131131 "cell_type" : " code" ,
132132 "execution_count" : null ,
133- "id" : " 9340e84c " ,
133+ "id" : " f28f8625 " ,
134134 "metadata" : {},
135135 "outputs" : [],
136136 "source" : [
153153 },
154154 {
155155 "cell_type" : " markdown" ,
156- "id" : " 3ee9e75a " ,
156+ "id" : " b2c56e8a " ,
157157 "metadata" : {},
158158 "source" : [
159159 " Now modify the script above to add spline features (`periodic_encoding=\" spline\" `).\n "
162162 {
163163 "cell_type" : " code" ,
164164 "execution_count" : null ,
165- "id" : " 584444a9 " ,
165+ "id" : " ec6b6d1f " ,
166166 "metadata" : {},
167167 "outputs" : [],
168168 "source" : [
188188 {
189189 "cell_type" : " code" ,
190190 "execution_count" : null ,
191- "id" : " 36cfe30a " ,
191+ "id" : " 59f4879f " ,
192192 "metadata" : {},
193193 "outputs" : [],
194194 "source" : []
You can’t perform that action at this time.
0 commit comments