Skip to content

Commit 67489f1

Browse files
Merge pull request #161 from LaunchCodeEducation/eda-with-pands-audit
Typos, among some calrity issues fixed for all reading material
2 parents 7b1ee46 + b6862eb commit 67489f1

7 files changed

Lines changed: 19 additions & 19 deletions

File tree

content/eda-with-pandas/reading/conditional-formatting/_index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ draft = false
55
weight = 5
66
+++
77

8-
When exploring data, using pandas you can also apply conditional formatting similar to how you did using spreadsheets. Say, for instance, you only want to display related to a specific city, state, movie genre, or name, you can do so!
8+
When exploring data, using pandas you can also apply conditional formatting similar to how you did using spreadsheets. Say, for instance, you only want to display data related to a specific city, state, movie genre, or name, you can do so!
99

1010
We will begin by identifying rows based on a condition using one column of data.
1111

content/eda-with-pandas/reading/dataframes-with-pandas/_index.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ weight = 3
77

88
A pandas **DataFrame** is the second type of class that is capable of handling data.
99

10-
Similar to a spreadsheet, a DataFrame can be visualzed as having multiple columns and rows associated with the data inside. The data within can be of any type.
10+
Similar to a spreadsheet, a DataFrame can be visualized as having multiple columns and rows associated with the data inside. The data within can be of any type.
1111

1212
A DataFrame can also be considered a collection or assortment of Series. Similar to a Series there are multiple ways that a DataFrame can be created:
1313
1. Using a multi-dimensional list, dictionary, or tuple
@@ -66,7 +66,7 @@ One thing to note about lists when they are added into a DataFrame is that each
6666
import pandas as pd
6767

6868
# Create a pandas DataFrame by providing a dictionary
69-
movie_dictionary_dataframe = pd.DataFrame(movies = {'Name': ["Interstellar", "Pride and Prejudice", "Inception", "Barbie"],'Release': [2014, 2005, 2010, 2003]})
69+
movie_dictionary_dataframe = pd.DataFrame({'Name': ["Interstellar", "Pride and Prejudice", "Inception", "Barbie"],'Release': [2014, 2005, 2010, 2003]})
7070

7171
# Create a pandas DataFrame from a pre-existing dictionary
7272
movies = {'Name': ["Interstellar", "Pride and Prejudice", "Inception", "Barbie"],'Release': [2014, 2005, 2010, 2003]}
@@ -131,7 +131,7 @@ movies_genres_dataframe = pd.concat([movies, genres], axis=1) # axis 1 specifies
131131
the `axis` parameter specifies whether the data will be joined or combined along the *row* or *column*. Take a look at the table below. If you do not specify `axis=1` it will default to `axis=0`.
132132

133133
| Axis | Represents | Use Case |
134-
|---|---|---|---|
134+
|---|---|---|
135135
| 0 (default) | **Row** | Operations performed **across rows** |
136136
| 1 | **Column** | Operations performed **down each column** |
137137
{{% /notice %}}

content/eda-with-pandas/reading/exploring-data-with-pandas/_index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ weight = 1
77

88
Exploratory Data Analysis or EDA as you already know is a critical step when beginning your analysis work. Similar to the EDA work with spreadsheets we will do the same with Python and **pandas** in order to accomplish the following:
99

10-
1. Form a hypotheses about what is the underlying forces effecting your data.
10+
1. Form a hypothesis about what is the underlying forces effecting your data.
1111
1. Challenge previous assumptions that may have been made when discussing the business issue.
1212
1. Guide you on what tools and techniques you should use when working with that dataset.
1313

@@ -55,7 +55,7 @@ import pandas as pd
5555

5656
## NumPy
5757

58-
The NumPy library will be used in conjuction with pandas so that we can perform mathematical operations on some of our datasets. As we explore our data and in later chapters, begin cleaning and manipulating data we will use the tools it provides to make our life easier.
58+
The NumPy library will be used in conjunction with pandas so that we can perform mathematical operations on some of our datasets. As we explore our data and in later chapters, begin cleaning and manipulating data we will use the tools it provides to make our life easier.
5959

6060
{{% notice blue Note "rocket" %}}
6161
Once NumPy is installed, it can be imported into your workspace in the following way:

content/eda-with-pandas/reading/numpy-intro/_index.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ weight = 2
77

88
NumPy, or Numerical Python, provides additional flexibility in working with data than standard Python lists. The foundational element of NumPy are arrays, and the main structure found in NumPy is the `ndarray`.
99

10-
Arrays can hold a collection of the same data type and can be one-dimesional, also called a vector, or multi-dimensional, such as a matrix. The benefits of NumPy arrays are efficient element access and data manipulation / transformation.
10+
Arrays can hold a collection of the same data type and can be one-dimensional, also called a vector, or multi-dimensional, such as a matrix. The benefits of NumPy arrays are efficient element access and data manipulation / transformation.
1111

1212
### The basics of NumPy arrays
1313

@@ -26,7 +26,7 @@ print(array1)
2626
[8 5 3 2 1 1]
2727
```
2828

29-
Multi-dimensional arrays can be created applying the same method using mutiple nested lists, such as the following two-dimensional array (or matrix)
29+
Multi-dimensional arrays can be created applying the same method using multiple nested lists, such as the following two-dimensional array (or matrix)
3030

3131
```python {linenos=table}
3232
# import numpy as np
@@ -45,7 +45,7 @@ print(array2)
4545

4646
### Basic array operations
4747

48-
Many functions can be performed on arrays, such as determing attributes of the array, sorting, spliting or combining arrays, and, of course, numerical operations.
48+
Many functions can be performed on arrays, such as determining attributes of the array, sorting, spliting or combining arrays, and, of course, numerical operations.
4949

5050
We will introduce a brief selection of the broad array (no pun intended) of functions which provide useful insights into NumPy arrays.
5151

@@ -118,7 +118,7 @@ print(array5)
118118
[3 7 9]]
119119
```
120120

121-
Lastly for this introduction, arthimetic functions can be performed on an array as a whole, or for a specific dimension.
121+
Lastly for this introduction, arithmetic functions can be performed on an array as a whole, or for a specific dimension.
122122

123123
```python {linenos=table}
124124
# import numpy as np

content/eda-with-pandas/reading/pandas-functions/_index.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -29,14 +29,14 @@ Two functions below that may stick out to you might be the `.info()` and the `.d
2929
| `.fillna(desired_value)` | Fills null values in a DataFrame with the specified value. |
3030
| `.groupby('column_name')` | Groups a DataFrame by the unique values in the specified column. |
3131
| `.value_counts()` | Computes the value counts for each element in a DataFrame. |
32-
| `.shape()` | Returns a tuple representing the dimensions of your data structure |
32+
| `.shape` | Returns a tuple representing the dimensions of your data structure |
3333
| `.loc[]` | Method used to locate rows or columns based on label-indexing |
3434
| `.iloc[]` | Method used to locate rows or columns based on index positions `(0, 1, 2 ,3 ,etc..)` |
3535

3636
{{% notice blue Note "rocket" %}}
37-
When using the `.shape()` function for a DataFrame it will show the number of rows and columns `(rows, columns)`.
37+
When using the `.shape` attribute for a DataFrame it will show the number of rows and columns `(rows, columns)`.
3838

39-
When using the `.shape()` function for a Series it will only return the number of rows and the column will be empty `(rows, )`
39+
When using the `.shape` attribute for a Series it will only return the number of rows and the column will be empty `(rows, )`
4040
{{% /notice %}}
4141

4242
## Statistics with pandas
@@ -87,7 +87,7 @@ How do we view only the first 13 rows of a DataFrame?
8787
{{% /notice %}}
8888

8989
{{% notice green Question "rocket" %}}
90-
Which pandas function will prin the number of records, three quartiles, mean, standard deviation, minimum and maximum values of a DataFrame?
90+
Which pandas function will print the number of records, three quartiles, mean, standard deviation, minimum and maximum values of a DataFrame?
9191

9292
<!-- Solution: .describe() -->
9393
{{% /notice %}}

content/eda-with-pandas/reading/series-with-pandas/_index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -170,7 +170,7 @@ What does index-labeling default to if none are provided?
170170
{{% /notice %}}
171171

172172
{{% notice green Question "rocket" %}}
173-
What type of data is a pandas Series capable of holidng?
173+
What type of data is a pandas Series capable of holding?
174174

175175
<!-- Solution: Any type -->
176176
{{% /notice %}}

content/eda-with-pandas/reading/visuals-with-pandas/_index.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ weight = 4
66
hidden = true
77
+++
88

9-
The `pandas` library works in conjuction and is able to integrate other libraries very easily. One of those libraries is **Matplotlib**, which is a library used to visalize the data that you are working with.
9+
The `pandas` library works in conjuction and is able to integrate other libraries very easily. One of those libraries is **Matplotlib**, which is a library used to visualize the data that you are working with.
1010

1111
## Installation
1212

@@ -22,7 +22,7 @@ import matplotlib as mpl
2222
```
2323

2424
```python
25-
# best practice when is to import matplotlib.pyplot as plt
25+
# best practice is to import matplotlib.pyplot as plt
2626
import matplotlib.pyplot as plt
2727
```
2828

@@ -31,7 +31,7 @@ import matplotlib.pyplot as plt
3131

3232
When importing matplotlib you are also importing `pyplot` as well. You could reference pyplot as such: `mpl.pyplot.function_here()`
3333

34-
Often times pyplot is used to generate 2-D graphics and since it is referenced often importing it separately and assigning it it's own alias is helpful and may save some time!
34+
Often times pyplot is used to generate 2-D graphics and since it is referenced often importing it separately and assigning it its own alias is helpful and may save some time!
3535
{{% /notice %}}
3636

3737
## Example
@@ -54,7 +54,7 @@ plt.show()
5454
```
5555

5656
{{% notice blue Note "rocket" %}}
57-
When using tools like Jupyter Notebooks you may not need to include the `plt.show()` line of code shown on line 11 in the above code block.
57+
When using tools like Jupyter Notebooks you may not need to include the `plt.show()` line of code shown in the above code block.
5858

5959
The `plt.show()` when used within a terminal emulator environment will create a separate pop-out of the graph. Tools like Jupyter Notebooks will include the visual within the notebook itself, eliminating the need for the pop out window.
6060
{{% /notice %}}

0 commit comments

Comments
 (0)