File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change 1+ # noqa: E501
12"""
2- This is to clean the data pulled
3- by the photos_detail.py script
4- so as to further delete useless columns
5- and reorganize the dataset as this form:
3+ This is to clean the data pulled by the photos_detail.py script so as to
4+ further delete useless columns and reorganize the dataset as this form:
65
7- locations | amount | time | license | content_categories | highest_comment | total_view
8- Minneapolis, United States | 20 | 2022-10-22 | 4 | football, life | 105 | 100000
9- São José do Rio Preto SP, Brasil | 30 | 2022-10-22 | 4 | football, life | 50 | 300000
6+ | locations | amount | time | license | content_categories | highest_comment | total_view | # noqa: E501
7+ | -------------------------------- | -----: | ---------- | ------: | ------------------ | --------------: | ---------: | # noqa: E501
8+ | Minneapolis, United States | 20 | 2022-10-22 | 4 | football, life | 105 | 100000 | # noqa: E501
9+ | São José do Rio Preto SP, Brasil | 30 | 2022-10-22 | 4 | football, life | 50 | 300000 | # noqa: E501
1010...
11+
1112Note:
1213content_categories will be got from basic NLP on the tags column
1314"""
You can’t perform that action at this time.
0 commit comments