Skip to content

Commit d885b9c

Browse files
Sync notes and files as of 2025-03-17 19:00:12 UTC
1 parent e8cff6f commit d885b9c

5 files changed

Lines changed: 14 additions & 9 deletions

File tree

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
---
2-
title: Learn about API
2+
title: Learn API
33
lang: en
44
category: data-coding
55
tags:
66
- learn
77
season: autumn
88
created: 2024-03-18
9-
updated: 2024-12-22
9+
updated: 2025-03-17
1010
sources: Misc
1111
---
1212

_notes/data-coding/Learn Git.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ category: data-coding
55
tags: [learn, git]
66
season: summer
77
created: 2021-02-13
8-
updated: 2025-02-26
8+
updated: 2025-03-17
99
sources: https://stackoverflow.com/questions/18418718/git-still-adds-and-tracks-folders-marked-in-gitignore
1010
---
1111

@@ -109,7 +109,7 @@ sources: https://stackoverflow.com/questions/18418718/git-still-adds-and-tracks-
109109
- Version-control large datasets, esp. in ML projects, using [DVC](https://dvc.org/)
110110
- [Merging vs Rebasing](https://www.atlassian.com/git/tutorials/merging-vs-rebasing)
111111
- [Git's data model](https://missing.csail.mit.edu/2020/version-control/)
112-
- Git uses hashing via [SHA-1](https://en.wikipedia.org/wiki/SHA-1) – maps arbitrary-sized inputs to 160-bit outputs (which can be represented as 40 hexadecimal characters, e.g. commit hashes) but [no longer unbroken](https://shattered.io/); more info in [[Learn about cryptography]]
112+
- Git uses hashing via [SHA-1](https://en.wikipedia.org/wiki/SHA-1) – maps arbitrary-sized inputs to 160-bit outputs (which can be represented as 40 hexadecimal characters, e.g. commit hashes) but [no longer unbroken](https://shattered.io/); more info in [[Learn cryptography]]
113113
- There are [many](https://nvie.com/posts/a-successful-git-branching-model/) [different](https://www.endoflineblog.com/gitflow-considered-harmful) [workflows](https://www.atlassian.com/git/tutorials/comparing-workflows/gitflow-workflow), i.e. practices to follow when working on big projects.
114114
- [Analyse how a Git repo grows over time](https://github.com/erikbern/git-of-theseus)
115115
- You can also [store your private data](https://github.com/sobolevn/git-secret) inside a git repo

_notes/data-coding/Learn about cryptography.md renamed to _notes/data-coding/Learn cryptography.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,12 @@
11
---
2-
title: Learn about cryptography
2+
title: Learn cryptography
33
lang: en
44
category: data-coding
5-
tags: [learn]
5+
tags:
6+
- learn
67
season: summer
78
created: 2022-08-22
8-
updated: 2025-01-03
9+
updated: 2025-03-17
910
sources: https://missing.csail.mit.edu/2020/security/
1011
---
1112

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
2+
- [Custom Formats Builder](https://customformats.com/): for Microsoft Excel and Google Sheets
3+
- [`strftime()`](https://docs.python.org/3/library/datetime.html#strftime-and-strptime-format-codes): for date formatting, introduced by Unix via C
4+

_notes/data-coding/Use MD5 function to create unique IDs.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,15 @@ category: data-coding
55
tags: [idea, sql]
66
season: spring
77
created: 2021-04-03
8-
updated: 2023-05-06
8+
updated: 2025-03-17
99
sources: https://blog.getdbt.com/one-analysts-guide-for-going-from-good-to-great/
1010
---
1111

1212
- I first encountered this function when trying to join two tables together using about eight separate fields. Not ideal.
1313
- The natural inclination is to create your own ID by simply concatenating a bunch of fields together. These columns are bad because they _kind of_ look like data but operate as an ID. It’s important to have a column whose sole function is to be a unique identifier for that row.
1414
- Instead, use [MD5 functions](https://docs.aws.amazon.com/redshift/latest/dg/r_MD5.html) to create unique IDs on AWS. IDs that are obviously IDs reduce confusion among junior analyst and end users by removing semi-comprehensible data strings throughout your database.
1515
- At any rate, be aware that MD5 is [no longer considered strong](https://valerieaurora.org/hash.html) as a hash function, should it contain sensitive information
16-
- More info on how this works is in [[Learn about cryptography]]
16+
- More info on how this works is in [[Learn cryptography]]
1717
```sql
1818
select md5('Amazon Redshift')
1919
# ---

0 commit comments

Comments
 (0)