Skip to content

Commit 1cb5bd6

Browse files
18-10-25
1 parent 025ac28 commit 1cb5bd6

4 files changed

Lines changed: 44 additions & 1 deletion

File tree

knowledge_base/AI/Evaluation.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
identification problem
2+
- F1 score
3+
4+
translation problem
5+
- BLEU
6+
- CODEBLUE - https://arxiv.org/pdf/2009.10297
7+
- BERT
8+
- CODEBERT
9+
10+
Papers to read
11+
- https://arxiv.org/pdf/2301.09043
12+
- https://arxiv.org/pdf/2408.16498
13+
- https://arxiv.org/pdf/1906.04903
14+
- https://arxiv.org/pdf/2501.12934
15+
16+

knowledge_base/AI/Observability.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
# Observability
2+
Collect useful information from application for debugging or app insight purpose
3+
4+
[FastAPI Observability Setup](https://github.com/blueswen/fastapi-observability)
5+
- will prometheus, loki, tempo uses db to store data?

knowledge_base/AI/ThinkPad.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -88,4 +88,5 @@
8888
- not much useful
8989

9090
# explore
91-
- anthropic prompt improver and prompt evaluator
91+
- anthropic prompt improver and prompt evaluator
92+
- neo4j certification

knowledge_base/AI/Transformers.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
2+
## Understanding Transformers
3+
4+
Softmax
5+
- Normalizing values so that adding them will result 1 and they will represent probablity
6+
- Each value is raised to e (ie e^x) to convert negatives and divided by the total
7+
8+
Temperature
9+
- the `t` variable in softmax function `e^(x/t)`
10+
- the higher the `t`, the smoother the distribution, so less related token will be sampled
11+
12+
13+
### Defeating Non determinism in LLMs
14+
[Blog](https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/)
15+
- What is non determinism?
16+
- If same prompt is given to LLM multiple times, all the responses will not be exactly similar
17+
- Why it happens?
18+
- Output Sampling - temperature parameter
19+
- Set to 0 to avoid it
20+
- concurrent operations on floating point numbers
21+
- Make each operation batch invariant

0 commit comments

Comments
 (0)