Commit b55cc8e
authored
fix: batch FAISS embedding to avoid 429 rate limit exhaustion (#289)
Split document embedding into 100-chunk batches with a 1s delay
between batches so a 429 only retries one batch (~1 API call) rather
than restarting FAISS.from_documents from scratch (~87 calls). Also
raise retry wait times from max 120s to max 600s to give the quota
time to reset before the next attempt.
Signed-off-by: Jack Luar <jluar@precisioninno.com>1 parent 8a1258a commit b55cc8e
1 file changed
Lines changed: 11 additions & 3 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
| 2 | + | |
2 | 3 | | |
3 | 4 | | |
4 | 5 | | |
| |||
74 | 75 | | |
75 | 76 | | |
76 | 77 | | |
77 | | - | |
| 78 | + | |
78 | 79 | | |
79 | 80 | | |
80 | 81 | | |
81 | 82 | | |
82 | 83 | | |
83 | | - | |
| 84 | + | |
84 | 85 | | |
85 | 86 | | |
86 | 87 | | |
| |||
90 | 91 | | |
91 | 92 | | |
92 | 93 | | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
93 | 101 | | |
94 | 102 | | |
95 | 103 | | |
| |||
229 | 237 | | |
230 | 238 | | |
231 | 239 | | |
232 | | - | |
| 240 | + | |
233 | 241 | | |
234 | 242 | | |
235 | 243 | | |
| |||
0 commit comments