Commit 74a0fc4
fix: split bullet-list emails Teams concatenates without separators
Teams delivers bullet lists in activity.text as flat HTML-entity-encoded
text with the <li> boundaries removed, so "a@adobe.com" and "b@adobe.com"
arrive glued as "a@adobe.comb@adobe.com". The email regex then backtracks
into nonsense (TLD=.comb, etc).
Decode common HTML entities, then insert a space after a known common TLD
when it's immediately followed by a letter (only possible when two
emails were concatenated). Trade-off: addresses in less common TLDs that
share a prefix with a common one (e.g. .coffee) lose correctness, but
this covers the Adobe/customer corporate domains we actually see.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>1 parent 22f49c6 commit 74a0fc4
1 file changed
Lines changed: 24 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | | - | |
6 | | - | |
7 | | - | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
8 | 24 | | |
9 | | - | |
| 25 | + | |
10 | 26 | | |
11 | | - | |
| 27 | + | |
| 28 | + | |
12 | 29 | | |
13 | 30 | | |
14 | 31 | | |
| |||
18 | 35 | | |
19 | 36 | | |
20 | 37 | | |
21 | | - | |
22 | | - | |
| 38 | + | |
| 39 | + | |
23 | 40 | | |
24 | 41 | | |
25 | 42 | | |
| |||
0 commit comments