Commit ed0d168
committed
perf(dict): 简拼查询改为精确长度匹配,消除短简拼卡顿
LookupAbbrev 旧实现用 PrefixCollect 收集整棵 code* 子树,短简拼
(如 sf)需扫描数千叶子 / 上万词条再排序截断,造成 50~100ms 卡顿,
且会把更长简拼词(sf→sfg 三字)当噪声召回,不符合"N 声母 = N 字"语义。
改为 ExactMatch 精确匹配,与 binformat DictReader.LookupAbbrev 行为对齐。
新增 TestWdatReader_LookupAbbrevExactNoPrefix 锁定精确匹配语义。1 parent 18efb8d commit ed0d168
2 files changed
Lines changed: 48 additions & 24 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
320 | 320 | | |
321 | 321 | | |
322 | 322 | | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
323 | 360 | | |
324 | 361 | | |
325 | 362 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
285 | 285 | | |
286 | 286 | | |
287 | 287 | | |
288 | | - | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
| 291 | + | |
| 292 | + | |
| 293 | + | |
| 294 | + | |
289 | 295 | | |
290 | 296 | | |
291 | 297 | | |
292 | 298 | | |
293 | 299 | | |
294 | | - | |
295 | | - | |
296 | | - | |
297 | | - | |
298 | | - | |
299 | | - | |
300 | | - | |
301 | | - | |
302 | | - | |
303 | | - | |
304 | | - | |
305 | | - | |
306 | | - | |
307 | | - | |
308 | | - | |
309 | | - | |
310 | | - | |
311 | | - | |
| 300 | + | |
| 301 | + | |
312 | 302 | | |
313 | 303 | | |
314 | 304 | | |
315 | | - | |
316 | | - | |
317 | | - | |
318 | | - | |
319 | | - | |
| 305 | + | |
| 306 | + | |
320 | 307 | | |
321 | 308 | | |
322 | 309 | | |
| |||
0 commit comments