Commit 4c7ab4a
committed
Address second review round: clearer docs, eager check in SqliteITarEntryReader
- data_prep.md: replace "consumed sequentially" with the precise constraint
("not used as auxiliary or mounted").
- SqliteIndexReader: expose has_sample_tables as a constructor-time attribute
(mirrors db_has_sample_parts); drop the per-method _check_samples_table guard.
- SqliteITarEntryReader: raise MissingSamplesTableError at __init__ when the
samples table is missing — fail fast at the boundary that actually requires it.
- Test updated to assert at SqliteITarEntryReader construction.
Signed-off-by: Pei Li <pei.li@kaiko.ai>1 parent 88245f2 commit 4c7ab4a
4 files changed
Lines changed: 21 additions & 42 deletions
File tree
- docs/source/basic
- src/megatron/energon/flavors/webdataset
- tests
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
688 | 688 | | |
689 | 689 | | |
690 | 690 | | |
691 | | - | |
| 691 | + | |
692 | 692 | | |
693 | 693 | | |
694 | 694 | | |
695 | 695 | | |
696 | 696 | | |
697 | | - | |
| 697 | + | |
698 | 698 | | |
699 | 699 | | |
700 | 700 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
397 | 397 | | |
398 | 398 | | |
399 | 399 | | |
400 | | - | |
| 400 | + | |
401 | 401 | | |
402 | 402 | | |
403 | 403 | | |
| |||
413 | 413 | | |
414 | 414 | | |
415 | 415 | | |
416 | | - | |
417 | | - | |
418 | | - | |
419 | | - | |
420 | | - | |
421 | | - | |
422 | | - | |
423 | | - | |
424 | | - | |
425 | | - | |
426 | | - | |
427 | | - | |
428 | | - | |
429 | 416 | | |
430 | 417 | | |
431 | 418 | | |
432 | | - | |
433 | | - | |
434 | | - | |
| 419 | + | |
435 | 420 | | |
436 | 421 | | |
437 | 422 | | |
| |||
466 | 451 | | |
467 | 452 | | |
468 | 453 | | |
469 | | - | |
470 | 454 | | |
471 | 455 | | |
472 | 456 | | |
| |||
479 | 463 | | |
480 | 464 | | |
481 | 465 | | |
482 | | - | |
483 | 466 | | |
484 | 467 | | |
485 | 468 | | |
| |||
505 | 488 | | |
506 | 489 | | |
507 | 490 | | |
508 | | - | |
509 | 491 | | |
510 | 492 | | |
511 | 493 | | |
| |||
525 | 507 | | |
526 | 508 | | |
527 | 509 | | |
528 | | - | |
529 | 510 | | |
530 | 511 | | |
531 | 512 | | |
532 | 513 | | |
533 | 514 | | |
534 | 515 | | |
535 | 516 | | |
536 | | - | |
537 | 517 | | |
538 | 518 | | |
539 | 519 | | |
| |||
549 | 529 | | |
550 | 530 | | |
551 | 531 | | |
552 | | - | |
553 | 532 | | |
554 | 533 | | |
555 | 534 | | |
| |||
579 | 558 | | |
580 | 559 | | |
581 | 560 | | |
582 | | - | |
583 | 561 | | |
584 | 562 | | |
585 | 563 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
485 | 485 | | |
486 | 486 | | |
487 | 487 | | |
| 488 | + | |
488 | 489 | | |
489 | 490 | | |
490 | 491 | | |
| |||
496 | 497 | | |
497 | 498 | | |
498 | 499 | | |
499 | | - | |
| 500 | + | |
| 501 | + | |
| 502 | + | |
| 503 | + | |
500 | 504 | | |
501 | 505 | | |
502 | 506 | | |
| |||
507 | 511 | | |
508 | 512 | | |
509 | 513 | | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
510 | 518 | | |
511 | 519 | | |
512 | 520 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| 53 | + | |
53 | 54 | | |
54 | 55 | | |
55 | 56 | | |
| |||
1862 | 1863 | | |
1863 | 1864 | | |
1864 | 1865 | | |
1865 | | - | |
1866 | | - | |
| 1866 | + | |
1867 | 1867 | | |
1868 | 1868 | | |
1869 | 1869 | | |
1870 | 1870 | | |
1871 | 1871 | | |
1872 | 1872 | | |
1873 | | - | |
1874 | | - | |
1875 | | - | |
1876 | | - | |
1877 | | - | |
1878 | | - | |
1879 | | - | |
1880 | | - | |
1881 | | - | |
1882 | | - | |
1883 | | - | |
| 1873 | + | |
| 1874 | + | |
| 1875 | + | |
| 1876 | + | |
| 1877 | + | |
| 1878 | + | |
1884 | 1879 | | |
1885 | 1880 | | |
1886 | 1881 | | |
| |||
1894 | 1889 | | |
1895 | 1890 | | |
1896 | 1891 | | |
1897 | | - | |
1898 | | - | |
1899 | 1892 | | |
1900 | 1893 | | |
1901 | 1894 | | |
| |||
0 commit comments