Skip to content

Commit abd3ffa

Browse files
committed
chore: update config for may 2026 webgraph
1 parent 8caa3db commit abd3ffa

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

src/script/hostgraph/hostgraph_config.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
### saved as tuples <from_host, to_host>
1313

1414
# crawls to be processed
15-
CRAWLS=("CC-MAIN-2025-47" "CC-MAIN-2025-51" "CC-MAIN-2026-04")
15+
CRAWLS=("CC-MAIN-2026-12" "CC-MAIN-2026-17" "CC-MAIN-2026-21")
1616

1717
INPUT_BASE_URL="s3://commoncrawl/"
1818

@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
4242
################################################################################
4343
# construct a merged graph of multiple monthly crawls
4444

45-
MERGE_NAME=cc-main-2025-26-nov-dec-jan
45+
MERGE_NAME=cc-main-2026-mar-apr-may
4646

4747
# Naming convention should be the three months' crawls that are
4848
# used to generate this graph release. In the event of multiple months

0 commit comments

Comments
 (0)