You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add change to s3 scan source to create partitions after each page of … (#6006)
* Add change to s3 scan source to create partitions after each page of objects listed
Signed-off-by: Taylor Gray <tylgry@amazon.com>
(cherry picked from commit bf74cb7)
* Renew the unit test to handle the two scans and seperate buckets
Signed-off-by: Siqi Ding <dingdd@amazon.com>
* Add global state ownership renewal during partition creation
Signed-off-by: Siqi Ding <dingdd@amazon.com>
* Add ownership renewal during S3 partition creation
Prevents timeout during long-running operations by renewing global state ownership every 3 minutes
Signed-off-by: Siqi Ding <dingdd@amazon.com>
* Remove the each time update in ScanObjectWorker; Add the lastGlobalOwnershipRenewal related renewGlobalOwnershipIfNeeded to ensures ownership is maintained both before starting and throughout the entire partition creation process
Signed-off-by: Siqi Ding <dingdd@amazon.com>
* Make renewGlobalStateOwnershipForPartitionCreation private
Signed-off-by: Siqi Ding <dingdd@amazon.com>
---------
Signed-off-by: Taylor Gray <tylgry@amazon.com>
Signed-off-by: Siqi Ding <dingdd@amazon.com>
Co-authored-by: Taylor Gray <tylgry@amazon.com>
* Should be called by the source when it has fully processed a given partition
68
74
* @throws org.opensearch.dataprepper.model.source.coordinator.exceptions.PartitionNotFoundException if the partition key could not be found in the distributed store
Copy file name to clipboardExpand all lines: data-prepper-core/src/main/java/org/opensearch/dataprepper/core/sourcecoordination/LeaseBasedSourceCoordinator.java
+32-1Lines changed: 32 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -60,6 +60,8 @@ public class LeaseBasedSourceCoordinator<T> implements SourceCoordinator<T> {
Copy file name to clipboardExpand all lines: data-prepper-plugins/s3-source/src/main/java/org/opensearch/dataprepper/plugins/source/s3/S3ScanPartitionCreationSupplier.java
LOG.info("Running in folder_partitions mode at depth {}, found {} unique prefixes from {} objects", folderPartitioningOptions.getFolderDepth(), folderPartitions.size(), allPartitionIdentifiers.size());
163
-
164
-
returnnewArrayList<>(folderPartitions);
165
-
} else {
166
-
LOG.info("Returning partitions for {} S3 objects from bucket {}", allPartitionIdentifiers.size(), bucket);
LOG.info("Running in folder_partitions mode at depth {}, found {} unique prefixes from {} objects", folderPartitioningOptions.getFolderDepth(), folderPartitions.size(), objectPartitionIdentifiers.size());
Copy file name to clipboardExpand all lines: data-prepper-plugins/s3-source/src/main/java/org/opensearch/dataprepper/plugins/source/s3/ScanObjectWorker.java
+1-2Lines changed: 1 addition & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -156,7 +156,7 @@ public ScanObjectWorker(final S3Client s3Client,
0 commit comments