You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: sagemaker-core/src/sagemaker/core/shapes/shapes.py
+48-1Lines changed: 48 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -6354,7 +6354,7 @@ class ProcessingS3Input(Base):
6354
6354
6355
6355
Attributes
6356
6356
----------------------
6357
-
s3_uri: The URI of the Amazon S3 prefix Amazon SageMaker downloads data required to run a processing job.
6357
+
s3_uri: The URI of the Amazon S3 prefix Amazon SageMaker downloads data required to run a processing job. Also accepts local file or directory paths, which will be automatically uploaded to S3 during job normalization.
6358
6358
local_path: The local path in your container where you want Amazon SageMaker to write input data to. LocalPath is an absolute path to the input data and must begin with /opt/ml/processing/. LocalPath is a required parameter when AppManaged is False (default).
6359
6359
s3_data_type: Whether you use an S3Prefix or a ManifestFile for the data type. If you choose S3Prefix, S3Uri identifies a key name prefix. Amazon SageMaker uses all objects with the specified key name prefix for the processing job. If you choose ManifestFile, S3Uri identifies an object that is a manifest file containing a list of object keys that you want Amazon SageMaker to use for the processing job.
6360
6360
s3_input_mode: Whether to use File or Pipe input mode. In File mode, Amazon SageMaker copies the data from the input source onto the local ML storage volume before starting your processing container. This is the most commonly used input mode. In Pipe mode, Amazon SageMaker streams input data from the source directly to your processing container into named pipes without using the ML storage volume.
@@ -6474,6 +6474,11 @@ class DatasetDefinition(Base):
# Default constants for ProcessingS3Input creation from source parameter
6478
+
_DEFAULT_S3_DATA_TYPE = "S3Prefix"
6479
+
_DEFAULT_S3_INPUT_MODE = "File"
6480
+
6481
+
6477
6482
class ProcessingInput(Base):
6478
6483
"""
6479
6484
ProcessingInput
@@ -6485,12 +6490,54 @@ class ProcessingInput(Base):
6485
6490
app_managed: When True, input operations such as data download are managed natively by the processing job application. When False (default), input operations are managed by Amazon SageMaker.
6486
6491
s3_input: Configuration for downloading input data from Amazon S3 into the processing container.
6487
6492
dataset_definition: Configuration for a Dataset Definition input.
6493
+
source: Convenience parameter that accepts a local file/directory path or S3 URI.
6494
+
When provided (and s3_input is not), a ProcessingS3Input is automatically created.
6495
+
Local paths will be uploaded to S3 during job normalization.
0 commit comments