HDDS-15179. Support Chunked Transfer without Content Length by peterxcli · Pull Request #10196 · apache/ozone

peterxcli · 2026-05-05T20:41:17Z

What changes were proposed in this pull request?

Currently chunked transfer would fail if we don't specify content length

def _ev_add_te_header(request, **kwargs):
    request.headers.add_header('Transfer-Encoding', 'chunked')

def test_object_write_with_chunked_transfer_encoding():
    bucket_name = get_new_bucket()
    client = get_client()

    client.meta.events.register_first('before-sign.*.*', _ev_add_te_header)
    response = client.put_object(Bucket=bucket_name, Key='foo', Body='bar')

    assert response['ResponseMetadata']['HTTPStatusCode'] == 200

with the following error:

botocore.exceptions.ClientError: An error occurred (XAmzContentSHA256Mismatch) when calling the PutObject operation: The provided 'x-amz-content-sha256' header does not match the computed hash.

https://github.com/ceph/s3-tests/blob/fb8b73092bb1dd8db829f1205a9e52e73bf9a232/s3tests/functional/test_s3.py#L1589-L1599

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-15179

How was this patch tested?

(Please explain how this patch was tested. Ex: unit tests, manual tests, workflow run on the fork git repo.)
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this.)

…quests Signed-off-by: peterxcli <peterxcli@gmail.com>

peterxcli · 2026-05-05T20:49:49Z

I notice the jira ticket should be as a subtask under https://issues.apache.org/jira/browse/HDDS-8423, is there any way to fix it? I couldn't found the move option.

adoroszlai

Thanks @peterxcli for the patch.

Co-authored-by: Doroszlai, Attila <6454655+adoroszlai@users.noreply.github.com>

chungen0126

Thanks @peterxcli for the patch.

Based on my understanding of S3, requests fall into one of two cases: they either include a Content-Length header or an x-amz-decoded-content-length header. That being said, I totally understand that this proposed change adds flexibility to the upload process, and for me it is ok.

chungen0126 · 2026-05-06T07:45:15Z

      OzoneBucket bucket = context.getBucket();
      final String lengthHeader = getHeaders().getHeaderString(HttpHeaders.CONTENT_LENGTH);
      long length = lengthHeader != null ? Long.parseLong(lengthHeader) : 0;
+      if (lengthHeader == null && body != null) {


Regarding S3, we should also consider the scenario where the Content-Length header is absent, but the x-amz-decoded-content-length header is provided. In this case, since we already know the exact size of the payload, we wouldn't need to manually calculate the upload length. See https://docs.aws.amazon.com/AmazonS3/latest/API/sigv4-streaming.html

For all requests, you must include the x-amz-decoded-content-length header, specifying the size of the object in bytes.

Signed-off-by: peterxcli <peterxcli@gmail.com>

chungen0126

+1 LGTM. Let's wait for the CI to pass.

ivandika3 · 2026-05-06T13:09:15Z

+      if (lengthHeader == null && body != null && !hasMultiChunksUpload) {
+        spooledBody = new FileBackedOutputStream(32);
+        length = IOUtils.copyLarge(body, spooledBody, new byte[getIOBufferSize(0)]);
+        body = spooledBody.asByteSource().openStream();
+        hasCalculatedLength = true;
+      }


Is this going to copy the whole object into a temporary file on S3G machine? In other words, is S3G required to allocate some disk space for this length calculation?

If so, this might be problematic since firstly currently a lot of cluster (including mine) assume that S3G does not have variable disk requirement (this then require some kind of persistent volume implementation for S3G deployed in K8s). Additionally, putting the whole object into a file and then re-reading again might incur additional disk IO.

ivandika3 · 2026-05-06T13:17:42Z

      if (canCreateDirectory &&
-          (length == 0 || hasAmzDecodedLengthZero) &&
+          hasKnownZeroLength &&
          StringUtils.endsWith(keyPath, "/")


From what I see, the length is only need to check whether the length is 0 or not. In that case is it possible to simply check the first byte existence?

ivandika3 · 2026-05-06T13:20:35Z

+      if (lengthHeader == null && body != null && !hasMultiChunksUpload) {
+        spooledBody = new FileBackedOutputStream(32);
+        length = IOUtils.copyLarge(body, spooledBody, new byte[getIOBufferSize(0)]);
+        body = spooledBody.asByteSource().openStream();
+        hasCalculatedLength = true;
+      }


If so, this might be problematic since firstly currently a lot of cluster (including mine) assume that S3G does not have variable disk requirement (this then require some kind of persistent volume implementation for S3G deployed in K8s). Additionally, putting the whole object into a file and then re-reading again might incur additional disk IO.

peterxcli · 2026-05-06T16:25:59Z

thanks for the review, let me think about this.

Enhance ObjectEndpoint to support chunked transfer encoding in PUT re…

c0b56c8

…quests Signed-off-by: peterxcli <peterxcli@gmail.com>

peterxcli requested review from ChenSammi and adoroszlai and removed request for adoroszlai May 5, 2026 20:41

peterxcli self-assigned this May 5, 2026

peterxcli requested review from adoroszlai, chungen0126 and jojochuang May 5, 2026 21:15

adoroszlai changed the title ~~HDDS-15179. [S3 Compatibility] Support Chunked Transfer without Content Length Set~~ HDDS-15179. Support Chunked Transfer without Content Length May 6, 2026

adoroszlai reviewed May 6, 2026

View reviewed changes

Comment thread hadoop-ozone/s3gateway/src/main/java/org/apache/hadoop/ozone/s3/endpoint/ObjectEndpoint.java Outdated

Comment thread hadoop-ozone/s3gateway/src/main/java/org/apache/hadoop/ozone/s3/endpoint/ObjectEndpoint.java Outdated

peterxcli marked this pull request as draft May 6, 2026 07:44

suggestion

befd926

Co-authored-by: Doroszlai, Attila <6454655+adoroszlai@users.noreply.github.com>

peterxcli marked this pull request as ready for review May 6, 2026 07:45

peterxcli requested a review from adoroszlai May 6, 2026 07:46

chungen0126 reviewed May 6, 2026

View reviewed changes

peterxcli added 2 commits May 6, 2026 17:43

add regression test for dir case

2be2da2

Signed-off-by: peterxcli <peterxcli@gmail.com>

stop auto length calculation for multi chunk payload put

4f7c352

Signed-off-by: peterxcli <peterxcli@gmail.com>

peterxcli requested a review from chungen0126 May 6, 2026 11:58

chungen0126 approved these changes May 6, 2026

View reviewed changes

ivandika3 reviewed May 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-15179. Support Chunked Transfer without Content Length#10196

HDDS-15179. Support Chunked Transfer without Content Length#10196
peterxcli wants to merge 4 commits intoapache:masterfrom
peterxcli:fix/s3-chunked-transfer-without-content-length

peterxcli commented May 5, 2026 •

edited

Loading

Uh oh!

peterxcli commented May 5, 2026

Uh oh!

adoroszlai left a comment

Uh oh!

Uh oh!

Uh oh!

chungen0126 left a comment •

edited

Loading

Uh oh!

chungen0126 May 6, 2026

Uh oh!

chungen0126 left a comment

Uh oh!

ivandika3 May 6, 2026 •

edited

Loading

Uh oh!

ivandika3 May 6, 2026

Uh oh!

ivandika3 May 6, 2026

Uh oh!

ivandika3 May 6, 2026

Uh oh!

peterxcli commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

peterxcli commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

peterxcli commented May 5, 2026

Uh oh!

adoroszlai left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

chungen0126 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chungen0126 May 6, 2026

Choose a reason for hiding this comment

Uh oh!

chungen0126 left a comment

Choose a reason for hiding this comment

Uh oh!

ivandika3 May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ivandika3 May 6, 2026

Choose a reason for hiding this comment

Uh oh!

ivandika3 May 6, 2026

Choose a reason for hiding this comment

Uh oh!

ivandika3 May 6, 2026

Choose a reason for hiding this comment

Uh oh!

peterxcli commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

peterxcli commented May 5, 2026 •

edited

Loading

chungen0126 left a comment •

edited

Loading

ivandika3 May 6, 2026 •

edited

Loading