Limit S3 client connections when using virtual threads #6369

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

jorgee wants to merge 14 commits into master

from virtual-threads-s3-client

Open

Limit S3 client connections when using virtual threads #6369

jorgee wants to merge 14 commits into master from virtual-threads-s3-client

+126 −628

Conversation

jorgee

Copy link

Contributor

@jorgee jorgee commented Aug 28, 2025 •

edited

Loading

close #4995
In SDK v1, S3 tranfers are managed by the transfer manager and a pool of threads to the transfers with a sync client. In SDK v2, the transfer manager is using an async client and the pool of threads is just used for preliminar work but not the tranfers. However, previous steps in publish dir ( check if is a directory, etc.) are using the sync client producing the same timeout errors.

This PR make the changes to limit the resources when using virtual threads is S3 transfers. A semaphore with a permit equal to the client's max connections is set in the Netflow's S3 client. It avoids to perform too many concurrent calls in the S3 client.

Tested with a pipeline with 15 tasks that generate 1000 files of 25 MB each one. It generated timeout errors when using in both 25.04 (SDK v1 and master (SDK v2). They disappear with this PR.

@jorgee


 add semaphore for s3 client when using virtual threads

0ef27e5

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@netlify Netlify

Copy link

netlify bot commented Aug 28, 2025 •

edited

Loading

✅ Deploy Preview for nextflow-docs-staging canceled.

Name	Link
🔨 Latest commit	`7d952cd`
🔍 Latest deploy log	https://app.netlify.com/projects/nextflow-docs-staging/deploys/68f120a88308570008db8f9f

@bentsherman bentsherman self-requested a review

August 28, 2025 15:56

@bentsherman bentsherman added the storage/aws label

Aug 28, 2025

@bentsherman bentsherman changed the title ~~(削除) Limiting S3 sync client connections when using virtual threads (削除ここまで)~~ (追記) Limit S3 client connections when using virtual threads (追記ここまで)

Aug 28, 2025

bentsherman added 2 commits

August 28, 2025 12:13

@bentsherman


 Remove extraneous config options, cleanup

fab3aa5

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

@bentsherman


 Fix compile error

46cfa21

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

@bentsherman

Copy link

Member

bentsherman commented Aug 28, 2025

I have removed the config options we discussed and updated the docs to show the desired state.

The sync client should use the semaphore to limit virtual threads based on the maxConnections. All calls to the sync client are wrapped with runWithPermit to enforce this
The async client should limit itself internally based on either the maxConnections or target throughput. Neither of these settings are specified by default in AwsS3Config, so the default behavior should be that the async client uses a target throughput of 10 Gbps
The sync client uses a default maxConnections of 50 to ensure that it always has a connection limit

@bentsherman bentsherman added this to the 25.10 milestone

Aug 28, 2025

@jorgee

Copy link

Contributor Author

jorgee commented Sep 1, 2025

As commented last week, I got some OutOfMemory issues when using virtual threads with small heap sizes (<2GB). So, I have inspected a heap dump for 15K file uploads and as you can see in the following image, the main memory consumption comes from two parts:
image

The XML deserializer used internally by the AWS Sync client is storing the XMLInputFactory using thread locals, a instance of this is replicated in each thread. These objects are keeping the XML request' responses, so the retained size of these objects is more than 500MB in this case. For the case of the issue (~60K transfer), it could be 2GB.
The async request handling used in the trasnsfer manager client is also consuming a lot of heap space with large number of transfers. For instance, in the case of the following figure, it is doing a bucket-to-bucket copy but, as the size exceeds the multipart-threshold, so a copy is converted to several multi-part copy requests. For 15K copies, it is creating more than 50k async requests. Each request is using 12KB, so it is consuming 600MB. In the case of the issue (60K), it could consume another 2GB.

image

As they are internal to the AWS SDK implementation, I think we can not currently fix it. So, I think the easier option would be to limit the number of publish transfers, as it is done in the file porter. We could have the same issues for other providers. However, in case we don't want to limit it, other options that could work is moving the S3 sync to async as well as adding a limit of concurrent async operations submitted to the async client, to limit the number of AsyncHandlers.
@bentsherman, @pditommaso, what do you think?

@pditommaso

Copy link

Member

pditommaso commented Sep 1, 2025

not sure it's worth optimising for virtual threads until they are not property supported by the AWS sdk

@bentsherman

Copy link

Member

bentsherman commented Sep 2, 2025

@jorgee is it possible to use a sync client for S3 transfers? then we could apply the semaphore to it and ensure that there are no more than a few hundred concurrent requests by limiting aws.client.maxConnections.

Right now it seems that the async client is allocating thousands of request handlers even with a smaller max concurrency, is that right?

It seems like we should move towards sync clients anyway so that we can just control it with virtual threads + semaphore.

It will be nice if the AWS SDK can provide the target throughput functionality using virtual threads, but we also need to find some kind of solution since many customers have been asking for it, and we have no control over AWS timeline.

@jorgee

Copy link

Contributor Author

jorgee commented Sep 2, 2025 •

edited

Loading

The async client is mandatory for the S3 transfer manager in AWS SDKv2. We should manage multi-part stuff in Nextflow again, and I think in sdk v1 we also managed some transfers with S3 transfer manager. So we should also rewrite them to not use the transfer manager.

@bentsherman

Copy link

Member

bentsherman commented Sep 2, 2025

Well, I don't know if we should go that far. I doubt we'll be able to manage the transfers as well as the transfer manager.

I have an idea that I'd like to try. I'll push it if it works

@bentsherman


 Wrap S3 async client in sync client limited by semaphore

f099b8b

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

@bentsherman

Copy link

Member

bentsherman commented Sep 2, 2025

I managed to wrap the S3 transfer manager in a synchronous interface. It just calls the async client and immediately waits for the result. I also wrapped these calls in the semaphore limiter so that the async client is controlled directly by aws.client.maxConnections.

This should ensure that the async client doesn't create too many requests when using virtual threads.

I also kept the target throughput setting so that users can control the maximum throughput, but it will still be limited by maxConnections.

If AWS ever improves the SDK to support virtual threads natively, we need only remove the wrapper class while the config options remain the same. This way we don't have to wait for AWS to give us a solution to customer issues.

@bentsherman

Copy link

Member

bentsherman commented Sep 2, 2025

@jorgee can you test the PR with your benchmark and see if the OOM errors are resolved?

@jorgee

Copy link

Contributor Author

jorgee commented Sep 2, 2025 •

edited

Loading

I will do. I will also try to test with this AWS SDK v2 preview version aws/aws-sdk-java-v2#6268

jorgee added 3 commits

September 16, 2025 12:11

@jorgee


 Add woodstox as default stax implementation

8a5c0d9

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@jorgee


 Modify thread pool manager to pass a ThreadFactory for virtual threads

8d97c51

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@jorgee


 fix javax.xml.stream.XMLInputFactory service loader file

4a7821b

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@jorgee

Copy link

Contributor Author

jorgee commented Sep 17, 2025

Some updates regarding Virtual threads.
1- The master implementation was not using Virtual Threads at least for publishDir. It was due to the CustomThreadFactory which was creating a platform thread per publishDir. I noticed it because I got this error when trying to transfer more than 30K files.
Server VM warning: INFO: os::commit_memory(0x000076957f9ff000, 16384, 0) failed; error='Not enough space' (errno=12) [259,030s][warning][os,thread] Failed to start thread "Unknown thread" - pthread_create failed (EAGAIN) for attributes: stacksize: 1024k, guardsize: 0k, detached.
This means that it was trying to create a pthread per publishDir thread. After the change to a Virtual thread factory, I can see 30K VirtualThread objects but not the platform threads.

2- From the heap memory usage, the proposed @bentsherman change was limiting the memory because CRT handlers but not the XMLInputFactory. I finally found a solution by changing the stax implementation to woodstox

So the current implementation has solved the memory issues for uploads and copies. For downloads, we just need to merge with PR #6402 because few big downloads can increase a lot the heap. @bentsherman, @pditommaso do you want to keep it in separate PRs or merge both in one?

@pditommaso

Copy link

Member

pditommaso commented Sep 17, 2025

@claude review

@pditommaso

Copy link

Member

pditommaso commented Sep 17, 2025

Smaller, well scope pr is better IMO

@claude Claude

Copy link

claude bot commented Sep 17, 2025 •

edited

Loading

Claude finished @pditommaso's task —— View job

PR Review: Limit S3 client connections when using virtual threads

Review Tasks:

Analyze S3Client wrapper and semaphore implementation
Review virtual thread factory changes
Examine woodstox stax implementation addition
Check configuration and documentation updates
Review deleted classes and their replacements
Provide overall assessment and recommendations

✅ Overall Assessment: Well-architected solution

This PR successfully addresses the S3 timeout issues with virtual threads through a thoughtful architectural approach. The key insight is wrapping the async S3 transfer manager in a synchronous interface while using semaphores to limit concurrency.

Key Strengths

🎯 Excellent Semaphore Implementation (`S3Client.java:86-107`)

Conditionally created only when Threads.useVirtual() is true
Uses maxConnections (default 50) as permit count - perfect balance
runWithPermit() method properly handles acquire/release with try-finally blocks
Correct interrupt handling with Thread.currentThread().interrupt()
Applied consistently across all S3 operations (200+ call sites)

🔧 Clean S3TransferManagerSync Wrapper

Simple but effective design:

Converts async operations to blocking via .completionFuture().get()
Proper exception propagation (InterruptedException, ExecutionException)
Minimal overhead while maintaining full functionality

🧵 Virtual Thread Factory Fix (`ThreadPoolManager.groovy:92-94`)

Replaced problematic CustomThreadFactory with proper virtual thread factory
Fixes the platform thread creation issue that was causing EAGAIN errors
Clean conditional logic: virtual threads when enabled, legacy pool otherwise

🚀 Memory Issue Resolution

Woodstox Integration: Added com.fasterxml.woodstox:woodstox-core:7.1.1 dependency
Service Loader Configuration: META-INF/services/javax.xml.stream.XMLInputFactory points to com.ctc.wstx.stax.WstxInputFactory
Addresses XMLInputFactory ThreadLocal memory leaks (500MB+ per test case)

Configuration & Documentation

✅ Simplified Configuration (`AwsS3Config.groovy`)

Removed complex virtual thread specific configs
Kept essential maxConnections and targetThroughputInGbps settings
Documentation clearly explains the new approach with virtual threads

📚 Updated Documentation (`aws-java-sdk-v2.md:20-25`)

Clear guidance on:

Using maxConnections for HTTP connection limits
Using targetThroughputInGbps for transfer concurrency (default 10 Gbps)
Virtual thread optimization recommendations

Deleted Classes Analysis

✅ Appropriate Removals

CustomThreadFactory: Replaced by proper virtual thread factory
PriorityThreadPool: No longer needed with virtual threads
S3UploadHelper: Functionality moved to transfer manager wrapper
Clean removal of 300+ lines of complex thread management code

Minor Suggestions

Error Message Enhancement: Consider more descriptive error messages in S3TransferManagerSync exception handling
Semaphore Permit Tuning: The TODO mentions potentially using more permits than maxConnections - consider documenting the reasoning for 1:1 ratio
Memory Monitoring: Consider adding debug logging for semaphore permit usage in high-concurrency scenarios

Technical Correctness

✅ Proper resource management (semaphore acquire/release)
✅ Exception handling maintains interrupt semantics
✅ Thread-safe implementation
✅ No deadlock risks identified
✅ Backwards compatible configuration approach

Recommendation: ✅ APPROVE

This PR demonstrates excellent engineering judgment in solving a complex concurrency problem. The semaphore-based limiting approach is elegant and the memory issues have been properly addressed with woodstox. The virtual thread factory fix resolves the platform thread exhaustion issue.

The solution is production-ready and should significantly improve S3 performance with virtual threads while avoiding the timeout and memory issues described in the issue.

@jorgee


 add a VirtualThreadFactoryBuilder class to allow using Java 17 withou...

e0370ed

...t virtual threads
Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@jorgee jorgee marked this pull request as ready for review

September 17, 2025 18:28

@jorgee jorgee requested a review from a team as a code owner

September 17, 2025 18:28

cursor[bot]

This comment was marked as outdated.

@pditommaso

Copy link

Member

pditommaso commented Sep 17, 2025

The invasion of bot

@jorgee


 add runWithPermits in deleteObject

58a1548

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@jorgee

Copy link

Contributor Author

jorgee commented Sep 17, 2025

The deleteObject method isn't wrapped with runWithPermit(), unlike other S3 client operations in this class. This means delete operations bypass the semaphore-based connection limiting, which defeats the purpose of controlling concurrent S3 connections when using virtual threads.

good catch cursor! Fixed

cursor[bot]

This comment was marked as outdated.

@jorgee


 fix interrupted management

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

christopher-hakkaart

christopher-hakkaart reviewed

Sep 18, 2025

View reviewed changes

Copy link

Collaborator

@christopher-hakkaart christopher-hakkaart left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added some suggestions. Overall the docs are great. These suggestions change the tense to make them active and present, but otherwise awesome.

docs/guides/aws-java-sdk-v2.md Outdated Show resolved Hide resolved

@jorgee


 Merge branch 'master' into virtual-threads-s3-client

96067f5

Signed-off-by: jorgee <jorge.ejarque@seqera.io>

@pditommaso pditommaso force-pushed the master branch from b7b4221 to c1114bc Compare

October 9, 2025 20:41

bentsherman

bentsherman reviewed

Oct 16, 2025

View reviewed changes

docs/reference/config.md Outdated

Comment on lines 179 to 182

`aws.client.minimumPartSize`

: :::{versionadded} 25.06.0-edge

:::

: The minimum part size used by the S3 transfer manager for multi-part uploads (default: `8 MB`).

: The minimum part size used for multi-part uploads to S3 (default: `8 MB`).

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jorgee do these multi-part settings apply only to uploads or also to downloads? I keep getting confused about this, it seems like they are used for both

Copy link

Contributor Author

@jorgee jorgee Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They are applied to uploads and copies

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But I see that this setting is being used for downloads as well, such as AwsS3Config::checkDownloadBufferParams() and ExtendedS3TransferManager::setDownloadBufferProperties() ?

Copy link

Contributor Author

@jorgee jorgee Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah its true. It is also used in downloads to setup the initial buffer size, that by default is 10 times the part size. So it is used in all transfers

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, I thought the multipart threshold was 5 GB but here it is 8 MB. I must be missing something

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Because if the threshold is only 8 MB then that would mean that our benchmarks with 1 GB files were using multipart

bentsherman

bentsherman reviewed

Oct 16, 2025

View reviewed changes

plugins/nf-amazon/src/main/nextflow/cloud/aws/nio/S3Client.java

if( transferManager == null ) {

log.debug("Creating S3 transfer manager pool - max-treads={};", transferManagerThreads);

transferPool = ThreadPoolManager.create("S3TransferManager", transferManagerThreads);

transferPool = ThreadPoolManager.create("S3TransferManager");

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jorgee I am restoring the maxConcurrency and maxNativeMemory, and removing the semaphore from the S3 transfers. Do we still need to limit the transfer manager thread pool size here? I figured the async client already limits the number of concurrent transfers

Copy link

Contributor Author

@jorgee jorgee Oct 16, 2025 •

edited

Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fine with restoring the maxConcurrency and maxNativeMemory, and we don't need to limit the transfer manager thread pool.
However I tried to remove the semaphore for s3 transfers when benchmarking the virtual threads. If you see the result, I got a heap space error when copying 1000 files of 1GB in the large instance. It is basically caused by the number of Native handlers as I saw in #6369 (comment) The benefit of removing the limit is very small and just for uploading small files. This is why I didn't remove the limit.
image

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see. But we also agreed that we should use separate limits for the S3 client vs the transfer manager right? So maybe I should just add a semaphore to the ExtendedS3TransferManager

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, I was concerned that putting a hard limit on the S3 transfer manager might bottleneck the case of performing many s3-to-s3 copies in parallel

So when you did the benchmark that showed 50% reduction in publishing time, that was with a limit of 50 concurrent transfers?

Copy link

Contributor Author

@jorgee jorgee Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, it was with the default 50 concurrent transfers

Copy link

Member

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, maybe we can bump the default to 100 👀

bentsherman added 3 commits

October 16, 2025 10:25

@bentsherman


 Restore maxConcurrency and maxNativeMemory, remove semaphore from s3 ...

5b8163d

...transfers
Signed-off-by: Ben Sherman <bentshermann@gmail.com>

@bentsherman


 minor edits

781a2c8

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

@bentsherman


 Fix build error

7d952cd

Signed-off-by: Ben Sherman <bentshermann@gmail.com>

Labels

storage/aws

4 participants

@jorgee @bentsherman @pditommaso @christopher-hakkaart

Limit S3 client connections when using virtual threads #6369

Are you sure you want to change the base?

Limit S3 client connections when using virtual threads #6369

Conversation

@jorgee jorgee commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for nextflow-docs-staging canceled.

Uh oh!

bentsherman commented Aug 28, 2025

Uh oh!

jorgee commented Sep 1, 2025

Uh oh!

pditommaso commented Sep 1, 2025

Uh oh!

bentsherman commented Sep 2, 2025

Uh oh!

jorgee commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bentsherman commented Sep 2, 2025

Uh oh!

bentsherman commented Sep 2, 2025

Uh oh!

bentsherman commented Sep 2, 2025

Uh oh!

jorgee commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jorgee commented Sep 17, 2025

Uh oh!

pditommaso commented Sep 17, 2025

Uh oh!

pditommaso commented Sep 17, 2025

Uh oh!

claude bot commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: Limit S3 client connections when using virtual threads

✅ Overall Assessment: Well-architected solution

Key Strengths

🎯 Excellent Semaphore Implementation (S3Client.java:86-107)

🔧 Clean S3TransferManagerSync Wrapper

🧵 Virtual Thread Factory Fix (ThreadPoolManager.groovy:92-94)

🚀 Memory Issue Resolution

Configuration & Documentation

✅ Simplified Configuration (AwsS3Config.groovy)

📚 Updated Documentation (aws-java-sdk-v2.md:20-25)

Deleted Classes Analysis

✅ Appropriate Removals

Minor Suggestions

Technical Correctness

Recommendation: ✅ APPROVE

Uh oh!

This comment was marked as outdated.

Uh oh!

pditommaso commented Sep 17, 2025

Uh oh!

jorgee commented Sep 17, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

@christopher-hakkaart christopher-hakkaart left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

@jorgee jorgee Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

@bentsherman bentsherman Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

@jorgee jorgee Oct 16, 2025

Choose a reason for hiding this comment

@jorgee jorgee commented Aug 28, 2025 •

edited

Loading

netlify bot commented Aug 28, 2025 •

edited

Loading

jorgee commented Sep 2, 2025 •

edited

Loading

jorgee commented Sep 2, 2025 •

edited

Loading

claude bot commented Sep 17, 2025 •

edited

Loading

🎯 Excellent Semaphore Implementation (`S3Client.java:86-107`)

🧵 Virtual Thread Factory Fix (`ThreadPoolManager.groovy:92-94`)

✅ Simplified Configuration (`AwsS3Config.groovy`)

📚 Updated Documentation (`aws-java-sdk-v2.md:20-25`)

@jorgee jorgee Oct 16, 2025 •

edited

Loading