Skip to content

Conversation

TomAugspurger
Copy link
Contributor

Description

Adds docs for how these workflows were added to the database.

@@ -36,6 +36,6 @@ collections:
# But we start at depth=2 thanks to the split, so we use a depth of 5 here.
min_depth: 5
max_depth: 5
chunk_length: 5000
chunk_length: 20000
Copy link
Contributor Author

@TomAugspurger TomAugspurger Jun 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Increasing the chunk size to decrease the Batch Task parallelism here: we managed to saturate the storage account, I think because this is using a thread pool to walk in parallel.

image

(That blue spike is 50x errors from the storage service)

@TomAugspurger TomAugspurger merged commit 6e8a0a3 into main Jun 17, 2024
@TomAugspurger TomAugspurger deleted the user/tom/doc/stac-geoparquet-update branch June 17, 2024 15:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant