Thanks for checking up (I'm working with Josh on this problem). It seems there isn't a built-in process for this kind of use case currently, and that the best process right now is to handle our own bundling and threading in the DoFn. If you had any other suggestions, or anything to keep in mind in doing this, let us know!

Our team has a pipeline that make external network calls. These pipelines are currently super slow, and the hypothesis is that they are slow because we are not threading for our network calls. The github issue below provides some discussion around this:

In beam 1.0, there was IntraBundleParallelization, which helped with this. However, this was removed because it didn't comply with a few BEAM paradigms.

Questions going forward:

What is advised for jobs that make blocking network calls? It seems bundling the elements into groups of size X prior to passing to the DoFn, and managing the threading within the function might work. thoughts?