We found a few instances where we didn't have enough time to post-process our results. We should leave uploads in the queue for longer.
For some reason, I thought this time was way shorter than it actually is. Our PROCESS_TIMEOUT is 24 hours. Instead, I scaled up the number of worker processes which are doing the post-processing. This seems like a better solution.