performance - Efficient handling of long running HTTP connections in an nginx/gunicorn/django web architecture - Stack Overflow Essentially, change gunicorn workers to be async (so that connections are not dropped if workers are simply waiting for long I/O), though we would still need to think of how to avoid OoM.