TL;DR, For CPU bounded apps increase workers and/or cores. For I/O bounded apps use “pseudo-threads”. Gunicorn is a Python WSGI HTTP Server that usually lives between a reverse proxy (e.g., Nginx) or load balancer (e.g., AWS ELB) and a web application such as Django or Flask. Gunicorn architectureGunicorn implements a UNIX pre-fork web server. Great, what does that mean? Gunicorn starts a single m