SCP38: Low project throttling
What it does
Reports a setting module (e.g. settings.py) where:
CONCURRENT_REQUESTS_PER_DOMAINis higher than1.DOWNLOAD_DELAYis lower than1.0.
Why is this bad?
Your setting module defines the default settings for all your spiders.
Even if all your current spiders target websites that can handle low throttling, it is recommended [1] to set higher throttling values (i.e. lower concurrency, higher delay), in case future spiders target websites that are more sensitive to scraping load.
Additional, more granular adjustments can be made on a per-domain basis
(DOWNLOAD_SLOTS) or on a per-spider basis (e.g. in
custom_settings).
Example
CONCURRENT_REQUESTS_PER_DOMAIN = 8
DOWNLOAD_DELAY = 0.0
Instead use:
CONCURRENT_REQUESTS_PER_DOMAIN = 1
DOWNLOAD_DELAY = 1.0