Is this hardcoded or is there a parameter to tweak?
Where can I watch these numbers?
How can I double the number of crawling slots and the timeout?
I can see tons of timeout errors in the recjected URLS list, although they can be reached and I do not think that all of these have “I do not want to be crawled automatically” restrictions implemented.
(although this cloudflare garbage is getting more and more popular)
If this is the reason for the poor performance, shouldn’t show the “Network Access” a full workload at the right side (which it only does during GET robots.txt)?
Time | URL | Fail-Reason |
---|---|---|
2020/12/26 17:43:48 | http://www.www.gasthaus-rosengarten.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: www.www.gasthaus-rosengarten.ch duration=144 for url http://www.www.gasthaus-rosengarten.ch/ |
2020/12/26 17:43:48 | https://www.chin-min.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: www.chin-min.ch duration=155 for url https://www.chin-min.ch/ |
2020/12/26 17:43:48 | https://www.kulturgut.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://www.kulturgut.ch/ to https://www.denkmal.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:47 | https://www.schoenzeit.webstores.ch/robots.txt | TEMPORARY_NETWORK_FAILURE no response body (http return code = 404) |
2020/12/26 17:43:47 | http://cajon.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=http://cajon.ch/ to https://cajon.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:47 | https://www.dalucia.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: www.dalucia.ch duration=356 for url https://www.dalucia.ch/ |
2020/12/26 17:43:47 | https://www.sineq.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: www.sineq.ch duration=468 for url https://www.sineq.ch/ |
2020/12/26 17:43:47 | https://www.schoenshop.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://www.schoenshop.ch/ to https://www.schoenzeit.webstores.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:46 | https://www.heime-consulting.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://www.heime-consulting.ch/ to https://www.heime-consulting.ch/home.html placed on crawler queue for double-check |
2020/12/26 17:43:46 | http://www.www.telefonmonteur.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: www.www.telefonmonteur.ch duration=135 for url http://www.www.telefonmonteur.ch/ |
2020/12/26 17:43:46 | https://suli.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://suli.ch/ to https://www.suli.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:46 | https://www.alpsu.ch/robots.txt | TEMPORARY_NETWORK_FAILURE no response body (http return code = 404) |
2020/12/26 17:43:46 | https://citywettingen.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://citywettingen.ch/ to https://www.citywettingen.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:46 | http://www.alpsu.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=http://www.alpsu.ch/ to https://www.alpsu.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:46 | https://www.tdcag.ch/robots.txt | TEMPORARY_NETWORK_FAILURE no response body (http return code = 404) |
2020/12/26 17:43:46 | https://stattboden-riet.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: stattboden-riet.ch duration=73 for url https://stattboden-riet.ch/ |
2020/12/26 17:43:46 | https://www.lenzerheide2020.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://www.lenzerheide2020.ch/ to https://www.biathlon-lenzerheide.swiss/ placed on crawler queue for double-check |
2020/12/26 17:43:45 | https://www.reklamegrafik.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=https://www.reklamegrafik.ch/ to http://www.artatelier.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:45 | https://www.etude-avocat-belhocine.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - Client can’t execute: www.etude-avocat-belhocine.ch duration=464 for url https://www.etude-avocat-belhocine.ch/ |
2020/12/26 17:43:45 | http://tdcag.ch/ | TEMPORARY_NETWORK_FAILURE cannot load: load error - CRAWLER Redirect of URL=http://tdcag.ch/ to https://www.tdcag.ch/ placed on crawler queue for double-check |
2020/12/26 17:43:44 | https://suxessm |