2019-01-05 14:04:50 [csrc][scrapy.core.engine] DEBUG: Crawled (404) <GET http://www.csrc.gov.cn/pub/zjhpublic/G00306202/201806/t20180622_340238.htm> (referer: http://www.csrc.gov.cn/pub/newsite/xxpl/yxpl/index_16.html)
2019-01-05 14:04:51 [csrc][scrapy.spidermiddlewares.httperror] INFO: Ignoring response <404 http://www.csrc.gov.cn/pub/zjhpublic/G00306202/201806/t20180622_340238.htm>: HTTP status code is not handled or not allowed
抓取 网址的时候 报 404,很多都没有问题,就是个别的出现 404 这是为啥?
网址:
http://www.csrc.gov.cn/pub/newsite/xxpl/yxpl/index.html
2019-01-05 14:04:51 [csrc][scrapy.spidermiddlewares.httperror] INFO: Ignoring response <404 http://www.csrc.gov.cn/pub/zjhpublic/G00306202/201806/t20180622_340238.htm>: HTTP status code is not handled or not allowed
抓取 网址的时候 报 404,很多都没有问题,就是个别的出现 404 这是为啥?
网址:
http://www.csrc.gov.cn/pub/newsite/xxpl/yxpl/index.html