就是这个 http://blog.csdn.net/lanbing510/article/details/45887075 运行$ python doubanSpider.py 后一直在下载,是什么意思额? /usr/local/lib/python2.7/dist-packages/bs4/init.py:166: UserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("html.parser"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.
To get rid of this warning, change this:
BeautifulSoup([your markup])
to this:
BeautifulSoup([your markup], "html.parser")
markup_type=markup_type)) Downloading Information From Page 1 Downloading Information From Page 2 Downloading Information From Page 3 Downloading Information From Page 4 Downloading Information From Page 5 Downloading Information From Page 6 WARNING:root:Some characters could not be decoded, and were replaced with REPLACEMENT CHARACTER. Downloading Information From Page 7 Downloading Information From Page 8 Downloading Information From Page 9 Downloading Information From Page 10 Downloading Information From Page 11 Downloading Information From Page 12 Downloading Information From Page 13 Downloading Information From Page 14 Downloading Information From Page 15 Downloading Information From Page 16 Downloading Information From Page 17 Downloading Information From Page 18 Downloading Information From Page 19 Downloading Information From Page 20 Downloading Information From Page 21 Downloading Information From Page 22 Downloading Information From Page 23 Downloading Information From Page 24
1
woniu127 2016-07-02 19:28:09 +08:00
BeautifulSoup([your markup], "lxml")
|