Python 简单网络爬虫 爬取图片

精选圈子榜单优站

编程综合

技术

20关注

编程技术记录、分享，记录你的编程生活点点滴滴！

Python 简单网络爬虫爬取图片

原理：

1.抓取网页内容

2.从网页内容读取图片链接

代码（详解见代码）：

import requests
import re
import time
# 获取到网页内容
r = requests.get('https://www.douyu.com') # 斗鱼
html=r.text
# 通过正则表达式获取网页内容的图片
images=re.findall(r'data-original="(.*?\.(jpg|png))"',html)
print(images)
# 通过遍历下载图片存储到本地
for image in images:
    s=image[0]
    index=s.rindex('/')
    # 睡眠一秒下载
    time.sleep(1)
    req=requests.get(s)
    # 将文件url文件最后一个/后面的字符作为文件名称
    with open('/Users/lyf/test/%s'%s[index+1:],'wb') as code:
        code.write(req.content)

效果

E56C2FF2-89AF-43EE-AAAC-51A2650CF1D7

若文章侵犯了您的权益，请联系我们进行处理。

python

2018-01-14
1757阅读

Python 简单网络爬虫 爬取图片

评论

作者

圈子热门

01.

02.

03.

04.

05.

06.

07.

08.

举报文章

Python 简单网络爬虫爬取图片