# python_web-crawler **Repository Path**: zuikeai/python_web-crawler ## Basic Information - **Project Name**: python_web-crawler - **Description**: python爬虫项目 - **Primary Language**: Python - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2018-01-04 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # python_web-crawler > **个人简书地址:http://www.jianshu.com/u/c0d2d4bcfe9b** **python爬虫实战项目导航** > **doubanmovies(3 Types)** 豆瓣电影网站即将上映电影的爬取 douban_01.py (requests,BeautifulSoup(bs4)、lxml) douban_02.py (urllib2、HTMLParser) douban_03.py (requests,lxml(xpath)) > **model(1 Types)** 某模特网站模特照片爬取程序 model_01.py (requests,BeautifulSoup(bs4)、urllib、sys.setrecursionlimit、os) > **qiushibaike(1 Types)** python2爬虫实战——糗事百科 qiushibaike_01.py (urllib、urllib2、re、thread、time) > **taobaogirls(1 Types)** python2爬虫实战——淘宝女郎照片爬取 taobaogirls_01.py (pyspider、os) > **tieba(1 Types)** python2爬虫实战——贴吧帖子爬取 taobaogirls_01.py (urllib、urllib2、re) > **BSBDJ(1 Types)** 百思不得姐网站视频爬取程序 BSBDJ_01.py (urllib(urlretrieve)、urllib2、re)