# python_web-crawler

**Repository Path**: zuikeai/python_web-crawler

## Basic Information

- **Project Name**: python_web-crawler
- **Description**: python爬虫项目
- **Primary Language**: Python
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2018-01-04
- **Last Updated**: 2020-12-19

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# python_web-crawler
	
> **个人简书地址：http://www.jianshu.com/u/c0d2d4bcfe9b**


 **python爬虫实战项目导航** 

	
> **doubanmovies（3 Types）**

	豆瓣电影网站即将上映电影的爬取
	douban_01.py	(requests,BeautifulSoup(bs4)、lxml)
	douban_02.py	(urllib2、HTMLParser)
	douban_03.py	(requests,lxml(xpath))
	

> **model（1 Types）**

	某模特网站模特照片爬取程序
	model_01.py		(requests,BeautifulSoup(bs4)、urllib、sys.setrecursionlimit、os)


> **qiushibaike（1 Types）**

	python2爬虫实战——糗事百科
	qiushibaike_01.py	(urllib、urllib2、re、thread、time)

> **taobaogirls（1 Types）**

	python2爬虫实战——淘宝女郎照片爬取
	taobaogirls_01.py	(pyspider、os)

> **tieba（1 Types）**

	python2爬虫实战——贴吧帖子爬取
	taobaogirls_01.py	(urllib、urllib2、re)

> **BSBDJ（1 Types）**

	百思不得姐网站视频爬取程序
	BSBDJ_01.py	(urllib(urlretrieve)、urllib2、re)