码迷,mamicode.com
首页 > 编程语言 > 详细

Python_ip代理

时间:2017-11-04 13:24:55      阅读:254      评论:0      收藏:0      [点我收藏+]

标签:.com   mysqldb   ldb   close   ack   range   int   www   gecko   

#encoding=utf8
import urllib
import urllib2
import sys
sys.path.append(‘D:/python/beautifulsoup‘)
sys.path.append(‘C:/Python27/Lib/site-packages‘)
from bs4 import BeautifulSoup
import MySQLdb
import re
print sys.getdefaultencoding()
User_Agent = ‘Mozilla/5.0 (Windows NT 6.3; WOW64; rv:43.0) Gecko/20100101 Firefox/43.0‘
header = {}
header[‘User-Agent‘] = User_Agent

url = ‘http://www.xicidaili.com/nn/1‘
req = urllib2.Request(url,headers=header)
res = urllib2.urlopen(req).read()

soup = BeautifulSoup(res)
ips = soup.findAll(‘tr‘)
#print ips
f = open("proxy.txt","w")

for x in range(1,len(ips)):
ip = ips[x]
tds = ip.findAll("td")
#print tds
ip_temp = tds[1].contents[0]+"\t"+tds[2].contents[0]+"\n"
print ip_temp
#print tds[2].contents[0]+"\t"+tds[3].contents[0]
f.write(ip_temp)
f.close()

Python_ip代理

标签:.com   mysqldb   ldb   close   ack   range   int   www   gecko   

原文地址:http://www.cnblogs.com/kongxc/p/7782838.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!