码迷,mamicode.com
首页 > 其他好文 > 详细

IP地址爬取

时间:2015-02-28 15:57:27      阅读:104      评论:0      收藏:0      [点我收藏+]

标签:

ip_spider.py= = =

#!/usr/bin/python
# coding: utf-8
import os
import sys
import requests
import re
import urllib

import sys
reload(sys)
sys.setdefaultencoding( "utf-8" )

def getUrl(urlIP):
    url = ‘http://www.123cha.com/ip/?q=%s‘ % urlIP
    r = requests.get(url)
    reg = r‘<td class="tg-data">(.+?.)</td>‘
    gre = re.compile(reg)
    number = re.findall(gre,r.text)
    print number[0]
    print number[2]
    fsock = open(‘ipaddress.txt‘, ‘a+‘)
    fsock.write("%s|%s\n" % (str(number[0]),str(number[2])))


if __name__ == ‘__main__‘:

    file_object = open(‘ipfile3‘)
    list_of_all_the_lines = file_object.readlines( )
    # print list_of_all_the_lines
    for dd in list_of_all_the_lines:
        getUrl(‘%s‘ % dd)

  

IP地址爬取

标签:

原文地址:http://www.cnblogs.com/firstrate/p/4305456.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!