python截取搜索引擎关键词

时间：2014-11-02 22:27:04 阅读：221 评论：0 收藏：0 [点我收藏+]

标签：style blog http color ar os 使用 for sp

这段代码是自己学了python的基本语法之后，参考一个网上视频写的代码，功能是截取搜索引擎360的关键词。

代码：

 1 #!/usr/bin/python
 2 #encoding:utf-8
 3 
 4 import urllib
 5 import urllib2
 6 import re
 7 import time
 8 from random import choice
 9 
10 ipList = [‘1.9.189.65:3128‘, ‘27.24.158.130:80‘, ‘27.24.158.154:80‘]
11 
12 listKeyWords = ["集团", "科技"]
13 for item in listKeyWords:
14     ip = choice(ipList)
15     gjc = urllib.quote(item)
16     url = "http://sug.so.360.cn/suggest?callback=suggest_so&encodein=utf-8&encodeout=utf-8&word=" + gjc
17     headers = {
18         "GET":url,
19         "Host":"sug.so.360.cn", 
20         "Referer":"http://www.so.com/", 
21         "User-Agent":"Mozilla/5.0 (X11; Linux i686) AppleWebKit/537.36 (KHTML, like Gecko) Ubuntu Chromium/37.0.2062.120 Chrome/37.0.2062.120 Safari/537.36"
22     }
23 
24     #proxy_support = urllib2.ProxyHandler({"http":"http://"+ip})
25 
26     #opener = urllib2.build_opener(proxy_support)
27     #urllib2.install_opener(opener)
28     req = urllib2.Request(url)
29 
30     for key in headers:
31         req.add_header(key, headers[key])
32 
33     html = urllib2.urlopen(req).read()
34     print html
35 
36     ss = re.findall("\"(.*?)\"", html)
37     for item in ss:
38         print item
39         
40     time.sleep(3)

主要使用了python自i带的几个库，用法可以查看帮助文档。

python截取搜索引擎关键词

标签：style blog http color ar os 使用 for sp

原文地址：http://www.cnblogs.com/lit10050528/p/4069980.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行