python strip_tags 支持保留指定标签

时间：2015-05-06 17:03:44 阅读：143 评论：0 收藏：0 [点我收藏+]

标签：

#coding:utf-8

import re

def strip_tags(string, allowed_tags=‘‘):
  if allowed_tags != ‘‘:
    # Get a list of all allowed tag names.
    allowed_tags = allowed_tags.split(‘,‘)
    allowed_tags_pattern = [‘</?‘+allowed_tag+‘[^>]*>‘ for allowed_tag in allowed_tags]
    all_tags = re.findall(r‘<[^>]+>‘, string, re.I)
    not_allowed_tags = []
    tmp = 0
    for tag in all_tags:
        for pattern in allowed_tags_pattern:
            rs = re.match(pattern,tag)
            if rs:
                tmp += 1
            else:
                tmp += 0
        if not tmp:
            not_allowed_tags.append(tag)
        tmp = 0
    for not_allowed_tag in not_allowed_tags:
        string = re.sub(re.escape(not_allowed_tag), ‘‘,string)
    print not_allowed_tags
  else:
    # If no allowed tags, remove all.
    string = re.sub(r‘<[^>]*?>‘, ‘‘, string)
 
  return string

标签：

原文地址：http://www.cnblogs.com/bushe/p/4482114.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行