python用类实现文章敏感词的过滤

最新推荐文章于 2022-12-15 12:30:48 发布

代序春秋

最新推荐文章于 2022-12-15 12:30:48 发布

阅读量1.5k

点赞数 2

分类专栏： python

本文链接：https://blog.csdn.net/geek64581/article/details/102750552

版权

该博客介绍了如何使用Python通过递归方式实现文章敏感词的过滤，包括建立敏感词库和编写过滤代码的两个步骤。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

过滤一遍并将敏感词替换之后剩余字符串中新组成了敏感词语,这种情况就要用递归来解决，直到过滤替换之后的结果和过滤之前一样时才算结束

第一步:建立一个敏感词库(.txt文本)

在这里插入图片描述

第二步:编写代码在文章中过滤敏感词(递归实现)

# -*- coding: utf-8 -*-
# author 代序春秋
import os
import chardet

# 获取文件目录和绝对路径
curr_dir = os.path.dirname(os.path.abspath(__file__))
# os.path.join()拼接路径
sensitive_word_stock_path = os.path.join(curr_dir, 'sensitive_word_stock.txt')


# 获取存放敏感字库的路径
# print(sensitive_w