.NET脏字过滤算法(转) - 走看看

zoukankan html css js c++ java

.NET脏字过滤算法(转)

来源：xingd.net - 博客园
　　但在我这里测试的时候，RegEx要快一倍左右。但是还是不太满意，应为我们网站上脏字过滤用的相当多，对效率已经有了一些影响，经过一番思考后，自己做了一个算法。在自己的机器上测试了一下，使用原文中的脏字库，0x19c的字符串长度，1000次循环，文本查找耗时1933.47ms，RegEx用了1216.719ms，而我的算法只用了244.125ms.

　　主要算法如代码所示
private static Dictionary dic = new Dictionary();
private static BitArray fastcheck = new BitArray(char.MaxValue);
static void Prepare()
{
string[] badwords = // read from file
foreach (string word in badwords)
{
if (!dic.ContainsKey(word))
{
dic.Add(word, null);
maxlength = Math.Max(maxlength, word.Length);
int value = word[0];
fastcheck[word[0]] = true;
}
}
}

　　使用的时候
int index = 0;
while (index ＜ target.Length)
{
if (!fastcheck[target[index]])
{
while (index ＜target.Length - 1 && !fastcheck[target[++index]]) ;
}
for (int j = 0; j ＜ Math.Min(maxlength, target.Length - index); j++)
{
string sub = target.Substring(index, j);
if (dic.ContainsKey(sub))
{
sb.Replace(sub, "***", index, j);
index += j;
break;
}
}
index++;
}

查看全文

相关阅读:
时间操作、时间戳
 滚动条大于120px时，判断pc端的情况下，导航条固定定位
 通过js中的useragrent来判断设备是pc端还是移动端，跳转不同的地址
 js构建函数，点击按钮显示div，再点击按钮或其他区域，隐藏div
localStorage用法总结
 轮播插件、原生js编写，弄懂这个，基本上各种轮播都可以自己写了
 （原）选择远比努力重要
 Java线程之间通信
 迪杰斯特拉(Java)
FFTW中文参考

原文地址：https://www.cnblogs.com/ami/p/906453.html

Copyright © 2011-2022 走看看