10个.NET中删除空白字符串的方法(2)

日期：2020-06-12 栏目：程序人生浏览：次

用代码实现循环，并使用StringBuilder类，通过依靠StringBuilder的内在优化来创建新的字符串。为了避免任何其他因素对本实施产生干扰，不调用其他的方法，并且通过缓存到本地变量避免访问类成员。最后通过设置StringBuilder.Length将缓冲区调整到合适大小。

// Code suggested by

public static string TrimAllWithLexerLoop(string s) { int length = s.Length; var buffer = new StringBuilder(s); var dstIdx = 0; for (int index = 0; index < s.Length; index++) { char ch = s[index]; switch (ch) { case '\u0020': case '\u00A0': case '\u1680': case '\u2000': case '\u2001': case '\u2002': case '\u2003': case '\u2004': case '\u2005': case '\u2006': case '\u2007': case '\u2008': case '\u2009': case '\u200A': case '\u202F': case '\u205F': case '\u3000': case '\u2028': case '\u2029': case '\u0009': case '\u000A': case '\u000B': case '\u000C': case '\u000D': case '\u0085': length--; continue; default: break; } buffer[dstIdx++] = ch; } buffer.Length = length; return buffer.ToString();; }

循环字符法

这种方法几乎和前面的循环交换法相同，不过它采用if语句来调用isWhiteSpace()，而不是乱七八糟的switch伎俩 :)。

public static string TrimAllWithLexerLoopCharIsWhitespce(string s) { int length = s.Length; var buffer = new StringBuilder(s); var dstIdx = 0; for (int index = 0; index < s.Length; index++) { char currentchar = s[index]; if (isWhiteSpace(currentchar)) length--; else buffer[dstIdx++] = currentchar; } buffer.Length = length; return buffer.ToString();; }

原地改变字符串法（不安全）

这种方法使用不安全的字符指针和指针运算来原地改变字符串。我不推荐这个方法，因为它打破了.NET框架在生产中的基本约定：字符串是不可变的。

public static unsafe string TrimAllWithStringInplace(string str) { fixed (char* pfixed = str) { char* dst = pfixed; for (char* p = pfixed; *p != 0; p++) if (!isWhiteSpace(*p)) *dst++ = *p; /*// reset the string size * ONLY IT DIDN'T WORK! A GARBAGE COLLECTION ACCESS VIOLATION OCCURRED AFTER USING IT * SO I HAD TO RESORT TO RETURN A NEW STRING INSTEAD, WITH ONLY THE PERTINENT BYTES * IT WOULD BE A LOT FASTER IF IT DID WORK THOUGH... Int32 len = (Int32)(dst - pfixed); Int32* pi = (Int32*)pfixed; pi[-1] = len; pfixed[len] = '\0';*/ return new string(pfixed, 0, (int)(dst - pfixed)); } }

原地改变字符串法V2（不安全）

这种方法几乎和前面那个相同，不过此处使用类似数组的指针访问。我很好奇，不知道这两种哪种存储访问会更快。

public static unsafe string TrimAllWithStringInplaceV2(string str) { var len = str.Length; fixed (char* pStr = str) { int dstIdx = 0; for (int i = 0; i < len; i++) if (!isWhiteSpace(pStr[i])) pStr[dstIdx++] = pStr[i]; // since the unsafe string length reset didn't work we need to resort to this slower compromise return new string(pStr, 0, dstIdx); } } String.Replace（“”，“”）

这种实现方法很天真，由于它只替换空格字符，所以它不使用空白的正确定义，因此会遗漏很多其他的空格字符。虽然它应该算是本文中最快的方法，但功能不及其他。

但如果你只需要去掉真正的空格字符，那就很难用纯.NET写出胜过string.Replace的代码。大多数字符串方法将回退到手动优化本地C ++代码。而String.Replace本身将用comstring.cpp调用C ++方法：

FCIMPL3(Object*, COMString::ReplaceString, StringObject* thisRefUNSAFE, StringObject* oldValueUNSAFE, StringObject* newValueUNSAFE)

下面是基准测试套件方法：

public static string TrimAllWithStringReplace(string str) { // This method is NOT functionaly equivalent to the others as it will only trim "spaces" // Whitespace comprises lots of other characters return str.Replace(" ", ""); }

以上就是.NET中删除空白字符串的10大方法，希望对大家的学习有所帮助。

您可能感兴趣的文章:

转载注明出处：https://www.heiqu.com/wjwpzp.html

10个.NET中删除空白字符串的方法(2)

相关推荐