从File.ReadAllBytes（byte ）中删除字节顺序标记

我有一个HTTPHandler，它读取一组CSS文件并将它们组合起来然后GZipping它们。但是，一些CSS文件包含一个字节顺序标记（由于TFS 2005自动合并中的一个错误），而在FireFox中，BOM被作为实际内容的一部分被读取，所以它搞砸了我的类名等。我怎么能剥离出BOM字符？有没有一种简单的方法可以在没有手动浏览字节数组的情况下查找“ï»¿”？

用样本扩展Jon的评论。

var name = GetFileName(); var bytes = System.IO.File.ReadAllBytes(name); System.IO.File.WriteAllBytes(name, bytes.Skip(3).ToArray());

扩展JaredPar示例以递归子目录：

 using System.Linq; using System.IO; namespace BomRemover { ///  /// Remove UTF-8 BOM (EF BB BF) of all *.php files in current & sub-directories. /// 
 class Program { private static void removeBoms(string filePattern, string directory) { foreach (string filename in Directory.GetFiles(directory, file Pattern)) { var bytes = System.IO.File.ReadAllBytes(filename); if(bytes.Length > 2 && bytes[0] == 0xEF && bytes[1] == 0xBB && bytes[2] == 0xBF) { System.IO.File.WriteAllBytes(filename, bytes.Skip(3).ToArray()); } } foreach (string subDirectory in Directory.GetDirectories(directory)) { removeBoms(filePattern, subDirectory); } } static void Main(string[] args) { string filePattern = "*.php"; string startDirectory = Directory.GetCurrentDirectory(); removeBoms(filePattern, startDirectory); } } }

在您尝试执行基本PHP下载文件时发现UTF-8 BOM损坏文件后，我需要C＃代码片段。

 var text = File.ReadAllText(args.SourceFileName); var streamWriter = new StreamWriter(args.DestFileName, args.Append, new UTF8Encoding(false)); streamWriter.Write(text); streamWriter.Close();

另一种方法，假设UTF-8为ASCII。

 File.WriteAllText(filename, File.ReadAllText(filename, Encoding.UTF8), Encoding.ASCII);

对于较大的文件，请使用以下代码; 记忆效率高！

 StreamReader sr = new StreamReader(path: @"", detectEncodingFromByteOrderMarks: true); StreamWriter sw = new StreamWriter(path: @"", append: false, encoding: new UnicodeEncoding(bigEndian: false, byteOrderMark: false)); var lineNumber = 0; while (!sr.EndOfStream) { sw.WriteLine(sr.ReadLine()); lineNumber += 1; if (lineNumber % 100000 == 0) Console.Write("\rLine# " + lineNumber.ToString("000000000000")); } sw.Flush(); sw.Close();

从File.ReadAllBytes（byte ）中删除字节顺序标记

XDocument：将XML保存到没有BOM的文件

通过StringBuilder将字节顺序标记添加到字符串

如何使用C＃从XmlTextWriter中删除BOM？

如何删除存在于某些文本中的任何UTF-8 BOM，而不是在某些文本的开头

XmlReader在UTF-8 BOM上中断