C＃：使用System.Speech命名空间将WAV文件转录为文本（语音到文本）

如何使用.NET语音命名空间类将WAV文件中的音频转换为文本格式，我可以在屏幕上显示或保存到文件？

我正在寻找一些教程样本。

UPDATE

在这里找到了代码示例。但是当我尝试它时会得到不正确的结果。下面是我采用的vb代码示例。（实际上我不介意lang，只要它的vb / c＃…）。它没有给我正确的结果。我假设如果我们把正确的语法 – 即我们在录音中所期望的单词 – 我们应该得到它的文本输出。首先，我尝试了调用中的示例单词。它有时只打印那个（一个）字而不是其他字。然后我尝试了一些我们在录音中完全没有想到的词……不幸的是它也打印出来…… 🙁

Imports System Imports System.Speech.Recognition Public Class Form1 Dim WithEvents sre As SpeechRecognitionEngine Private Sub btnLiterate_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles btnLiterate.Click If TextBox1.Text.Trim.Length = 0 Then Exit Sub sre.SetInputToWaveFile(TextBox1.Text) Dim r As RecognitionResult r = sre.Recognize() If r Is Nothing Then TextBox2.Text = "Could not fetch result" Return End If TextBox2.Text = r.Text End Sub Private Sub Button1_Click(ByVal sender As System.Object, ByVal e As System.EventArgs) Handles Button1.Click TextBox1.Text = String.Empty Dim dr As DialogResult dr = OpenFileDialog1.ShowDialog() If dr = Windows.Forms.DialogResult.OK Then If Not OpenFileDialog1.FileName.Contains("wav") Then MessageBox.Show("Incorrect file") Else TextBox1.Text = OpenFileDialog1.FileName End If End If End Sub Public Sub New() ' This call is required by the Windows Form Designer. InitializeComponent() sre = New SpeechRecognitionEngine() End Sub Private Sub sre_LoadGrammarCompleted(ByVal sender As Object, ByVal e As System.Speech.Recognition.LoadGrammarCompletedEventArgs) Handles sre.LoadGrammarCompleted End Sub Private Sub sre_SpeechHypothesized(ByVal sender As Object, ByVal e As System.Speech.Recognition.SpeechHypothesizedEventArgs) Handles sre.SpeechHypothesized System.Diagnostics.Debug.Print(e.Result.Text) End Sub Private Sub sre_SpeechRecognitionRejected(ByVal sender As Object, ByVal e As System.Speech.Recognition.SpeechRecognitionRejectedEventArgs) Handles sre.SpeechRecognitionRejected System.Diagnostics.Debug.Print("Rejected: " & e.Result.Text) End Sub Private Sub sre_SpeechRecognized(ByVal sender As Object, ByVal e As System.Speech.Recognition.SpeechRecognizedEventArgs) Handles sre.SpeechRecognized System.Diagnostics.Debug.Print(e.Result.Text) End Sub Private Sub Form1_Load(ByVal sender As Object, ByVal e As System.EventArgs) Handles Me.Load Dim words As String() = New String() {"triskaidekaphobia"} Dim c As New Choices(words) Dim grmb As New GrammarBuilder(c) Dim grm As Grammar = New Grammar(grmb) sre.LoadGrammar(grm) End Sub End Class

更新（11月28日之后）

找到了一种加载默认语法的方法。它是这样的：

 sre.LoadGrammar(New DictationGrammar)

这里还有问题。认可并不准确。输出是垃圾。对于6分钟的文件，它可能提供5-6个字的文本，与语音文件完全无关。

System.Speech中的类用于文本到语音（主要是可访问性function）。

您正在寻找语音识别。自.Net 3.0以来，可以使用System.Speech.Recognition命名空间。它使用Windows桌面语音引擎。这可能会让你开始，但我想有更好的引擎。

语音识别非常复杂，很难做到正确，还有一些商业产品可用。

我意识到这是一个老问题，但在以后的问题和答案中有更好的信息。例如，请参阅在asp.net Web应用程序中转录语音转文本的最佳选项是什么？

您可以调用SetInputToWaveFile（）来读取音频文件，而不是调用SetInputToDefaultAudioDevice（）。

Windows Vista和Windows 7中的桌面识别引擎包含一个听写语法，如参考答案中所示。

您实际上需要自然语言工具包。在python我使用过NTLK http://www.nltk.org/

在.Net我刚刚找到Antelope https://stackoverflow.com/questions/1762040/natural-language-toolkit-equivalent-in-c

请参阅文章http://en.wikipedia.org/wiki/Speech_recognition

您应该使用SpeechRecognitionEngine 。要使用wave文件，请调用SetInputToWaveFile 。我希望我能帮助你更多，但我不是专家。

哦，如果你的话真的是triskaidekaphobia ，我不认为即使是人类语音识别引擎也会认识到……

我已经测试了你的代码，但它没有正确地抓取波形文件。它很吸引人

如果不是OpenFileDialog1.FileName.Contains（“wav”）那么MessageBox.Show（“不正确的文件”）Else TextBox1.Text = OpenFileDialog1.FileName End If

不是其他条件。我也尝试在字符串中使用.wav。

我还需要一个示例代码，用于将wav文件转录为不是来自Microphone的文本。如果您找到了一个好的解决方案，请在此处发布。

C＃：使用System.Speech命名空间将WAV文件转录为文本（语音到文本）

UPDATE

更新（11月28日之后）

如何列出所有类的变量

我应该如何“暂停”程序以向用户显示提示？

如何使用OpenXML SDK通过应用格式（格式化单元格值）获取单元格值

正则表达式匹配除给定列表之外的所有单词

极大的单行文件解析

Visual Studio 2010的HWnd

如何在Windows中覆盖最大32×32鼠标大小，就像这个程序一样

正则表达式分割线（csv文件）

尝试使用AxAcroPDFLib打开PDF

将计数和数据作为json返回到extjs网格中