3月特卖开启!从即日起至3月21日晚上,精选商品五折特惠。

Offline Speech Recognition

Ilgar Lunin - 代码插件 - 2021/09/20

Accurate offline speech recognition

  • 支持的平台
  • 支持的引擎版本
    4.25 - 4.27, 5.0 - 5.3
  • 下载类型
    引擎插件
    此产品包含一款代码插件,含有预编译的二进制文件以及与虚幻引擎集成的所有源代码,能够安装到您选择的引擎版本中,并根据每个项目的需求启动。

Allows you to recognize speech from more than 15 languages, without relying on any cloud service or subscription. Instead, a language server is a separate process on your machine, which talks with your game. The language server app is public ( https://github.com/IlgarLunin/vosk-language-server ), you can fork it and customize, distribute with your game, run it without any user interface.


Unreal engine client is dead simple communication with language server. It connects to it, records, and feeds your voice to the language server, the server sends recognized voices as text back to unreal.


This is streaming voice recognition, and you can implement simple conversations with your NPC without any user input except voice. "Ok robot, do this", "Ok robot, do that" etc.


Download latest language server: https://github.com/IlgarLunin/vosk-language-server/releases


Using language server as a separate app is optional! Your game itself can act like language server.


Visit discord and documentation for more info


Video demonstration: https://youtu.be/iJVCsuuC5A4

Example project for Unreal 5.3: here

技术细节

Features:

  • No dependencies on other paid cloud services
  • One time payment
  • The server can handle multiple clients at the same time
  • Easy to setup
  • No internet required


Code Modules:

  • VoskPlugin (Runtime)


Number of Blueprints: 0

Number of C++ Classes: 2

Network Replicated: No

Supported Development Platforms: Windows, Mac, Linux

Supported Target Build Platforms: Windows, Mac, Linux

Documentation: https://github.com/IlgarLunin/VoskPlugin-docs

Discord: https://discord.gg/Tkf7xe2