No description
Find a file
modeco80 ddd3483620 add boost asio to speech2
currently the main program is just some stackful coro example stuff that i used just to test

i'll add beast too in a sec but just. hang on
2024-07-20 06:37:09 -04:00
SAPIServer port dll to clang-cl 2024-07-19 07:45:18 -04:00
speech2 add boost asio to speech2 2024-07-20 06:37:09 -04:00
.clang-format add speech2 stuff 2024-07-17 21:40:14 -04:00
.editorconfig add speech2 stuff 2024-07-17 21:40:14 -04:00
.gitattributes Add .gitignore and .gitattributes. 2024-07-02 20:12:59 -04:00
.gitignore replace build system with cmake 2024-07-19 22:01:05 -04:00
.gitmodules add boost asio to speech2 2024-07-20 06:37:09 -04:00
Justfile replace build system with cmake 2024-07-19 22:01:05 -04:00
LICENSE.txt add readme and license 2024-07-02 21:06:37 -04:00
README.md port dll to clang-cl 2024-07-19 07:45:18 -04:00
SAPIServer.sln Add project files. 2024-07-02 20:13:00 -04:00

SAPIServer

Simple HTTP frontend API for Microsoft Speech API

Building

Requirements

  • .NET SDK
  • VS2022 lib pack (TODO: link)
  • mingw-w64 toolchain built with win32 thread model (pthread won't work)

You'll also need to chattr +F (or mount the whole thing with ciopfs and rename the headers to lowercase, if not on ext4 or you don't want to tune2fs) the windows sdk header directories so the build works.

just should do the trick.

Running

Usage: SAPIServer.exe <port>

API Usage

List voices

GET /api/voices

{"voices":["LH Michael","LH Michelle","Microsoft Sam"]}

Synthesize text

POST /api/synthesize 
Content-Type: application/json

{"text":"Lorem ipsum doler sit amet...","voice":"Microsoft Sam"}

Returns synthesized TTS audio as a wave-form file.