alekssamos / msspeech Goto Github PK

View Code? Open in Web Editor NEW

55.0 3.0 10.0 429 KB

not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud

Home Page: https://pypi.org/project/msspeech/

Python 95.47% Makefile 4.53%

speech-synthesis speech-synthesis-api speech-synthesis-library tts-api

msspeech's People

Contributors

Stargazers

Watchers

Forkers

ishine xyx208 homebrew-startup pendave gdtiti faithless35 18639522576 1zero8 leos-code pixu2019

msspeech's Issues

Does the CLI support reading text from a file?

There is a text file containing several sentences. When specifying this file through the CLI, multiple audio files will be outputted, with each sentence separated by a newline character, and each line representing one audio.

Option to use the Narrator voices?

In new Windows 11 builds, Narrator can use the exact same voices that are available in Edge, but offline, and seemingly more customizable. These voices install as store apps, so we don't need an internet connection either, like you seem to in order to use the Edge voices. Is it possible to get these integrated into this project?

(problem and solution) save the audio in data buffer

@alekssamos Thank you very much for this tool, I barely know it and it is perfect for some projects I am working on.

Problem:
When trying to save the audio file in a data buffer, the following error is displayed

msspeech/__init__.py, line 520, in _synthesize

     bc += await f.write(resp[1])

TypeError: object int can't be used in 'await' expression

My code

file = BytesIO()
await mss.synthesize(data.text.strip(), file)
file.seek(0)

Solution:
remove the await and it worked for me.

Doesn't work with SoundLoader (kivy)

I already tried to save the file as .mp3, .wav, .ogg, doesn't matter, kivy isn't able to load properly.
b'Unrecognized audio format'
from kivy.app import App
from kivy.core.audio import SoundLoader

class MusicPlayerApp(App):
def build(self):
# load the mp3 file using SoundLoader
sound = SoundLoader.load(R"D:\Documents\aaaa.ogg")
# play the audio file
sound.play()
return

if name == 'main':
MusicPlayerApp().run()

I can't use many new narators but exist on microsoft.

https://learn.microsoft.com/en-us/azure/cognitive-services/Speech-Service/language-support?tabs=tts
for voice in voices: if voice["Locale"] == "zh-CN": print("Chinese voices found: ", voice["ShortName"]) await mss.set_voice(voice["Name"])

I can't use:
such as
zh-CN-henan
zh-CN-liaoning
.........

the voices_list_plus.json is outdated, how can I update it myself?

Hello I find the voices_list_plus.json is outdated, as I can't find this voice for example:
zh-CN-YunjianNeural

https://raw.githubusercontent.com/alekssamos/msspeech/da5904e14e7e8b383f4230c57585eaa92271a4bb/msspeech/voices_list_plus.json

:)
How can I update it myself?

And where can I add custom SSML?

alekssamos / msspeech Goto Github PK

msspeech's People

Contributors

Stargazers

Watchers

Forkers

msspeech's Issues

Does the CLI support reading text from a file?

Option to use the Narrator voices?

(problem and solution) save the audio in data buffer

Doesn't work with SoundLoader (kivy)

I can't use many new narators but exist on microsoft.

the voices_list_plus.json is outdated, how can I update it myself?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent