Version: Next

Class: TextToSpeechModule

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:16

Module for Text to Speech (TTS) functionalities.

Methods

delete()

delete(): void

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:223

Unloads the model from memory.

Returns

void

forward()

forward(input, speed?, phonemize?): Promise<Float32Array<ArrayBufferLike>>

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:119

Synthesizes the provided input (text or IPA phonemes) into speech.

Parameters

input

string

The input text or phonemes to be synthesized.

speed?

number = 1.0

Playback speed multiplier (default: 1.0).

phonemize?

boolean = true

If true (default), treats input as text and converts it to phonemes. If false, input is treated as phonemes.

Returns

Promise<Float32Array<ArrayBufferLike>>

A promise resolving to the full audio waveform as a Float32Array.

stream()

stream(input): AsyncGenerator<Float32Array<ArrayBufferLike>>

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:134

Starts a streaming synthesis session. Yields audio chunks as they are generated.

Parameters

input

TextToSpeechStreamingInput

Input object containing optional speed, phonemize flag and stopAutomatically flag.

Returns

AsyncGenerator<Float32Array<ArrayBufferLike>>

An async generator yielding Float32Array audio chunks.

Yields

An audio chunk generated during synthesis.

streamFlush()

streamFlush(): void

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:202

Force-partitions whatever is currently buffered, even without an end-of-sentence character. Call after the final streamInsert of an utterance to play out the trailing tail without ending the stream.

Returns

void

streamInsert()

streamInsert(input): void

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:193

Inserts new content (text or IPA phonemes) into the buffer to be processed in streaming mode. Trailing un-terminated content sits in the buffer until TextToSpeechModule.streamFlush or streamStop(false) releases it.

Parameters

input

string

The text or phoneme fragment to append to the streaming buffer.

Returns

void

streamStop()

streamStop(instant?): void

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:212

Stops the streaming process if there is any ongoing.

Parameters

instant?

boolean = true

If true, stops the streaming as soon as possible. Otherwise drains the current buffer (force-flushing any trailing un-terminated content) before stopping.

Returns

void

fromModelName()

static fromModelName(config, onDownloadProgress?): Promise<TextToSpeechModule>

Defined in: modules/natural_language_processing/TextToSpeechModule.ts:30

Creates a Text to Speech instance.

Parameters

config

TextToSpeechModelConfig

Configuration object containing model and voice sources.

onDownloadProgress?

(progress) => void

Optional callback to monitor download progress, receiving a value between 0 and 1.

Returns

Promise<TextToSpeechModule>

A Promise resolving to a TextToSpeechModule instance.

Methods​

delete()​

Returns​

forward()​

Parameters​

input​

speed?​

phonemize?​

Returns​

stream()​

Parameters​

input​

Returns​

Yields​

streamFlush()​

Returns​

streamInsert()​

Parameters​

input​

Returns​

streamStop()​

Parameters​

instant?​

Returns​

fromModelName()​

Parameters​

config​

onDownloadProgress?​

Returns​

Methods

delete()

Returns

forward()

Parameters

input

speed?

phonemize?

Returns

stream()

Parameters

input

Returns

Yields

streamFlush()

Returns

streamInsert()

Parameters

input

Returns

streamStop()

Parameters

instant?

Returns

fromModelName()

Parameters

config

onDownloadProgress?

Returns