LAUSR: text speech

Fast Griffin Lim based waveform generation strategy for text-to-speech synthesis

Sign Up to like & get
recommendations!
0 Published in 2020 at "Multimedia Tools and Applications"

DOI: 10.1007/s11042-020-09321-7

Abstract: The performance of text-to-speech (TTS) systems heavily depends on spectrogram to waveform generation, also known as the speech reconstruction phase. The time required for the same is known as synthesis delay. In this paper, an… read more here.

Keywords: text speech; speech; speech synthesis; waveform generation ... See more keywords

The Effect of Simultaneous Text on the Recall of Noise-Degraded Speech

Sign Up to like & get
recommendations!
1 Published in 2017 at "Journal of Experimental Psychology: Human Perception and Performance"

DOI: 10.1037/xhp0000360

Abstract: Written and spoken language utilize the same processing system, enabling text to modulate speech processing. We investigated how simultaneously presented text affected speech recall in babble noise using a retrospective recall task. Participants were presented… read more here.

Keywords: text speech; speech; degraded speech; recall ... See more keywords

The Effects of Shared e-Book Reading With Dynamic Text and Speech Output on the Single-Word Reading Skills of Young Children With Developmental Disabilities.

Sign Up to like & get
recommendations!
1 Published in 2020 at "Language, speech, and hearing services in schools"

DOI: 10.1044/2020_lshss-20-00009

Abstract: Purpose This study investigated the use of a new software feature, namely, dynamic text with speech output, on the acquisition of single-word reading skills by six children with developmental disabilities during shared e-book reading experiences… read more here.

Keywords: children developmental; text speech; word; reading ... See more keywords

Exploring Universal Text-to-Speech Use in Assessment Among Student Sub-Populations

Sign Up to like & get
recommendations!
0 Published in 2025 at "Applied Measurement in Education"

DOI: 10.1080/08957347.2025.2474958

Abstract: ABSTRACT Text-to-speech (TTS) is increasingly being built into large-scale assessment platforms and offered to any students who choose to use it, particularly when decoding skills are not a part of the measured construct. There is… read more here.

Keywords: use; assessment; exploring universal; text speech ... See more keywords

Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input

Sign Up to like & get
recommendations!
2 Published in 2023 at "IEEE Access"

DOI: 10.1109/access.2023.3251657

Abstract: Work in the development of neural incremental text-to-speech (iTTS), which is attracting increasing attention, has recently pursued low-latency processing by generating speech on the fly before reading complete sentences. Most current state-of-the-art iTTS systems use… read more here.

Keywords: neural incremental; text speech; speech; incremental text ... See more keywords

Emotional Text-To-Speech in Japanese Using Artificially Augmented Dataset

Sign Up to like & get
recommendations!
0 Published in 2024 at "IEEE Access"

DOI: 10.1109/access.2024.3495694

Abstract: This study explores the feasibility of using artificial emotional speech datasets generated by existing artificial voice-generating software as an alternative to human-generated datasets for emotional speech synthesis. Focusing on the Japanese language, we assess the… read more here.

Keywords: emotional text; speech japanese; text speech; speech ... See more keywords

Incremental Text-to-Speech Synthesis Using Pseudo Lookahead With Large Pretrained Language Model

Sign Up to like & get
recommendations!
0 Published in 2021 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2021.3073869

Abstract: This letter presents an incremental text-to-speech (TTS) method that performs synthesis in small linguistic units while maintaining the naturalness of output speech. Incremental TTS is generally subject to a trade-off between latency and synthetic speech… read more here.

Keywords: language model; pseudo lookahead; method; text speech ... See more keywords

A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization

Sign Up to like & get
recommendations!
1 Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2021.3125259

Abstract: In this letter, we propose a multivariate information minimization method that disentangles three or more latent representations. We show that control factors can be disentangled by minimizing interactive dependency, which can be expressed as a… read more here.

Keywords: multi; information minimization; information; multivariate information ... See more keywords

SNAC: Speaker-Normalized Affine Coupling Layer in Flow-Based Architecture for Zero-Shot Multi-Speaker Text-to-Speech

Sign Up to like & get
recommendations!
1 Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2022.3226655

Abstract: Zero-shot multi-speaker text-to-speech (ZSM-TTS) models aim to generate a speech sample with the voice characteristic of an unseen speaker. The main challenge of ZSM-TTS is to increase the overall speaker similarity for unseen speakers. One… read more here.

Keywords: text speech; speech; multi speaker; speaker ... See more keywords

Text-to-Speech With Lip Synchronization Based on Speech-Assisted Text-to-Video Alignment and Masked Unit Prediction

Sign Up to like & get
recommendations!
0 Published in 2025 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2025.3537949

Abstract: Text-to-speech (TTS) with lip synchronization (TTSLS) is the task of generating a speech signal synchronized with the lip movements in a video given the text transcription and the video without speech. Previous approaches to TTSLS… read more here.

Keywords: unit; video; lip; alignment ... See more keywords

Distribution-Preserving Steganography Based on Text-to-Speech Generative Models

Sign Up to like & get
recommendations!
1 Published in 2022 at "IEEE Transactions on Dependable and Secure Computing"

DOI: 10.1109/tdsc.2021.3095072

Abstract: Steganography is the art and science of hiding secret messages in public communication so that the presence of secret messages cannot be detected. There are two distribution-preserving steganographic frameworks, one is sampler-based and the other… read more here.

Keywords: text speech; preserving steganography; distribution preserving; generative models ... See more keywords

LAUSR

You are not signed in:

Sign Up!