Articles with "speech text" as a keyword



Improving Speech to Text Alignment Based on Repetition Detection for Dysarthric Speech

Sign Up to like & get
recommendations!
Published in 2020 at "Circuits, Systems, and Signal Processing"

DOI: 10.1007/s00034-020-01419-5

Abstract: Alignment of transcription to the speech finds applications in video subtitling, human–computer interaction by means of natural language communication, etc. In spite of many advancements, alignment of transcription to speech remains a challenging task and… read more here.

Keywords: speech text; speech; repetition detection; dysarthric speech ... See more keywords

Applications of speech-to-text recognition and computer-aided translation for facilitating cross-cultural learning through a learning activity: issues and their solutions

Sign Up to like & get
recommendations!
Published in 2018 at "Educational Technology Research and Development"

DOI: 10.1007/s11423-017-9556-8

Abstract: In this study, 21 university students, who represented thirteen nationalities, participated in an online cross-cultural learning activity. The participants were engaged in interactions and exchanges carried out on Facebook® and Skype® platforms, and their multilingual… read more here.

Keywords: learning activity; cross cultural; cultural learning; cat ... See more keywords

Speech-to-text intervention to support text production among students with writing difficulties: a single-case study in nordic countries

Sign Up to like & get
recommendations!
Published in 2024 at "Disability and Rehabilitation: Assistive Technology"

DOI: 10.1080/17483107.2024.2351488

Abstract: Abstract Studies report that speech-to-text applications (STT) may support students with writing difficulties in text production. However, existing research is sparse, shows mixed results, and lacks information on STT interventions and their applicability in schools.… read more here.

Keywords: stt; speech text; intervention; production ... See more keywords

A 28-nm 1.3-mW Speech-to-Text Accelerator for Edge AI Devices

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Journal of Solid-State Circuits"

DOI: 10.1109/jssc.2024.3389965

Abstract: Speech-to-text conversion has been extensively deployed for a variety of applications. To implement speech-to-text conversion on energy-constrained edge devices, a hybrid algorithm is adopted in this work. A bidirectional recurrent neural network (BRNN), composed of… read more here.

Keywords: speech text; tex math; network; inline formula ... See more keywords

Factors in Emotion Recognition With Deep Learning Models Using Speech and Text on Multiple Corpora

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2022.3151551

Abstract: Emotion recognition performance of deep learning models is influenced by multiple factors such as acoustic condition, textual content, style of emotion expression (e.g. acted, natural), etc. In this paper, multiple factors are analysed by training… read more here.

Keywords: speech text; corpora; learning models; deep learning ... See more keywords

RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2022.3208528

Abstract: This letter introduces a new synthesis-based defense algorithm for counteracting with a varieties of adversarial attacks developed for challenging the performance of the cutting-edge speech-to-text transcription systems. Our algorithm implements a Sobolev-based GAN and proposes… read more here.

Keywords: rsd gan; adversarial attacks; speech text; defense ... See more keywords

Generative Semantic Communications for Robust Speech-to-Text Translation

Sign Up to like & get
recommendations!
Published in 2025 at "IEEE Transactions on Wireless Communications"

DOI: 10.1109/twc.2025.3590671

Abstract: In this article, we propose a robust semantic communication system for speech transmission, named Ross-S2T, to execute the speech-to-text translation (S2TT) transmission efficiently. First, a deep semantic encoder is developed to directly convert speech in… read more here.

Keywords: transmission; speech; language; speech text ... See more keywords

A multicriteria comparison of end-to-end and cascade speech-to-text translation models

Sign Up to like & get
recommendations!
Published in 2025 at "Bulletin of Electrical Engineering and Informatics"

DOI: 10.11591/eei.v14i4.9241

Abstract: This paper presents a thorough examination of two prominent speech-to-text translation (STT) models: the end-to-end (E2E) model and the cascade model. STT is a critical technology in today’s multilingual society, facilitating communication across language barriers.… read more here.

Keywords: speech text; translation; model; cascade ... See more keywords

Evaluation of Lithuanian Speech-to-Text Transcribers

Sign Up to like & get
recommendations!
Published in 2025 at "Informatica"

DOI: 10.15388/25-infor591

Abstract: For more than two decades, Lithuanian speech recognition has been researched solely in Lithuania due to the need for deep knowledge of Lithuanian. AI advancements now allow high-quality speech-to-text systems to be built without native… read more here.

Keywords: lithuanian speech; speech; evaluation lithuanian; speech text ... See more keywords
Photo from wikipedia

3D Avatar Approach for Continuous Sign Movement Using Speech/Text

Sign Up to like & get
recommendations!
Published in 2021 at "Applied Sciences"

DOI: 10.3390/app11083439

Abstract: Sign language is a visual language for communication used by hearing-impaired people with the help of hand and finger movements. Indian Sign Language (ISL) is a well-developed and standard way of communication for hearing-impaired people… read more here.

Keywords: speech text; sentence; sign; sign language ... See more keywords

Domain Adaptation Speech-to-Text for Low-Resource European Portuguese Using Deep Learning

Sign Up to like & get
recommendations!
Published in 2023 at "Future Internet"

DOI: 10.3390/fi15050159

Abstract: Automatic speech recognition (ASR), commonly known as speech-to-text, is the process of transcribing audio recordings into text, i.e., transforming speech into the respective sequence of words. This paper presents a deep learning ASR system optimization… read more here.

Keywords: speech text; european portuguese; speech; domain ... See more keywords