Access the full text.
Sign up today, get DeepDyve free for 14 days.
Geoffrey Meltzner, J. Heaton, Yunbin Deng, G. Luca, Serge Roy, Joshua Kline (2017)
Silent Speech Recognition as an Alternative Communication Device for Persons With LaryngectomyIEEE/ACM Transactions on Audio, Speech, and Language Processing, 25
T. Celin, G. Anushiya, MemberIEEE Nagarajan, SeniorMemberIEEE Vijayalakshmi, T.A.MariyaCelin (2019)
A Weighted Speaker-Specific Confusion Transducer-Based Augmentative and Alternative Speech Communication Aid for Dysarthric SpeakersIEEE Transactions on Neural Systems and Rehabilitation Engineering, 27
R. Fraile, Juan Godino-Llorente, N. Sáenz-Lechón, V. Osma-Ruiz, J. Gutiérrez-Arriola (2013)
Characterization of dysphonic voices by means of a filterbank-based spectral analysis: sustained vowels and running speech.Journal of voice : official journal of the Voice Foundation, 27 1
(2020)
common voice datasets
D. Childers, Ke Wu, K. Bae, D. Hicks (1988)
Automatic recognition of gender by voiceICASSP-88., International Conference on Acoustics, Speech, and Signal Processing
Dong Wang, Lantian Li, Ying Shi, Yixiang Chen, Zhiyuan Tang (2017)
Deep Factorization for Speech Signal2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Rohan Khanna, Daegun Oh, Youngwook Kim (2019)
Through-Wall Remote Human Voice Recognition Using Doppler Radar With Transfer LearningIEEE Sensors Journal, 19
Michael Wand, J. Schmidhuber (2016)
Deep Neural Network Frontend for Continuous EMG-Based Speech Recognition
Li Chai, Jun Du, Qing-Feng Liu, Chin-Hui Lee (2021)
A Cross-Entropy-Guided Measure (CEGM) for Assessing Speech Recognition Performance and Optimizing DNN-Based Speech EnhancementIEEE/ACM Transactions on Audio, Speech, and Language Processing, 29
D. Dov, R. Talmon, I. Cohen (2016)
Kernel Method for Voice Activity Detection in the Presence of TransientsIEEE/ACM Transactions on Audio, Speech, and Language Processing, 24
Myungjong Kim, Younggwan Kim, Hoirin Kim (2015)
Automatic Intelligibility Assessment of Dysarthric Speech Using Phonologically-Structured Sparse Linear ModelIEEE/ACM Transactions on Audio, Speech, and Language Processing, 23
Jiangyan Yi, J. Tao, Zhengqi Wen, Ye Bai (2019)
Language-Adversarial Transfer Learning for Low-Resource Speech RecognitionIEEE/ACM Transactions on Audio, Speech, and Language Processing, 27
Reza Lotfidereshgi, P. Gournay (2017)
Biologically inspired speech emotion recognition2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Adriana Stan, Yoshitaka Mamiya, J. Yamagishi, P. Bell, O. Watts, R. Clark, Simon King (2016)
ALISA: An automatic lightly supervised speech segmentation and alignment toolComput. Speech Lang., 35
Yuanyuan Liu, Tan Lee, P. Ching, T. Law, K. Lee (2017)
Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-Level ASR Posterior Features
(2020)
A review of deep learning architectures for speech and audio processing
D.G.K. Childers, K.S. Wu, D.M. Bae, D.M. Hicks (1988)
Automatic recognition of gender by voice” in proc
Zhong-Qiu Wang, Deliang Wang (2016)
A Joint Training Framework for Robust Automatic Speech RecognitionIEEE/ACM Transactions on Audio, Speech, and Language Processing, 24
(2020)
Medical speech, transcription, and intent audio utterances paired with text for common medical symptoms
U. Shanthamallu, A. Spanias, C. Tepedelenlioğlu, M. Stanley (2017)
A brief survey of machine learning methods and their sensor and IoT applications2017 8th International Conference on Information, Intelligence, Systems & Applications (IISA)
(2021)
Complete perspective on speech recognition
A. Maddali (2020)
Functional Analysis and Hybrid Optimal Cepstrum Approach for Gender Classification Using Machine LearningInternational Journal of Emerging Trends in Engineering Research
(2020)
User voice management and power spectrum analysis for voice recognition systems
(2018)
An unmanned speech cognizant for medical application
B. Schultz, V. Tarigoppula, Gustavo Noffs, Sandra Rojas, Anneke Walt, D. Grayden, A. Vogel (2021)
Automatic speech recognition in neurodegenerative diseaseInternational Journal of Speech Technology, 24
S. Deena, Madina Hasan, Mortaza Doulaty, O. Saz, Thomas Hain (2019)
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and AlignmentIEEE/ACM Transactions on Audio, Speech, and Language Processing, 27
Sadeka Ali, Shariful Islam (2012)
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL, 2
Shilpi Shukla, Madhu Jain (2021)
A novel stochastic deep resilient network for effective speech recognitionInternational Journal of Speech Technology, 24
Lauri Tavi (2018)
Classifying females’ stressed and neutral voices using acoustic–phonetic analysis of vowels: an exploratory investigation with emergency callsInternational Journal of Speech Technology
Currently, the design, technological features of voices, and their analysis of various applications are being simulated with the requirement to communicate at a greater distance or more discreetly. The purpose of this study is to explore how voices and their analyses are used in modern literature to generate a variety of solutions, of which only a few successful models exist.Design/methodologyThe mel-frequency cepstral coefficient (MFCC), average magnitude difference function, cepstrum analysis and other voice characteristics are effectively modeled and implemented using mathematical modeling with variable weights parametric for each algorithm, which can be used with or without noises. Improvising the design characteristics and their weights with different supervised algorithms that regulate the design model simulation.FindingsDifferent data models have been influenced by the parametric range and solution analysis in different space parameters, such as frequency or time model, with features such as without, with and after noise reduction. The frequency response of the current design can be analyzed through the Windowing techniques.Original valueA new model and its implementation scenario with pervasive computational algorithms’ (PCA) (such as the hybrid PCA with AdaBoost (HPCA), PCA with bag of features and improved PCA with bag of features) relating the different features such as MFCC, power spectrum, pitch, Window techniques, etc. are calculated using the HPCA. The features are accumulated on the matrix formulations and govern the design feature comparison and its feature classification for improved performance parameters, as mentioned in the results.
International Journal of Pervasive Computing and Communications – Emerald Publishing
Published: Nov 8, 2024
Keywords: Short time autocorrelation; HPCA; MFCC; Long- short-term memory (LSTM); Decision-tree (DT); Automatic speech recognition (ASR)
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.