Improving the Sound Classification Accuracy Using CNN-LSTM and MFCC with Audio Augmentation for Diagnosing Respiratory Disease

dc.contributor.authorManop Phankokkruad
dc.contributor.authorSirirat Wacharawichanant
dc.date.accessioned2026-05-08T19:21:31Z
dc.date.issued2025-5-23
dc.description.abstractAudio is vital information data for understanding various situations. A multitude of sound features can be explained by analysis through the audio signals. Numerous classification methods have been developed to study audio classification. This work studies the improvement of audio classification for the diagnosis of respiratory disease through the integration of audio data augmentation and CNN in conjunction with LSTM (CNN-LSTM). Furthermore, this paper focuses on audio data augmentation and feature extraction in the deep learning approach. This study proposed the CNN-LSTM model to diagnose respiratory disease by learning from the different audio datasets. The results reveal that the CNN-LSTM model attained an accuracy of 81.48%, precision of 0.8340, sensitivity of 0.6948, and F1-score of 0.7225. Considering the achieved F1-score, the CNN-LSTM model demonstrates a high level of diagnotic accuracy. Therefore, all evaluation evaluation parameters collectively indicate the robust performance of the proposed disease classification model.
dc.identifier.doi10.1109/icaibd64986.2025.11082034
dc.identifier.urihttps://dspace.kmitl.ac.th/handle/123456789/18092
dc.subjectPhonocardiography and Auscultation Techniques
dc.subjectMusic and Audio Processing
dc.subjectSpeech and Audio Processing
dc.titleImproving the Sound Classification Accuracy Using CNN-LSTM and MFCC with Audio Augmentation for Diagnosing Respiratory Disease
dc.typeArticle

Files

Collections