International Journal of Innovative Research in Computer Science and Technology
Year: 2026, Volume: 14, Issue: 1
First page : ( 79) Last page : ( 88)
Online ISSN : 2350-0557.
Supritha P O
, Omkar Mahale
, Shalya Gaonkar
, Shetty Aditya Udaya
, Sooraj Devadiga
DOI: 10.55524/ijircst.2026.14.1.10 |
DOI URL: https://doi.org/10.55524/ijircst.2026.14.1.10
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0)http://creativecommons.org/licenses/by/4.0
Article Tools: Print the Abstract | Indexing metadata | How to cite item | Email this article | Post a Comment
Supritha P O , Omkar Mahale, Shalya Gaonkar, Shetty Aditya Udaya, Sooraj Devadiga
Pronunciation accuracy is a fundamental factor in effective language learning; however, many existing systems face difficulties in delivering real-time error analysis without relying on computationally intensive acoustic model training. This paper introduces an AI-driven pronunciation mistake detection system developed using Google Gemini 1.5 Flash, a low-latency multimodal large language model capable of directly processing spoken input. Unlike conventional approaches based on MFCC features or task-specific deep learning pipelines, the proposed system employs prompt-guided reasoning combined with algorithmic scoring methods to detect pronunciation errors at the word, phoneme, and prosodic levels. Learner speech is transmitted to the Gemini API, which generates a structured pronunciation analysis that includes phoneme-level interpretations and word-level discrepancies. These outputs are further processed by a custom scoring framework to evaluate pronunciation quality and produce clear, actionable feedback. Experimental evaluation using diverse English utterances demonstrates the system’s effectiveness in identifying vowel–consonant substitutions, omitted syllables, and stress-related errors. The findings underscore the potential of LLM-based audio reasoning as a lightweight, scalable, and real-time solution for automated pronunciation assessment.
Assistant Professor, CSE, SDMIT, Ujire, India
No. of Downloads: 27 | No. of Views: 221
Suchetha N V, Anisha Upadhayaya H S, Anushree U Rao, H N Swati, Mallana Gowda G S.
January 2026 - Vol 14, Issue 1
Dhawal Jain, Omkar S, Pragathi, Sanidhya M Jain, Vishal Raju Angadi.
January 2026 - Vol 14, Issue 1
Supritha P O, Prajwal, Saikumar Laxman Pujari, Siddartha R, Adarsh Shendage.
January 2026 - Vol 14, Issue 1
