Please use this identifier to cite or link to this item: http://10.9.150.37:8080/dspace//handle/atmiyauni/2294
Title: Development Of A Model To Analyze & Interpret Vernacular Voice Recognition Of Gujarati Dialects
Authors: Shah, Meerabahen M.
Kavathiya, Hiren R.
Issue Date: Dec-2024
Publisher: Department of Computer Science, Faculty of Science Atmiya University.
Citation: Shah, Meerabahen M.; Kavathiya, Hiren R. (2024). Development Of A Model To Analyze & Interpret Vernacular Voice Recognition Of Gujarati Dialects. Department of Computer Science, Faculty of Science Atmiya University.
Abstract: The development of voice recognition systems tailored to vernacular dialects holds transformative potential for enhancing accessibility and inclusivity in technology. This thesis focuses on creating a voice recognition model specifically designed for vernacular Gujarati dialects, addressing the unique linguistic and phonetic challenges inherent in regional variations of the language. The key part of this research was to gather a diverse and representative spoken Gujarati corpora sourced via varied public repositories, which includes radio broadcast, interview, folk song, community recording and public availability speech corpora. This dataset includes a variety of dialectal variation in phonology, syntax and usage to guarantee robustness and inclusivity to the development of the models. A dialect-specific recognition system using advanced techniques in voice recognition system, including deep learning architectures the proposed framework and model was developed. The model is further enriched with dialectal linguistic features integrated to its architecture, phoneme based pretraining to increase recognition accuracy, and transfer learning to adapt general speech recognition systems to dialect specific nuances. The model was evaluated and found to achieve substantial improvement in phoneme recognition accuracy over baseline systems. The results show that modeling context-aware, high quality, diverse datasets are crucial to vernacular speech recognition. The system developed is there to provide practical applications for voice enabled user interface, digital accessibility and protection of linguistic diversity more specific examples of such languages which are least represented. This work contributes to the emerging area of regional language processing with an end-to-end framework that can be used for future work on low-resource languages and dialects and to build inclusive, ubiquitous and accessible technology solutions in multilingual communities.
URI: http://10.9.150.37:8080/dspace//handle/atmiyauni/2294
Appears in Collections:01. PhD. Thesis Computer Applications

Files in This Item:
File Description SizeFormat 
01. Title page.pdf138.85 kBAdobe PDFView/Open
02_Prelim pages.pdf963.37 kBAdobe PDFView/Open
03. Contents.pdf179.46 kBAdobe PDFView/Open
04. Abstract.pdf25.88 kBAdobe PDFView/Open
05. Chapter 1.pdf360.26 kBAdobe PDFView/Open
06. Chapter 2.pdf434.6 kBAdobe PDFView/Open
07. Chapter 3.pdf1.21 MBAdobe PDFView/Open
08. Chapter 4.pdf761.81 kBAdobe PDFView/Open
09. Chapter 5.pdf705.87 kBAdobe PDFView/Open
10. Chapter 6.pdf664.89 kBAdobe PDFView/Open
11_Publication.pdf11.57 MBAdobe PDFView/Open
Show full item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.