DC Field | Value | Language |
---|---|---|
dc.contributor.author | Shah, Meerabahen M. | - |
dc.contributor.author | Kavathiya, Hiren R. | - |
dc.date.accessioned | 2025-01-28T10:24:29Z | - |
dc.date.available | 2025-01-28T10:24:29Z | - |
dc.date.issued | 2024-12 | - |
dc.identifier.citation | Shah, Meerabahen M.; Kavathiya, Hiren R. (2024). Development Of A Model To Analyze & Interpret Vernacular Voice Recognition Of Gujarati Dialects. Department of Computer Science, Faculty of Science Atmiya University. | en_US |
dc.identifier.uri | http://10.9.150.37:8080/dspace//handle/atmiyauni/2294 | - |
dc.description.abstract | The development of voice recognition systems tailored to vernacular dialects holds transformative potential for enhancing accessibility and inclusivity in technology. This thesis focuses on creating a voice recognition model specifically designed for vernacular Gujarati dialects, addressing the unique linguistic and phonetic challenges inherent in regional variations of the language. The key part of this research was to gather a diverse and representative spoken Gujarati corpora sourced via varied public repositories, which includes radio broadcast, interview, folk song, community recording and public availability speech corpora. This dataset includes a variety of dialectal variation in phonology, syntax and usage to guarantee robustness and inclusivity to the development of the models. A dialect-specific recognition system using advanced techniques in voice recognition system, including deep learning architectures the proposed framework and model was developed. The model is further enriched with dialectal linguistic features integrated to its architecture, phoneme based pretraining to increase recognition accuracy, and transfer learning to adapt general speech recognition systems to dialect specific nuances. The model was evaluated and found to achieve substantial improvement in phoneme recognition accuracy over baseline systems. The results show that modeling context-aware, high quality, diverse datasets are crucial to vernacular speech recognition. The system developed is there to provide practical applications for voice enabled user interface, digital accessibility and protection of linguistic diversity more specific examples of such languages which are least represented. This work contributes to the emerging area of regional language processing with an end-to-end framework that can be used for future work on low-resource languages and dialects and to build inclusive, ubiquitous and accessible technology solutions in multilingual communities. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Department of Computer Science, Faculty of Science Atmiya University. | en_US |
dc.title | Development Of A Model To Analyze & Interpret Vernacular Voice Recognition Of Gujarati Dialects | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | 01. PhD. Thesis Computer Applications |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01. Title page.pdf | 138.85 kB | Adobe PDF | View/Open | |
02_Prelim pages.pdf | 963.37 kB | Adobe PDF | View/Open | |
03. Contents.pdf | 179.46 kB | Adobe PDF | View/Open | |
04. Abstract.pdf | 25.88 kB | Adobe PDF | View/Open | |
05. Chapter 1.pdf | 360.26 kB | Adobe PDF | View/Open | |
06. Chapter 2.pdf | 434.6 kB | Adobe PDF | View/Open | |
07. Chapter 3.pdf | 1.21 MB | Adobe PDF | View/Open | |
08. Chapter 4.pdf | 761.81 kB | Adobe PDF | View/Open | |
09. Chapter 5.pdf | 705.87 kB | Adobe PDF | View/Open | |
10. Chapter 6.pdf | 664.89 kB | Adobe PDF | View/Open | |
11_Publication.pdf | 11.57 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.