International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 3 | Views: 239 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article | Science and Technology | India | Volume 10 Issue 6, June 2021 | Popularity: 4.9 / 10


     

Multi-Modal Fusion for Enhanced Image and Speech Recognition in AI Systems

Ankur Tak


Abstract: This research investigates the integration of multi-modal information, specifically images and speech, to enhance the recognition capabilities of artificial intelligence (AI) systems. Adopting an interpretive philosophy and employing a deductive approach, the study explores the potential of dynamic attention mechanisms, semi-supervised learning, and cross-domain adaptation techniques. A descriptive research design is employed, utilizing secondary data collection from reputable academic sources. The research critically evaluates the feasibility and applicability of hardware optimization for efficient multi-modal processing, considering factors like specialized processors and parallel computing. The study presents a thorough analysis of dynamic attention mechanisms, emphasizing their role in dynamically allocating attention across different modalities based on contextual relevance. Additionally, it delves into semi-supervised learning techniques, showcasing their ability to leverage both labeled and unlabeled data for improved recognition performance. Cross-domain adaptation techniques are explored to facilitate the seamless deployment of multi-modal fusion models in diverse real-world scenarios.


Keywords: AI systems, knowledge, connecting, integrating, multi-modal classification, aural, visual information


Edition: Volume 10 Issue 6, June 2021


Pages: 1780 - 1788


DOI: https://www.doi.org/10.21275/SR231208202748



Make Sure to Disable the Pop-Up Blocker of Web Browser




Text copied to Clipboard!
Ankur Tak, "Multi-Modal Fusion for Enhanced Image and Speech Recognition in AI Systems", International Journal of Science and Research (IJSR), Volume 10 Issue 6, June 2021, pp. 1780-1788, https://www.ijsr.net/getabstract.php?paperid=SR231208202748, DOI: https://www.doi.org/10.21275/SR231208202748



Similar Articles

Downloads: 0

Masters Thesis, Science and Technology, Philippines, Volume 11 Issue 7, July 2022

Pages: 1858 - 1879

Level of Knowledge and Attitude on Comprehensive Sexuality Education: Basis for Designing Career and Life Skills Based Instructional Materials for Senior High School

Ryan Jason P. Cruz, Elisa N. Chua

Share this Article

Downloads: 0

Informative Article, Science and Technology, India, Volume 9 Issue 6, June 2020

Pages: 1919 - 1924

Energy and Sustainability: CMS - Powered Communication Strategies

Bhargav Reddy Piduru

Share this Article

Downloads: 0

Informative Article, Science and Technology, India, Volume 9 Issue 7, July 2020

Pages: 1999 - 2003

Optimizing Resource Utilization in Kubernetes: Definitive Best Practices for Efficient Cluster Management

Dinesh Reddy Chittibala

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Masters Thesis, Science and Technology, Philippines, Volume 12 Issue 5, May 2023

Pages: 1806 - 1809

Conventional and Pragmatic Approaches in Teaching Science in Bulan III District

April Grace A. Gutlay

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Informative Article, Science and Technology, India, Volume 7 Issue 9, September 2018

Pages: 1653 - 1656

Accelerating Software Quality: A Comprehensive Guide to Automation Testing for Java Applications

Vandana Sharma

Share this Article
Top