International Journal of Innovative Research in Engineering and Management
Year: 2026, Volume: 13, Issue: 3
First page : ( 38) Last page : ( 46)
Online ISSN : 2350-0557
Alimpia Roy
DOI: 10.55524/ijirem.2026.13.3.5 |
DOI URL: https://doi.org/10.55524/ijirem.2026.13.3.5
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0) (http://creativecommons.org/licenses/by/4.0)
Article Tools: Print the Abstract | Indexing metadata | How to cite item | Email this article | Post a Comment
Alimpia Roy , Gurpreet Singh
The Korean Wave (Hallyu) has significantly influenced consumer behaviour in India, yet understanding the nuanced sentiment patterns that drive cross-cultural product adoption remains challenging. Traditional consumer behaviour studies rely primarily on surveys and unimodal text analysis, which fail to capture the rich multimodal nature of social media discourse. In this paper, we propose CultureFuse, a novel multimodal sentiment analysis framework that leverages both textual and visual modalities to analyse Indian consumers’ perceptions of Korean products. Our framework employs multilingual BERT for textual feature extraction and category conditioned visual encoding, combined through a Cross-Modal Attention Fusion (CMAF) mechanism with adaptive gating. We construct Hallyu India-MM, a curated multimodal dataset of 133 consumer review samples spanning Korean beauty, food, fashion, and entertainment products popular among Indian consumers. Through rigorous 5-fold cross-validation, our multimodal CultureFuse approach achieves 90.9% accuracy in sentiment classification, substantially outperforming text only BERT (72.1%), TF-IDF+SVM (60.1%), and late fusion (78.1%) baselines. The learned adaptive gating mechanism assigns 62.1% weight to textual features and 37.9% to visual features on average, with category-dependent variation. Our per-category analysis reveals that visual features are particularly discriminative for beauty and fashion categories (100% accuracy), while food products remain more challenging across all modalities (73.3%). This work bridges multimodal AI and consumer behaviour research, demonstrating that cross-modal attention fusion substantially improves the understanding of cross-cultural consumption patterns.
MSc Scholar, Computer Science, Woosong University, Daejeon, KR
No. of Downloads: 4 | No. of Views: 16
