Multimodal Reinforcement Learning for Dynamic Cross-Media Advertising Budget Allocation Via DDPG and PPO Combined with Meta-Learning and Adversarial Training

Wei Gao, Xiaoxin Meng, Yuyang Zhang

Abstract


This paper proposes a reinforcement learning framework to address the instability in cross-media advertising budgets, which often fail to adapt to dynamic user behavior and bidding fluctuations. The framework combines multimodal feature fusion—incorporating ad images, copy, and user behavior—and adaptive policy optimization using Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization (PPO). To enhance robustness across different platforms, adversarial training and meta-learning are used to adapt to shifting user feedback distributions. The policy reallocates budgets in real time, optimized through small-scale A/B testing. Experiments with over 100 million ad impressions and 5 million users show a 50% increase in ROI on social platforms, a 22.6% decrease in cost per acquisition, and near-full budget utilization on e-commerce platforms. These results highlight the effectiveness of multimodal reinforcement learning in improving cross-media resource allocation and advertising outcomes.


Full Text:

PDF

References


Ref

Lim W M, Gupta S, Aggarwal A, et al. How do digital natives perceive and react toward online advertising? Implications for SMEs[J]. Journal of Strategic Marketing, 2024, 32(8): 1071-1105.

Lina L F, Ahluwalia L. Customers’ impulse buying in social commerce: The role of flow experience in personalized advertising[J]. Jurnal Manajemen Maranatha, 2021, 21(1): 1-8.

Hair Jr J F, Sarstedt M. Data, measurement, and causal inferences in machine learning: opportunities and challenges for marketing[J]. Journal of Marketing Theory and Practice, 2021, 29(1): 65-77.

Alshaketheep K, Mansour A M, Al-Ma’aitah I M, et al. Leveraging AI predictive analytics for marketing strategy: The mediating role of management awareness[J]. Journal of System and Management Sciences, 2024, 14(2): 71-89.

Luangrath A W, Peck J, Hedgcock W, et al. Observing product touch: The vicarious haptic effect in digital marketing and virtual reality[J]. Journal of Marketing Research, 2022, 59(2): 306-326.

Rosário A, Raimundo R. Consumer marketing strategy and E-commerce in the last decade: a literature review[J]. Journal of theoretical and applied electronic commerce research, 2021, 16(7): 3003-3024.

Yones P C P, Muthaiyah S. eWOM via the TikTok application and its influence on the purchase intention of somethinc products[J]. Asia Pacific Management Review, 2023, 28(2): 174-184.

Hayes J L, Brinson N H, Bott G J, et al. The influence of consumer–brand relationship on the personalized advertising privacy calculus in social media[J]. Journal of Interactive Marketing, 2021, 55(1): 16-30.

Varnukhov A Y, Nazarov D M. Product matching in digital marketplaces: Multimodal model based on the transformer architecture[J]. Бизнес-информатика, 2025, 19(2 (eng)): 7-24.

Yang Q, Farseev A, Ongpin M, et al. Fusing Predictive and Large Language Models for Actionable Recommendations In Creative Marketing[J]. ACM Transactions on Information Systems, 2025, 43(5): 1-31.

Zeller M, Behley J, Heidingsfeld M, et al. Gaussian radar transformer for semantic segmentation in noisy radar data[J]. IEEE Robotics and Automation Letters, 2022, 8(1): 344-351.

Zhu Y, Wang X, Li Q, et al. Botspot++: A hierarchical deep ensemble model for bots install fraud detection in mobile advertising[J]. ACM Transactions on Information Systems (TOIS), 2021, 40(3): 1-28.

Wang S, Hu C, Jia G. Deep Learning-Based Saliency Assessment Model for Product Placement in Video Advertisements[J]. Journal of Advanced Computing Systems, 2024, 4(5): 27-41.

Ullah F, Ullah S, Srivastava G, et al. IDS-INT: Intrusion detection system using transformer-based transfer learning for imbalanced network traffic[J]. Digital Communications and Networks, 2024, 10(1): 190-204.

Liu X, Qi F. Research on advertising content recognition based on convolutional neural network and recurrent neural network[J]. International Journal of Computational Science and Engineering, 2021, 24(4): 398-404.

Zhang K, Xing S, Chen Y. Research on Cross-Platform Digital Advertising User Behavior Analysis Framework Based on Federated Learning[J]. Artificial Intelligence and Machine Learning Review, 2024, 5(3): 41-54.

Zhang Q, Chang W, Li C, et al. Attention-based spatial-temporal graph transformer for traffic flow forecasting[J]. Neural Computing and Applications, 2023, 35(29): 21827-21839.

Li Z, Huang W, Wang L, et al. CNN and Transformer interaction network for hyperspectral image classification[J]. International Journal of Remote Sensing, 2023, 44(18): 5548-5573.

Wang X, Qiao Y, Xiong J, et al. Advanced network intrusion detection with tabtransformer[J]. Journal of Theory and Practice of Engineering Science, 2024, 4(03): 191-198.

Kalidindi A, Arrama M B. A tabtransformer based model for detecting botnet-attacks on internet of things using deep learning[J]. Journal of Theoretical and Applied Information Technology, 2023, 101(13): 5206-5218.

Häglund E, Björklund J. AI-driven contextual advertising: Toward relevant messaging without personal data[J]. Journal of Current Issues & Research in Advertising, 2024, 45(3): 301-319.

Hu F, Pei W, Wu Y, et al. Star-transformer based semantic enhanced union relation extraction[J]. The Journal of Supercomputing, 2025, 81(10): 1-24.

Gruetzemacher R, Paradice D. Deep transfer learning & beyond: Transformer language models in information systems research[J]. ACM Computing Surveys (CSUR), 2022, 54(10s): 1-35.

Xie J, Chen Z. Hierarchical transformer with spatio-temporal context aggregation for next point-of-interest recommendation[J]. ACM Transactions on Information Systems, 2023, 42(2): 1-30.

Zhou F, Xu X, Trajcevski G, et al. A survey of information cascade analysis: Models, predictions, and recent advances[J]. ACM Computing Surveys (CSUR), 2021, 54(2): 1-36.

Lorenz-Spreen P, Oswald L, Lewandowsky S, et al. A systematic review of worldwide causal and correlational evidence on digital media and democracy[J]. Nature human behaviour, 2023, 7(1): 74-101.

Ihzaturrahma N, Kusumawati N. Influence of integrated marketing communication to brand awareness and brand image toward purchase intention of local fashion product[J]. International Journal of Entrepreneurship and Management Practices, 2021, 4(15): 23-41.

Afful-Dadzie E, Afful-Dadzie A, Egala S B. Social media in health communication: A literature review of information quality[J]. Health Information Management Journal, 2023, 52(1): 3-17.

Andriani R, Santika I D A D M. Verbal and Non-Verbal Signs in Facial Wash Advertisements: A Semiotic Analysis[J]. Yavana Bhasha: Journal of English Language Education, 2021, 4(2): 24-29.

Hu H, Lin Z, Hu Q, et al. Multi-source information fusion based DLaaS for traffic flow prediction[J]. IEEE Transactions on Computers, 2023, 73(4): 994-1003.

Liu J, Li T, Ji S, et al. Urban flow pattern mining based on multi-source heterogeneous data fusion and knowledge graph embedding[J]. IEEE Transactions on Knowledge and Data Engineering, 2021, 35(2): 2133-2146.

Li M, Wang F, Jia X, et al. Multi-source data fusion for economic data analysis[J]. Neural Computing and Applications, 2021, 33(10): 4729-4739.

Yang F, Zhang P. MSIF: Multi-source information fusion based on information sets[J]. Journal of Intelligent & Fuzzy Systems, 2023, 44(3): 4103-4112.

Tong G, Li Z, Peng H, et al. Multi-source features fusion single stage 3D object detection with transformer[J]. IEEE robotics and automation letters, 2023, 8(4): 2062-2069.




DOI: https://doi.org/10.31449/inf.v49i30.11440

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.