Nguyen, Khang. “Feature Extraction Semilearning and Augmented Representation for Image Captioning in Crowd Scenes”. VNUHCM Journal of Science and Technology Development 26, no. 4 (December 31, 2023): 3128-3138. Accessed July 16, 2025. http://stdj.scienceandtechnology.com.vn/index.php/stdj/article/view/4028.