Nguyen, Khang. “Feature Extraction Semilearning and Augmented Representation for Image Captioning in Crowd Scenes”. Science and Technology Development Journal 26, no. 4 (December 31, 2023): 3128-3138. Accessed May 16, 2024. http://stdj.scienceandtechnology.com.vn/index.php/stdj/article/view/4028.