2026

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and SynchronizationCVPR
DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Findings of the Conference on Computer Vision and Pattern Recognition (Findings CVPR) 2026

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and SynchronizationCVPR
DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Findings of the Conference on Computer Vision and Pattern Recognition (Findings CVPR) 2026

2025

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT ReconstructionTMLR
DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

Cuong Tran Van, Trong-Thang Pham, Ngoc-Son Nguyen, Duy Minh Ho Nguyen, Ngan Le

Transactions on Machine Learning Research (TMLR) 2025 J2C Certification

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT ReconstructionTMLR
DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

Cuong Tran Van, Trong-Thang Pham, Ngoc-Son Nguyen, Duy Minh Ho Nguyen, Ngan Le

Transactions on Machine Learning Research (TMLR) 2025 J2C Certification

Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-SpeechUnder Review
Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech

Hieu-Nghia Huynh-Nguyen, Huynh Nguyen Dang, Ngoc-Son Nguyen, Van Nguyen

Under Review 2025

Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-SpeechUnder Review
Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech

Hieu-Nghia Huynh-Nguyen, Huynh Nguyen Dang, Ngoc-Son Nguyen, Van Nguyen

Under Review 2025

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow MatchingUnder Review
DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Ngoc-Son Nguyen, Thanh V. T. Tran, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Under Review 2025

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow MatchingUnder Review
DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Ngoc-Son Nguyen, Thanh V. T. Tran, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Under Review 2025

Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent SpaceUnder Review
Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent Space

Thanh V. T. Tran, Ngoc-Son Nguyen, Luong Tran, Long-Khanh Pham, Paarth Neekhara, Shehzeen Samarah Hussain, Van Nguyen

Under Review 2025

Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent SpaceUnder Review
Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent Space

Thanh V. T. Tran, Ngoc-Son Nguyen, Luong Tran, Long-Khanh Pham, Paarth Neekhara, Shehzeen Samarah Hussain, Van Nguyen

Under Review 2025

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath ModelingICCV
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Trong Thang Pham, Akash Awasthi, Saba Khan, Esteban Duran Marti, Tien-Phat Nguyen, Khoa Vo, Minh Tran, Ngoc-Son Nguyen, Cuong Tran, Yuki Ikebe, Anh Totti Nguyen, Anh Nguyen, Zhigang Deng, Carol C Wu, Hien Nguyen, Ngan Le

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Highlight

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath ModelingICCV
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Trong Thang Pham, Akash Awasthi, Saba Khan, Esteban Duran Marti, Tien-Phat Nguyen, Khoa Vo, Minh Tran, Ngoc-Son Nguyen, Cuong Tran, Yuki Ikebe, Anh Totti Nguyen, Anh Nguyen, Zhigang Deng, Carol C Wu, Hien Nguyen, Ngan Le

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Highlight

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow MatchingACL
OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Nghia Huynh Nguyen Hieu, Ngoc-Son Nguyen, Huynh Nguyen Dang, Thieu Vo, Truong-Son Hy, Van Nguyen

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) 2025

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow MatchingACL
OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Nghia Huynh Nguyen Hieu, Ngoc-Son Nguyen, Huynh Nguyen Dang, Thieu Vo, Truong-Son Hy, Van Nguyen

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) 2025

2024

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification TaskarXiv
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc*, Ryan Zhang*, Ngoc-Son Nguyen*, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy (* equal contribution)

Preprint 2024

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification TaskarXiv
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc*, Ryan Zhang*, Ngoc-Son Nguyen*, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy (* equal contribution)

Preprint 2024

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional IntegrationElsevier Journal
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration

Ngoc-Son Nguyen, Van Son Nguyen, Tung Le

Journal Computers and Electrical Engineering 2024 Q1, IF = 4.9

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional IntegrationElsevier Journal
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration

Ngoc-Son Nguyen, Van Son Nguyen, Tung Le

Journal Computers and Electrical Engineering 2024 Q1, IF = 4.9