2026

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and SynchronizationCVPR
DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Proceedings of Findings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (Findings of CVPR) 2026

Also accepted at CVPR 2026 Workshop: "Sight and Sound"

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and SynchronizationCVPR
DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Proceedings of Findings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (Findings of CVPR) 2026

Also accepted at CVPR 2026 Workshop: "Sight and Sound"

2025

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT ReconstructionTMLR
DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

Cuong Tran Van, Trong-Thang Pham, Ngoc-Son Nguyen, Duy Minh Ho Nguyen, Ngan Le

Transactions on Machine Learning Research (TMLR) 2025 J2C Certification

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT ReconstructionTMLR
DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

Cuong Tran Van, Trong-Thang Pham, Ngoc-Son Nguyen, Duy Minh Ho Nguyen, Ngan Le

Transactions on Machine Learning Research (TMLR) 2025 J2C Certification

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath ModelingICCV
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Trong Thang Pham, Akash Awasthi, Saba Khan, Esteban Duran Marti, Tien-Phat Nguyen, Khoa Vo, Minh Tran, Ngoc-Son Nguyen, Cuong Tran, Yuki Ikebe, Anh Totti Nguyen, Anh Nguyen, Zhigang Deng, Carol C Wu, Hien Nguyen, Ngan Le

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Highlight

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath ModelingICCV
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Trong Thang Pham, Akash Awasthi, Saba Khan, Esteban Duran Marti, Tien-Phat Nguyen, Khoa Vo, Minh Tran, Ngoc-Son Nguyen, Cuong Tran, Yuki Ikebe, Anh Totti Nguyen, Anh Nguyen, Zhigang Deng, Carol C Wu, Hien Nguyen, Ngan Le

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Highlight

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow MatchingACL
OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Nghia Huynh Nguyen Hieu, Ngoc-Son Nguyen, Huynh Nguyen Dang, Thieu Vo, Truong-Son Hy, Van Nguyen

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) 2025

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow MatchingACL
OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Nghia Huynh Nguyen Hieu, Ngoc-Son Nguyen, Huynh Nguyen Dang, Thieu Vo, Truong-Son Hy, Van Nguyen

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) 2025

2024

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional IntegrationElsevier Journal
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration

Ngoc-Son Nguyen, Van Son Nguyen, Tung Le

Journal Computers and Electrical Engineering 2024 Q1, IF = 4.9

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional IntegrationElsevier Journal
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration

Ngoc-Son Nguyen, Van Son Nguyen, Tung Le

Journal Computers and Electrical Engineering 2024 Q1, IF = 4.9

Preprints & Under Review

Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-SpeechUnder Review
Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech

Hieu-Nghia Huynh-Nguyen, Huynh Nguyen Dang, Ngoc-Son Nguyen, Van Nguyen

Under Review 2025

Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-SpeechUnder Review
Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech

Hieu-Nghia Huynh-Nguyen, Huynh Nguyen Dang, Ngoc-Son Nguyen, Van Nguyen

Under Review 2025

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow MatchingUnder Review
DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Ngoc-Son Nguyen, Thanh V. T. Tran, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Under Review 2025

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow MatchingUnder Review
DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Factorized Discrete Flow Matching

Ngoc-Son Nguyen, Thanh V. T. Tran, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Under Review 2025

Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent SpaceUnder Review
Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent Space

Thanh V. T. Tran, Ngoc-Son Nguyen, Luong Tran, Long-Khanh Pham, Paarth Neekhara, Shehzeen Samarah Hussain, Van Nguyen

Under Review 2025

Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent SpaceUnder Review
Precise Video-to-Audio Generation with Cross-Modal Alignment in Latent Space

Thanh V. T. Tran, Ngoc-Son Nguyen, Luong Tran, Long-Khanh Pham, Paarth Neekhara, Shehzeen Samarah Hussain, Van Nguyen

Under Review 2025

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification TaskarXiv
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc*, Ryan Zhang*, Ngoc-Son Nguyen*, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy (* equal contribution)

Preprint 2024

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification TaskarXiv
LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc*, Ryan Zhang*, Ngoc-Son Nguyen*, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy (* equal contribution)

Preprint 2024