Publications -

Interspeech

DiFlow-TTS: Compact and Low-Latency Zero-Shot Text-to-Speech with Discrete Flow Matching

Ngoc-Son Nguyen, Thanh V. T. Tran, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Interspeech (Long Paper track) 2026

CVPR

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings 2026

Also accepted at CVPR 2026 Workshop: "Sight and Sound"

CVPR

DiFlowDubber: Discrete Flow Matching for Automated Video Dubbing via Cross-Modal Alignment and Synchronization

Ngoc-Son Nguyen, Thanh V. T. Tran, Jeongsoo Choi, Hieu-Nghia Huynh-Nguyen, Truong-Son Hy, Van Nguyen

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings 2026

Also accepted at CVPR 2026 Workshop: "Sight and Sound"

TMLR

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

Cuong Tran Van, Trong-Thang Pham, Ngoc-Son Nguyen, Duy Minh Ho Nguyen, Ngan Le

Transactions on Machine Learning Research (TMLR) 2025 J2C Certification

[Paper] [Code] [Video]

TMLR

DuFal: Dual-Frequency-Aware Learning for High-Fidelity Extremely Sparse-view CBCT Reconstruction

Cuong Tran Van, Trong-Thang Pham, Ngoc-Son Nguyen, Duy Minh Ho Nguyen, Ngan Le

Transactions on Machine Learning Research (TMLR) 2025 J2C Certification

[Paper] [Code] [Video]

ICCV

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Trong Thang Pham, Akash Awasthi, Saba Khan, Esteban Duran Marti, Tien-Phat Nguyen, Khoa Vo, Minh Tran, Ngoc-Son Nguyen, Cuong Tran, Yuki Ikebe, Anh Totti Nguyen, Anh Nguyen, Zhigang Deng, Carol C Wu, Hien Nguyen, Ngan Le

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Highlight

ICCV

CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Trong Thang Pham, Akash Awasthi, Saba Khan, Esteban Duran Marti, Tien-Phat Nguyen, Khoa Vo, Minh Tran, Ngoc-Son Nguyen, Cuong Tran, Yuki Ikebe, Anh Totti Nguyen, Anh Nguyen, Zhigang Deng, Carol C Wu, Hien Nguyen, Ngan Le

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 Highlight

ACL

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Nghia Huynh Nguyen Hieu, Ngoc-Son Nguyen, Huynh Nguyen Dang, Thieu Vo, Truong-Son Hy, Van Nguyen

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) 2025

ACL

OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Nghia Huynh Nguyen Hieu, Ngoc-Son Nguyen, Huynh Nguyen Dang, Thieu Vo, Truong-Son Hy, Van Nguyen

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL) 2025

Elsevier Journal

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration

Ngoc-Son Nguyen, Van Son Nguyen, Tung Le

Journal Computers and Electrical Engineering 2024 Q1, IF = 4.9

Elsevier Journal

Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration

Ngoc-Son Nguyen, Van Son Nguyen, Tung Le

Journal Computers and Electrical Engineering 2024 Q1, IF = 4.9

Under Review

Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech

Hieu-Nghia Huynh-Nguyen, Huynh Nguyen Dang, Ngoc-Son Nguyen, Van Nguyen

Under Review 2025

Under Review

Flamed-TTS: Flow Matching Attention-Free Models for Efficient Generating and Dynamic Pacing Zero-shot Text-to-Speech

Hieu-Nghia Huynh-Nguyen, Huynh Nguyen Dang, Ngoc-Son Nguyen, Van Nguyen

Under Review 2025

arXiv

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc*, Ryan Zhang*, Ngoc-Son Nguyen*, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy (* equal contribution)

Preprint 2024

arXiv

LiteGPT: Large Vision-Language Model for Joint Chest X-ray Localization and Classification Task

Khai Le-Duc*, Ryan Zhang*, Ngoc-Son Nguyen*, Tan-Hanh Pham, Anh Dao, Ba Hung Ngo, Anh Totti Nguyen, Truong-Son Hy (* equal contribution)

Preprint 2024