📝 Publications

  • ✉️ means Corresponding Author; * means Equal Contribution

🤖 LLMs & MLLMs

  1. arXiv 2025 Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction, Nex-AGI Team
  2. ICASSP 2025 DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering, Haochen Wang, Kai Hu, Liangcai Gao$^✉️$
  3. arXiv 2025 (Cutting-edge Project) DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning, DeepSeek AI
  4. arXiv 2024 (Cutting-edge Project) DeepSeek-V3 Technical Report, DeepSeek AI
  5. arXiv 2024 (Cutting-edge Project) DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding, Zhiyu Wu*, Xiaokang Chen*, Zizheng Pan*, Xingchao Liu*, Wen Liu*, Damai Dai, Huazuo Gao, Yiyang Ma, Chengyue Wu, Bingxuan Wang, Zhenda Xie, Yu Wu, Kai Hu, Jiawei Wang, Yaofeng Sun, Yukun Li, Yishi Piao, Kang Guan, Aixin Liu, Xin Xie, Yuxiang You, Kai Dong, Xingkai Yu, Haowei Zhang, Liang Zhao, Yisong Wang, Chong Ruan$^✉️$
  6. ICDAR 2024 DocTabQA: Answering Questions from Long Documents Using Tables, Haochen Wang, Kai Hu, Haoyu Dong, Liangcai Gao$^✉️$

📄 Document Intelligence

  1. Pattern Recognition 2025 (SCI Q1 Journal) UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis, Jiawei Wang$^✉️$, Kai Hu, Qiang Huo
  2. ICDAR 2024 (Oral) DLAFormer: An End-to-End Transformer For Document Layout Analysis, Jiawei Wang*$^✉️$, Kai Hu*$^✉️$, Qiang Huo
  3. ICDAR 2024 (Oral) Dynamic Relation Transformer for Contextual Text Block Detection, Jiawei Wang*$^✉️$, Shunchi Zhang*$^✉️$, Kai Hu*$^✉️$, Chixiang Ma, Zhuoyao Zhong, Lei Sun, Qiang Huo
  4. ICDAR 2024 (Oral) UniVIE: A Unified Label Space Approach to Visual Information Extraction from Form-Like Documents, Kai Hu$^✉️$, Jiawei Wang, Weihong Lin, Zhuoyao Zhong, Lei Sun, Qiang Huo
  5. Pattern Recognition 2024 (SCI Q1 Journal) Mathematical formula detection in document images: A new dataset and a new approach, Kai Hu$^✉️$, Zhuoyao Zhong, Lei Sun, Qiang Huo
  6. Pattern Recognition 2024 (SCI Q1 Journal) Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure Analysis, Jiawei Wang$^✉️$, Kai Hu, Zhuoyao Zhong, Lei Sun, Qiang Huo
  7. ICDAR 2023 A Hybrid Approach to Document Layout Analysis for Heterogeneous Document Images, Zhuoyao Zhong$^✉️$, Jiawei Wang, Haiqing Sun, Kai Hu, Erhan Zhang, Lei Sun, Qiang Huo
  8. AAAI 2023 (Oral) A Question-Answering Approach to Key Value Pair Extraction from Form-like Document Images, Kai Hu*, Zhuoyuan Wu*, Zhuoyao Zhong$^✉️$, Weihong Lin, Lei Sun, Qiang Huo
  9. ICDAR 2021 (Best Paper Award) ViBERTgrid: A Jointly Trained Multi-modal 2D Document Representation for Key Information Extraction from Documents, Weihong Lin*$^✉️$, Qifang Gao*, Lei Sun, Zhuoyao Zhong, Kai Hu, Qin Ren, Qiang Huo

📚 Academic Services

  • ICDAR Reviewer (2023, 2024, 2025, 2026)
  • Pattern Recognition Reviewer (2025)
  • AAAI Reviewer (2025)