Tan Wang      

Currently, Tan Wang is a first-year Ph.D. student at MreaL Lab of Nanyang Technological University (NTU), supervised by Prof. Zhang Hanwang. His research interests include but not limit to Visual Reasoning, Causal Inference and Vision & Language.

Before that, He obtained the honoured bachelor degree in Department of EIE from University of Electronic Science and Technology of China (UESTC) in 2020. He was a research assistant at Center for Future Media , supervised by Prof. Xing Xu and Prof. Yang Yang. He also had a close research collaboration with Prof. Alan Hanjalic at TU Delft.

Email  /  CV  /  Github

News

  • [2020/04]   2 Journal papers are accepted by TNNLS 2020.
  • [2020/02]   1 paper with Prof. Hanwang Zhang is accepted by CVPR 2020.
  • [2019/07]   1 paper with Prof. Alan Hanjalic is accepted by ACM MM 2019 Oral.

  • Education

    University of Electronic Science and Technology of China (UESTC), China
    Honours Degree in Electronic Information Engineering      • Sep. 2016 - Jun. 2020
    GPA: 92.98/100,   Ranking: 2/284 (Overall) or 1/415 (first 2 years)
    Supervisors: Prof. Xing Xu and Prof. Yang Yang.    Collaborated with Prof. Alan Hanjalic

    Chiba University, Japan
    Exchange Program        • Aug. 2017
    Sakura Science Club Scholarship awardee. Funded by Japan Science and Technology Agency (JST).

    Nanyang Technological University (NTU), Singapore
    First-year Ph.D. in MreaL Lab, School of Computer Science and Engineering      • Aug. 2020 - Present
    Supervisor: Prof. Zhang Hanwang

    Research Experience

    Center For Future Media, UESTC
    Research Assistant       • Mar. 2018 - Jun. 2020
    Advisors:   Prof. Xing Xu and Prof. Yang Yang.   Collaborated with Prof. Alan Hanjalic

  • Proposed several novel methods for cross-modal retrieval which achieves the state-of-the-art performance on image-text matching.
  • Combined the GCN with Visual Question Generation Task and further boost the performance on an unexplored challenging task zero-shot VQA.
  • Complete 3 works and make the submission.

  • MReal Lab, NTU
    Research Assistant       • July. 2019 - Aug. 2020
    Advisors:   Prof. Hanwang Zhang

    Publication & Manuscript
    Visual Commonsense R-CNN
    Tan Wang, Jianqiang Huang, Hanwang Zhang, Qianru Sun
    IEEE International Conference on Computer Vision and Pattern Recognition, CVPR 2020, [Paperlink], [Code], [Zhihu]
    Area: Visual and Language, Causal Reasoning, Self-supervised Learning

    In this paper, we present a novel un-/self-supervised feature representation learning method, Visual Commonsense Region-based Convolutional Neural Network (VC R-CNN), to serve as an improved visual region encoder for Vision & Language high-level tasks.

    Visual Commonsense Representation Learning via Causal Inference (Abstact Version of VC R-CNN)
    Tan Wang, Jianqiang Huang, Hanwang Zhang, Qianru Sun
    IEEE International Conference on Computer Vision and Pattern Recognition MVM Workshop, CVPRW 2020, [Paperlink], [Code], [Zhihu]
    (Oral Presentation)
    Area: Visual and Language, Causal Reasoning, Self-supervised Learning
    Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking
    Tan Wang, Xing Xu, Yang Yang, Alan Hanjalic, Heng Tao Shen
    ACM International Conference on Multimedia, MM 2019, Nice, France, October 2019, [Paperlink], [Code]
    (Oral Presentation, 4.96% acceptance rate)
    Area: Visual and Language, Image-text matching

    In this paper, we propose a novel framework for image-text matching that achieves remarkable matching performance with acceptable model complexity and much less time consuming.

    Cross-Modal Attention with Semantic Consistence for Image-Text Matching
    Xing Xu*, Tan Wang*, Yang Yang, Lin Zuo, Fumin Shen, Heng Tao Shen
    IEEE Transactions on Neural Networks and learning systems, TNNLS 2020
    Area: Visual and Language, Image-text matching

    In this paper, we propose a novel hybrid matching approach named Cross-modal Attention with Semantic Consistence (CASC) for image-text matching, which is a joint framework that performs cross-modal attention for local alignment and multi-label prediction for global semantic consistence.

    Cross-Modal Attention with Semantic Consistence for Image-Text Matching
    Xing Xu*, Tan Wang*, Yang Yang, Alan Hanjalic, Heng Tao Shen
    IEEE Transactions on Neural Networks and learning systems, TNNLS 2020
    Area: Visual and Language, Image-text matching

    We propose an innovative answer-centric approach to focus on the relevant image regions only to reduce the complexity on VQG task.

    Honors & Scholarships

  • Outstanding Graduates of Sichuan Province (Top 1% student),  2020
  • Outstanding Undergraduate Thesis Award (Top 2% student),  2020
  • National Scholarship (Top 2% student),  2018
  • National Scholarship (Top 2% student),  2017
  • Tang Lixin Sponsored Elite Scholarship (Only 60 awardees pre year in UESTC),  2017
  • Best Freshman Award (Top 1 student per year in Department),  2016
  • Honor Student Scholarship (Top 10 students per year in Department),  2018
  • Outstanding Student Scholarship (Top 10% student),  2017~2019

  • Leadership Experience
    Lecture Group of EE Department
    Founder & President       • Oct. 2017 - Sep. 2018

  • Organized academic forum, sharing sessions, Q&A meetings more than 30 times, serving over 1000 students on studying and future planing.
  • The team grows to 30 people and won the Outstanding Student Organisation prize in 2018.

  • Innovative Entrepreneurship Project of UESTC
    Team Leader       • Sep. 2017 - Mar. 2018

  • This project focus on the pedestrian detection in low-light condition with excellent conclusion. We combine the recent pedestrian detection models with the low-light image enhancement algorithm based on Laplace operator.
  • Responsible for the code implementation and project promotion.

  • Personal Interests

    DOTA1: My first and most playing PC game which accompanied me in my whole middle and high school. And I got about 1350 score on the '11' Battle Platform Ladder Tournament. :)

    Running: During my college, I offen run a long distance for the pleasure releasing. And I have participated in the Chengdu Shuangyi Marathon in 2018.


    This awesome template borrowed from this guy~