Jingdong Zhang

I am currently a third-year Ph.D. student majoring in Computer Science at Texas A&M University, advised by Prof. Wenping Wang and Prof. Xin Li.

I received my Bachelor of Engineering degree at Fudan University in 2023. I also worked as a research assistant at HKUST CSE with Prof. Dan Xu. Previously, I worked with Prof. Tao Chen and Prof. Jiayuan Fan.

My research interests lie in Computer Vision and Graphics.

  • For computer graphics, I work on 3D asset generation and neural rendering.
  • For computer vision, I work on representation learning and scene understanding, especially under multi-tasking scenarios.
Email: jdzhang [at] tamu [dot] edu
Wechat: Triumph0929

News

Education

Texas A&M University

Department of Computer Science and Engineering

Ph.D. Student

August 2023 - Present
Fudan University

Intelligent Science and Technology (Excellent Class)

Undergraduate Student

September 2019 - June 2023

Internship

Tencent America

Research Intern, working on high-quality 3D asset generation.

May 2024 - Aug 2024
Adobe

Research Intern, working on generative soft inpainting.

May 2025 - Aug 2025

Publications

UniSER: A Foundation Model for Unified Soft Effects Removal
Jingdong Zhang, Lingzhi Zhang, Qing Liu, Mang Tik Chiu, Connelly Barnes, Yizhou Wang, Haoran You, Xiaoyang Liu, Yuqian Zhou, Zhe Lin, Eli Shechtman, Sohrab Amirghodsi, Xin Li, Wenping Wang, Xiaohang Zhan
(Project in progress/Preprint)    [arxiv]

Abstract: We propose a foundational image soft effect removal (SER) model with: i) a large, curated pair-wise dataset with diverse soft effects (e.g. lens flare, haze, shadows, and reflections), ii) fine-grained user control with spatial masks and strength control, iii) generalize on zero-shot unseen effects, iv) add or enhance effects.

SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation
SIGGRAPH Asia, 2025    [arxiv]    [project]

Abstract: SPGen leverages Spherical Projection (SP) to generate high-quality 3D shapes with i) Consistency: SP maps ensure view-consistent and unambiguous 3D reconstruction, ii) Flexibility: Supports arbitrary topologies, iii) Efficiency: Inherit powerful 2D diffusion priors and enables efficient finetuning.

SolidGS: Consolidating Gaussian Surfel Splatting for Sparse-View Surface Reconstruction
Arxiv, 2024    [arxiv]    [project]

Abstract: We present SolidGS, which reconstructs a consolidated Gaussian field from sparse inputs. Given only three input views, our approach enables high-precision and detailed mesh extraction, and high-quality novel view synthesis, achieved within just three minutes.

Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions
ACMMM (ACM MultiMedia), 2025    [arxiv]    [code]

Abstract: This research proposes a new approach to multi-task dense predictions with partially labeled data. We introduce hierarchical task tokens (HiTTs) to capture multi-level representations. The global task tokens conduct cross-task interactions and transfer knowledge from labeled to unlabeled tasks.

BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions
TPAMI (IEEE TPAMI), 2025    [paper]    [code]

Abstract: This work introduces a novel BridgeNet for multi-task learning on dense predictions. It uses a Bridge Feature Extractor (BFE) to create strong bridge features and a Task Pattern Propagation (TPP) to solve the task-pattern entanglement issue, resulting in task-specific features with higher quality.

Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors
TIP (IEEE TIP), 2023    [paper]    [code]

Abstract: We introduce a new approach for cross-domain pedestrian one-stage detectors. The paper identifies a foreground-background misalignment issue in image-level feature alignment, and a novel framework, Background-Focused Distribution Alignment (BFDA) is proposed to address this issue.

Research Experience

Selected Awards

Academic Service

Miscellaneous

I love photography and road trips.