Research Focus

Foundation models for visual perception aim to address the fundamental tasks of object recognition and localization. This line of research focuses on visual backbone networks and object detection models, providing foundational architectures for general-purpose visual perception.


Representative Works:

High-Accuracy, High-Efficiency Object Detection Foundation Model

r-fcn.png



deformable_detr.png


Visual Backbone Networks Centered on Deformable Convolutions, Large-Scale General-Purpose Visual Foundation Models

  • Deformable Convolutional Networks v1/v2

    [6th Most Influential Paper at ICCV 2017]

    [Included in Pytorch Vision Operator Library]

    dcn.png



dcn-v3.png


+

Doctoral Degree in Engineering

Jifeng DAI
MOBILE Version