Tianheng Cheng
Deep high-resolution representation learning for visual recognition
MMDetection: Open mmlab detection toolbox and benchmark
An end-to-end automatic cloud database tuning system using deep reinforcement learning
Boundary-preserving Mask R-CNN
Maptr: Structured modeling and learning for online vectorized hd map construction
Sparse instance activation for real-time instance segmentation
Polar parametrization for vision-based surround-view 3d detection
Efficient and robust 2d-to-bev representation learning via geometry-guided kernel transformer
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation
Bayesian cycle-consistent generative adversarial networks via marginalizing latent sampling
Perceive, interact, predict: Learning dynamic and static clues for end-to-end motion prediction
Lane graph as path: Continuity-preserving path-wise modeling for online lane graph construction
Vision-based uneven bev representation learning with polar rasterization and surface estimation
Knowledge mining with scene text for fine-grained recognition
Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Vma: Divide-and-conquer vectorized map annotation system for large-scale driving scene
Azinorm: Exploiting the radial symmetry of point cloud for azimuth-normalized 3d perception
MobileInst: Video Instance Segmentation on the Mobile
YOLO-World: Real-Time Open-Vocabulary Object Detection
