在自动驾驶之心平台的论文推荐专栏文章列表整理。
论文推荐专栏:紧跟自动驾驶感知的前沿论文,将前沿论文按照论文思路、主要贡献、网络设计、实验结果四个部分进行拆分,写成中文的文章,向广大中文研究者和自动驾驶之心的关注者提供前沿论文速览服务。
日期(Date) | 论文名称(Paper name) | 翻译名字 | 论文推荐链接 | 论文链接(paper link) | 代码链接(code link) |
---|---|---|---|---|---|
2023-05-29 | Deep Radar Inverse Sensor Models for Dynamic Occupancy Grid Maps (Preprint)* | 用于动态占用网格地图的深度毫米波雷达逆传感器模型(Inverse Sensor Models) | https://zhuanlan.zhihu.com/p/632717882 | https://arxiv.org/pdf/2305.12409.pdf | |
2023-05-29 | Curricular Object Manipulation in LiDAR-based Object Detection | CVPR 2023 基于LiDAR的目标检测中的Curricular Object Manipulation | https://zhuanlan.zhihu.com/p/632927353 | https://arxiv.org/pdf/2304.04248.pdf | |
2023-05-27 | MonoATT: Online Monocular 3D Object Detection with Adaptive Token Transformer | CVPR 2023 MonoATT:在线单目3D目标检测与自适应Token Transformer | https://zhuanlan.zhihu.com/p/632577292 | https://arxiv.org/pdf/2303.13018.pdf | |
2023-05-23 | PVO: Panoptic Visual Odometry | CVPR 2023 PVO:全景视觉里程计 | https://zhuanlan.zhihu.com/p/631643495 | (https://arxiv.org/pdf/2207.01610.pdf) | https://zju3dv.github.io/pvo/ |
2023-05-22 | Dense Distinct Query for End-to-End Object Detection | CVPR 2023 用于端到端目标检测的稠密Distinct Query | https://zhuanlan.zhihu.com/p/625613243 | https://arxiv.org/pdf/2303.12776.pdf | |
2023-05-21 | Referring Multi-Object Tracking | CVPR 2023 Referring多目标跟踪(旷视科技) | https://zhuanlan.zhihu.com/p/631012208 | https://arxiv.org/pdf/2303.03366.pdf | https://github.com/wudongming97/RMOT |
2023-05-15 | EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision | CVPR 2023 EFEM:基于等变神经场期望最大化的无场景监督三维目标分割 | https://zhuanlan.zhihu.com/p/628708768 | https://arxiv.org/pdf/2303.15440.pdf | |
2023-05-14 | 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds | CVPR 2023 3D语义分割in the Wild: 学习不利条件点云的泛化模型 | https://zhuanlan.zhihu.com/p/628708057 | https://arxiv.org/pdf/2304.00690.pdf | https://github.com/xiaoaoran/SemanticSTF |
2023-05-14 | Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection | CVPR 2023 基于Hierarchical监督和Shuffle数据增强的半监督三维目标检测 | https://zhuanlan.zhihu.com/p/626264665 | https://arxiv.org/pdf/2304.01464.pdf | |
2023-05-13 | Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation | CVPR2023 利用2D和3D网络的互补性,解决三维语义分割中的域偏移问题 | https://zhuanlan.zhihu.com/p/628707262 | https://arxiv.org/pdf/2304.02991.pdf | |
2023-05-5 | MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving | CVPR 2023 MSeg3D:用于自动驾驶的多模态3D语义分割(浙江大学最新) | https://zhuanlan.zhihu.com/p/626843023 | https://arxiv.org/pdf/2303.08600.pdf | |
2023-05-5 | Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection | CVPR 2023 基于等级监督和Shuffle数据增强的半监督3D目标检测 | https://zhuanlan.zhihu.com/p/627095580 | https://arxiv.org/pdf/2304.01464.pdf | https://github.com/azhuantou/HSSDA |
2030-05-01 | Renderable Neural Radiance Map for Visual Navigation | CVPR 2023 用于视觉导航的可绘制神经辐射Map | https://zhuanlan.zhihu.com/p/626201215 | https://arxiv.org/pdf/2303.00304.pdf | |
2030-04-28 | MixTeacher: Mining Promising Labels with Mixed Scale Teacher for Semi-Supervised Object Detection | CVPR 2023 Mix Teacher:半监督目标检测新方法 | https://zhuanlan.zhihu.com/p/625756689 | https://arxiv.org/pdf/2303.09061.pdf | |
2023-05-04 | Rotation-Invariant Transformer for Point Cloud Matching | CVPR 2023 用于点云匹配的旋转不变Transformer | https://zhuanlan.zhihu.com/p/624188832 | https://arxiv.org/pdf/2303.08231.pdf | |
2023-04-27 | ACL-SPC: Adaptive Closed-Loop system for Self-Supervised Point Cloud Completion | CVPR 2023 ACL-SPC:用于自监督点云补全的自适应Closed-Loop系统 | https://zhuanlan.zhihu.com/p/625456198 | https://arxiv.org/pdf/2303.01979.pdf | |
2032-04-25 | SCPNet: Semantic Scene Completion on Point Cloud | CVPR 2023 SCPNet:点云上的语义场景补全 | https://zhuanlan.zhihu.com/p/624187098 | https://arxiv.org/pdf/2303.06884.pdf | |
2023-04-24 | Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis | CVPR 2023 用于高效点云分析的稀疏卷积网络二值化 | https://zhuanlan.zhihu.com/p/623709104 | https://arxiv.org/pdf/2303.15493.pdf | |
2023-04-20 | PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection | CVPR 2023 PiMAE:用于3D目标检测的点云和图像交互式自动编码器 | https://zhuanlan.zhihu.com/p/623529429 | https://arxiv.org/pdf/2303.08129.pdf | |
2023-04-15 | 3D Video Object Detection with Learnable Object-Centric Global Optimization | CVPR 2023 基于可学习目标中心全局优化的3D视频目标检测 | https://zhuanlan.zhihu.com/p/621614451 | https://arxiv.org/pdf/2303.15416.pdf | https://github.com/jiaweihe1996/BA-Det |
2023-04-15 | LinK: Linear Kernel for LiDAR-based 3D Perception | CVPR 2023 LinK:基于lidar的3D感知的线性Kernel | https://zhuanlan.zhihu.com/p/622237858 | https://arxiv.org/pdf/2303.16094.pdf | |
2023-04-14 | Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View | CVPR 2023 面向鸟瞰多视图三维目标检测的域泛化 | https://zhuanlan.zhihu.com/p/620518090 | https://arxiv.org/pdf/2303.01686.pdf | |
2023-04-11 | NeuralPCI: Spatio-temporal Neural Field for 3D Point Cloud Multi-frame Non-linear Interpolation | CVPR 2023 基于时空神经辐射场的三维点云多帧非线性插值 | https://zhuanlan.zhihu.com/p/619200995 | https://arxiv.org/pdf/2303.15126.pdf | https://github.com/ispc-lab/NeuralPCI |
2023-04-12 | Weakly Supervised Monocular 3D Object Detection using Multi-View Projection and Direction Consistency | CVPR 2023 基于多视图投影和方向一致性的弱监督单目3D检测 | https://zhuanlan.zhihu.com/p/621462564 | https://arxiv.org/pdf/2303.08686.pdf | https://github.com/weakmono3d/weakmono3d |
2023-04-09 | TBP-Former: Learning Temporal Bird’s-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving | CVPR 2023 TBP-Former: 最新基于BEV的以视觉为中心的联合感知和预测网络 | https://zhuanlan.zhihu.com/p/620518461 | https://arxiv.org/pdf/2303.09998.pdf | https://github.com/MediaBrain-SJTU/TBP-Former |
2023-04-08 | SimpleNet: A Simple Network for Image Anomaly Detection and Localization | CVPR 2023 SimpleNet:一个简单的图像异常检测和定位网络 | https://zhuanlan.zhihu.com/p/619199955 | https://arxiv.org/pdf/2303.15140.pdf | https://github.com/DonaldRR/SimpleNet |
2023-04-06 | Learning to Retain while Acquiring: Combating Distribution-Shift in Adversarial Data-Free Knowledge Distillation | CVPR2023 Learning to Retain while Acquiring:对抗Adversarial Data-Free知识蒸馏中的分布偏移 | https://zhuanlan.zhihu.com/p/617924951 | https://arxiv.org/pdf/2302.14290.pdf | |
2023-04-03 | Viewpoint Equivariance for Multi-View 3D Object Detection | CVPR 2023 多视图3D目标检测中的viewpoint equivariance | https://zhuanlan.zhihu.com/p/619170916 | https://arxiv.org/pdf/2303.14548.pdf | https://github.com/TRI-ML/VEDet |
2023-03-29 | ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution | CVPR2023 ISBNet:一种基于实例感知采样和box感知动态卷积的三维点云实例分割网络 | https://zhuanlan.zhihu.com/p/617923193 | https://arxiv.org/pdf/2303.00246.pdf | |
2023-03-29 | Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision | CVPR2023 Hidden Gems: 使用跨模态监督的4D雷达场景流学习 | https://zhuanlan.zhihu.com/p/617733380 | https://arxiv.org/pdf/2303.00462.pdf | https://github.com/Toytiny/CMFlow |
2023-03-24 | Multimodal Industrial Anomaly Detection via Hybrid Fusion | CVPR 2023 多模态融合的工业异常检测 | https://zhuanlan.zhihu.com/p/615572115 | https://arxiv.org/pdf/2303.00601.pdf | https://github.com/nomewang/M3DM |
2023-03-22 | Token Contrast for Weakly-Supervised Semantic Segmentation | CVPR 2023 基于Token对比的弱监督语义分割 | https://zhuanlan.zhihu.com/p/615570599 | https://arxiv.org/pdf/2303.01267.pdf | https://github.com/rulixiang/ToCo |
2023-03-21 | Delivering Arbitrary-Modal Semantic Segmentation | CVPR 2023 提供任意模态语义分割 | https://zhuanlan.zhihu.com/p/615573285 | https://arxiv.org/pdf/2303.01480.pdf | https://jamycheung.github.io/DELIVER.html |
2023-03-14 | PointCert: Point Cloud Classification with Deterministic Certified Robustness Guarantees | CVPR2023 PointCert: 一种鲁棒的点云分类网络 | https://zhuanlan.zhihu.com/p/614021909 | https://arxiv.org/pdf/2303.01959.pdf | |
2023-03-11 | MixVPR: Feature Mixing for Visual Place Recognition | MixVPR:用于视觉场所识别的特征混合 | https://zhuanlan.zhihu.com/p/613070820 | https://arxiv.org/pdf/2303.02190v1.pdf | |
2023-03-09 | BSH-Det3D: Improving 3D Object Detection with BEV Shape Heatmap | BSH-Det3D:改进三维目标检测与BEV shape heatmap | https://zhuanlan.zhihu.com/p/612497856 | https://arxiv.org/pdf/2303.02000.pdf | |
2023-03-05 | Efficient Context Integration through Factorized Pyramidal Learning for Ultra-Lightweight Semantic Segmentation | 基于分解金字塔学习的上下文集成用于超轻量级语义分割 | https://zhuanlan.zhihu.com/p/611424753 | https://arxiv.org/pdf/2302.11785.pdf | |
2023-02-22 | Uncertainty-Aware AB3DMOT by Variational 3D Object Detection | 变分三维目标检测的不确定性感知算法 | https://zhuanlan.zhihu.com/p/608180167 | https://arxiv.org/pdf/2302.05923.pdf | |
2023-02-19 | On the Adversarial Robustness of Camera-based 3D Object Detection | 基于camera的三维目标检测的对抗鲁棒性研究 | https://zhuanlan.zhihu.com/p/607565836 | https://arxiv.org/pdf/2301.10766.pdf | |
2023-02-17 | Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for Autonomous Driving | 自动驾驶激光雷达点云的Generalized Few-Shot三维目标检测 | https://zhuanlan.zhihu.com/p/607076049 | https://arxiv.org/pdf/2302.03914v1.pdf | |
2023-02-15 | Variational Voxel Pseudo Image Tracking | 变分voxel伪图像跟踪 | https://zhuanlan.zhihu.com/p/606685230 | https://arxiv.org/pdf/2302.05914v1.pdf | |
2023-02-12 | LiDAR-CS Dataset: LiDAR Point Cloud Dataset with Cross-Sensors for 3D Object Detection | LiDAR-CS Dataset:用于3D目标检测的跨传感器激光雷达点云数据集 | https://zhuanlan.zhihu.com/p/605624779 | https://arxiv.org/pdf/2301.12515v1.pdf | https://github.com/LiDAR-Perception/LiDAR-CS |
2023-02-10 | Generating Evidential BEV Maps in Continuous Driving Space | 连续驾驶空间中生成Evidential BEV Maps | https://zhuanlan.zhihu.com/p/605288916 | https://arxiv.org/pdf/2302.02928v1.pdf | |
2023-02-06 | Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss | 基于语义容忍对比损失的自监督图像到点云蒸馏(Distillation) | https://zhuanlan.zhihu.com/p/603993309 | https://arxiv.org/pdf/2301.05709v1.pdf | |
2023-02-04 | BIDIRECTIONAL PROPAGATION FOR CROSS-MODAL 3D OBJECT DETECTION | ICLR 2023 | 跨模态三维目标检测的双向传播 | https://zhuanlan.zhihu.com/p/603432375 | https://arxiv.org/pdf/2301.09077v1.pdf | https://github.com/Eaphan/BiProDet |
2023-2-02 | SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network | SwinDepth:基于Swin Transformer和密集级联网络的单目序列无监督深度估计 | https://zhuanlan.zhihu.com/p/602756208 | https://arxiv.org/pdf/2301.06715v1.pdf | |
2023-01-29 | SensorX2car: Sensors-to-car calibration for autonomous driving in road scenarios | SensorX2car:道路场景中自动驾驶的传感器到车体标定(calibration) | https://zhuanlan.zhihu.com/p/601700023 | https://arxiv.org/pdf/2301.07279.pdf | https://github.com/OpenCalib/SensorX2car |
2023-01-27 | PTA-Det: Point Transformer Associating Point cloud and Image for 3D Object Detection | PTA-Det:用于三维目标检测的点云与图像关联点Transformer | https://zhuanlan.zhihu.com/p/601157599 | https://arxiv.org/pdf/2301.07301.pdf | |
2023-01-25 | BSNet: Lane Detection via Draw B-spline Curves Nearby | BSNet:基于B-spline曲线的车道线检测 | https://zhuanlan.zhihu.com/p/600924538 | https://arxiv.org/pdf/2301.06910.pdf | |
2023-01-25 | OA-BEV: Bringing Object Awareness to Bird’s-Eye-View Representation for Multi-Camera 3D Object Detection | OA-BEV:将目标感知引入多摄像机3D目标检测的鸟瞰视图表示 | https://zhuanlan.zhihu.com/p/600909451 | https://arxiv.org/pdf/2301.05711.pdf | |
2023-01-22 | Object Detection in 3D Point Clouds via Local Correlation-Aware Point Embedding | 基于局部相关感知点嵌入的三维点云目标检测 | https://zhuanlan.zhihu.com/p/600492379 | https://arxiv.org/pdf/2301.04613v1.pdf | |
2023-01-22 | Street-View Image Generation from a Bird’s-Eye View Layout | 基于鸟瞰布局的街景图像生成 | https://zhuanlan.zhihu.com/p/600448216 | https://arxiv.org/pdf/2301.04634v1.pdf | |
2023-01-14 | POLICY PRE-TRAINING FOR AUTONOMOUS DRIVING VIA SELF-SUPERVISED GEOMETRIC MODELING | 基于自监督几何建模的自动驾驶策略预训练 | https://zhuanlan.zhihu.com/p/599014144 | https://arxiv.org/pdf/2301.01006.pdf | https://github.com/OpenDriveLab/PPGeo |
2023-01-13 | Super Sparse 3D Object Detection | 超稀疏三维目标检测 | https://zhuanlan.zhihu.com/p/598713876 | https://arxiv.org/pdf/2301.02562.pdf | https://github.com/tusen-ai/SST |
2023-01-12 | PanDepth: Joint Panoptic Segmentation and Depth Completion | PanDepth:联合全景分割与深度补全 | https://zhuanlan.zhihu.com/p/598488004 | https://arxiv.org/pdf/2212.14180v1.pdf | https://github.com/juanb09111/PanDepth |
2023-01-10 | Cross Modal Transformer: Towards Fast and Robust 3D Object Detection | 基于坐标编码的3D目标检测跨模态Transformer | https://zhuanlan.zhihu.com/p/597516255 | https://arxiv.org/pdf/2301.01283.pdf | https://github.com/junjie18/CMT |
2023-01-10 | An Integrated LiDAR-SLAM System for Complex Environment with Noisy Point Clouds | 一种用于含噪声点云复杂环境的集成LiDAR-SLAM系统 | https://zhuanlan.zhihu.com/p/597516768 | https://arxiv.org/pdf/2212.05705.pdf | |
2023-01-04 | CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion | CC-3DT:基于跨相机融合的全景三维目标跟踪 | https://zhuanlan.zhihu.com/p/596563671 | https://arxiv.org/pdf/2212.01247.pdf | https://www.vis.xyz/pub/cc-3dt/ |
2023-01-03 | Estimation of Appearance and Occupancy Information in Bird’s Eye View from Surround Monocular Images | 从环视单目图像估计鸟瞰视野中的外观和占用信息 | https://zhuanlan.zhihu.com/p/596339740 | https://arxiv.org/pdf/2211.04557.pdf | https://uditsinghparihar.github.io/APP_OCC/ |
2022-12-24 | SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud | SSDA3D:用于点云三维目标检测的半监督域自适应算法 | https://zhuanlan.zhihu.com/p/594080232 | https://arxiv.org/pdf/2212.02845.pdf | https://github.com/yinjunbo/SSDA3D |
2022-12-22 | CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION | 激光雷达三维目标检测中的上下文感知数据增强 | https://zhuanlan.zhihu.com/p/593623415 | https://zhuanlan.zhihu.com/p/593623415 | |
2022-12-20 | SceneRF: Self-Supervised Monocular 3D Scene Reconstruction with Radiance Fields | SceneRF:基于辐射场的自监督单目三维场景重建 | https://zhuanlan.zhihu.com/p/593193238 | https://arxiv.org/pdf/2212.02501.pdf | https://astra-vision.github.io/SceneRF/ |
2022-12-20 | Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation | Lite-Mono:一种用于自监督单目深度估计的轻量级CNN和Transformer体系结构 | https://zhuanlan.zhihu.com/p/593062025 | https://arxiv.org/pdf/2211.13202.pdf | https://github.com/noahzn/Lite-Mono |
2022-12-18 | 3D Object Aided Self-Supervised Monocular Depth Estimation | 三维目标辅助自监督单目深度估计 | https://zhuanlan.zhihu.com/p/592641555 | https://arxiv.org/pdf/2212.01768.pdf | |
2022-12-14 | Gaussian Radar Transformer for Semantic Segmentation in Noisy Radar Data | 高斯Radar Transformer在Radar数据语义分割中的应用 | https://zhuanlan.zhihu.com/p/591880664 | https://arxiv.org/pdf/2212.03690.pdf | |
2022-12-13 | Robust Point Cloud Segmentation with Noisy Annotations | 带噪声标注的鲁棒点云分割 | https://zhuanlan.zhihu.com/p/591596771 | https://arxiv.org/pdf/2212.03242.pdf | https://github.com/pleaseconnectwifi/PNAL |
2022-12-27 | Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments | Wild-Places:非结构化自然环境中大规模数据集的激光雷达位置识别 | https://zhuanlan.zhihu.com/p/594752961 | https://arxiv.org/pdf/2211.12732.pdf | https://csiro-robotics.github.io/Wild-Places/ |
2022-12-08 | Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Object Detection | 面向高效紧凑的multi-view三维目标检测的结构化知识蒸馏 | https://zhuanlan.zhihu.com/p/590374321 | https://arxiv.org/pdf/2211.08398.pdf | |
2022-12-07 | Progressive Learning with Cross-Window Consistency for Semi-Supervised Semantic Segmentation | 基于跨窗口一致性的半监督语义分割的渐进学习算法 | https://zhuanlan.zhihu.com/p/590084676 | https://arxiv.org/pdf/2211.12425.pdf | |
2022-12-07 | Structural Knowledge Distillation for Object Detection | 目标检测的结构化知识蒸馏 | https://zhuanlan.zhihu.com/p/590083270 | https://arxiv.org/pdf/2211.13133.pdf | |
2022-12-02 | SAILOR: Scaling Anchors via Insights into Latent Object Representation | SAILOR:通过对潜在目标表示的洞察来缩放anchor | https://zhuanlan.zhihu.com/p/588504784 | https://arxiv.org/pdf/2210.07811.pdf | |
2022-12-02 | XC: Exploring Quantitative Use Cases for Explanations in 3D Object Detection | XC:探索三维目标检测中解释的定量用例 | https://zhuanlan.zhihu.com/p/588504549 | https://arxiv.org/pdf/2210.11590.pdf | |
2022-11-30 | PAI3D: Painting Adaptive Instance-Prior for 3D Object Detection | PAI3D:用于3D目标检测的自适应实例先验绘Painting | https://zhuanlan.zhihu.com/p/587986901 | https://arxiv.org/pdf/2211.08055.pdf | |
2022-11-30 | Hyperbolic Cosine Transformer for LiDAR 3D Object Detection* | 用于激光雷达3D目标检测的双曲余弦Transformer | https://zhuanlan.zhihu.com/p/587987589 | https://arxiv.org/ftp/arxiv/papers/2211/2211.05580.pdf | |
2022-11-28 | You Only Label Once: 3D Box Adaptation from Point Cloud to Image via Semi-Supervised Learning | 只标注一次! You Only Label Once:基于半监督学习的点云到图像的3D box自适应 | https://zhuanlan.zhihu.com/p/587542491 | https://arxiv.org/pdf/2211.09302.pdf | |
2022-11-26 | PointSee: Image Enhances Point Cloud | PointSee:使用图像增强点云 | https://zhuanlan.zhihu.com/p/586824434 | https://arxiv.org/pdf/2211.01664.pdf | |
2022-11-26 | Multi-Camera Multi-Object Tracking on the Move via Single-Stage Global Association Approach | 基于单阶段全局关联的多摄像机多目标运动跟踪 | https://zhuanlan.zhihu.com/p/586818421 | https://arxiv.org/pdf/2211.09663.pdf | |
2022-11-25 | Recursive Cross-View: Use Only 2D Detectors to Achieve 3D Object Detection without 3D Annotations | Recursive Cross-View:仅使用2D检测器实现无需3D标注的3D目标检测 | https://zhuanlan.zhihu.com/p/586586304 | https://arxiv.org/ftp/arxiv/papers/2211/2211.07108.pdf | |
2022-11-25 | ImLiDAR: Cross-Sensor Dynamic Message Propagation Network for 3D Object Detection | IMLIDAR:用于三维目标检测的跨传感器动态消息传播网络 | https://zhuanlan.zhihu.com/p/586585740 | https://arxiv.org/pdf/2211.09518.pdf | |
2022-11-23 | DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion | DeepMLE:一种基于SFM的双视结构鲁棒深度极大似然估计器 | https://zhuanlan.zhihu.com/p/586134028 | https://arxiv.org/pdf/2210.05517.pdf | |
2022-11-23 | CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds | CAGroup3D:基于类感知的点云三维目标检测分组算法 | https://zhuanlan.zhihu.com/p/586131844 | https://arxiv.org/pdf/2210.04264.pdf | https://github.com/Haiyang-W/CAGroup3D |
2022-11-21 | Boosting Monocular 3D Object Detection with Object-Centric Auxiliary Depth Supervision | 以物体为中心的辅助深度监督增强单目3D目标检测 | https://zhuanlan.zhihu.com/p/585504648 | https://arxiv.org/pdf/2210.16574.pdf | |
2022-11-21 | Multi-Camera Calibration Free BEV Representation for 3D Object Detection | 用于三维目标检测的多摄像机无标定BEV表示 | https://zhuanlan.zhihu.com/p/585506429 | https://arxiv.org/pdf/2210.17252.pdf | |
2022-11-14 | Li3DeTr: A LiDAR based 3D Detection Transformer | Li3DeTr:一种基于激光雷达的三维检测Transformer | https://zhuanlan.zhihu.com/p/583415796 | https://arxiv.org/pdf/2210.15365.pdf | |
2022-11-14 | NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields | NeRF-SLAM: 具有神经辐射场的实时密集单目SLAM | https://zhuanlan.zhihu.com/p/583419503 | https://arxiv.org/pdf/2210.13641.pdf | |
2022-11-13 | TripletTrack: 3D Object Tracking using Triplet Embeddings and LSTM | TripletTrack:基于三元组嵌入和LSTM的三维目标跟踪 | https://zhuanlan.zhihu.com/p/583070856 | https://arxiv.org/pdf/2210.16204.pdf | |
2022-11-13 | MSF3DDETR: Multi-Sensor Fusion 3D Detection Transformer for Autonomous Driving | MSF3DDETR: 用于自动驾驶的多传感器融合3D检测Transformer | https://zhuanlan.zhihu.com/p/583068183 | https://arxiv.org/pdf/2210.15316.pdf | |
2022-11-09 | VP-SLAM: A Monocular Real-time Visual SLAM with Points, Lines and Vanishing Points | VP-SLAM:一种具有点、线和消失点的单目实时视觉SLAM | https://zhuanlan.zhihu.com/p/581976777 | https://arxiv.org/pdf/2210.12756.pdf | |
2022-11-09 | Strong-TransCenter: Improved Multi-Object Tracking based on Transformers with Dense Representations | Strong-TransCenter:改进的基于稠密表示的Transformer的多目标跟踪 | https://zhuanlan.zhihu.com/p/581975219 | https://arxiv.org/pdf/2210.13570.pdf | https://github.com/amitgalor18/STC_Tracker |
2022-11-09 | High-Resolution Depth Estimation for 360◦ Panoramas through Perspective and Panoramic Depth Images Registration | 通过透视与全景深度图像配准实现360°全景图的高分辨率深度估计 | https://zhuanlan.zhihu.com/p/581970766 | https://arxiv.org/pdf/2210.10414.pdf | |
2022-11-03 | CenterLineDet: CenterLine Graph Detection for Road Lanes with Vehicle-mounted Sensors by Transformer for HD Map Generation | CenterLineDet:基于transformer的车载传感器的车道中心线图检测(用于高清地图创建) | https://zhuanlan.zhihu.com/p/580182205 | https://arxiv.org/pdf/2209.07734.pdf | https://tonyxuqaq.github.io/projects/CenterLineDet/ |
2022-11-03 | CurveFormer: 3D Lane Detection by Curve Propagation with Curve Queries and Attention | CurveFormer:基于曲线传播和曲线查询的三维车道检测 | https://zhuanlan.zhihu.com/p/580184768 | https://arxiv.org/pdf/2209.07989.pdf | |
2022-11-03 | Domain Adaptive Object Detection for Autonomous Driving under Foggy Weather | 大雾天气下自动驾驶的域自适应目标检测 | https://zhuanlan.zhihu.com/p/580188194 | https://arxiv.org/pdf/2210.15176.pdf | https://github.com/jinlong17/DA-Detect |
2022-11-03 | Row-wise LiDAR Lane Detection Network with Lane Correlation Refinement | 基于车道相关细化的行内(Row-wise)激光雷达车道检测网络 | https://zhuanlan.zhihu.com/p/580187274 | https://arxiv.org/pdf/2210.08745.pdf | |
2022-10-31 | Rethinking the compositionality of point clouds through regularization in the hyperbolic space | NeurIPS 2022 通过双曲空间中的正则化重新思考点云的组成性 | https://zhuanlan.zhihu.com/p/579179736 | https://arxiv.org/pdf/2209.10318v1.pdf | https://github.com/diegovalsesia/HyCoRe |
2022-10-31 | Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification | 混合策略梯度对高级自动化车辆的集成决策与控制及其实验验证 | https://zhuanlan.zhihu.com/p/579176804 | https://arxiv.org/pdf/2210.10613v1.pdf | |
2022-10-31 | Sim-to-Real via Sim-to-Seg: End-to-end Off-road Autonomous Driving Without Real Data | CoRL 2022 Sim-to-Real via Sim-to-Seg:没有真实数据的端到端越野自动驾驶 | https://zhuanlan.zhihu.com/p/579178473 | https://arxiv.org/pdf/2210.14721v1.pdf | |
2022-10-31 | Let Images Give You More: Point Cloud Cross-Modal Training for Shape Analysis | NIPS 2022 让图像给你更多: 形状分析的点云交叉模态训练 | https://zhuanlan.zhihu.com/p/579180596 | https://arxiv.org/pdf/2210.04208v1.pdf | https://github.com/ZhanHeshen/PointCMT |
2022-10-28 | SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds | SWFormer:点云3D目标检测的稀疏窗口Transformer | https://zhuanlan.zhihu.com/p/577985802 | https://arxiv.org/pdf/2210.07372v1.pdf | |
2022-10-27 | BoundED: Neural Boundary and Edge Detection in 3D Point Clouds via Local Neighborhood Statistics | BoundED: 基于局部邻域统计的3D点云神经边界和边缘检测 | https://zhuanlan.zhihu.com/p/577683400 | https://arxiv.org/pdf/2210.13305v1.pdf | |
2022-10-27 | Dual-Curriculum Teacher for Domain-Inconsistent Object Detection in Autonomous Driving | DucTeacher:自动驾驶域不一致下的目标检测 | https://zhuanlan.zhihu.com/p/577683900 | https://arxiv.org/pdf/2210.08748v1.pdf | |
2022-10-25 | An Efficient FPGA Accelerator for Point Cloud | 一种高效的点云FPGA加速器 | https://zhuanlan.zhihu.com/p/577253942 | https://arxiv.org/pdf/2210.07803.pdf | |
2022-10-24 | Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection | 3D目标检测中的同质多模态特征融合与交互(ECCV2022) | https://zhuanlan.zhihu.com/p/576470649 | https://arxiv.org/pdf/2210.09615.pdf | |
2022-10-21 | CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection | Cramnet:基于射线约束交叉注意的鲁棒三维目标检测的Camera-Radar融合 | https://zhuanlan.zhihu.com/p/576042183 | https://arxiv.org/pdf/2210.09267.pdf | |
2022-10-21 | Instance Segmentation with Cross-Modal Consistency | 具有跨模态一致性的实例分割 | https://zhuanlan.zhihu.com/p/576037478 | https://arxiv.org/pdf/2210.08113.pdf | |
2022-10-14 | Towards Efficient 3D Object Detection with Knowledge Distillation | 通过知识蒸馏实现高效的3D目标检测 | https://zhuanlan.zhihu.com/p/573732965 | https://arxiv.org/pdf/2205.15156.pdf | https://github.com/CVMI-Lab/SparseKD |
2022-10-14 | CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection | CrossDTR:基于多目深度引导的3D目标检测 | https://zhuanlan.zhihu.com/p/572556344 | https://arxiv.org/pdf/2209.13507.pdf | https://github.com/sty61010/CrossDTR |
2022-10-12 | Unsupervised confidence for LiDAR depth maps and applications | IROS 2022 | 激光雷达深度图的无监督置信度及其应用 | https://zhuanlan.zhihu.com/p/573009801 | https://arxiv.org/pdf/2210.03118v1.pdf | https://github.com/andreaconti/lidar-confidence |
2022-10-11 | TIME WILL TELL: NEW OUTLOOKS AND A BASELINE FOR TEMPORAL MULTI-VIEW 3D OBJECT DETECTION | SOLOFusion:时空多视图3D目标检测的新基线 | https://zhuanlan.zhihu.com/p/572649410 | https://arxiv.org/pdf/2210.02443v1.pdf | https://github.com/Divadi/SOLOFusion |
2022-10-10 | LOPR: Latent Occupancy PRediction using Generative Models | LOPR: 使用生成模型进行潜在occupancy预测 | https://zhuanlan.zhihu.com/p/572294360 | https://arxiv.org/pdf/2210.01249v1.pdf | https://github.com/sisl/LOPR |
2022-10-09 | D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequence | D-Align: 基于多帧点云序列的三维目标检测双查询协同attention网络 | https://zhuanlan.zhihu.com/p/571955426 | https://arxiv.org/pdf/2210.00087v1.pdf | |
2022-10-06 | DirectTracker: 3D Multi-Object Tracking Using Direct Image Alignment and Photometric Bundle Adjustment | DirectTracker: 使用直接图像对齐和光度束调整的3D多目标跟踪 | https://zhuanlan.zhihu.com/p/570955927 | https://arxiv.org/pdf/2209.14965v1.pdf | https://cvg.cit.tum.de/research/vslam/directtracker |
2022-09-29 | ERASE-Net: Efficient Segmentation Networks for Automotive Radar Signals | ERASE-Net: 自动驾驶Radar数据的高效分割网络 | https://zhuanlan.zhihu.com/p/569510004 | https://arxiv.org/pdf/2209.12940v1.pdf | |
2022-09-28 | Exploring Attention GAN for Vehicle Motion Prediction | 探索用于车辆运动预测的attention GAN | https://zhuanlan.zhihu.com/p/569147057 | https://arxiv.org/pdf/2209.12674v1.pdf | https://github.com/Cram3r95/mapfe4mp |
2022-09-28 | Attitude-Guided Loop Closure for Cameras with Negative Plane | 负平面相机的姿态引导闭环 | https://zhuanlan.zhihu.com/p/569151652 | https://arxiv.org/ | https://github.com/flysoaryun/LF-VISLAM |
2022-09-26 | R3LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state Estimator | R3LIVE++:一个鲁棒实时的重建package!具有紧密耦合的激光雷达惯性视觉状态估计器 | https://zhuanlan.zhihu.com/p/568436911 | https://arxiv.org/pdf/2209.03666v1.pdf | https://github.com/hku-mars/r3live |
2022-09-23 | GANet: Goal Area Network for Motion Forecasting | GANet:运动预测的目标区域网络 | https://zhuanlan.zhihu.com/p/567605999 | https://arxiv.org/pdf/2209.09723v1.pdf | |
2022-09-22 | A Dual-Cycled Cross-View Transformer Network for Unified Road Layout Estimation and 3D Object Detection in the Bird’s-Eye-View | BEV中统一道路布局估计和3D目标检测的Transformer网络 | https://zhuanlan.zhihu.com/p/567239638 | https://arxiv.org/pdf/2209.08844v1.pdf | |
2022-09-22 | Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving | 统一自动驾驶多任务协同训练的有效适应 | https://zhuanlan.zhihu.com/p/567235140 | https://arxiv.org/pdf/2209.08953v1.pdf | |
2022-09-20 | GATraj: A Graph- and Attention-based Multi-Agent Trajectory Prediction Model | GATraj:基于图和注意力的多智能体轨迹预测模型 | https://zhuanlan.zhihu.com/p/566497492 | https://arxiv.org/pdf/2209.07857v1.pdf | https://github.com/mengmengliu1998/gatraj |
2022-09-19 | CRAFT: Camera-Radar 3D Object Detection with Spatio-Contextual Fusion Transformer | CRAFT:毫米波雷达与相机融合3D目标检测 | https://zhuanlan.zhihu.com/p/566114804 | https://arxiv.org/pdf/2209.06535v1.pdf | |
2022-09-15 | SVNet: Where SO(3) Equivariance Meets Binarization on Point Cloud Representation | 3DV2022 高效鲁棒!SVNet:SO(3) 等变在点云表示上遇到二值化 | https://zhuanlan.zhihu.com/p/564844717 | https://arxiv.org/pdf/2209.05924v1.pdf | https://github.com/hellozhuo/svnet |
2022-09-15 | CenterFormer: Center-based Transformer for 3D Object Detection | ECCV2022 oral CenterFormer:用于 3D 目标检测的Transformer | https://zhuanlan.zhihu.com/p/564838907 | https://arxiv.org/pdf/2209.05588v1.pdf | https://github.com/TuSimple/centerformer |
2022-09-14 | Multi-modal Streaming 3D Object Detection | 多模态流式3D目标检测 | https://zhuanlan.zhihu.com/p/564467078 | https://arxiv.org/pdf/2209.04966v1.pdf | |
2022-09-13 | Real-time 3D Single Object Tracking with Transformer | 使用 Transformer 进行实时3D单目标跟踪 | https://zhuanlan.zhihu.com/p/563331685 | https://arxiv.org/pdf/2209.00860v1.pdf | https://github.com/shanjiayao/PTT |
2022-09-13 | MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection | MSMDFusion:将多尺度激光雷达和相机与多深度种子融合以进行3D目标检测 | https://zhuanlan.zhihu.com/p/563331218 | https://arxiv.org/pdf/2209.03102v1.pdf | https://github.com/SxJyJay/MSMDFusion |
2022-09-12 | LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds | ECCV2022 LESS:LiDAR 点云的标签高效语义分割 | https://zhuanlan.zhihu.com/p/563532266 | https://cseweb.ucsd.edu/~mil070/projects/ECCV2022/paper.pdf | |
2022-09-09 | CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object Tracking with Camera-LiDAR Fusion | CAMO-MOT:基于LiDAR-Camera 融合的3D 多目标跟踪优化方法 | https://zhuanlan.zhihu.com/p/562755238 | https://arxiv.org/pdf/2209.02540v2.pdf | |
2022-09-08 | DeepInteraction: 3D Object Detection via Modality Interaction | DeepInteraction:通过模态交互进行 3D 对象检测 | https://zhuanlan.zhihu.com/p/562386666 | https://arxiv.org/pdf/2208.11112v2.pdf | https://github.com/fudan-zvg/DeepInteraction |
This repository was mainly written by Rujia Wang.
If you have any questions about the paper list, please do not hesitate to email me or open an issue on GitHub.