Objective 6D pose detection is a key technology for enabling autonomous grasping in robots. Currently, traditional point‒pair feature (PPF) methods face three major challenges: 1) excessive sensitivity to sensor noise, severe occlusions, and background clutter; 2) reduced matching performance when the workpiece has numerous repetitive features; 3) slow recognition speed due to the need to search many point pairs and compute transformation relationships. This study proposes a point-pair feature-based 6D pose detection method designed for robotic sorting system grasping tasks. Methods Firstly, multi-plane feature workpieces were screened based on distributions of model plane points, and their boundary features were extracted for 6D pose detection. Model point pairs were extracted from multi-view points to remove redundant point pairs and improve the recognition speed of algorithms. Secondly, to further enhance recognition speed, a method was employed to extract model point pairs from multiple viewpoints, which helped in eliminating redundant point pairs that did not contribute to the detection process. Thirdly, the point-to-point characteristics between scenes and models were matched, and a fast voting scheme was employed to obtain pose hypothesis sets for targets in a disordered scene. Then, a pose verification and screening method was introduced to eliminate duplicate and mismatched poses, which was essential for realizing a rough estimation of multi-instance poses for the targeted workpieces. Finally, an algorithm called Iterative Closest Points (ICPs) was utilized to refine the rough estimates and achieve a more accurate estimation of the targeted poses. Results and Discussions Experimental results showed that in the context of disordered simulation scenes, the proposed method demonstrated a single recognition time of 1.2 seconds, with an average translation deviation of 1 mm and an average rotation error of 1.56°. These results indicated a high level of precision and efficiency in pose detection. In an actual scenario, this method achieved an average recognition success rate of 95.8%, with an average single recognition time of 1.1 seconds. The high success rate and rapid speed highlighted that this method has favorable practical applicability in robotic sorting tasks. Therefore, this study highlighted that the proposed 6D pose detection method significantly outperformed the original PPF algorithm in terms of recognition speed, while also improving the accuracy of pose estimation. This advancement was crucial for the reliable and efficient operation of robotic systems in precision grasping applications. Finally, this 6D pose detection method not only ensured recognition efficiency but also accounted for the accuracy of pose estimation. This meant that recognition speed was significantly improved compared to the original PPF algorithm, providing a strong guarantee for the realization of accurate robotic grasping. Conclusions The research presents a comprehensive approach to enhancing 6D pose detection in disordered sorting scenarios, representing a significant advancement in robotic vision and grasping technologies. The proposed method is verified as effective in both simulated environments and real-world working conditions. In addition, it demonstrates superior performance compared to existing approaches, supporting more accurate analysis in 6D pose detection applications.
DrostB, UlrichM, NavabN,et al.Model globally,match locally:Efficient and robust 3D object recognition[C]//Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.San Francisco:IEEE,2010:998‒1005. doi:10.1109/cvpr.2010.5540108
[2]
ChoiC, TaguchiY, TuzelO,et al.Voting-based pose estimation for robotic assembly using a 3D sensor[C]//Proceedings of the 2012 IEEE International Conference on Robotics and Automation.Saint Paul:IEEE,2012:1724‒1731. doi:10.1109/icra.2012.6225371
[3]
HinterstoisserS, LepetitV, RajkumarN,et al.Going further with point pair features[M]//Leibe B,Matas J,Sebe N,et al,eds.Lecture Notes in Computer Science.Cham:Springer International Publishing,2016:834‒848. doi:10.1007/978-3-319-46487-9_51
[4]
YuMenghui, CuiXining, WuLinqigao,et al.3D recognition algorithm based on curvature point pair features[J].Laser & Optoelectronics Progress,2023,60(12):222‒229.
LiuDiyi, AraiS, MiaoJiaqi,et al.Point pair feature-based pose estimation with multiple edge appearance models (PPF‒MEAM) for robotic Bin picking[J].Sensors,2018,18(8):2719. doi:10.3390/s18082719
[7]
GuoJianwei, XingXuejun, QuanWeize,et al.Efficient center voting for object detection and 6D pose Estimation in 3D Ppoint cloud[J]. IEEE Transactions on Image Processing,2021,30:5072‒5084. doi:10.1109/tip.2021.3078109
[8]
LabbéY, CarpentierJ, AubryM,et al.CosyPose:Consistent multi-view multi-object 6D pose estimation[M]//Lecture Notes in Computer Science.Cham:Springer International Publishing,2020:574‒591. doi:10.1007/978-3-030-58520-4_34
[9]
WongJ M, KeeV, LeT,et al.SegICP:Integrated deep semantic segmentation and pose estimation[C]//Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).Vancouver:IEEE,2017:5784‒5789. doi:10.1109/iros.2017.8206470
[10]
BirdalT, IlicS.Point pair features based object detection and pose estimation revisited[C]//Proceedings of the 2015 International Conference on 3D Vision.Lyon:IEEE,2015:527‒535. doi:10.1109/3dv.2015.65
[11]
ZhaoYinshuai, WuQingxiao, FuShuangfei,et al.Efficient planar object recognition and localization system based on boundary point pair feature[J].Application Research of Computers,2019,36(2):601‒605.
WuChenghei, JiangS Y, SongKaitai.CAD-based pose estimation for random Bin-picking of multiple objects using a RGB‒D camera[C]//Proceedings of the 2015 15th International Conference on Control,Automation and Systems.Busan:IEEE,2015:1645‒1649. doi:10.1109/iccas.2015.7364621
[14]
YanWu, XuZhihao, ZhouXuefeng,et al.Fast object pose estimation using adaptive threshold for Bin-picking[J].IEEE Access,2020,8:63055‒63064. doi:10.1109/access.2020.2983173
[15]
LuJun, WeiPanyi, WangWei.3D target recognition algorithm based on point-pair features of key points[J].Transactions of Beijing Institute of Technology,2022,42(2):200‒207.
BirdalT, IlicS.Point pair features based object detection and pose estimation revisited[C]//Proceedings of the 2015 International Conference on 3D Vision.Lyon:IEEE,2015:527‒535. doi:10.1109/3dv.2015.65
[18]
ParkK, PattenT, VinczeM.Pix2Pose:Pixel-wise coordinate regression of objects for 6D pose estimation[C]//Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision.Seoul:IEEE,2019:7667‒7676. doi:10.1109/iccv.2019.00776
[19]
LiangHongzhuo, MaXiaojian, LiShuang,et al.PointNetGPD:Detecting grasp configurations from point sets[C]//Proceedings of the 2019 International Conference on Robotics and Automation.Montreal:IEEE,2019:3629‒3635. doi:10.1109/icra.2019.8794435
[20]
LanYuqing, DuanYao, LiuChenyi,et al.ARM3D:Attention-based relation module for indoor 3D object detection[J].Computational Visual Media,2022,8(3):395‒414. doi:10.1007/s41095-021-0252-6
[21]
ZengLong, LvWei jie, DongZhi kai,et al.PPR‒net:Accurate 6-D pose estimation in stacked scenarios[J].IEEE Transactions on Automation Science and Engineering,2022,19(4):3139‒3151. doi:10.1109/tase.2021.3108800
[22]
FischlerM A, BollesR C.Random sample consensus:A paradigm for model fitting with applications to image analysis and automated cartography[M]//Readings in Computer Vision.Amsterdam:Elsevier,1987:726‒740. doi:10.1016/b978-0-08-051581-6.50070-2
[23]
DonadiI, PrettoA.KVN:Keypoints voting network with differentiable RANSAC for stereo pose estimation[EB/OL]//(2023‒07‒21)[2024‒06‒01].11543.doi:10.1109/lra.2024.3367508/mm2
[24]
JiangJie, LingSirui. Parallel Voting RANSAC and its implementation on FPGA[J].Journal of Electronics & Informetion Technology,2013,39(2),96‒100.
LiuKeping, GaoRunze, LiYan,et al.Fast and accurate edge extraction algorithm of stacked workpiece point cloud[C]//Proceedings of the 2021 IEEE International Conference on Real-time Computing and Robotics.Xi’ning:IEEE,2021:75‒80. doi:10.1109/rcar52367.2021.9517364
[27]
XiaoZhengtao, GaoJian, WuDongqing,et al.A uniform downsampling method for three-dimensional point clouds based on voxel grids[J].Machinery Design & Manufacture,2023(8):180‒184.
HeYisheng, WangYao, FanHaoqiang,et al.FS6D:Few-shot 6D pose estimation of novel objects[C]//Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition.New Orleans:IEEE,2022:6804‒6814. doi:10.1109/cvpr52688.2022.00669
[30]
PengSida, LiuYuan, HuangQixing,et al.PVNet:Pixel-wise voting network for 6DoF pose estimation[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE,2019:4556‒4565. doi:10.1109/cvpr.2019.00469
[31]
WangChen, XuDanfei, ZhuYuke,et al.DenseFusion:6D object pose estimation by iterative dense fusion[C]//Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE,2019:3338‒3347. doi:10.1109/cvpr.2019.00346