Surgery

Evaluation of single-stage vision models for pose estimation of surgical instruments.

April 29, 2023 Surgery

Abstract

Multiple applications in open surgical environments may benefit from adoption of markerless computer vision depending on associated speed and accuracy requirements. The current work evaluates vision models for 6-degree of freedom pose estimation of surgical instruments in RGB scenes. Potential use cases are discussed based on observed performance.Convolutional neural nets were developed with simulated training data for 6-degree of freedom pose estimation of a representative surgical instrument in RGB scenes. Trained models were evaluated with simulated and real-world scenes. Real-world scenes were produced by using a robotic manipulator to procedurally generate a wide range of object poses.CNNs trained in simulation transferred to real-world evaluation scenes with a mild decrease in pose accuracy. Model performance was sensitive to input image resolution and orientation prediction format. The model with highest accuracy demonstrated mean in-plane translation error of 13 mm and mean long axis orientation error of 5[Formula: see text] in simulated evaluation scenes. Similar errors of 29 mm and 8[Formula: see text] were observed in real-world scenes.6-DoF pose estimators can predict object pose in RGB scenes with real-time inference speed. Observed pose accuracy suggests that applications such as coarse-grained guidance, surgical skill evaluation, or instrument tracking for tray optimization may benefit from markerless pose estimation.© 2023. CARS.

Show Full Text

Evaluation of single-stage vision models for pose estimation of surgical instruments.

Researchers

Journal

Modalities

Models

Abstract

Semantic Segmentation of Pancreatic Cancer in Endoscopic Ultrasound Images Using Deep Learning Approach.

How the use of the artificial intelligence could improve surgical skills in urology: state of the art and future perspectives.

Deep Learning for Image Super-resolution: A Survey.

Multi-task recurrent convolutional network with correlation loss for surgical video analysis.

Flexible needle puncture path planning for liver tumors based on deep reinforcement learning.

Deep Learning-Based Haptic Guidance for Surgical Skills Transfer.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply