Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > cs > arXiv:2506.02768

Help | Advanced Search

Computer Science > Robotics

arXiv:2506.02768 (cs)
[Submitted on 3 Jun 2025 ]

Title: Geometric Visual Servo Via Optimal Transport

Title: 基于最优传输的几何视觉伺服

Authors:Ethan Canzini, Simon Pope, Ashutosh Tiwari
Abstract: When developing control laws for robotic systems, the principle factor when examining their performance is choosing inputs that allow smooth tracking to a reference input. In the context of robotic manipulation, this involves translating an object or end-effector from an initial pose to a target pose. Robotic manipulation control laws frequently use vision systems as an error generator to track features and produce control inputs. However, current control algorithms don't take into account the probabilistic features that are extracted and instead rely on hand-tuned feature extraction methods. Furthermore, the target features can exist in a static pose thus allowing a combined pose and feature error for control generation. We present a geometric control law for the visual servoing problem for robotic manipulators. The input from the camera constitutes a probability measure on the 3-dimensional Special Euclidean task-space group, where the Wasserstein distance between the current and desired poses is analogous with the geometric geodesic. From this, we develop a controller that allows for both pose and image-based visual servoing by combining classical PD control with gravity compensation with error minimization through the use of geodesic flows on a 3-dimensional Special Euclidean group. We present our results on a set of test cases demonstrating the generalisation ability of our approach to a variety of initial positions.
Abstract: 在为机器人系统开发控制律时,考察其性能的主要因素是选择能够实现平滑跟踪参考输入的输入。 在机器人操作的背景下,这涉及将物体或末端执行器从初始姿态移动到目标姿态。 机器人操作控制律经常使用视觉系统作为误差生成器来跟踪特征并产生控制输入。 然而,当前的控制算法并未考虑提取的概率特征,而是依赖于手工调整的特征提取方法。 此外,目标特征可能处于静态姿态,从而允许结合姿态和特征误差用于控制生成。 我们提出了一种几何控制律,用于解决机器人操作器的视觉伺服问题。 来自摄像机的输入构成了三维特殊欧几里得任务空间群上的概率测度,在该群中,当前姿态与期望姿态之间的 Wasserstein 距离类似于几何测地线。 基于此,我们开发了一种控制器,通过结合经典PD控制与重力补偿,并利用三维特殊欧几里得群上的测地流来最小化误差,从而实现了姿态和基于图像的视觉伺服。 我们在一组测试用例上展示了我们的结果,证明了我们的方法对多种初始位置的泛化能力。
Comments: 19 pages, 5 figures
Subjects: Robotics (cs.RO) ; Systems and Control (eess.SY)
Cite as: arXiv:2506.02768 [cs.RO]
  (or arXiv:2506.02768v1 [cs.RO] for this version)
  https://doi.org/10.48550/arXiv.2506.02768
arXiv-issued DOI via DataCite

Submission history

From: Eytan Canzini [view email]
[v1] Tue, 3 Jun 2025 11:38:09 UTC (13,742 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
  • Other Formats
license icon view license
Current browse context:
cs
< prev   |   next >
new | recent | 2025-06
Change to browse by:
cs.RO
cs.SY
eess
eess.SY

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号