Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > cs > arXiv:2212.00618

Help | Advanced Search

Computer Science > Robotics

arXiv:2212.00618 (cs)
[Submitted on 1 Dec 2022 ]

Title: Safe Reinforcement Learning with Probabilistic Control Barrier Functions for Ramp Merging

Title: 带有概率控制屏障函数的匝道合并安全强化学习

Authors:Soumith Udatha, Yiwei Lyu, John Dolan
Abstract: Prior work has looked at applying reinforcement learning and imitation learning approaches to autonomous driving scenarios, but either the safety or the efficiency of the algorithm is compromised. With the use of control barrier functions embedded into the reinforcement learning policy, we arrive at safe policies to optimize the performance of the autonomous driving vehicle. However, control barrier functions need a good approximation of the model of the car. We use probabilistic control barrier functions as an estimate of the model uncertainty. The algorithm is implemented as an online version in the CARLA (Dosovitskiy et al., 2017) Simulator and as an offline version on a dataset extracted from the NGSIM Database. The proposed algorithm is not just a safe ramp merging algorithm but a safe autonomous driving algorithm applied to address ramp merging on highways.
Abstract: 先前的工作已经研究了将强化学习和模仿学习方法应用于自动驾驶场景,但要么算法的安全性受到影响,要么效率受到影响。 通过将控制屏障函数嵌入到强化学习策略中,我们得出了安全策略,以优化自动驾驶车辆的性能。 然而,控制屏障函数需要对车辆模型有良好的近似。 我们使用概率控制屏障函数作为模型不确定性的估计。 该算法在CARLA(Dosovitskiy等,2017)模拟器中实现为在线版本,并在从NGSIM数据库提取的数据集上实现为离线版本。 所提出的算法不仅仅是一个安全的匝道合并算法,而是一个应用于解决高速公路上匝道合并的安全自动驾驶算法。
Comments: Safe Learning for Autonomous Driving Workshop, ICML 2022
Subjects: Robotics (cs.RO) ; Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2212.00618 [cs.RO]
  (or arXiv:2212.00618v1 [cs.RO] for this version)
  https://doi.org/10.48550/arXiv.2212.00618
arXiv-issued DOI via DataCite

Submission history

From: Soumith Udatha [view email]
[v1] Thu, 1 Dec 2022 16:14:40 UTC (1,357 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • TeX Source
license icon view license
Current browse context:
cs.RO
< prev   |   next >
new | recent | 2022-12
Change to browse by:
cs
cs.AI
cs.LG

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号