严正声明:近期,《海洋技术学报》编辑部发现某些中介组织利用假冒网站、电子信箱、电话等方式,以可以快速刊发稿件为由,向作者提供假冒录用通知并收取费用。对此,我刊发布严正声明,详见动态信息公告。
融合梯度自适应A*与DA-Q学习的多无人船亚中尺度涡边缘搜索算法
A Submesoscale Eddy Boundary Search Algorithm for USVs Based on Gradient-Adaptive A-Star and Dynamic-Award Q-Learning
  
DOI:10.3969/j.issn.1003-2029.2025.06.003
中文关键词:  亚中尺度涡  无人船  A*算法  DA-Q学习
英文关键词:submesoscale eddy  Unmanned Surface Vehicle  A-star algorithm  DA-Qlearning
基金项目:山东省自然科学基金资助项目(ZR2021QD006)
作者单位
李雯雯,马毅,张飞飞,李忠伟 自然资源部第一海洋研究所中国石油大学(华东)海洋与空间信息学院 
摘要点击次数: 82
全文下载次数: 39
中文摘要:
      亚中尺度涡在海洋能量传输和生态调控中具有重要作用,其精准实时监测面临挑战。针对传统监测手段时效性不足、空间分辨率有限和观测平台灵活性差等问题,本文基于高机动性的多无人船平台,提出一种融合梯度自适应A*与动态奖励驱动的强化学习算法(Dynamic-Award O-leamning,DA-学习)的多无人船亚中尺度涡边缘搜索算法。针对亚中尺度涡边缘形态复杂、环境动态变化快且信息不完整等具象化场景,算法利用梯度自适应A*引导多无人船迅速向涡旋区域收敛,并在搜索边缘过程中,构建温度梯度变化与历史路径信息融合的动态奖励机制,驱动DA-0学习算法实现无人船自主适应环境变化并灵活调整航向。基于模拟数据与仿真海表面温度数据,设计了涵盖不同季节、无人船数量和初始配置的多组对比实验,分析了不同环境、不同涡旋类型和涡旋畸变条件下的算法性能和鲁棒性。结果表明:多无人船搜索亚中尺度涡边缘策略在路径长度与搜索效率方面优于单船方案;在部署方式上,多点分布式部署进一步提升了搜索覆盖能力与任务响应效率,优于单点母船释放式部署。本文结果可为多无人船在复杂海洋环境中的亚中尺度涡监测应用提供技术支撑。
英文摘要:
      Submesoscale eddies play significant roles in oceanic energy transfer and ecological regulation, yet their accurate and real-time monitoring remains challenging. To address the limitations of conventional monitoring methods -such as insufficient timeliness, limited spatial resolution, and poor flexibility of observation platforms, this paper proposes a multi-Unmanned Surface Vehicle (multi-USV) edge search algorithm for submesoscale eddies based on a highly maneuverable multi -USV platform. The algorithm integrates a gradient-adaptive A-star algorithm and a dynamic reward-driven reinforcement learning method (Dynamic-Award Q-learning, DA-Q learning). Targeting the complex morphology of submesoscale eddy edges, rapidly changing dynamic environments, and incomplete information in realistic scenarios, the algorithm employs gradient-adaptive A-star to guide multiple USVs to quickly converge toward the eddy region. During the edge search process, a dynamic reward mechanism combining temperature gradient variations and historical path information is established to drive the DA-Q learning algorithm, enabling USVs to autonomously adapt to environmental changes and flexibly adjust their courses. Using both simulated data and simulated sea surface temperature data, multiple sets of comparative experiments were designed, covering different seasons, numbers of USVs, and initial configurations. The performance and robustness of the algorithm under various environmental conditions, eddy types, and eddy distortion scenarios were analyzed. Results show that the multi -USV strategy for submesoscale eddy edge search outperforms the single-USV approach in both path length and search efficiency. In terms of deployment strategy, distributed multi-point deployment further enhances search coverage and mission response efficiency compared to single mothervessel release deployment. The findings of this study can provide technical support for multi-USV monitoring of submesoscale eddies in complex marine environments.
查看全文  查看/发表评论  下载PDF阅读器