深度混合型邻域搜索模型求解CVRP问题

doi:10.13232/j.cnki.jnju.2023.06.012

南京大学学报(自然科学版) ›› 2023, Vol. 59 ›› Issue (6): 1023–1033.doi: 10.13232/j.cnki.jnju.2023.06.012

• • 上一篇

深度混合型邻域搜索模型求解CVRP问题

杨笑笑, 陈智斌()

昆明理工大学理学院，昆明，650000

收稿日期:2023-09-14 出版日期:2023-11-30 发布日期:2023-12-06
通讯作者: 陈智斌 E-mail:chenzhibin311@126.com
基金资助:
国家自然科学基金(11761042)

Deep hybrid neighborhood search model solves the CVRP

Xiaoxiao Yang, Zhibin Chen()

Faculty of Science，Kunming University of Science and Technology，Kunming，650000，China

Received:2023-09-14 Online:2023-11-30 Published:2023-12-06
Contact: Zhibin Chen E-mail:chenzhibin311@126.com

摘要/Abstract

摘要：

邻域搜索算法的关键是邻域结构的选择，但每次迭代搜索的时间较长，缺少在解空间内自主搜索的能力.利用深度强化学习（DRL）模型对邻域搜索算法进行改进，设计了一个新的深度混合型邻域搜索（DHNS）模型来求解带容量的车辆路径问题（CVRP）.首先，利用贪婪算法为DRL模型提供初始解；其次，采用指针网络以及Transformer混合编码，利用不同网络的优势，深层次地提取节点特征信息；最后，将修复算子的修复过程转至DHNS模型，自动完成邻域搜索修复解的过程，扩大解空间的自主搜索能力.同时，针对混合编码中复杂传输机制以及解码输出误导性信息的问题，进一步在编码和解码过程中添加AOA （Attention on Attention）机制.AOA负责筛选有价值的信息，过滤不相关或误导性信息，有效刻画了注意力结果和查询之间的相关性，并对节点间的关系进行建模.实验结果表明，DHNS模型在100规模CVRP的优化效果上，优于现有DRL模型和部分传统算法.采用CVRPlib数据集中的算例对该算法的效能进行验证，结果表明，采用DHNS模型能够极大地提升路径问题的优化效能.

关键词: 深度混合型邻域搜索模型, 深度强化学习, 混合模型, AOA机制

Abstract:

The key to the neighborhood search algorithm is the selection of neighborhood structure，but the search time of each iteration is long，and the ability to search autonomously in the solution space is lacking. In this paper，the deep reinforcement learning (DRL) model is used to improve the neighborhood search algorithm，and a new deep hybrid neighborhood search (DHNS) model is designed to solve the capacitated vehicle routing problem (CVRP). Firstly，the greedy algorithm is used to provide the initial solution for the DRL model. Secondly，the pointer network and Transformer hybrid encoder are used to take advantage of different networks to extract node feature information at a deep level. Finally，the repair process of the repair operator is transferred to the DHNS model，and the process of neighborhood search repair solution is automatically completed，expanding the ability to solution space for autonomous search. At the same time，aiming at the complex transmission mechanism in hybrid encoder and the problem of decoding misleading information，the AOA （Attention on Attention） mechanism is further added in the encoding and decoding process. AOA is responsible for screening valuable information，filtering out irrelevant or misleading information，effectively characterizing the correlation between attention results and queries，and modeling the relationship among nodes. Experimental results show that the DHNS model is superior to the existing DRL model and some traditional algorithms in the optimization effect of 100?scale CVRP. The efficiency of the algorithm is verified by using an example in the CVRPlib dataset，and the results show that the DHNS model greatly improve the optimization efficiency of the routing problem.

Key words: deep hybrid neighborhood search, deep reinforcement learning, hybrid model, Attention on Attention

中图分类号:

杨笑笑, 陈智斌. 深度混合型邻域搜索模型求解CVRP问题[J]. 南京大学学报(自然科学版), 2023, 59(6): 1023–1033.

Xiaoxiao Yang, Zhibin Chen. Deep hybrid neighborhood search model solves the CVRP[J]. Journal of Nanjing University(Natural Sciences), 2023, 59(6): 1023–1033.

图/表 9

图1

图2

图3

表1

DHNS模型在CVRP问题上的优化结果比较"

模型	CVRP20			CVRP50			CVRP100
模型	Obj	Gap	Time	Obj	Gap	Time	Obj	Gap	Time
Random CW	6.81	11.64%	-	12.25	18.07%	-	18.96	21.18%	-
LKH3	6.12	0.00%	2 h	10.38	0.00%	7 h	15.65	0.00%	13 h
ALNS	6.69	9.31%	1 s	11.24	8.28%	2 s	17.33	10.7%	5 s
OR⁃Tools^*	6.42	4.84%	2 min	11.22	8.12%	12 min	17.14	9.34%	1 h
RL(BS)^*	6.40	4.39%	27 min	11.15	7.46%	39 min	16.96	8.39%	74 min
AM (sampling)^*	6.25	1.91%	6 min	10.62	2.40%	28 min	16.23	3.72%	2 h
AM⁃D (greedy)^*	6.28	2.95%	$≪$ 1 s	10.78	3.85%	$≪$ 1 s	16.40	4.79%	$≪$ 1 s
NeuRewriter^*	6.16	0.48%	22 min	10.51	1.25%	35 min	16.10	2.88%	66 min
POMO	6.35	3.42%	$≪$ 1 s	10.74	3.52%	1 s	16.15	3.00%	3 s
NLNS	6.14	0.61%	1 h	10.55	1.65%	2 h	16.11	2.99%	3 h
DACT	6.13	0.24%	11 min	10.39	0.18%	32 min	15.71	0.38%	1.5 h
MDAM(BS)	6.14	0.26%	3 min	10.50	1.18%	9 min	16.03	2.49%	31 min
DPDP^*	-	-	-	-	-	-	15.69	0.26%	6 h
DGTM	6.13	0.13%	2 s	10.39	0.15%	5 s	15.68	0.19%	20 s
DHNS	6.12	0.06%	3 min	10.39	0.09%	9 min	15.67	0.12%	20 min

表1

图4

图5

图6

表2

表3

参考文献 21

1	Cook W J, Cunningham W H, Pulleyblank W R, et al. Combinatorial optimization. New York, USA： Wiley?Interscience,2010：11-22.
2	Augerat P， Belenguer J M， Benavent E，et al. Separating capacity constraints in the CVRP using tabu search. European Journal of Operational Research，1998，106(2-3)：546-557.
3	代婉玉，张丽娟，吴佳峰，等. 改进TEB算法的局部路径规划算法研究. 计算机工程与应用，2022，58(8)：283-288.
	Dai W Y， Zhang L J， Wu J F，et al. Research on local path planning algorithm based on improved TEB algorithm. Computer Engineering and Applications，2022，58(8)：283-288.
4	Yogatama D， Blunsom P， Dyer C，et al. Learning to compose words into sentences with reinforcement learning. 2016，arXiv：.
5	Kool W， Van Hoof H， Attention Welling M.，learn to solve routing problems. 2019，arXiv：.
6	王扬，陈智斌，吴兆蕊，等. 强化学习求解组合最优化问题的研究综述. 计算机科学与探索，2022，16(2)：261-279.
	Wang Y， Chen Z B， Wu Z R，et al. Review of reinforcement learning for combinatorial optimization problem. Journal of Frontiers of Computer Science and Technology，2022，16(2)：261-279.
7	Huang L， Wang W M， Chen J，et al. Attention on attention for image captioning∥Proceedings of the IEEE/CVF International Conference on Computer Vision. Seoul，Korea (South)：IEEE，2019：4633-4642.
8	Vaswani A， Shazeer N， Parmar N，et al. Attention is all you need∥Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach，CA，USA：Curran Associates Inc.，2017：6000-6010.
9	Vinalys O， Fortunato M， Jaitly N. Pointer networks∥Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal，Canada：MIT Press，2015：2692-2700.
10	Nazari M， Oroojlooy A， Taká? A，et al. Reinforcement learning for solving the vehicle routing problem∥Proceedings of the 32nd International Conference on Neural Information Processing Systems. Montréal，Canada：Curran Associates Inc.，2018：9861-9871.
11	Bresson X， Laurent T. The transformer network for the traveling salesman problem. 2021，arXiv:2103. 03012.
12	王扬，陈智斌. 一种求解CVRP的动态图转换模型. 计算机工程与科学，2023，45(5)：859-868.
	Wang Y， Chen Z B. A dynamic graph transformer model for solving CVRP. Computer Engineering and Science，2023，45(5)：859-868.
13	Kwon Y D， Choo J， Kim B，et al. Pomo：Policy optimization with multiple optima for reinforcement learning∥Proceedings of the 34th International Conference on Neural Information Processing Systems. Vancouver，Canada：Curran Associates Inc.，2020：21188-21198.
14	Wu Y X， Song W， Cao Z G，et al. Learning improvement heuristics for solving routing problems. IEEE Transactions on Neural Networks and Learning Systems，2022，33(9)：5057-5069.
15	王原，陈名，邢立宁，等. 用于求解旅行商问题的深度智慧型蚁群优化算法. 计算机研究与发展，2021，58(8)：1586-1598.
	Wang Y， Chen M， Xing L N，et al. Deep intelligent ant colony optimization for solving travelling salesman problem. Journal of Computer Research and Development，2021，58(8)：1586-1598.
16	Ma Y N, Li J W, Cao Z G, et al. Learning to iteratively solve routing problems with dual?aspect collaborative transformer. Advances in Neural Information Processing Systems,2021(34)：11096-11107.
17	Hottung A， Tierney K. Neural large neighborhood search for the capacitated vehicle routing problem. 2020，arXiv：.
18	Qin Z， Sun W X， Deng H，et al. cosFormer：Rethinking softmax in attention. 2022，arXiv：2202. 08791.
19	Chen X Y， Tian Y D. Learning to perform local rewriting for combinatorial optimization∥Proceedings of the 33rd International Conference on Neural Information Processing Systems. Vancouver，Canada：Curran Associates Inc.，2019：6281-6292.
20	Xin L， Song W， Cao Z G，et al. Multi?decoder attention model with embedding glimpse for solving vehicle routing problems∥Proceedings of the 35th AAAI Conference on Artificial Intelligence. Palo Alto，CA，USA：AAAI Press，2021：12042-12049.
21	Kool W， Van Hoof H， Gromicho J，et al. Deep policy dynamic programming for vehicle routing problems. 2021，arXiv：.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

实例	最优	OR⁃Tools	AM	Wu et al	NLNS	POMO	DACT	DHNS
平均间隙	0.00%	8.06%	31.62%	14.27%	11.67%	6.10%	3.41%	3.07%
X⁃n101⁃k25	27591	29405	37702	29716	29845	28595	27996	27999
X⁃n106⁃k14	26362	27343	28473	27642	27688	26850	26855	26809
X⁃n110⁃k13	14791	16149	15443	15927	15247	15094	14810	14099
X⁃n115⁃k10	12747	13320	13745	14445	14256	13191	12961	12875
X⁃n120⁃k6	13332	14242	13937	15486	13986	13615	13649	13602
X⁃n125⁃k30	55539	58665	75067	60423	57896	59504	58560	56912
X⁃n129⁃k18	28940	31361	30176	32126	31045	29221	29678	29665
X⁃n134⁃k13	10916	13275	13619	12669	12430	11377	11203	11188
X⁃n139⁃k10	13590	15223	14251	15627	14652	13900	13873	13886
X⁃n143⁃k7	15700	17470	17397	18872	18689	16166	16257	16106
X⁃n148⁃k46	43448	46836	79514	50563	49692	52085	44413	44104
X⁃n153⁃k22	21220	22919	37938	26088	27103	23800	22606	22394
X⁃n157⁃k13	16876	17309	21330	19771	19862	17347	17403	17289
X⁃n162⁃k11	14138	15030	15085	16847	15426	14812	14508	14520
X⁃n167⁃k10	20557	22477	22285	24365	22359	21390	21270	21412
X⁃n172⁃k51	45607	50505	87809	51108	52968	55636	47162	47366
X⁃n176⁃k26	47812	52111	58178	57131	58023	52722	50647	50654
X⁃n181⁃k23	25569	26321	27520	27173	27179	26101	26201	26055
X⁃n186⁃k15	24145	26017	25757	28422	26896	24664	25345	24452
X⁃n190⁃k8	16980	18088	36383	20145	20356	18551	18123	18102
X⁃n195⁃k51	44225	50311	79276	51763	48562	48307	46153	46012
X⁃n200⁃k36	58578	61009	76477	64200	62495	61513	62011	61280

[1]	方明月, 冯早, 朱雪峰. 基于半监督聚类方法的管道运行状态识别研究[J]. 南京大学学报(自然科学版), 2023, 59(3): 435-445.
[2]	常芳芳, 陈祺航, 刘云龙. 局部可观测环境下未来信息辅助的无模型深度强化学习[J]. 南京大学学报(自然科学版), 2022, 58(5): 796-804.
[3]	王扬, 陈智斌, 杨笑笑, 吴兆蕊. 深度强化学习结合图注意力模型求解TSP问题[J]. 南京大学学报(自然科学版), 2022, 58(3): 420-429.
[4]	刘玲珊, 熊轲, 张煜, 张锐晨, 樊平毅. 信息年龄受限下最小化无人机辅助无线供能网络的能耗：一种基于DQN的方法[J]. 南京大学学报(自然科学版), 2021, 57(5): 847-856.
[5]	吴礼福, 徐行. 融合韵律与动态倒谱特征的语音疲劳度检测[J]. 南京大学学报(自然科学版), 2021, 57(4): 709-714.
[6]	王旻,林志斌,卢晶. 适用于虚拟低音音质的客观评价方法研究[J]. 南京大学学报(自然科学版), 2019, 55(5): 796-803.
[7]	朱　尧，毛晓蛟，杨育彬* . 基于多特征混合模型的视觉目标跟踪[J]. 南京大学学报(自然科学版), 2016, 52(4): 762-.

深度混合型邻域搜索模型求解CVRP问题

Deep hybrid neighborhood search model solves the CVRP

RichHTML

PDF (PC)

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献 21

相关文章 7

Metrics

本文评价

推荐阅读 0