深度强化学习结合图注意力模型求解TSP问题
王扬, 陈智斌, 杨笑笑, 吴兆蕊

Deep reinforcement learning combined with graph attention model to solve TSP
Yang Wang, Zhibin Chen, Xiaoxiao Yang, Zhaorui Wu
表1 不同模型在TSP问题上的优化结果比较
Table 1 Optimization results for TSP problem by different models
模型20⁃TSP50⁃TSP100⁃TSP
花费间隙时间花费间隙时间花费间隙时间
Concorde[9]3.830.00%5 min5.690.00%13 min7.760.00%1 h
Gurobi[8]3.830.00%7 s5.690.00%2 min7.760.00%17 min
OR⁃Tools3.860.94%1 min5.852.87%5 min8.063.86%23 min
LKH3[12]3.830.00%42 s5.690.00%6 min7.760.00%25 min
2⁃opt [1]3.953.13%1 s6.117.38%7 s8.509.53%33 s
Farthest Insertion[11]3.891.56%1 s5.974.92%2 s8.347.47%10 s
Nearest Neighbor[1]4.4816.9%1 s6.9421.9%3 s9.6824.7%7 s
Kool et al(Greedy)[19]3.850.34%≪1 s5.801.76%2 s8.124.53%6 s
Kool et al(Sampling)[19]3.840.08%5 min5.730.52%24 min7.942.26%1 h
Costa et al[17]3.830.00%15 min5.710.12%29 min7.830.87%41 min
Wu et al[22]3.830.00%1 h5.700.20%1.5 h7.871.42%2 h
Kwon et al(single trajec)[5]3.830.12%≪1 s5.741.03%3 s7.841.12%8 s
Kwon et al(no augment)[5]3.830.04%≪1 s5.710.35%10 s7.790.50%54 s
Kwon et al(8×augment)[5]3.830.00%16 s5.690.05%1 min7.770.14%7 min
Ours (single trajec)3.830.13%≪1 s5.730.70%5 s7.841.08%8 s
Ours (no augment)3.830.05%≪1 s5.700.28%10 s7.790.47%55 s
Ours (8×augment)3.830.00%17 s5.690.03%1 min7.770.12%7 min
Ours (4×augment)3.830.00%9 s5.690.01%42 s7.760.09%3 min