深度强化学习结合图注意力模型求解TSP问题

王扬, 陈智斌, 杨笑笑, 吴兆蕊

Deep reinforcement learning combined with graph attention model to solve TSP

Yang Wang, Zhibin Chen, Xiaoxiao Yang, Zhaorui Wu

表4 本文模型在训练和推理阶段的时间花费

Table 4 Time cost for training and reasoning by our model

阶段	TSP20	TSP50	TSP100
训练模型	3 h	24 h	136 h
推理(single trajec)	≪1 s	5 s	8 s
推理(no augment)	≪1s	10 s	55 s
推理(8×augment)	17 s	1 min	7 min
推理(4×augment)	9 s	42 s	3 min