Hyperheuristic Frameworks for Combinatorial Optimization Problems using Deep Reinforcement Learning

Sivakumaran, Akilavan

dc.contributor.author	Sivakumaran, Akilavan
dc.date.accessioned	2024-08-24T00:03:57Z
dc.date.available	2024-08-24T00:03:57Z
dc.date.issued	2024-06-17
dc.date.submitted	2024-06-17T10:01:54Z
dc.identifier	ENERGI399I 0 O ORD 2024 VÅR
dc.identifier.uri	https://hdl.handle.net/11250/3148401
dc.description.abstract	Many metaheuristic frameworks exist for solving different combinatorial optimization problems. Despite formulating general strategies that can be applied to many problems, they often rely on problem-specific implementation. Hyperheuristic frameworks attempt to fully generalize the solution method by only relying on general search information for decision making. The addition of Deep Reinforcement Learning (DRL) in a hyperheuristic framework provides the opportunity of learning complex relations between different actions and the state of the search. When it is used for selection of heuristics, it is important to mitigate the opportunity for reward hacking by carefully designing the reward function to be as representative of our objective as possible. This thesis proposes two hyperheuristic frameworks using DRL, with a new reward function for heuristic selection that is based on the percentage improvement compared to the initial solution. Deep Reinforcement Learning Hyperheuristic Plus (DRLH+) combines this DRL heuristic selection with the acceptance strategy of simulated annealing. Dual-Network Deep Reinforcement Learning Hyperheuristic (D^2RLH) combines the DRL heuristic selection with a second DRL agent for acceptance. The frameworks are tested by solving instances of the Pickup and Delivery Problem with Time Windows, and consistently perform well on large problem sizes. The reward function is shown to improve upon the reward function of Deep Reinforcement Learning Hyperheuristic (DRLH) by making gradual and consistent improvements throughout the search, and is able to adjust the strategy to account for extended searches.
dc.language.iso	eng
dc.publisher	The University of Bergen
dc.rights	Copyright the Author. All rights reserved
dc.subject	Pickup and Delivery Problem with Time Windows
dc.subject	Reward function
dc.subject	Hyperheuristics
dc.subject	Reward hacking
dc.subject	Heuristics
dc.subject	PDPTW
dc.subject	Deep Reinforcement Learning
dc.subject	DRLH+
dc.title	Hyperheuristic Frameworks for Combinatorial Optimization Problems using Deep Reinforcement Learning
dc.type	Master thesis
dc.date.updated	2024-06-17T10:01:54Z
dc.rights.holder	Copyright the Author. All rights reserved
dc.description.degree	Masteroppgave i energi
dc.description.localcode	ENERGI399I
dc.description.localcode	5MAMN-ENER
dc.subject.nus	752903
fs.subjectcode	ENERGI399I
fs.unitcode	12-44-0

Files in this item

Name:: 49825374.pdf
Size:: 1.495Mb
Format:: PDF
Description:: master thesis

View/Open

This item appears in the following Collection(s)

Master theses [124]

Show simple item record