Show simple item record

dc.contributor.authorSivakumaran, Akilavan
dc.date.accessioned2024-08-24T00:03:57Z
dc.date.available2024-08-24T00:03:57Z
dc.date.issued2024-06-17
dc.date.submitted2024-06-17T10:01:54Z
dc.identifierENERGI399I 0 O ORD 2024 VÅR
dc.identifier.urihttps://hdl.handle.net/11250/3148401
dc.description.abstractMany metaheuristic frameworks exist for solving different combinatorial optimization problems. Despite formulating general strategies that can be applied to many problems, they often rely on problem-specific implementation. Hyperheuristic frameworks attempt to fully generalize the solution method by only relying on general search information for decision making. The addition of Deep Reinforcement Learning (DRL) in a hyperheuristic framework provides the opportunity of learning complex relations between different actions and the state of the search. When it is used for selection of heuristics, it is important to mitigate the opportunity for reward hacking by carefully designing the reward function to be as representative of our objective as possible. This thesis proposes two hyperheuristic frameworks using DRL, with a new reward function for heuristic selection that is based on the percentage improvement compared to the initial solution. Deep Reinforcement Learning Hyperheuristic Plus (DRLH+) combines this DRL heuristic selection with the acceptance strategy of simulated annealing. Dual-Network Deep Reinforcement Learning Hyperheuristic (D^2RLH) combines the DRL heuristic selection with a second DRL agent for acceptance. The frameworks are tested by solving instances of the Pickup and Delivery Problem with Time Windows, and consistently perform well on large problem sizes. The reward function is shown to improve upon the reward function of Deep Reinforcement Learning Hyperheuristic (DRLH) by making gradual and consistent improvements throughout the search, and is able to adjust the strategy to account for extended searches.
dc.language.isoeng
dc.publisherThe University of Bergen
dc.rightsCopyright the Author. All rights reserved
dc.subjectPickup and Delivery Problem with Time Windows
dc.subjectReward function
dc.subjectHyperheuristics
dc.subjectReward hacking
dc.subjectHeuristics
dc.subjectPDPTW
dc.subjectDeep Reinforcement Learning
dc.subjectDRLH+
dc.titleHyperheuristic Frameworks for Combinatorial Optimization Problems using Deep Reinforcement Learning
dc.typeMaster thesis
dc.date.updated2024-06-17T10:01:54Z
dc.rights.holderCopyright the Author. All rights reserved
dc.description.degreeMasteroppgave i energi
dc.description.localcodeENERGI399I
dc.description.localcode5MAMN-ENER
dc.subject.nus752903
fs.subjectcodeENERGI399I
fs.unitcode12-44-0


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record