Reinforcement Learning for Lifelong Multi-Agent Pathfinding in AutoStore system
Master thesis
View/ Open
Date
2023-05-25Metadata
Show full item recordCollections
- Department of Informatics [1001]
Abstract
AutoStore (AS) uses a cubic system for warehouse automation, utilizing robots to retrieve and organize objects in a three-dimensional grid of bins in a Manhattan geometry environment. Lifelong Multi-Agent Pathfinding (LMAPF) is a crucial element to the system’s effectiveness, where robots are assigned a new goal upon completion of a given goal. Solving LMAPF in the cubic system requires coordination and agile decision-making, which is challenging for a system with a large set of agents. PRIMAL2 is a distributed Reinforcement Learning (RL) framework for LMAPF that allows agents to learn decentralized policies for planning paths online in partially observable environments. This thesis extends PRIMAL2 specifically for the AS cube environment. The main differences for the AS environment are; the z-axis of bins, low-density of unreachable cells, sparsely occupied with agents, and no corridors. Agents spend time changing velocity, changing x/y driving direction, and delivering or retrieving bins to their assigned goal. We modify the PRIMAL2 architecture by altering observation and action space to improve agent co- ordination and local behavior for the specific AS environment. Furthermore, we devise a data transformer to convert real-world historical warehouse operations, sourced from multiple AS clients, from a continuous environment to a discrete representation suitable for training simulations. Our simulated implementation of PRIMAL2 on AS environments indicates that Machine Learning (ML) constitutes a viable approach to LMAPF in AS systems, converging to a model adept at efficiently solving and accomplishing tasks. The model exhibits generalizability, performing well across environments with varying agent densities and configurations.
Description
Postponed access: the file will be accessible after 2026-05-01
Publisher
The University of BergenCopyright
Copyright the Author. All rights reservedRelated items
Showing items related by title, author, creator and subject.
-
Deep Learning and Deep Reinforcement Learning for Graph Based Applications
Hasibi, Ramin (Doctoral thesis, 2024-01-26)Dyp læring har gitt state-of-the-art ytelse i mange applikasjoner som datasyn, tekstanalyse, biologi, osv. Suksessen med dyp læring har også hjulpet fremveksten av dyp forsterkende læring for optimal beslutningstaking og ... -
Online learning through Reinforcement learning in a high-fidelity physics simulator
Hystad, Erik (Master thesis, 2022-11-21) -
Projective Simulation compared to reinforcement learning
Bjerland, Øystein Førsund (Master thesis, 2015-06-01)This thesis explores the model of projective simulation (PS), a novel approach for an artificial intelligence (AI) agent. The model of PS learns by interacting with the environment it is situated in, and allows for simulating ...