ISSN : 2005-0461(Print)
ISSN : 2287-7975(Online)
ISSN : 2287-7975(Online)
시스템 특성함수 기반 평균보상 TD() 학습을 통한 유한용량 Fab 스케줄링 근사화
Capacitated Fab Scheduling Approximation using Average Reward TD() Learning based on System Feature Functions
Abstract
In this paper, we propose a logical control-based actor-critic algorithm as an efficient approach for the approximation of the capacitated fab scheduling problem. We apply the average reward temporal-difference learning method for estimating the relative value functions of system states, while avoiding deadlock situation by Banker's algorithm. We consider the Intel mini-fab re-entrant line for the evaluation of the suggested algorithm and perform a numerical experiment by generating some sample system configurations randomly. We show that the suggested method has a prominent performance compared to other well-known heuristics
- SOGOBO_2011_v34n4_189[1].pdf414.7KB