Journal Search Engine

ISSN : 2005-0461(Print)
ISSN : 2287-7975(Online)

Journal of Society of Korea Industrial and Systems Engineering Vol.34 No.4 pp.189-196
DOI :

시스템 특성함수 기반 평균보상 TD() 학습을 통한 유한용량 Fab 스케줄링 근사화

최진영

아주대학교 산업정보시스템공학부

Capacitated Fab Scheduling Approximation using Average Reward TD() Learning based on System Feature Functions

Jin-Young Choi

Division of Industrial and Information Systems Engineering, Ajou University

[$AuthorMark7$]

Abstract

In this paper, we propose a logical control-based actor-critic algorithm as an efficient approach for the approximation of the capacitated fab scheduling problem. We apply the average reward temporal-difference learning method for estimating the relative value functions of system states, while avoiding deadlock situation by Banker's algorithm. We consider the Intel mini-fab re-entrant line for the evaluation of the suggested algorithm and perform a numerical experiment by generating some sample system configurations randomly. We show that the suggested method has a prominent performance compared to other well-known heuristics

Key Words : Fab Scheduling Problem; Actor-critic; Temporal-difference; Average Reward; Banker's Algorithm; Feature Functions

: SOGOBO_2011_v34n4_189[1].pdf414.7KB

시스템 특성함수 기반 평균보상 TD() 학습을 통한 유한용량 Fab 스케줄링 근사화

Capacitated Fab Scheduling Approximation using Average Reward TD() Learning based on System Feature Functions

Abstract

Reference