Real-time rescheduling of production systems using relational reinforcement learning
Palavras-chave:
Learning, Rescheduling, Relational Modeling, Agile Manufacturing.Resumo
Most scheduling methodologies developed until now have laid down good theoretical foundations, but there is still the need for real-time rescheduling methods that can work effectively in disruption management. In this work, a novel approach for automatic generation of rescheduling knowledge using Relational Reinforcement Learning (RRL) is presented. Relational representations of schedule states and repair operators enable to encode in a compact way and use in real-time rescheduling knowledge learned through intensive simulations of state transitions. An industrial example where a current schedule must be repaired following the arrival of a new order is discussed using a prototype application –SmartGantt®- for interactive rescheduling in a reactive way. SmartGantt® demonstrates the advantages of resorting to RRL and abstract states for real-time rescheduling. A small number of training episodes are required to define a repair policy which can handle on the fly events such as order insertion, resource break-down, raw material delay or shortage and rush order arrivals using a sequence of operators to achieve a selected goal.
10.13084/2175-8018.v03n06a09Referências
ADHITYA, A.; SRINIVASAN, R.; KARIMI, I. A. Heuristic rescheduling of crude oil operations to manage abnormal supply chain events. AIChE Journal, v. 53, n. 2, p. 397-422 2007.
BLOCKEEL, H.; DE RAEDT, L. Top-down induction of first order logical decision trees. Artificial Intelligence, v. 101, n. 1-2, p. 285-297, 1998.
BLOCKEEL, H.; DE RAEDT, L.; JACOBS, N.; DEMOEN, B. Scaling up inductive logic programming by learning from interpretations. Data Mining and Knowledge Discovery v. 3, n. 1, p. 59-93, 1999.
CROONENBORGHS, T. Model-assisted approaches to relational reinforcement learning. Ph.D. Dissertation, Department of Computer Science, K. U. Leuven, Leuven, Belgium 2009.
DE RAEDT, L. Logical and relational learning. Springer-Verlag, Berlin, 2008.
DRIESSENS, K., DŽEROSKI, S. Integrating guidance into relational reinforcement learning. Machine Learning, v. 57, n. 3, p, 271-304, 2004.
DRIESSENS, K.; RAMON, J. Relational instance based regression for relational reinforcement learning. In: 20th International Conference on Machine Learning, 123, AAAI Press, Washington, 2003.
DRIESSENS, K.; RAMON, J.; BLOCKEEL, H. Speeding up Relational Reinforcement Learning through the use of an Incremental First Order Decision Tree Learner. In: DE RAEDT, L. and FLACH, P. (eds.) 13th European Conference on Machine Learning, vol. 2167, 97, Springer, Heidelberg (2001)
DRIESSENS, K.; RAMON, J.; GÄRTNER, T. Graph Kernels and Gaussian processes for relational reinforcement learning. Machine Learning, v. 64, n. 1-3, p. 91-119, 2006.
DŽEROSKI, S.; DE RAEDT, L.; DRIESSENS, K. Relational reinforcement learning. Machine Learning, v. 43, n. 1-2, p. 7-52, 2001.
GÄRTNER, T. Kernels for structured data. Series in Machine Perception and Artificial Intelligence, v. 72, World Scientific Publishing, Singapore, 2008.
Henning, G. Production scheduling in the process industries: current trends, emerging challenges and opportunities. Computer Aided Chemical Engineering, v. 27, p. 23-28 2009.
HENNING, G., CERDA, J. Knowledge-based predictive and reactive scheduling in industrial environments. Computers and Chemical Engineering, v. 24, n. 9, p. 2315-2338, 2000.
MARTINEZ, E. Solving batch process scheduling/planning tasks using reinforcement learning. Computers & Chemical Engineering, v. 23, 527-530, 1999.
MÉNDEZ, C.; CERDÁ, J.; HARJUNKOSKI, I.; GROSSMANN, I.; FAHL, M. State-of-the-art review of optimization methods for short-term scheduling of batch processes. Computers and Chemical Engineering, v. 30, n. 6, p. 913-946, 2006.
MIYASHITA, K. Learning scheduling control through reinforcements. International Transactions in Operational Research (Pergamon Press), v. 7, n. 2, p. 125-138, 2000.
MIYASHITA, K.; SYCARA, K. CABINS: a framework of knowledge acquisition and iterative revision for schedule improvement and iterative repair. Artificial Intelligence, v. 76, n. 1-2 p. 377-426, 1995.
MUSIER, R.; EVANS, L. An approximate method for the production scheduling of industrial batch processes with parallel units. Computers and Chemical Engineering, v. 13, n. 1-2, p. 229-238, 1989.
PALOMBARINI, J.; MARTÍNEZ, E. Learning to repair plans and schedules using a relational (deictic) representation. Brazilian Journal of Chemical Engineering, v. 27, n. 3, p. 413- 427, 2010.
SUTTON, R.; BARTO, A. Reinforcement learning: an introduction. MIT Press, Boston, 1998.
VAN OTTERLO, M. The logic of adaptive behavior: knowledge representation and algorithms for adaptive sequential decision making under uncertainty in first-order and relational domains. IOS Press, Amsterdam, 2009.
VIEIRA, G.; HERRMANN, J.; LIN, E. Rescheduling Manufacturing Systems: a Framework of Strategies, Policies and Methods. Journal of Scheduling, v. 6, n. 1, p. 39-62, 2003.
ZWEBEN, M.; DAVIS, E.; DOUN, B.; DEALE, M. Iterative repair of scheduling and rescheduling. IEEE Transactions On Systems, Man, And Cybernetics, v. 23, n. 6, p. 1588-1596, 1993.
Downloads
Publicado
Edição
Seção
Licença
Os artigos publicados são de propriedade do IJIE – Iberoamerican Journal of Industrial Engineering, Revista Iberoamericana de Engenharia Industrial, Revista Iberoamericana de Ingeniería Industrial. Os autores são os responsáveis pelos conteúdos dos artigos. O IJIE não se responsabiliza ou endossa as opiniões emitidas pelos autores dos textos publicados, salientando que as opiniões são de exclusiva responsabilidade dos autores.
O periódico se reserva o direito de introduzir alterações no original, visando a manter a homogeneidade e a qualidade da publicação, respeitando, no entanto, o estilo e as opiniões dos autores. Essas alterações serão editoriais (correções gramaticais e adequações estilísticas) e não substanciais, de forma que não modifiquem o sentido do texto. As provas finais não serão enviadas aos autores, sendo o artigo publicado com os ajustes necessários.
Conforme citado neste site, no item “Caráter do IJIE”, os artigos são de uso gratuito, com atribuições próprias em aplicações educacionais e não-comerciais. Uma nova publicação do mesmo texto, de iniciativa de seu autor ou de terceiros, fica sujeita à expressa menção da precedência de sua publicação neste periódico, citando-se a edição e a data dessa publicação.