{"id":1928,"date":"2021-07-14T09:30:00","date_gmt":"2021-07-14T07:30:00","guid":{"rendered":"http:\/\/cirpicme.org\/?page_id=1928"},"modified":"2021-07-13T18:58:48","modified_gmt":"2021-07-13T16:58:48","slug":"test-driven-reward-function-for-reinforcement-learning-a-contribution-towards-applicable-machine-learning-algorithms-for-production-systems","status":"publish","type":"page","link":"https:\/\/cirpicme.org\/index.php\/production-systems-networks\/test-driven-reward-function-for-reinforcement-learning-a-contribution-towards-applicable-machine-learning-algorithms-for-production-systems\/","title":{"rendered":"Test-driven reward function for reinforcement learning: a contribution towards applicable machine learning algorithms for production systems"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><em>by <em>Florian Jaensch, Karl Kuebler, Elmar Schwarz, Alexander Verl<\/em><\/em> <em>(Germany)<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Abstract<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Reinforcement Learning algorithms find more and more application in fields where complex tasks need to be solved. The automation of production systems is one of those fields. Normally, programming a control system defines the automation strategy. Previous contributions by the authors have shown that a so-called agent is able to learn automation strategies for production systems using a Reinforcement Learning setup. However, the development of the reward function for the agent can be challenging and needs Reinforcement Learning domain knowledge. This paper introduces a novel approach in combining Test-Driven Development with Reinforcement Learning in order to solve the problem of a suitable reward function. In the presented approach predefined test cases are used to derive rewards for the agent. The use of an automated test framework allows for continuous learning sequences until all test cases are passed. An application example of a robot cell is used to demonstrate the novel approach and verify its suitability and usability. The first application shows promising results for further examination towards more fields of application.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Keywords<\/strong>: Virtual commissioning simulation, Test-driven development, Reinforcement learning<\/p>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Video presentation<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-video\"><video height=\"1080\" style=\"aspect-ratio: 1920 \/ 1080;\" width=\"1920\" controls src=\"http:\/\/cirpicme.org\/wp-content\/uploads\/2021\/07\/Florian_Jaensch.mp4\"><\/video><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Presenting author<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-table is-style-stripes\"><table class=\"has-subtle-light-gray-background-color has-background\"><tbody><tr><td><\/td><td><\/td><td><\/td><\/tr><tr><td><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" class=\"wp-image-2255\" style=\"width: 150px;\" src=\"https:\/\/i0.wp.com\/cirpicme.org\/wp-content\/uploads\/2021\/07\/Florian_Jaensch_Photo.jpg?resize=150%2C150\" alt=\"\" srcset=\"https:\/\/i0.wp.com\/cirpicme.org\/wp-content\/uploads\/2021\/07\/Florian_Jaensch_Photo.jpg?w=220&amp;ssl=1 220w, https:\/\/i0.wp.com\/cirpicme.org\/wp-content\/uploads\/2021\/07\/Florian_Jaensch_Photo.jpg?resize=150%2C150&amp;ssl=1 150w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/td><td><strong>Name:<\/strong><br><br><strong>Affiliation:<\/strong><br><br><strong>Email:<\/strong><\/td><td>Florian Jaensch<br><br>University of Stuttgart, Germany<br><br>florian.jaensch@isw.uni-stuttgart.de<\/td><\/tr><tr><td><\/td><td><\/td><td><\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>by Florian Jaensch, Karl Kuebler, Elmar Schwarz, Alexander Verl (Germany) Abstract Reinforcement Learning algorithms find more and more application in fields where complex tasks need to be solved. The automation of production systems is one of those fields. Normally, programming a control system defines the automation strategy. Previous contributions by&#8230;<\/p>\n<p> <a class=\"continue-reading-link\" href=\"https:\/\/cirpicme.org\/index.php\/production-systems-networks\/test-driven-reward-function-for-reinforcement-learning-a-contribution-towards-applicable-machine-learning-algorithms-for-production-systems\/\"><span>Continue reading<\/span><i class=\"crycon-right-dir\"><\/i><\/a> <\/p>\n","protected":false},"author":9,"featured_media":0,"parent":2301,"menu_order":0,"comment_status":"open","ping_status":"closed","template":"","meta":{"nf_dc_page":"","om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"class_list":["post-1928","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/pages\/1928","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/comments?post=1928"}],"version-history":[{"count":3,"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/pages\/1928\/revisions"}],"predecessor-version":[{"id":2261,"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/pages\/1928\/revisions\/2261"}],"up":[{"embeddable":true,"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/pages\/2301"}],"wp:attachment":[{"href":"https:\/\/cirpicme.org\/index.php\/wp-json\/wp\/v2\/media?parent=1928"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}