دانلود مقاله انگلیسی رایگان:بهبود یادگیری تقویتی با برنامه درسی - 2020
دانلود بهترین مقالات isi همراه با ترجمه فارسی
دانلود مقاله انگلیسی یادگیری تقویتی رایگان
  • Improved reinforcement learning with curriculum Improved reinforcement learning with curriculum
    Improved reinforcement learning with curriculum

    سال انتشار:

    2020


    عنوان انگلیسی مقاله:

    Improved reinforcement learning with curriculum


    ترجمه فارسی عنوان مقاله:

    بهبود یادگیری تقویتی با برنامه درسی


    منبع:

    Sciencedirect - Elsevier - Expert Systems With Applications, 158 (2020) 113515. doi:10.1016/j.eswa.2020.113515


    نویسنده:

    Joseph West a,⇑, Frederic Maire a, Cameron Browne b, Simon Denman a


    چکیده انگلیسی:

    Humans tend to learn complex abstract concepts faster if examples are presented in a structured manner. For instance, when learning how to play a board game, usually one of the first concepts learned is how the game ends, i.e. the actions that lead to a terminal state (win, lose or draw). The advantage of learning endgames first is that once the actions leading to a terminal state are understood, it becomes possible to incrementally learn the consequences of actions that are further away from a terminal state – we call this an end-game-first curriculum. The state-of-the-art machine learning player for general board games, AlphaZero by Google DeepMind, does not employ a structured training curriculum. Whilst Deepmind’s approach is effective, their method for generating experiences by self-play is resource intensive, costing literally millions of dollars in computational resources. We have developed a new method called the endgame- first training curriculum, which, when applied to the self-play/experience-generation loop, reduces the required computational resources to achieve the same level of learning. Our approach improves performance by not generating experiences which are expected to be of low training value. The end-gamefirst curriculum enables significant savings in processing resources and is potentially applicable to other problems that can be framed in terms of a game.
    Keywords: Curriculum learning | Reinforcement learning | Monte Carlo tree search | General game playing


    سطح: متوسط
    تعداد صفحات فایل pdf انگلیسی: 15
    حجم فایل: 2060 کیلوبایت

    قیمت: رایگان


    توضیحات اضافی:




اگر این مقاله را پسندیدید آن را در شبکه های اجتماعی به اشتراک بگذارید (برای به اشتراک گذاری بر روی ایکن های زیر کلیک کنید)

تعداد نظرات : 0

الزامی
الزامی
الزامی
rss مقالات ترجمه شده rss مقالات انگلیسی rss کتاب های انگلیسی rss مقالات آموزشی
logo-samandehi