• Login
    View Item 
    •   DSpace Home
    • Faculty of Technology, Art and Design
    • TKD - Department of Computer Science
    • View Item
    •   DSpace Home
    • Faculty of Technology, Art and Design
    • TKD - Department of Computer Science
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

    Thumbnail
    Abstract
    Although the field of Learning Automata (LA) has made significant progress in the last four decades, the LA-based methods to tackle problems involving environments with a large number of actions are, in reality, relatively unresolved. The extension of the traditional LA (fixed structure, variable structure, discretized, and pursuit) to problems within this domain cannot be easily established when the number of actions is very large. This is because the dimensionality of the action probability vector is correspondingly large, and consequently, most components of the vector will, after a relatively short time, have values that are smaller than the machine accuracy permits, implying that they will never be chosen. This paper pioneers a solution that extends the continuous pursuit paradigm to such large-actioned problem domains. The beauty of the solution is that it is hierarchical, where all the actions offered by the environment reside as leaves of the hierarchy. Further, at every level, we merely require a two-action LA which automatically resolves the problem of dealing with arbitrarily small action probabilities. Additionally, since all the LA invoke the pursuit paradigm, the best action at every level trickles up towards the root. Thus, by invoking the property of the “max” operator, in which, the maximum of numerous maxima is the overall maximum, the hierarchy of LA converges to the optimal action. Apart from reporting the theoretical properties of the scheme, the paper contains extensive experimental results which demonstrate the power of the scheme and its computational advantages. As far as we know, there are no comparable results in the field of LA.
    URI
    https://hdl.handle.net/10642/7238
    Collections
    • TKD - Department of Computer Science
    Date
    2018-05-22
    Author
    Yazidi, Anis
    Zhang, Xuan
    Lei, Jiao
    Oommen, John
    Show full item record
    HierCPA_conf_submitted.pdf (151.6Kb)

    Related items

    Showing items related by title, author, creator and subject.

    • Thumbnail

      Arts-based learning in vocational education: Using arts-based approaches to enrich vocational pedagogy and didactics and to enhance professional competence and identity 

      Meltzer, Cecilie; Schwencke, Eva (SAGE Publications, 2019)
      This article discusses in what way arts-based learning can complement and enrich vocational pedagogy and didactics. It examines how artwork and artistic, educational practices can enhance professional and vocational skills, ...
    • Thumbnail

      Learning transfer through industrial simulator training: Petroleum industry case 

      Komulainen, Tiina M.; Sannerud, Arne Ronny (Cogent OA, 2018-12-03)
      Efficient teamwork skills, high level of complex process knowledge and a vast set of operational abilities are essential for safe and economical operation in process industries. During the past decade, human factors have ...
    • Thumbnail

      Two-timescale learning automata for solving stochastic nonlinear resource allocation problems 

      Yazidi, Anis; Hammer, Hugo Lewi; Jonassen, Tore Møller (Springer Verlag, 2017)
      This papers deals with the the Stochastic Non-linear Fractional Equality Knapsack (NFEK) problem which is a fundamental resource allocation problem based on incomplete and noisy information [2,3]. The NFEK problem arises ...

    copyright © 2017 Oslo and Akershus University College of Applied Sciences
    Contact Us | Send Feedback
    Powered by KnowledgeArc
     

     

    Browse

    All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    copyright © 2017 Oslo and Akershus University College of Applied Sciences
    Contact Us | Send Feedback
    Powered by KnowledgeArc