<?xml version="1.0" encoding="UTF-8" ?><xml><records><record><database name="!wdg&apos;s ref list_v8.enl" path="/Users/gray/Documents/!wdg&apos;s ref list_v8.enl">!wdg&apos;s ref list_v8.enl</database><source-app name="EndNote" version="10.0">EndNote</source-app><rec-number>2022</rec-number><ref-type name="Journal Article">17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Gray, Wayne D.</style></author><author><style face="normal" font="default" size="100%">Schoelles, Michael J.</style></author><author><style face="normal" font="default" size="100%">Sims, Chris R.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Adapting to the task environment: Explorations in expected value</style></title><secondary-title><style face="normal" font="default" size="100%">Cognitive Systems Research</style></secondary-title></titles><periodical><full-title><style face="normal" font="default" size="100%">Cognitive Systems Research</style></full-title></periodical><pages><style face="normal" font="default" size="100%">27-40</style></pages><volume><style face="normal" font="default" size="100%">6</style></volume><number><style face="normal" font="default" size="100%">1</style></number><dates><year><style face="normal" font="default" size="100%">2005</style></year></dates><publisher><style face="normal" font="default" size="100%">Elsevier</style></publisher><abstract><style face="normal" font="default" size="100%">Small variations in how a task is designed can lead humans to tradeoff one set of strategies for another. In this paper we discuss our failure to model such tradeoffs in the Blocks World task using ACT-R’s default mechanism for selecting the best production among competing productions. ACT-R’s selection mechanism, its expected value equation, has had many successes (see, for example, Anderson &amp; Lebiere, 1998) and a recognized strength of this approach is that, across a wide variety of tasks, it tends to produce models that adapt to their task environment about as fast as humans adapt. (This congruence with human behavior is in marked contrast to other popular ways of computing the utility of alternative choices; for example, Reinforcement Learning or most Connectionist learning methods.) We believe that the failure to model the Blocks World task stems from the requirement in ACT-R that all actions must be counted as a binary success or failure. In Blocks World, as well as in many other circumstances, actions can be met with mixed success or partial failure. Working within ACT-R’s expected value equation we replace the binary success/failure judgment with three variations on a scalar one. We then compare the performance of each alternative with ACT-R’s default scheme and with the human data. We conclude by discussing the limits and generality of our attempts to replace ACT-R’s binary scheme with a scalar credit assignment mechanism.</style></abstract><notes><style face="normal" font="default" size="100%">The work reported was supported by a grant from the Office of Naval Research ONR #N000140310046. Additional support was provided by a subcontract to Rensselaer Polytechnic Institute from contract #MDA-904-03-C-0408 to Booz Allen Hamilton from the Advanced Research and Development Activity (ARDA). Thanks to Wai-Tat Fu for running human subjects as well as many other contributions to this project.</style></notes><urls><pdf-urls><url><style face="normal" font="default" size="100%">internal-pdf://GSS05_CSRj-3004078336/GSS05_CSRj.pdf</style></url></pdf-urls></urls><research-notes><style face="normal" font="default" size="100%">ONR/ARDA</style></research-notes></record></records></xml>
