CHI 97 Electronic Publications: Late-Breaking/Short Talks

CHI 97 Electronic Publications: Late-Breaking/Short Talks

LICAI+: A Comprehension-Based Model of Learning for Display-Based Human-Computer Interaction

Muneo Kitajima
National Institute of Bioscience and Human-Technology
1-1 Higashi Tsukuba Ibaraki 305, JAPAN
Tel: +81 (298) 54-6730
E-mail: kitajima@nibh.go.jp

Peter G. Polson
Institute of Cognitive Science
University of Colorado
Boulder, CO 80309-0345, USA
Tel: +1 (303) 492-5622
E-mail: ppolson@psych.colorado.edu

ABSTRACT

This paper describes a model of comprehension-based learning, LICAI+, an extension to the comprehension-based model of display-based HCI, LICAI [5], that simulates a user who performs tasks given as instructions. LICAI+ models users' learning of task performance by incorporating a process for encoding events during the task performance. A simulation of encoding and recalling events is described.

Keywords

cognitive model; learning; display-based human-computer interaction; construction-integration theory.

ABSTRACT

Keywords

INTRODUCTION
THE LICAI+ MODEL

The Construction-Integration Architecture
Comprehending Hints and Exploration
Encoding Results of Performing Actions

SIMULATION: ENCODING AND RECALLING

Recalling Memory of Events

REFERENCES

INTRODUCTION

This paper describes a model of how experienced users of an environment (e.g., Mac OS or Windows 95) learn to use a novel graphing application like Cricket Graph III. We assume that the user has an explicit task goal (e.g., Hide Legend), and that she generates the correct action sequence by being given hints of the form "Pull-down the Options menu" or successfully discovering steps by exploration. After successfully performing the task for the first time, the user is asked to perform the task again after a delay. The task goal and the displays generated by the application interface are the only retrieval cues given to the user.

In a previous paper [5], we described LICAI, LInked model of Comprehension-based Action planing and Instruction taking. It simulates the cognitive processes involved in comprehending and following hints and successfully performing steps by exploration. This paper extends LICAI by incorporating learning processes. We want to understand the relationship between users' recall performance and their representation of the task and the application displays. Franzke [1] found that both the probabilities of successful exploration and later recall were determined by overlap of users task goal descriptions with the labels of correct menu choices.

THE LICAI+ MODEL

LICAI+ incorporates learning mechanisms derived from the memory for text [3]. We start by describing LICAI for comprehending hints and successful exploration (See [4] and [5] for details). Then, we describe how LICAI+ encodes successful actions, and later retrieves them using a task goal and application displays as retrieval cues.

The Construction-Integration Architecture

LICAI is comprehension-based model of instruction following and exploration. The cognitive processes specified in LICAI are implemented using the construction-integration (C-I) architecture developed by Kintsch [2] which has been applied successfully to model cognitive processes involved in text comprehension [2], word problem solving [2], and action planning [4].

In the construction phase, a C-I cycle generates a connectionist network that represents alternative meanings of a sentence or alternative actions that can be performed on a given step in a computer-based task, and the knowledge necessary to select among the alternatives. The integration phase uses spreading activation to implement a constraint satisfaction process that selects a contextually appropriate alternative consistent with the users' goals. The nodes in the network are propositions. Links in the network are established by common arguments of propositions; when two nodes share a common argument, they are connected. The constraint satisfaction process is controlled by the pattern of interconnections.

Comprehending Hints and Exploration

Kitajima and Polson [5] describe in detail the processes in LICAI that simulate comprehension of hints like "Pull-down the Options menu." This hint specifies actions to be performed on an object on the screen. LICAI transforms the propositional representation of the hint into a representation that controls the action planning process, a do-it goal.

However, even without hints, the action planning process can discover correct actions by exploration. A task goal like Hide Legend does not contain any information about the correct action sequence. However, the action planning process can discover the correct action sequence if labels of screen objects like a correct menu choice overlap with the task goal. If the interface to a graphing program had a Hide menu and Legend was a menu item, the action planning process would be able to discover the correct actions utilizing knowledge about the interface stored in long-term memory. For example, the model knows that Hide is a menu label, that menus can be pointed at, that press-and-hold is a legal action, and that press-and-hold will pull-down the menu.

Encoding Results of Performing Actions

LICAI+ defines an encoding process that operates after LICAI's action planning process. This encoding process comprehends the results of the action just executed by the model, generating a memory trace as by-product of the C-I cycle that comprehends the result of the last action.

When an action causes a significant display change (i.e., appearance of a pull-down menu, of an application window, etc.), a specialized comprehension schema is used for constructing a propositional representation of the event. The comprehension schema generates a proposition including the following arguments: 1) the current task-goal, 2) the current do-it goal, 3) the label of the acted-on object, and 4) its current state. This proposition is incorporated into the network during the construction phase. During the integration phase, the amount of activation received by this proposition is determined by the pattern of links in the network which is determined by overlapping labels. The pattern of activation then determines the strength of the memory trace of this event.

SIMULATION: ENCODING AND RECALLING

We modeled learning of the steps for the task Hide Legend using Cricket Graph III. We assume that a user is an experienced user of the Macintosh OS who has had no experience with this application. Rodriguez and Polson [6] have shown such users cannot perform the first two steps of the task (Pull-down the Options Menu, Select Show Graph Items...) without hints. We gave the simulation hints that enabled it to perform these two steps successfully. LICAI+ comprehended the hints and converted them to do-it goals, followed by execution of the first two steps. Display changes triggered the encoding processes. Figure 1 shows the elements in the network that participated in the encoding process.

Releasing on "Show Graph Items..." caused a dialog box to appear for showing or hiding elements of the graph like the legend. Each element had a check box next to an associated label (e.g., Legend). The graph item described by a label was hidden by clearing the associated check box and clicking on the button labeled "OK." These two steps can be successfully performed by the model and subjects by exploration. We simulated doing these steps by exploration where the original task goal Hide Legend controlled the process. The resulting encoding of the outcome of an action has no do-it goal (see Figure 1).

Step No. Task Goal Do-It Goal Label of Acted-on Object State of Acted-on Object
1 Hide Legend select 'Options' 'Options' Highlighted
2 Hide Legend select 'Show Graph Items...' 'Show Graph Items...' not visible
3 Hide Legend none 'Legend' Legend Check Box Cleared
4 Hide Legend none 'OK' not visible
Figure 1. The elements in the network used for encoding events during Hide Legend task using Cricket Graph III.

Step No.	Task Goal	Do-It Goal	Label of Acted-on Object	State of Acted-on Object
1	Hide Legend	select 'Options'	'Options'	Highlighted
2	Hide Legend	select 'Show Graph Items...'	'Show Graph Items...'	not visible
3	Hide Legend	none	'Legend'	Legend Check Box Cleared
4	Hide Legend	none	'OK'	not visible

Recalling Memory of Events

Retention of the four steps performed during the learning session was tested by using the task goal, Hide Legend, and the initial display as retrieval cues. Figure 2 shows the activation values of the encoded events in the recall session. The most highly activated event representation is the representation of Step 3, clearing the legend check box.

Actions Involved in Events	Activation Values
Pulling-Down "Options"	0.0275
Releasing "Show Graph Items..."	0.0330
Clearing Legend Check-Box	0.1631
Clicking "OK"	0.0329

Figure 2. Retrieval of events from memory cued by the task description, "Hide Legend" and the initial display.

The memory retrieval process, like the action planning process is dominated by the overlap between the goal and labels of objects on the display. Observe that the model retrieved this step even though the object to be acted on, the check box, was not yet on the display. Additional cognitive processes would be required for selecting the encoding of the first step of the task, pulling down the Options menu. This suggests that performance of the actions that were hinted in the learning session can be still difficult to recall consistent with Franzke [1].

To conclude, the encoding process in LICAI+ generates weak memory traces for the actions on objects that do not have labels that overlap with the task goal. This result suggests that subjects will have difficulty recalling the steps from memory in a task that required hints (i.e., cannot be successfully performed by exploration.)

REFERENCES

Franzke, M. Turning research into practice: Characteristics of display-based interaction. In Proceedings of CHI '95 (Denver CO, May 1995), ACM Press, 421-428.
Kintsch, W. The role of knowledge in discourse comprehension: A construction-integration model. Psychological Review, 95, (1988), 163-182.
Kintsch, W., and Welsch, D. M. The construction-integartion model: A framework for studying memory for text. In W. E. Hockley and S. Lewandowsky (Eds.), Relating theory and data: Essays on human memory (pp. 367-385). Erlbaum, Hillsdale NJ, 1991.
Kitajima, M., and Polson, P. G. A comprehension-based model of correct performance and errors in skilled, display-based human-computer interaction. International Journal of Human-Computer Systems, 43, (1995), 65-99.
Kitajima, M., and Polson, P. G. A comprehension-based model of exploration. Human-Computer Interaction. (in press).
Rodriguez, M., and Polson, P. G. Expert exploration. 1996. unpublished.

CHI 97 Electronic Publications: Late-Breaking/Short Talks