[Next] [Previous] [Top] [Contents]

The Possible Incomensurability of Utilities and the Learning of Goals - Bruce Edmonds

2. Arguments Against the Possibility


One might seek to claim that in any particular situation for any particular agent there must be a mapping onto a single utility function, but it is entirely unclear why this should be necessarily true. This may be readily believable in some situations, but that is very far from the assumption that this must be possible in all situations.

It may also be argued that agents do choose actions in such situations. This argument can be put in two forms.

This leaves the possibility that the direction of explanation was intended to be in the reverse direction, i.e. that the observed actions were being used to explain the motivations of the agent. In this case, one is left with the question: "Why assume that the motivation can be reduced to one utility function?". There are several possible answers.

In order to make this possibility of incommensurability more real, let us consider an example. Imagine two people playing chess. Each player's goal is to win and not lose. In the early stages of the game the players have to make decisions as to choices of action without there being any possibility of being able to work out all the consequences of their actions. However, this does not mean that they make arbitrary moves - typically they will judge the possible chess positions using a number of ad hoc indicators (number of pieces, how far up the board their pawns have progressed, how many central squares they control, time they have left on the clock etc.). Since the goal of winning is too abstract to motivate any planning at this stage of the game, maximising these indicators take on the role of effective goals (ones that plans and actions can be meaningfully traced back to).

Now, is there any meaningful sense in which these indicators can be said to be commensurable? Could the actions of these players (at this stage of the game) be characterised in terms of a model of optimising a single utility function? I think the answer to both questions is "no". While it is true that a certain player in a certain position may judge there to be a certain trade-off between these various indicators, this trade-off may vary substantially for different positions, and against different players. It might even vary between different matches even if the game has reached the same position against the same player! This strongly suggests that the particular trade-off is result of the decision making process rather than representing its motivation in any meaningful way. Likewise a model of decision making in terms of a process of maximising a single utility that is so context-dependent is unlikely to have any explanatory or predictive value.


The Possible Incomensurability of Utilities and the Learning of Goals - Bruce Edmonds - 05 SEP 97
[Next] [Previous] [Top] [Contents]

Generated with CERN WebMaker