[Next] [Previous] [Up] [Top] [Contents]
3.1 The Set-up
3.1.1 The environment
Each agent gets the most utility for going when less than 7 of the other agents go (0.7), they get a fixed utility (0.5) if they do not go and the lowest utility for going when it is crowded (0.4). In this way there is no fixed reward for any particular action because the utility gained from going depends on whether too many other agents also go. In this way there is no fixed goal for the agent's learning, but it is relative to the other agent's behaviour. Thus it is in each agent's interest to discoordinate their action with a majority of the others. It is impossible for all agents to gain the maximum utility, there is always some conflict to provide a potential for continual dynamics.
Social Embeddedness and Agent Development - Bruce Edmonds - 30 OCT 98
[Next] [Previous] [Up] [Top] [Contents]
Generated with CERN WebMaker