Access the full text.
Sign up today, get DeepDyve free for 14 days.
References for this paper are not available at this time. We will be adding them shortly, thank you for your patience.
The objective of this work is to determine how/if learning agents can benefit from exchanging information during learning in problems where each team uses a different learning algorithm. In recent studies several problems were exposed, such as lack of coordination, exchange of useless information and difficulties in the adequate choice of advisors. In this work we propose new solutions and test them in two different domains (predator-prey and traffic-control). Our solutions involve hybrid algorithms derived from Q-Learning and Evolutionary Algorithms. Results indicate that some combinations of learning algorithms are more suited to the use of external information than others and that the difference in the results achieved, with and without communication, is problem dependent. The results also show that, in situations where communication is useful, the gain in quality and learning-time can be significant if the right combination of techniques is used to process external information.
Intelligent Decision Technologies – IOS Press
Published: Jan 1, 2008
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.