Print Email Facebook Twitter Learning the Problem Representation for Improving Negotiation Strategies Title Learning the Problem Representation for Improving Negotiation Strategies Author Fledderus, Eddy (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Murukannaiah, P.K. (mentor) Renting, B.M. (mentor) Zhang, X. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2022-06-23 Abstract The domains of the negotiation can vary significantly. It is possible that a domain is very cooperative, where both agents can receive a high utility; the opposite is also possible, where the domain is very competitive and the agents cannot both get a high utility. In the same manner, the agents can have different strategies leading to a complicated problem with no obvious solution.This research seeks to represent the differences in negotiation domains to improve a machine learning based agent to help the agent generalize these domains. To achieve this several ways of representing the domain have been explored.First is the shared domain information. With this representation, the agent uses information about the amount of issues, values and possible bids there are. Second is the private domain information, in this representation, the agent uses different calculations to get a view of how favorable the domain is in terms of utility. Last is the derived information, this is the representation where the agent learns about the domain by interaction with the environment or the opposing agent.From the experiments, a conclusion could be made that a part of these representations had a positive impact on the final utility of the agent. The shared domain information had a considerable improvement over the base agent with the features having a non-negligible impact on the negotiation. The derived information also had a considerable impact on the final outcome. Subject Machine learningnegotiating agentsReinforcement Learning (RL) To reference this document use: http://resolver.tudelft.nl/uuid:5856ca4f-74f3-40c7-b187-57632e0f4824 Bibliographical note https://github.com/brenting/negotiation_PPO Implementation discussed in paper can be found here Part of collection Student theses Document type bachelor thesis Rights © 2022 Eddy Fledderus Files PDF RP_Report_2_.pdf 361.95 KB TXT Base.txt 9.23 KB Close viewer /islandora/object/uuid:5856ca4f-74f3-40c7-b187-57632e0f4824/datastream/OBJ1/view