Title:
Domain Adaptation in Reinforcement Learning
Domain Adaptation in Reinforcement Learning
dc.contributor.advisor | Isbell, Charles L. | |
dc.contributor.author | Sood, Srijan | |
dc.contributor.committeeMember | Essa, Irfan | |
dc.contributor.department | Computer Science | |
dc.date.accessioned | 2019-05-30T16:23:36Z | |
dc.date.available | 2019-05-30T16:23:36Z | |
dc.date.created | 2017-08 | |
dc.date.issued | 2017-08 | |
dc.date.submitted | August 2017 | |
dc.date.updated | 2019-05-30T16:23:36Z | |
dc.description.abstract | Reinforcement learning is a powerful mechanism for training artificial and real-world agents to perform tasks. Typically, one can define a task for an agent by simply specifying rewards that reflect the agent’s performance. However, each time the task changes, one must develop a new reward specification. Our work aims to remove the necessity of designing rewards in tasks consisting of visual inputs. When humans are learning to complete tasks, we often look to other sources for inspiration or instruction. Even if the representation is different from our own, we can adapt our own representation to the task representation. This motivates our own work, where we present tasks to an agent that are from an environment different than its own. We compare the cross-domain goal representation with the agents representation to form Cross-Domain Perceptual Reward (CDPR) functions and show that these enable the agent to successfully complete its task. | |
dc.description.degree | Undergraduate | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/1853/61358 | |
dc.language.iso | en_US | |
dc.publisher | Georgia Institute of Technology | |
dc.subject | Reinforcement Learning | |
dc.subject | Machine Learning | |
dc.subject | Artificial Intelligence | |
dc.subject | Algorithms | |
dc.title | Domain Adaptation in Reinforcement Learning | |
dc.type | Text | |
dc.type.genre | Undergraduate Thesis | |
dspace.entity.type | Publication | |
local.contributor.advisor | Isbell, Charles L. | |
local.contributor.corporatename | College of Computing | |
local.contributor.corporatename | School of Computer Science | |
local.contributor.corporatename | Undergraduate Research Opportunities Program | |
local.relation.ispartofseries | Undergraduate Research Option Theses | |
relation.isAdvisorOfPublication | 3f357176-4c4b-402c-8b61-ec18ffb083a6 | |
relation.isOrgUnitOfPublication | c8892b3c-8db6-4b7b-a33a-1b67f7db2021 | |
relation.isOrgUnitOfPublication | 6b42174a-e0e1-40e3-a581-47bed0470a1e | |
relation.isOrgUnitOfPublication | 0db885f5-939b-4de1-807b-f2ec73714200 | |
relation.isSeriesOfPublication | e1a827bd-cf25-4b83-ba24-70848b7036ac | |
thesis.degree.level | Undergraduate |