no code implementations • 20 Aug 2023 • Bowei Xu, Hao Chen, Zhan Ma
Unlike direct observation-to-action mapping, Karma recurrently maintains a multi-dimensional time series of observations, returns, and actions as input and employs causal sequence modeling via a decision transformer to determine the next action.