no code implementations • 31 Dec 2022 • Xiaofa Liu, Jianqin Yin, Yuan Sun, Zhicheng Zhang, Jin Tang
Unlike most existing methods with offline feature generation, our method directly takes frames as input and further models motion evolution on two different temporal scales. Therefore, we solve the complexity problems of the two stages of modeling and the problem of insufficient temporal and spatial information of a single scale.