no code implementations • 11 Jul 2022 • Liwei Guo, Wonkyo Choe, Felix Xiaozhu Lin
Yet, the unprecedented size of an NLP model stresses both latency and memory, creating a tension between the two key resources of a mobile device.
Management