no code implementations • 7 May 2024 • Guanqiao Qu, Zheng Lin, Fangming Liu, Xianhao Chen, Kaibin Huang
To this end, we formulate a parameter-sharing model placement problem to maximize the cache hit ratio in multi-edge wireless networks by balancing the fundamental tradeoff between storage efficiency and service latency.
1 code implementation • 16 Dec 2023 • Aodong Chen, Fei Xu, Li Han, Yuan Dong, Li Chen, Zhi Zhou, Fangming Liu
GPUs have become the \emph{defacto} hardware devices for accelerating Deep Neural Network (DNN) inference workloads.
no code implementations • 6 Jul 2021 • Zimu Zheng, Qiong Chen, Chuang Hu, Dan Wang, Fangming Liu
We then show that task allocation with task importance for MTL (TATIM) is a variant of the NP-complete Knapsack problem, where the complicated computation to solve this problem needs to be conducted repeatedly under varying contexts.