no code implementations • 28 Jul 2020 • Caojin Zhang, Yicun Liu, Yuanpu Xie, Sofia Ira Ktena, Alykhan Tejani, Akshay Gupta, Pranay Kumar Myana, Deepak Dilipkumar, Suvadip Paul, Ikuhiro Ihara, Prasang Upadhyaya, Ferenc Huszar, Wenzhe Shi
The large model size usually entails a cost, in the range of millions of dollars, for storage and communication with the inference services.
no code implementations • 15 Jul 2019 • Sofia Ira Ktena, Alykhan Tejani, Lucas Theis, Pranay Kumar Myana, Deepak Dilipkumar, Ferenc Huszar, Steven Yoo, Wenzhe Shi
The focus of this paper is to identify the best combination of loss functions and models that enable large-scale learning from a continuous stream of data in the presence of delayed labels.
no code implementations • 9 Jan 2018 • Igor Gitman, Deepak Dilipkumar, Ben Parr
The basic idea of both of these algorithms is to make each step of the gradient descent proportional to the current weight norm and independent of the gradient magnitude.
no code implementations • 8 Dec 2017 • Ben Parr, Deepak Dilipkumar, Yu-An Liu
Nintendo's Super Smash Bros. Melee fighting game can be emulated on modern hardware allowing us to inspect internal memory states, such as character positions.