Publications
Heavy-Tailed Universality Predicts Trends in Test Accuracies for Very Large Pre-Trained Deep Neural Networks.
Proceedings of 2020 SDM Conference.
(2020). Inefficiency of K-FAC for Large Batch Size Training.
Proceedings of the AAAI-20 Conference.
(2020). Q-BERT: Hessian Based Ultra Low Precision Quantization of BERT.
Proceedings of the AAAI-20 Conference.
(2020). ANODEV2: A Coupled Neural ODE Evolution Framework.
Proceedings of the 2019 NeurIPS Conference.
(2019). Distributed estimation of the inverse Hessian by determinantal averaging.
Proceedings of the 2019 NeurIPS Conference.
(2019). GPU Accelerated Sub-Sampled Newton's Method.
Proceedings of the 2019 SDM Conference. 702-710.
(2019). HAWQ: Hessian AWare Quantization of Neural Networks with Mixed-Precision.
Proceedings of ICCV 2019.
(2019). Minimax experimental design: Bridging the gap between statistical and worst-case approaches to least squares regression.
Proceedings of 2019 COLT.
(2019). Statistical Mechanics Methods for Discovering Knowledge from Modern Production Quality Neural Networks.
Proceedings of the 25th Annual SIGKDD. 3239-3240.
(2019). Sub-Sampled Newton Methods.
Mathematical Programming. 293-326.
(2019). Traditional and Heavy-Tailed Self Regularization in Neural Network Models.
Proceeding of the 36th ICML Conference. 4284-4293.
(2019). Trust Region Based Adversarial Attack on Neural Networks.
Proceedings of the 32nd CVPR Conference. 11350-11359.
(2019). Accelerating Large-Scale Data Analysis by Offloading to High-Performance Computing Libraries using Alchemist.
Proceedings of the 24th Annual SIGKDD. 293-301.
(2018). Alchemist: An Apache Spark <=> MPI Interface.
Concurrency and Computation: Practice and Experience (Special Issue of the Cray User Group, CUG 2018), e5026.
(2018). Error Estimation for Randomized Least-Squares Algorithms via the Bootstrap.
Proceedings of the 35th ICML Conference. 3223-3232.
(2018). Group Collaborative Representation for Image Set Classification.
International Journal of Computer Vision. 1-26.
(2018). Hessian-based Analysis of Large Batch Training and Robustness to Adversaries.
Proceedings of the 2018 NeurIPS Conference. 4954-4964.
(2018).
(2018). A Short Introduction to Local Graph Clustering Methods and Software.
Abstracts of the 7th International Conference on Complex Networks and Their Applications.
(2018). DCAR: A Discriminative and Compact Audio Representation for Audio Processing.
IEEE Transactions on Multimedia. PP(99),
(2017).
(2016).
(2016).
A discriminative and compact audio representation for event detection.
Proceedings of the 2016 ACM Conference on Multimedia (MM '16). 57-61.
(2016).
(2016). Feature-distributed sparse regression: a screen-and-clean approach.
Proceedings of the 2016 NIPS Conference.
(2016).