English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Paper

No DNN Left Behind: Improving Inference in the Cloud with Multi-Tenancy

MPS-Authors

Samanta,  Amit
Group J. Mace, Max Planck Institute for Software Systems, Max Planck Society;

Shrinivasan,  Suhas
Group J. Mace, Max Planck Institute for Software Systems, Max Planck Society;

/persons/resource/persons231523

Kaufmann,  Antoine
Group P. Druschel, Max Planck Institute for Software Systems, Max Planck Society;

/persons/resource/persons231493

Mace,  Jonathan
Group J. Mace, Max Planck Institute for Software Systems, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

arXiv:1901.06887.pdf
(Preprint), 421KB

Supplementary Material (public)
There is no public supplementary material available
Citation

Samanta, A., Shrinivasan, S., Kaufmann, A., & Mace, J. (2019). No DNN Left Behind: Improving Inference in the Cloud with Multi-Tenancy. doi:10.48550/arXiv.1901.06887.


Cite as: https://hdl.handle.net/21.11116/0000-000A-7835-4
Abstract
With the rise of machine learning, inference on deep neural networks (DNNs)
has become a core building block on the critical path for many cloud
applications. Applications today rely on isolated ad-hoc deployments that force
users to compromise on consistent latency, elasticity, or cost-efficiency,
depending on workload characteristics. We propose to elevate DNN inference to
be a first class cloud primitive provided by a shared multi-tenant system, akin
to cloud storage, and cloud databases. A shared system enables cost-efficient
operation with consistent performance across the full spectrum of workloads. We
argue that DNN inference is an ideal candidate for a multi-tenant system
because of its narrow and well-defined interface and predictable resource
requirements.