Paper Session
in
Workshop: Cloud Intelligence: AI/ML for Efficient and Manageable Cloud Services
Technical Paper Session
Abstract:
A Survey of Multi-Tenant Deep Learning Inference on GPU Fuxun Yu, Yongbo Yu (George Mason University); Di Wang (Microsoft); Minjia Zhang (Microsoft AI and Research); Longfei Shangguan (Microsoft); Chenchen Liu (University of Maryland, Baltimore County), Tolga Soyata (GMU); Xiang Chen (George Mason University)
CWP: A Machine Learning based Approach to Detect Unknown Cloud Workload Derssie Mebratu, Mohammad Hossain, Niranjan Hasabnis, Jun Jin, Gaurav Chaudhary, Noah Shen (Intel)
Multi-level Explanation of Deep Reinforcement Learning-based Scheduling Shaojun Zhang (USYD); Chen Wang (DATA61, CSIRO); Albert Zomaya (The University of Sydney)
Chat is not available.