Skip to yearly menu bar Skip to main content


Paper Session
in
Workshop: Cloud Intelligence: AI/ML for Efficient and Manageable Cloud Services

Technical Paper Session


Abstract:

A Survey of Multi-Tenant Deep Learning Inference on GPU Fuxun Yu, Yongbo Yu (George Mason University); Di Wang (Microsoft); Minjia Zhang (Microsoft AI and Research); Longfei Shangguan (Microsoft); Chenchen Liu (University of Maryland, Baltimore County), Tolga Soyata (GMU); Xiang Chen (George Mason University)

CWP: A Machine Learning based Approach to Detect Unknown Cloud Workload Derssie Mebratu, Mohammad Hossain, Niranjan Hasabnis, Jun Jin, Gaurav Chaudhary, Noah Shen (Intel)

Multi-level Explanation of Deep Reinforcement Learning-based Scheduling Shaojun Zhang (USYD); Chen Wang (DATA61, CSIRO); Albert Zomaya (The University of Sydney)

Chat is not available.