(51 events)
Timezone: »
Toggle Poster Visibility
Poster
QuClassi: A Hybrid Deep Neural Network Architecture based on Quantum State Fidelity
Poster
FROTE: Feedback Rule-Driven Oversampling for Editing Models
Poster
SLA-Driven ML INFERENCE FRAMEWORK FOR CLOUDS WITH HETEROGENEOUS ACCELERATORS
Poster
Revelio: ML-Generated Debugging Queries for Finding Root Causes in Distributed Systems
Poster
Gyro Dropout: Maximizing Ensemble Effect in Neural Network Training
Poster
Towards the Co-design of Neural Networks and Accelerators
Poster
SRIFTY: Swift and Thrifty Distributed Neural Network Training on the Cloud
Poster
Pathways: Asynchronous Distributed Dataflow for ML
Poster
Sequential Aggregation and Rematerialization: Distributed Full-batch Training of Graph Neural Networks on Large Graphs
Poster
VirtualFlow: Decoupling Deep Learning Models from the Underlying Hardware
Poster
TyXe: Pyro-based Bayesian neural nets for Pytorch
Poster
torch.fx: Practical Program Capture and Transformation for Deep Learning in Python
Poster
GPU Semiring Primitives for Sparse Neighborhood Methods
Poster
Randomness in Neural Network Training: Characterizing the Impact of Tooling
Poster
NURD: Negative-Unlabeled Learning for Online Datacenter Straggler Prediction
Poster
ULPPACK: Fast Sub-8-bit Matrix Multiply on Commodity SIMD Hardware
Poster
A Tale of Two Models: Constructing Evasive Attacks on Edge Models
Poster
LightSecAgg: a Lightweight and Versatile Design for Secure Aggregation in Federated Learning
Poster
The CoRa Tensor Compiler: Compilation for Ragged Tensors with Minimal Padding
Poster
Hydrozoa: Dynamic Hybrid-Parallel DNN Training on Serverless Containers
Poster
Bit-serial Weight Pools: Compression and Arbitrary Precision Execution of Neural Networks on Resource Constrained Processors
Poster
URSABench: A System for Comprehensive Benchmarking of Bayesian Deep Neural Network Models and Inference methods
Poster
QuadraLib: A Performant Quadratic Neural Network Library for Architecture Optimization and Design Exploration
Poster
BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling
Poster
Collapsible Linear Blocks for Super-Efficient Super Resolution
Poster
Learning Compressed Embeddings for On-Device Inference
Poster
Improving Model Training with Multi-fidelity Hyperparameter Evaluation
Poster
REX: Revisiting Budgeted Training with an Improved Schedule
Poster
Synthesizing Optimal Parallelism Placement and Reduction Strategies on Hierarchical Systems for Deep Learning
Poster
HALOS: Hashing Large Output Space for Cheap Inference
Poster
Random Offset Block Embedding (ROBE) for compressed embedding tables in deep learning recommendation systems
Poster
Apollo: Automatic Partition-based Operator Fusion through Layer by Layer Optimization
Poster
Matchmaker: Data Drift Mitigation in Machine Learning for Large-Scale Systems
Poster
dPRO: A Generic Performance Diagnosis and Optimization Toolkit for Expediting Distributed DNN Training
Poster
Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining
Poster
Sustainable AI: Environmental Implications, Challenges and Opportunities
Poster
ML-EXray: Visibility into ML Deployment on the Edge
Poster
MLPerf Mobile Inference Benchmark: An Industry-Standard Open-Source Machine Learning Benchmark for On-Device AI
Poster
On the Utility of Gradient Compression in Distributed Training Systems
Poster
TAGLETS: A System for Automatic Semi-Supervised Learning with Auxiliary Data
Poster
Understanding GNN Computational Graph: A Coordinated Computation, IO, and Memory Perspective
Poster
A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Poster
mmSampler: Efficient Frame Sampler for Multimodal Video Retrieval
Poster
Graphiler: Optimizing Graph Neural Networks with Message Passing Data Flow Graph
Poster
TorchSparse: Efficient Point Cloud Inference Engine
Poster
Bolt: Bridging the Gap between Auto-tuners and Hardware-native Performance
Poster
PAPAYA: Practical, Private, and Scalable Federated Learning
Poster
Efficient Strong Scaling Through Burst Parallel Training
Poster
DietCode: Automatic Optimization for Dynamic Tensor Programs
Poster
Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines
Poster
AccMPEG: Optimizing Video Encoding for Accurate Video Analytics