Personalized Recommendation Systems and Algorithms

Workshop

Personalized Recommendation Systems and Algorithms

Udit Gupta · Carole-Jean Wu · Gu-Yeon Wei · David Brooks

Fri 9 Apr, 6:15 a.m. PDT

[ Abstract ] Workshop Website

Personalized recommendation is the task of recommendation content to users based on their preferences and history. Providing personalized content is crucial for many emerging applications including health care, fitness, education, food, and entertainment. Today, accurate and efficient recommendation of items power many Internet services such as online search, marketing, e-commerce, and video streaming. In fact, recent estimates show that recommendation systems drive many Internet businesses. In 2018, estimates show that recommendation systems drove up-to 35% of Amazon’s revenue, 75% of movies watched on Netflix, and 60% of videos on Youtube. In addition, the fraction of cycles devoted to serving personalized recommendation models in Facebook’s datacenter -- recommendation accounts for 80% of all AI inference cycles.

While the machine learning and systems research community has devoted significant effort to optimize AI and in particular deep neural networks, the majority of work studies AI-enabled perception, speech recognition, and natural language processing. As a result, efforts across machine learning and systems researchers have primarily focused on convolutional neural networks (CNNs) and recurrent neural networks (RNNs). However, not all services use CNNs and RNNs. In fact, as deep learning forms the backbone of many Internet services, AI for personalized recommendation is arguably one of the most impactful, widely used, and understudied applications of DNNs.

In addition to their importance, modern deep learning solutions for personalized recommendation impose unique compute, memory access, and storage requirements compared to CNNs and RNNs. However, in 2019, less than 2% of research papers were devoted to optimizing systems for recommendation engines.

To address this underinvestment from the research community, we propose a venue to discuss, share, and foster research into personalized recommendation systems and algorithms.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 6:15 a.m. - 6:30 a.m.	Welcome to the 3rd PeRSonAl workshop ( Introduction ) >	Udit Gupta · Carole-Jean Wu 🔗
Fri 6:30 a.m. - 7:00 a.m.	Explainable ML for Recommender Systems: Challenges and Opportunities ( Invited Talk 1 ) >	Himabindu Lakkaraju 🔗
Fri 7:00 a.m. - 7:30 a.m.	A Memory-centric Approach in Designing System Architectures for Personalized Recommendations ( Invited Talk 2 ) >	Minsoo Rhu 🔗
Fri 7:30 a.m. - 7:45 a.m.	MERCI: Efficient Embedding Reduction on Commodity Hardware via Sub-Query Memoization ( Contributed Talk 1 ) >	Yejin Lee 🔗
Fri 7:45 a.m. - 8:00 a.m.	Erasure Coding Based Fault Tolerance for Recommendation Model Training ( Contributed Talk 2 ) >	Kaige Liu 🔗
Fri 8:00 a.m. - 8:15 a.m.	Elliot: A Comprehensive and Rigorous Framework For Reproducible Recommender Systems Evaluation ( Contributed Talk 3 ) >	Vito W Anelli · Claudio Pomo 🔗
Fri 8:15 a.m. - 8:30 a.m.	Optimizing Deep Learning Recommender Systems Training on CPU Cluster Architectures ( Contributed Talk 4 ) >	Dhiraj Kalamkar 🔗
Fri 8:30 a.m. - 8:45 a.m.	Main-Memory Acceleration for Bandwidth-Bound Deep Learning Inference ( Contributed Talk 5 ) >	Benjamin Cho · Mattan Erez 🔗
Fri 8:45 a.m. - 9:00 a.m.	DeepRecSys: A System for Optimizing End-To-End At-scale Neural Recommendation Inference ( Contributed Talk 6 ) > link Link	Udit Gupta 🔗
Fri 10:00 a.m. - 11:00 a.m.	From Recommender Systems to Natural Language Processing and Back Again ( Keynote ) >	Julian McAuley 🔗
Fri 11:00 a.m. - 11:30 a.m.	Revisiting Recommender Systems on the GPU ( Invited Talk 3 ) >	Even Oldridge 🔗
Fri 12:00 p.m. - 12:30 p.m.	Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale ( Invited Talk 4 ) >	Summer Deng 🔗
Fri 12:30 p.m. - 1:00 p.m.	Pushing the Limits of Recommender Training Speed: An MLPerf Experience ( Invited Talk 5 ) >	Tayo Oguntebi 🔗
Fri 1:00 p.m. - 1:15 p.m.	Cross-Stack Workload Characterization of Deep Recommendation Systems ( Contributed Talk 7 ) >	Samuel Hsia 🔗
Fri 1:15 p.m. - 1:30 p.m.	Accelerated Learning by Exploiting Popular Choices ( Contributed Talk 8 ) >	Muhammad Adnan 🔗
Fri 1:30 p.m. - 1:45 p.m.	Towards Disaggregated Memory Recommenders ( Contributed Talk 9 ) >	Talha Imran 🔗
Fri 1:45 p.m. - 2:00 p.m.	Scalability, Latency, Flexibility: The Case for Similarity Search as a Service ( Contributed Talk 10 ) >	Amir Sadoughi 🔗
Fri 2:00 p.m. - 2:15 p.m.	Capacity-Driven Scale-Out Neural Recommendation: Enabling the Growing Scale of Recommendation ( Contributed Talk 11 ) >	Michael Lui 🔗
Fri 2:15 p.m. - 2:30 p.m.	Training with Multi-Layer Embeddings for Model Reduction ( Contributed Talk 12 ) >	Benjamin Ghaemmaghami · Zihao Deng 🔗
Fri 2:30 p.m. - 2:45 p.m.	Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction ( Contributed Talk 13 ) > link Link	Qingquan Song 🔗
Fri 2:45 p.m. - 3:00 p.m.	Closing session ( Closing ) >	Udit Gupta · Carole-Jean Wu 🔗