Loading [MathJax]/extensions/MathZoom.js
Self-Attentive Sequential Recommendation | IEEE Conference Publication | IEEE Xplore

Self-Attentive Sequential Recommendation


Abstract:

Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they have perfo...Show More

Abstract:

Sequential dynamics are a key feature of many modern recommender systems, which seek to capture the 'context' of users' activities on the basis of actions they have performed recently. To capture such patterns, two approaches have proliferated: Markov Chains (MCs) and Recurrent Neural Networks (RNNs). Markov Chains assume that a user's next action can be predicted on the basis of just their last (or last few) actions, while RNNs in principle allow for longer-term semantics to be uncovered. Generally speaking, MC-based methods perform best in extremely sparse datasets, where model parsimony is critical, while RNNs perform better in denser datasets where higher model complexity is affordable. The goal of our work is to balance these two goals, by proposing a self-attention based sequential model (SASRec) that allows us to capture long-term semantics (like an RNN), but, using an attention mechanism, makes its predictions based on relatively few actions (like an MC). At each time step, SASRec seeks to identify which items are 'relevant' from a user's action history, and use them to predict the next item. Extensive empirical studies show that our method outperforms various state-of-the-art sequential models (including MC/CNN/RNN-based approaches) on both sparse and dense datasets. Moreover, the model is an order of magnitude more efficient than comparable CNN/RNN-based models. Visualizations on attention weights also show how our model adaptively handles datasets with various density, and uncovers meaningful patterns in activity sequences.
Date of Conference: 17-20 November 2018
Date Added to IEEE Xplore: 30 December 2018
ISBN Information:

ISSN Information:

Conference Location: Singapore
References is not available for this document.

I. Introduction

The goal of sequential recommender systems is to combine personalized models of user behavior (based on historical activities) with some notion of ‘context’ on the basis of users' recent actions. Capturing useful patterns from sequential dynamics is challenging, primarily because the dimension of the input space grows exponentially with the number of past actions used as context. Research in sequential recommendation is therefore largely concerned with how to capture these high-order dynamics succinctly.

Select All
1.
S. Rendle, C. Freudenthaler and L. Schmidt-Thieme, "Factorizing personalized markov chains for next-basket recommendation" in WWW, 2010.
2.
B. Hidasi, A. Karatzoglou, L. Baltrunas and D. Tikk, "Session-based recommendations with recurrent neural networks" in ICLR, 2016.
3.
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, et al., "Attention is all you need" in NIPS, 2017.
4.
Y. Hu, Y. Koren and C. Volinsky, "Collaborative filtering for implicit feedback datasets" in ICDM, 2008.
5.
S. Rendle, C. Freudenthaler, Z. Gantner and L. Schmidt-Thieme, "BPR: bayesian personalized ranking from implicit feedback" in UAI, 2009.
6.
F. Ricci, L. Rokach, B. Shapira and P. Kantor, Recommender systems handbook, US:Springer, 2011.
7.
Y. Koren and R. Bell, "Advances in collaborative filtering" in Recommender Systems Handbook, Springer, 2011.
8.
S. Kabbur, X. Ning and G. Karypis, "Fism: factored item similarity models for top-n recommender systems" in SIGKDD, 2013.
9.
S. Zhang, L. Yao and A. Sun, Deep learning based recommender system: A survey and new perspectives, vol. abs/1707. 07435, 2017.
10.
S. Wang, Y. Wang, J. Tang, K. Shu, S. Ranganath and H. Liu, "What your images reveal: Exploiting visual contents for point-of-interest recommendation" in WWW, 2017.
11.
W. Kang, C. Fang, Z. Wang and J. McAuley, "Visually-aware fashion recommendation and design with generative image models" in ICDM, 2017.
12.
H. Wang, N. Wang and D. Yeung, "Collaborative deep learning for recommender systems" in SIGKDD, 2015.
13.
D. H. Kim, C. Park, J. Oh, S. Lee and H. Yu, "Convolutional matrix factorization for document context-aware recommendation" in RecSys, 2016.
14.
X. He, L. Liao, H. Zhang, L. Nie, X. Hu and T. Chua, "Neural collaborative filtering" in WWW, 2017.
15.
S. Sedhain, A. K. Menon, S. Sanner and L. Xie, "Autorec: Autoencoders meet collaborative filtering" in WWW, 2015.
16.
Y. Koren, "Collaborative filtering with temporal dynamics" in Communications of the ACM, 2010.
17.
C. Wu, A. Ahmed, A. Beutel, A. J. Smola and H. Jing, "Recurrent recommender networks" in WSDM, 2017.
18.
L. Xiong, X. Chen, T.-K. Huang, J. Schneider and J. G. Carbonell, "Temporal collaborative filtering with bayesian probabilistic tensor factorization" in SDM, 2010.
19.
R. He, W. Kang and J. McAuley, "Translation-based recommendation" in RecSys, 2017.
20.
R. He, C. Fang, Z. Wang and J. McAuley, "Vista: A visually socially and temporally-aware model for artistic recommendation" in RecSys, 2016.
21.
R. He and J. McAuley, "Fusing similarity models with markov chains for sparse sequential recommendation" in ICDM, 2016.
22.
J. Tang and K. Wang, "Personalized top-n sequential recommendation via convolutional sequence embedding" in WSDM, 2018.
23.
H. Jing and A. J. Smola, "Neural survival recommender" in WSDM, 2017.
24.
Q. Liu, S. Wu, D. Wang, Z. Li and L. Wang, "Context-aware sequential recommendation" in ICDM, 2016.
25.
A. Beutel, P. Covington, S. Jain, C. Xu, J. Li, V. Gatto, et al., "Latent cross: Making use of context in recurrent recommender systems" in WSDM, 2018.
26.
B. Hidasi and A. Karatzoglou, "Recurrent neural networks with top-k gains for session-based recommendations", CoRR, vol. abs/1706. 03847, 2017.
27.
K. Xu, J. Ba, R. Kiros, K. Cho, A. C. Courville, R. Salakhutdinov, et al., "Show attend and tell: Neural image caption generation with visual attention" in ICML, 2015.
28.
D. Bahdanau, K. Cho and Y. Bengio, "Neural machine translation by jointly learning to align and translate" in ICLR, 2015.
29.
J. Chen, H. Zhang, X. He, L. Nie, W. Liu and T. Chua, "Attentive collaborative filtering: Multimedia recommendation with item-and component-level attention" in SIGIR, 2017.
30.
J. Xiao, H. Ye, X. He, H. Zhang, F. Wu and T. Chua, "Attentional factorization machines: Learning the weight of feature interactions via attention networks" in IlCAI, 2017.
Contact IEEE to Subscribe

References

References is not available for this document.