About Tenrec
Tenrec is a large-scale benchmark dataset for recommeder systems. It was collected from two different feeds recommendation apps of Tencent with four scenarios. Tenrec has five characteristics: (1) it is large-scale, containing around 5 million users and 140 million interactions; (2) it has not only positive user feedback, but also true negative feedback (vs. one-class recommendation); (3) it contains overlapped users and items across four different scenarios; (4) it contains various types of user positive feedback, in forms of clicks, likes, shares, and follows, etc; (5) it contains additional features beyond the user IDs and item IDs.
If you use this dataset for your research, please cite this paper: @article{yuan2022tenrec, title={Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems}, author={Yuan, Guanghu and Yuan, Fajie and Li, Yudong and Kong, Beibei and Li, Shujie and Chen, Lei and Yang, Min and Yu, Chenyun and Hu, Bo and Li, Zang and others}, journal={arXiv preprint arXiv:2210.10629}, year={2022} }
*Email Fajie & Guanghu if you want to launch a new leaderboard for an important RS task using Tenrec.
Guanghu Yuan, Fajie Yuan, Yudong Li, Beibei Kong, Shujie Li, Lei Chen, Min Yang, Chenyun Yu, Bo Hu, Zang Li, Yu Xu, Xiaohu Qie. Tenrec: A Large-Scale Multipurpose Benchmark Dataset for Recommender Systems NeurIPS 2022.
Download
The Tenrec dataset can be used for research purposes under Tenrec Dataset link. Before you download the dataset, please read these terms. And Code link.
Note
If you use Tenrec (with our training, validation and testing set) and have new SOTA results, we are happy to update them on the leaderboard. In this case, you should provide (1) your algorithm code; (2) all your hyper-parameters; (3) a readme file tells other researchers how to run your code. We will append them on the leaderboard website, and make sure your models are evaluated with a fair comparison or common practice. We are also happy to create new leaderboard if you use Tenrec to perform new tasks, just email us.
Leaderboard
CTR-1M(Separate Embedding)
Rank | Model | AUC |
---|---|---|
1 Aug. 07, 2017 |
NFM | 0.7957 |
2 Jun. 03, 2021 |
DCN-v2 | 0.7932 |
3 Jul. 19, 2018 |
xDeepFM | 0.7931 |
4 Mar. 13, 2017 |
DeepFM | 0.7930 |
5 Aug. 15, 2017 |
AFM | 0.7928 |
6 Aug. 14, 2017 |
DCN | 0.7927 |
7 Sept. 15, 2016 |
Wide & Deep | 0.7919 |
CTR-1M(Shared Embedding)
Rank | Model | AUC |
---|---|---|
1 Aug. 07, 2017 |
NFM | 0.7924 |
2 Jun. 03, 2021 |
DCN-v2 | 0.7922 |
3 Jul. 19, 2018 |
xDeepFM | 0.7922 |
4 AUG. 14, 2017 |
AFM | 0.7921 |
5 MAR. 13, 2017 |
DeepFM | 0.7920 |
6 |
DIEN | 0.7918 |
7 AUG. 14, 2017 |
DCN | 0.7911 |
8 |
DIN | 0.7910 |
9 SEPT. 15, 2016 |
Wide & Deep | 0.7910 |
CTR-5M
Rank | Model | AUC |
---|---|---|
1 Jul. 19, 2018 |
xDeepFM | 0.8235 |
2 Mar. 13, 2017 |
DeepFM | 0.8235 |
3 Sept. 15, 2016 |
Wide & Deep | 0.8234 |
4 Aug. 07, 2017 |
NFM | 0.8231 |
5 Aug. 15, 2017 |
AFM | 0.8226 |
Session based Recommendation 1M
Rank | Model | NDCG@20 |
---|---|---|
1 Jan. 30, 2019 |
NextItNet | 0.0199 |
2 Dec. 30, 2018 |
SASRec | 0.0194 |
3 Aug. 27, 2017 |
GRU4Rec | 0.0192 |
4 Nov. 03, 2019 |
BERT4Rec | 0.0185 |
Session based Recommendation 5M
Rank | Model | NDCG@20 |
---|---|---|
1 Jan. 30, 2019 |
NextItNet | 0.0214 |
2 Dec. 30, 2018 |
SASRec | 0.0201 |
3 Nov. 03, 2019 |
BERT4Rec | 0.0191 |
Multi-task Learning
Rank | Model | click-AUC | like-AUC |
---|---|---|---|
1 Jun. 27, 2018 |
ESMM | 0.7940 | 0.9110 |
2 Sept. 22, 2020 |
PLE | 0.7822 | 0.9103 |
3 Jul. 19, 2018 |
MMOE | 0.7900 | 0.9020 |
Transfer Learning
Rank | Model | NDCG@20 |
---|---|---|
1 Jan. 30, 2019 |
NextItNet | 0.0489 |
2 Dec. 30, 2018 |
SASRec | 0.0479 |
User Profile Prediction
Rank | Model | Age-ACC | Gender-ACC |
---|---|---|---|
1 Nov. 03, 2019 |
BERT4Rec | 0.69903 | 0.90082 |
2 Jul. 25, 2020 |
PeterRec | 0.69712 | 0.90036 |
3 -- |
DNN | 0.67875 | 0.88531 |
Cold-Start
Rank | Model | NDCG@20 |
---|---|---|
1 Nov. 03, 2019 |
BERT4Rec | 0.0239 |
2 Jul. 25, 2020 |
PeterRec | 0.0221 |
Cold-Start 0.3
Rank | Model | NDCG@20 |
---|---|---|
1 Nov. 03, 2019 |
BERT4Rec | 0.0137 |
2 Jul. 25, 2020 |
PeterRec | 0.0133 |
Cold-Start 0.7
Rank | Model | NDCG@20 |
---|---|---|
1 Nov. 03, 2019 |
BERT4Rec | 0.0134 |
2 Jul. 25, 2020 |
PeterRec | 0.0132 |
Cold-Start 1
Rank | Model | NDCG@20 |
---|---|---|
1 Nov. 03, 2019 |
BERT4Rec | 0.0166 |
2 Jul. 25, 2020 |
PeterRec | 0.0165 |
Lifelong Learning
Rank | Model | Task1-NDCG@20 | Task2-NDCG@20 | Task3-NDCG@20 | Task4-NDCG@20 |
---|---|---|---|---|---|
1 Jul. 11, 2021 |
Conure-NextItNet | 0.0177 | 0.0095 | 0.0167 | 0.1074 |
2 Jul. 11, 2021 |
Conure-SASRec | 0.0172 | 0.0086 | 0.0166 | 0.0959 |
Model Compression
Rank | Model | Compress Para. | NDCG@20 |
---|---|---|---|
1 Jul. 25, 2020 |
Cp-NextItNet | 33.1% | 0.0195 |
2 Jul. 25, 2020 |
Cp-SASRec | 30.1% | 0.0191 |
Model Training Speedup
Rank | Model | Time Speedup | NDCG@20 |
---|---|---|---|
1 Jul. 11, 2021 |
Stack-NextItNet | 63.6% | 0.0202 |
2 Jul. 11, 2021 |
Stack-SASRec | 30% | 0.0196 |
Model Inference Speedup
Rank | Model | Time Speedup | NDCG@20 |
---|---|---|---|
1 May 18, 2021 |
Skip-NextItNet | 23.4% | 0.0472 |
2 May 18, 2021 |
Skip-SASRec | 32.5% | 0.0431 |
Top-N Recommendation with the random negative sampler
Rank | Model | NDCG@20 |
---|---|---|
1 Jul. 2, 2020/span> |
LightGCN | 0.0542 |
2 Aug. 07, 2009/span> |
NGCF | 0.0455 |
3 Jul. 18, 2019 |
MF | 0.0437 |
4 Apr. 03, 2017 |
NCF | 0.0403 |
Top-N Recommendation with the popularity negative sampler.
Rank | Model | NDCG@20 |
---|---|---|
1 Jul. 2, 2020/span> |
LightGCN | 0.0617 |
2 Aug. 07, 2009/span> |
NGCF | 0.0476 |
3 Jul. 18, 2019 |
MF | 0.0467 |
4 Apr. 03, 2017 |
NCF | 0.0405 |