[논문 리뷰] Self-supervised Graph Learning for Recommendation (SIGIR’21) & Do we really need graph augmentation? (SIGIR’22)

1 minute read

논문을 읽게 된 계기

Supervised learning을 위한 labeled data는 항상 부족하며 여기에 더해 모델 성능은 크게 발전하여 overfitting의 위험이 존재한다. (모델 capacity가 커질수록 학습 데이터 (labeled data)가 많이 필요하다. 그렇지 않으면 overfitting이 일어난다.) 이를 해결하기 위한 self-supervised learning 기법 중 하나인 contrastive learning을 GCN에 적용한 연구를 리뷰하게 되었다!

Reviewed Papers

Self-supervised Graph Learning for Recommendation (Wu et al., SIGIR’21)

Self-supervised Graph Learning for Recommendation

Are Graph Augmentations Necessary? Simple Graph Contrastive Learning for Recommendation (Yu et al., SIGIR’22)

Are Graph Augmentations Necessary? Simple Graph Contrastive…

Self-supervised Learning on GCN Recsys

Motivation

Popularity bias
- high-degree item이 학습에 영향을 과도하게 많이 줘서 low-degree item에 대한 추천을 방해함
Noises in interaction
- user가 선호하지 않는 item에 대한 interaction이 noise로 껴있을 가능성이 있음

Idea

SSL을 사용해서 위의 bias issue를 해결하겠다

Contrastive learning
- positive pair: original image-augmented image → 유사하게 학습!
- negative pair: original image-negative image → 구별되게 학습!

→ low degree items에서 성능 향상 & robustness 증가

Contrastive Learning on GCN Recsys

Untitled

Augmentation Types
- Node Dropout(ND)
- Edge dropout(ED)
- Random Walk(RW): ED의 확장, layer마다 서로 다른 mask를 쓴다는 점만이 다름
Contrastive Learning
- Node-Node간의 contrast를 위해 node의 서로 다른 views를 만듦
BPR Loss
- item간의 상대적인 선호도를 학습하는 것 (i: positive item, j: negative item)
Total Loss
1. BPR loss + Contrastive loss + L2 Regularization