? ? , ? e-commerce ? Big Data-
, ITsubbotnik . , .
EPAM , e-commerce . EPAM offline . , . , :
1.
, . . : , , ( ). : , - , . , , - . (multi-armed bandits).
2.
, . , . , ,
demographic based .
: , , ?
context based . , : , , , , . , , , . multi-armed bandits.
3.
, - . .
learning to rank sequential pattern mining .
4.
, . , , . , . , , ..
demographic based , ,
sequential pattern mining.
, , . apparel stores. -, . 50 . . , 300 . , , , .
: (71%). , . , , .
(62%) (83%). , , , , , . , , style guide 50% . , . , 50% , . , Occasion/event 100% .
, Top sellers list, New in, 0% . , : , , . , .
.
, .
1.
, , : , .. . , , .
, :
Neighborhood-based (Memory-Based) . , , , , . - , .
: User-Based Filtering (UBCF), Item-Based Filtering (IBCF), Slope-One.
Model Based . , ( ), .
:
o Matrix Factorization (MF): Singular Value Decomposition (SVD, SVD++, timeSVD++, MSVD), Non-Negative MF (ALS), Factorization Machines, Probabilistic Matrix Factorization (PMF)
o RBM (Restricted Bolzman Machines).
o Incremental CF via Co-clustering (COCL, ECOCL)
o Probabilistic Principle Component Analysis (pPCA), Probabilistic Latent Semantic Analysis (pLSA), Latent Dirichlet Allocation (LDA) and etc.
, , . .
:
(cold start problem): , , .
2. ontent based
, content based . . .
3. Demographic based
, .
, , . , . , , , . , , , .. knowledge based recommenders.
4. Knowledge based
, , . , : , , , . , . : , .
5. ontext-aware
, , . . , , , . , , , . ? , , context-aware .
, Change Point Detection . , , . , : factorization machines, Byesian Probablistic Tensor Factorization.
6.
, .
. . , . : (exploration) (exploitation) ?
Exploration vs. Exploitation problem. . :
1. Active learning (AL)
2. Multi-armed bandits (MAB) algorithms: E-greedy, UCB, LinUCB, Tomson Sampling, Active Thompson Sampling (ATS)
3. Markov Decision Process (MDP)/Reinforcement Learning (RL)
4. Hybrid scoring approaches could be considered models composition used.
MAB
1. E-greedy : . : , .
2. Upper Confidence Bound (UCB) , . . . , , . exploration, exploitation, .
3. Tomson Sampling (TS) . , , MAB.
, , , , . UCB Tompson Sampling + Probablistic Matrix Factorization.
, -. , . , : implicit explicit feedback.
Implicit feedback , .
explicit feedback . ? Explicit feedback , , , . implicit feedback , , . , , , . , implicit . , , . , , .
Spotify
Spotify . : , . . , Spotify NLP- (natural language processing) . , . NLP , , , . , . , , , . content based , deep learning. .
open source? : .
open source? : .
Spark MLLib, RankSys, LensKit. Waffles, ++.
, , machine learning . PredictionIO.
, , Recommendation System Handbook, 2nd Edition, , .
https://habrahabr.ru/post/335114/