-

   rss_rss_hh_new

 - e-mail

 

 -

 LiveInternet.ru:
: 17.03.2011
:
:
: 51

:


: ,

, 07 2017 . 20:01 +
? ? , ? e-commerce ? Big Data- , ITsubbotnik . , .








EPAM , e-commerce . EPAM offline . , . , :

1.
, . . : , , ( ). : , - , . , , - . (multi-armed bandits).

2.
, . , . , , demographic based .

: , , ? context based . , : , , , , . , , , . multi-armed bandits.

3.
, - . . learning to rank sequential pattern mining .

4.
, . , , . , . , , .. demographic based , , sequential pattern mining.



, , . apparel stores. -, . 50 . . , 300 . , , , .



: (71%). , . , , .

(62%) (83%). , , , , , . , , style guide 50% . , . , 50% , . , Occasion/event 100% .

, Top sellers list, New in, 0% . , : , , . , .

.





, .

1.
, , : , .. . , , .

, :

Neighborhood-based (Memory-Based) . , , , , . - , .
: User-Based Filtering (UBCF), Item-Based Filtering (IBCF), Slope-One.

Model Based . , ( ), .
:
o Matrix Factorization (MF): Singular Value Decomposition (SVD, SVD++, timeSVD++, MSVD), Non-Negative MF (ALS), Factorization Machines, Probabilistic Matrix Factorization (PMF)
o RBM (Restricted Bolzman Machines).
o Incremental CF via Co-clustering (COCL, ECOCL)
o Probabilistic Principle Component Analysis (pPCA), Probabilistic Latent Semantic Analysis (pLSA), Latent Dirichlet Allocation (LDA) and etc.

, , . .

: (cold start problem): , , .

2. ontent based
, content based . . .

3. Demographic based
, .

, , . , . , , , . , , , .. knowledge based recommenders.

4. Knowledge based
, , . , : , , , . , . : , .

5. ontext-aware
, , . . , , , . , , , . ? , , context-aware .

, Change Point Detection . , , . , : factorization machines, Byesian Probablistic Tensor Factorization.

6.
, .


. . , . : (exploration) (exploitation) ? Exploration vs. Exploitation problem. . :
1. Active learning (AL)
2. Multi-armed bandits (MAB) algorithms: E-greedy, UCB, LinUCB, Tomson Sampling, Active Thompson Sampling (ATS)
3. Markov Decision Process (MDP)/Reinforcement Learning (RL)
4. Hybrid scoring approaches could be considered models composition used.

MAB





1. E-greedy : . : , .

2. Upper Confidence Bound (UCB) , . . . , , . exploration, exploitation, .

3. Tomson Sampling (TS) . , , MAB.
, , , , . UCB Tompson Sampling + Probablistic Matrix Factorization.



, -. , . , : implicit explicit feedback. Implicit feedback , . explicit feedback . ? Explicit feedback , , , . implicit feedback , , . , , , . , implicit . , , . , , .

Spotify



Spotify . : , . . , Spotify NLP- (natural language processing) . , . NLP , , , . , . , , , . content based , deep learning. .



open source? : .

open source? : .

Spark MLLib, RankSys, LensKit. Waffles, ++.
, , machine learning . PredictionIO.

, , Recommendation System Handbook, 2nd Edition, , .
Original source: habrahabr.ru (comments, light).

https://habrahabr.ru/post/335114/

:  

: [1] []
 

:
: 

: ( )

:

  URL