Giter Site home page Giter Site logo

ranking_papers's Introduction

Search Ranking Papers

Collection of papers that I have personally found helpful for search ranking / recommendation systems (updating).

Great talks on search ranking / recommendation system

Personalization at Amazon Music (ICML 2019)

Reinforcement Learning for Recommender Systems: A Case Study on Youtube (2019)

Applying Deep Learning to Airbnb Search (Qcon 2019)

Artwork Personalization at Netflix (Data Council 2018)

Measurement and analysis of predictive feed ranking models on Instagram (Scale 2017)

Detecting place visits at scale (Scale 2017)

Deep Learning for Personalized Search and Recommender Systems (KDD 2017)

Facebook AI Research: An Introduction to Faiss and Similarity Search (2020)

My personal notes / ideas on papers

Deep Neural Networks for YouTube Recommendations

  • "Training examples are generated from all YouTube watches (even those embedded on other sites) rather than just watches on the recommendations we produce." The model is predicting watch time per impression, but it is unclear how YouTube constructs its training data (impression data). For example,
    • If a video is at the bottom of the recommendation list and user may not scroll down, does this count as an impression?
    • If a video is shown as relevant videos of an opened video, does this count as an impression?

Applying Deep Learning To Airbnb Search

  • They tried multi-task learning that optimizes for both booking and long view, and found out that long views increased by a lot while bookings remained neutral. This multi-task learning can be useful for advertisement modeling, where we would optimize for both clicks and conversions.

Amazon Search: The Joy of Ranking Products

  • "To manage the size of the training set, we sample unseen examples." An idea inspired by this statement:
    • For pairwise formulation, a good idea might be sampling the possibly unseen examples. For example, user makes a click on item at position 5, then items displayed after position 5 may not be seen and can be sampled.

Online Controlled Experiments at Large Scale

  • โ€œWe recently ran a slowdown experiment where we slowed 10% of users by 100msec (milliseconds) and another 10% by 250msec for two weeks. The results showed that performance absolutely matters a lot today: every 100msec improves revenue by 0.6%.โ€
    • Speed wins.

Word2vec algorithm (C implementation / gensim implementation)

syn0: random initialization
syn1neg: zero initialization
for central_word in [pick_a_central_word]:
  for context_word in [context_words_based_on_the_central_word]:
    neu1e = 0
    for d in range(0, negative + 1):
      if d == 0:
        word = context_word, label = 1
      if d > 0:
        word = negative_word, label = 0
      dot_product = syn0[central_word] * syn1neg[word]
      gradient = (label - sigmoid(dot_product)) * alpha
      syn1neg[word] += gradient * syn0[central_word]
      neu1e += gradient * syn1neg[word]
    syn0[central_word] += neu1e

Deep Learning Recommendation Model for Personalization and Recommendation Systems

  • Model Structure

  • Model Structure of this specific model (pyton dlrm_s_pytorch.py --arch-sparse-feature-size=16 --arch-mlp-bot="13-512-256-64-16" --arch-mlp-top="512-256-1" --data-generation=dataset --data-set=kaggle --raw-data-file=./dac/train.txt --loss-function=bce --round-targets=True --learning-rate=0.1 --mini-batch-size=128)

FAISS

  • Speed optimization: k-means clustering, find nearest centroid and check this cluster
  • Memory optimization: dimension reduction such as PCA, Product Quantization

ranking_papers's People

Contributors

liyinxiao avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.