Giter Site home page Giter Site logo

ustcml / recstudio Goto Github PK

View Code? Open in Web Editor NEW
164.0 5.0 20.0 2.71 MB

A highly-modularized and recommendation-efficient recommendation library based on PyTorch.

Home Page: http://recstudio.org.cn

License: MIT License

Python 92.80% Jupyter Notebook 7.20%
collaborative-filtering ctr-prediction deep-learning factorization-machines knowledge-graph matrix-factorization pytorch recommender-system sequential-recommendation graph-neural-networks

recstudio's People

Contributors

echobelbo avatar hr-jin avatar pepsi2222 avatar shadowtinker avatar xiuchen519 avatar xuhwang avatar yusanshi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

recstudio's Issues

Wrong entry names in some dataset config files

Inter_feat with ratings lower than low_rating_thres are filtered out in the following code.

def _filter(self, min_user_inter, min_item_inter):
self._filter_ratings(self.config.get('low_rating_thres', None))

However, the corresponding entries are misspelled as low_rating_threshold in the following dataset config files:

  • amazon-beauty
  • amazon-books
  • amazon-electronics
  • gowalla
  • ml-10m
  • ml-20m
  • tmall
  • yelp

Failed to open http://recstudio.org.cn/

无法打开官网,因此获取不到官方文档的信息。
正在使用这个库,有些内容不是很明白,希望能获得帮助,或者提供文档。

AE models output invalid results on part of datasets

I find MultiVAE and MultiDAE both output nan recall on ml-1m while performing correctly on ml-100k and gowalla. But BPR (MF model) and LightGCN (graph model) are normal on all three datasets. So I guess it may be a problem with AE models.

Feature names in different tables are not allowed to be the same.

When there are two same feature names in two different tables (e.g. one in the user information table and one in the item information table), there would be hidden problems.

For example, there is a column named category in both user.csv and item.csv. When I want to get the values of both two columns, the value of category would be overwritten

float seq (vector) feature supported in dataset

In dataset, sometimes we would use dataset which contains vector type features. For example, in if we want to use embeddings generated by language model, the feature would be embedding. Therefore, maybe we need the support of float sequence type.

Where to find LSH Sampling

In your paper, ref "Table 5: Samplers in RecStudio", you mentioned that LSH based samplers have been implemented. But I cannot find them in your code.

InfoNCE loss

The temperature hyperparameter seems to be missing from InfoNCE loss function in RecStudio.

Out-Of-Memory Error in Validation Phase

Why there will be a sudden Cuda memory usage increase in the validation phase?
The batch size of the validation phase set in the config file is smaller than the training phase, but there will be a sudden Cuda memory usage increase in the validation phase, which causes the OOM Error.
Specifically, when the model runs the code in run.pymodel.evaluate will cost more Cuda memory than model.fit, could you please help me solve this problem? Thanks for your attention.

No feature scaling.

It seems like there is no feature scaling in dataset.py (when the field type is float).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.