By: Joaquin Gomez, Michael O'Brien, and Tam Trinh
This project explores the Recipe NLG dataset, containing over 2 million observations of recipes. Natural language processing (NLP) is used to group clean the texts and group the recipes into topics. The topics are then used to recommend recipes based on inputed ingredients.
Presentation slides: https://docs.google.com/presentation/d/1-ZcwNFXDKa4H1y-YWDyM9rByilVFttw7RPTyOHkWn3c/edit?usp=sharing
This project was developed as capstone projects in June 2022 under the supervision of NYC Data Science Academy.
The dataset was downloaded from: https://recipenlg.cs.put.poznan.pl/ Sited as: @inproceedings{bien-etal-2020-recipenlg, title = "{R}ecipe{NLG}: A Cooking Recipes Dataset for Semi-Structured Text Generation", author = "Bie{'n}, Micha{\l} and Gilski, Micha{\l} and Maciejewska, Martyna and Taisner, Wojciech and Wisniewski, Dawid and Lawrynowicz, Agnieszka", booktitle = "Proceedings of the 13th International Conference on Natural Language Generation", month = dec, year = "2020", address = "Dublin, Ireland", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/2020.inlg-1.4", pages = "22--28" }