- code.ipynb -> contains the main code for our model
- code.py -> same as code.ipynb
- report -> report of our project
- test_X -> testing data for our trained model
- test_Y -> desired output for test_X
- zomato.csv -> complete dataset for our model
https://www.kaggle.com/shrutimehta/zomato-restaurants-data#zomato.csv
Our dataset consists of various attributes such as ‘Restaurant ID’, ‘Restaurant Name’, ‘Country Code’, ‘City’, ‘Address’, ‘Locality’, ‘Locality Verbose’, ‘Longitude’, ‘Latitude’, ‘Cuisines’, ‘Average Cost for two’, ‘Currency’, ‘Has Table booking’, ‘Has Online delivery’, ‘Is delivering now’, ‘Switch to order menu’, ‘Price range’, ‘Aggregate rating’, ‘Rating color’, ‘Rating text’ and ‘Votes’. We thought of all the attributes that would be important for deciding the rating of a restaurant and finally used ‘Average Cost for two’, ‘Price range’ and ‘Votes’ to find the rating of the restaurants which resulted in good accuracy of the model.
1)After using random forest using these ‘Average Cost for two' and ‘Votes’ , we get an accuracy of 0.93.
2)After using random forest using these ‘Average Cost for two' ,'Price Range' and ‘Votes’ , we get an accuracy of 0.935.