Description
The data for this project can be found on Kaggle in the New York City Taxi Fare Prediction competition held by Google Cloud. The entire training set consists of about 55 million rows of NYC taxi fare…
Summary
- NYC Taxi Fare Prediction Rider Fare Prediction in The Big Apple The Data Loading the Data The data for this project can be found on Kaggle in the New York City Taxi Fare Prediction competition held by Google Cloud.
- Along with the datetime attributes that were added, there are a few features that can be added using the pick-up and drop-off points from the data.
- Another alternative would be LightGBM which many others have used as well in the Kaggle competition.
- Using the same data cleaning processes earlier, I decided to put the XGBoost model to test using the entire dataset.