Context This dataset includes taxi trips for 2016, reported to the City of Chicago in its role as a regulatory agency. To protect privacy but allow for aggregate analyses, the Taxi ID is consistent for any given taxi medallion number but does not show the number, Census Tracts are suppressed in some cases, and times are rounded to the nearest 15 minutes. Due to the data reporting process, not all trips are reported but the City believes that most are. See http://digital.cityofchicago.org/index.php/chicago-taxi-data-released for more information about this dataset and how it was created. Content Please see the data dictionary for details of specific fields. We also shrunk the original files by roughly two thirds by dropping redundant columns and remapping several others to use shorter IDs. For example, the taxi_id column used to be a 128 character string. We’ve replaced it with an integer containing at most four digits. The redundant columns were unique_key, pickup_location, and dropoff_location. The remapped columns were taxi_id, company, pickup_census_tract, dropoff_census_tract, pickup_latitude, pickup_longitude, dropoff_latitude, and dropoff_longitude. The original versions of those columns can be unpacked using the column_remapping.json. Acknowledgements This dataset was kindly made publically available by the City of Chicago at: https://data.cityofchicago.org/Transportation/Taxi-Trips/wrvz-psew Please note that this site provides applications using data that has been modified for use from its original source, www.cityofchicago.org, the official website of the City of Chicago. The City of Chicago makes no claims as to the content, accuracy, timeliness, or completeness of any of the data provided at this site. The data provided at this site is subject to change at any time. It is understood that the data provided at this site is being used at one’s own risk. Inspiration - How centralized is Chicago? In other words, what portion of trips are to or from downtown? - Chicago has an extensive metro system. Are taxis competing with the trains by covering similar routes or supplementing public transit by getting people to and from train stations? Use this dataset with BigQuery You can use Kernels to analyze, share, and discuss this data on Kaggle, but if you’re looking for real-time updates and bigger data, check out the data on BigQuery, too: https://cloud.google.com/bigquery/public-data/chicago-taxi. BigQuery hosts the full version of this dataset, which extends from 2013 through the present.