Skip to content

Data Exploration of FordBike System is the final project of the Udacity Data Analyst Nanodegree. The dataset includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area. It was requested to investigate this dataset to get at least some insights.

Notifications You must be signed in to change notification settings

emmanuelbogun/Data-Exploration-of-Ford-GoBike-System

Repository files navigation

(Data Exploration of Ford GoBike-System)

by (Emmanuel Olorunbogun)

Dataset

Ford GoBike is a regional public bicycle sharing system in the San Francisco Bay Area, California. This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area. The data was downloaded through the link: https://video.udacity-data.com/topher/2020/October/5f91cf38_201902-fordgobike-tripdata/201902-fordgobike-tripdata.csv

This dataset contains about 183,412 rows and 16 columns and below are the descriptions for each column.

Column Description:


duration_sec: contains the trip duration in seconds.
start_time : the start time of the ride.
end_time : the end time of ride.
start_station_id : id to identify the station where the ride started.
start_station_name : name of the start station.
start_station_latitude : latitude of the start station.
start_station_longitude : longitude of the start station.
end_station_id : id to identify the station where the ride ended.
end_station_name : name of the end station.
end_station_latitude : latitude of the end station.
end_station_longitude : longitude of the end station.
bike_id : id to identify each bike.
user_type : shows if the user has subscription or not.
member_birth_year : year of birth of the user.
member_gender : gender of the user.
bike_share_for_all_trip

The wrangling of the dataset involved cleaning the dataset by changing the datatype format of some columns and the unwanted columns were removed from the dataset. Some new columns were added also, such as:

  • Day_of_week: to store day of the week.
  • duration_minutes: to store the duration in minutes.
  • age: age of riders.
  • time: hour of the day.

Summary of Findings


1. Riders in the age group of 25 - 40 had the highest number of trips taken.
2. About 75% of the riders are Males.
3. 90% of the riders are Subscribers.
4. Most of the trips taken only lasted for about 5 - 12.5 minutes.
5. Thursdays have the highest number of trips.
6. Subscriber completes at least thrice the number of Customers rides daily.
7. Weekends have lesser number of trips, therefore it records higher average ride time compared to weekdays.
8. The average ride time for riders who shared bikes and those that didn't was almost the same.
9. The average age of Female subcribers is 33.3 and for female customer is 32.7.
10. The average age of male subscribers is 34.5 while for male customers is 33.9.
11. The average age of subscribers of the other gender is 36 while for those that are customers is 34.9.

Key Insights for Presentation

For the Explanatory phase, the focus is on the characteristics of riders and how they performed in duration of rides in respect to the different:

  • User types
  • Gender
  • Hours of the day and
  • Days of the week.

The key insights for the presentation were as follows:
1. The Gender Proportion of Users in the GoFord Bike System.
2. The proportion of riders by user type.
3. Duration of an average trip.
4. Daily usage of bikes by user type.
5. Daily average ride time by the user type.
6. Busiest hour of the day for riders.

About

Data Exploration of FordBike System is the final project of the Udacity Data Analyst Nanodegree. The dataset includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area. It was requested to investigate this dataset to get at least some insights.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published