Project Overview
As part of the Google Data Analytics Professional Certificate Capstone project, I conducted an in-depth analysis of Cyclistic's bike-sharing data to understand the differences between casual riders and annual members, with the goal of developing actionable marketing strategies to convert casual riders into annual members.
Client
Cyclistic Bike-Share (Case Study)
Tools Used
R, Tableau, Visual Studio Code, Excel
Business Problem
Cyclistic, a bike-share company based in Chicago with over 5,800 bicycles and 600 docking stations, wanted to maximize the number of annual memberships, which are more profitable than casual ridership. The finance team identified that converting casual riders to annual members would be key to the company's future growth. My task was to analyze historical bike trip data to understand how annual members and casual riders use Cyclistic bikes differently.
Dataset Description
I analyzed 12 months of historical bike trip data from February 2023 to February 2024, containing over 5.8 million records with information about:
- Trip start and end times
- Trip start and end stations
- Rider type (casual or member)
- Bike type used
All personal customer information had been removed for privacy. The data needed to be cleaned to remove maintenance trips (rides shorter than 1 minute or longer than one day).
Technical Approach
Data Preparation & Processing
- Created a consolidated dataset by importing and combining 12 monthly CSV files into a single dataset for analysis
- Used R programming to clean, transform, and analyze the data
- Created additional columns for date, month, day, year, day of week, and ride length to facilitate deeper analysis
- Removed geographical coordinates but kept station names for location-based insights
- Filtered out maintenance trips (shorter than 1 minute or longer than 24 hours) and removed duplicate entries
- Processed over 5.8 million records, cleaning approximately 2.89 million rows to obtain a clean dataset of 3 million records
Analysis Methodology
- Used R for initial data aggregation, statistical analysis, and pattern recognition
- Calculated average ride lengths by day of week and member type
- Compared usage patterns between casual riders and annual members
- Analyzed seasonal trends and peak usage times
- Created visualizations in R showing ride duration and frequency by user type
- Exported key data to Tableau for more advanced visualizations and interactive dashboards
Initial R code used to load data
Data cleaning workflow showing the transformation process
Results & Insights
My analysis revealed significant differences in how casual riders and annual members use Cyclistic bikes:
Key Findings
- Ride Duration: Casual members ride almost twice as long per trip (26.28 min) compared to annual members (14.77 min)
- Weekly Patterns: Annual members use bikes consistently throughout the weekdays, while casual riders show higher activity on weekends
- Daily Patterns: Annual members show peak usage during morning and evening commute hours, suggesting they use bikes for work transportation
- Seasonal Trends: Both user types show highest activity during summer months (April to August), with winter seeing significant decline in usage
- Hourly Patterns: Annual members have two distinct peaks at 8am and 5pm, while casual riders' usage gradually increases throughout the day
Average trip duration by member type and day of week
Number of rides by day of week and member type
Monthly ridership trends showing seasonal patterns
Hourly usage patterns showing different peak times
Tableau Visualizations
I created two different Tableau visualization sets to communicate my findings:
- Version 1: Focused on ride length differences between user types with comparative bar charts highlighting the stark differences in usage patterns
- Version 2: Provided deeper insights with ride length by month, day of week, and number of trips started per hour, revealing seasonal patterns and peak usage times
Tableau dashboard showing ride length comparisons
Tableau story with multiple visualizations of usage patterns
Recommendations & Conclusion
Marketing Recommendations
Based on my analysis, I developed three strategic recommendations to convert casual riders to annual members:
-
Weekend-Focused Membership Benefits
Offer new annual membership plans that include special weekend discounts or extended ride times on weekends to appeal to casual riders who predominantly use bikes on weekends.
-
Flexible Membership Tiers
Introduce 3-month and 6-month membership options as stepping stones to annual membership, with pricing structured to make the annual membership the best value. Include special winter discounts to address the seasonal usage drop.
-
Strategic Promotional Timing
Deploy targeted marketing campaigns during peak casual rider hours and seasons, offering limited-time discount deals during high-traffic periods to capture attention when most casual riders are using the service.
Implementation Strategy
To effectively implement these recommendations, Cyclistic should:
- Test weekend discounts (1-2 months): Pilot special weekend rates for annual members to gauge impact on conversion.
- Launch flexible membership tiers (2-3 months): Introduce short-term membership options while maintaining value proposition of annual membership.
- Deploy targeted marketing (ongoing): Time promotional messaging to coincide with peak casual ridership hours and seasons.
- Measure conversion metrics (3-6 months): Track conversion rates from casual to member status to evaluate effectiveness.
- Refine approach (6-12 months): Adjust strategies based on performance data and seasonal patterns.
Skills Demonstrated
This project showcased my abilities in:
- Data cleaning and preparation: Processing large datasets and handling data quality issues
- R programming: Using R for data manipulation, statistical analysis, and visualization
- Data visualization: Creating effective visualizations in both R and Tableau
- Strategic thinking: Translating data insights into actionable business recommendations
- Communication: Presenting complex findings in a clear, concise manner for stakeholders