Skip to content

Cyberoctane29/Cyclistic-Bike-Share--Analyzing-Rider-Behavior

Repository files navigation

Cyclistic Bike Share: Analyzing Rider Behavior - Capstone Project

Overview

Welcome to my capstone project repository for the Google Data Analytics Professional Certificate! This project showcases my ability to apply data analytics techniques using SQL, spreadsheets, R programming, Python, and Power BI to derive insights from real-world data. The repository presents the complete analytical workflow, including data cleaning, validation, exploration, visualization, dashboard development, and actionable business recommendations.

The project demonstrates an end-to-end analytics workflow spanning data cleaning, validation, exploratory analysis, visualization, dashboard development, and business-focused recommendations.

Quick Access

Note: The Power BI (.pbix) file is provided through the project drive folder due to GitHub file size limitations.

Project Description

This project analyzes bike-sharing data from Cyclistic, a fictional bike-share company, to understand rider behavior and identify trends. The project leverages SQL for data analysis, spreadsheets for data cleaning and transformation, R for visualization, Python for data validation and exploratory analysis, and Power BI for interactive dashboard development and business-focused reporting.

The analysis involves:

  • Data Cleaning & Processing: Using SQL and spreadsheets for data cleaning and preparation.
  • Analysis: SQL queries for exploring trends, patterns, and insights within the data.
  • Visualizations: Using R to generate meaningful visualizations to communicate the analysis results effectively.
  • Data Validation & Exploration: Using Python and Jupyter Notebook to assess schema consistency, validate data quality, and explore dataset characteristics.
  • Interactive Dashboard Development: Using Power BI to transform analytical findings into an interactive reporting solution with business-focused insights and recommendations.

What’s Included

  • Jupyter Notebook (.ipynb): Complete Cyclistic case study documenting the full analytical workflow, including data preparation, SQL queries and results, R-based visualizations using ggplot2, exploratory analysis, key findings, and business recommendations.
  • Data Files: SQL query results and processed spreadsheet data used throughout the analysis and visualizations.
  • Knitted HTML & PDF Reports: Full case study reports containing code, explanations, findings, visualizations, and recommendations.
  • Power BI Dashboard File (.pbix): Interactive multi-page dashboard analyzing rider behavior, ride locations, and membership growth opportunities.
  • Dashboard PDF Export: Shareable static version of the complete dashboard for offline viewing.
  • Dashboard Walkthrough Video: Demonstration of dashboard functionality, navigation, and interactivity.
  • Data Validation & Exploration Notebook: Jupyter notebook documenting data quality checks, schema validation, exploratory analysis, and validation procedures performed throughout dashboard development.
  • Dashboard Assets: Supporting screenshots, PDFs, and project resources used in dashboard documentation and presentation.

Power BI Dashboard Overview

The Power BI dashboard was developed to explore behavioral differences between annual members and casual riders and support strategic membership growth recommendations.

The dashboard includes:

  • Executive Overview
  • Rider Behavior Analysis
  • Ride Location Analysis
  • Strategic Recommendations

The report combines exploratory analysis, data validation, interactive visualizations, and business-focused storytelling.

Dashboard Pages

Executive Overview

Cyclistic Executive Overview Page-1

Rider Behavior Analysis

Cyclistic Rider Behavior Analysis Page-2

Ride Location Analysis

Cyclistic Ride Location Analysis Page-3

Strategic Recommendations

Cyclistic Strategic Recommendations Page-4

Key Findings & Recommendations

The analysis revealed several meaningful differences between annual members and casual riders, providing opportunities for targeted membership growth strategies.

Key Findings

  • Annual members demonstrate stronger weekday commuting behavior.
  • Casual riders show higher weekend activity and leisure-oriented usage patterns.
  • Casual riders generally take longer rides than annual members.
  • Seasonal trends have a greater impact on casual riders than members.
  • Station usage preferences vary between rider segments, indicating differences in travel purpose and location demand.

Business Recommendations

  • Target casual riders during peak leisure and seasonal riding periods.
  • Promote membership benefits at frequently used casual rider stations.
  • Develop seasonal marketing campaigns focused on frequent casual riders.
  • Introduce incentives designed to encourage casual-to-member conversion.
  • Leverage location-based marketing strategies to improve membership adoption.

Technologies Used

  • SQL: For data cleaning, processing, and analysis.
  • Spreadsheets: For additional data processing and manipulation.
  • R Programming: For creating visualizations to highlight trends and insights.
  • Python (Pandas & NumPy): For exploratory analysis, validation, and data quality assessment.
  • Jupyter Notebook: For documenting validation workflows and exploratory investigations.
  • Power BI: For dashboard development, interactive reporting, and business-focused data storytelling.

Repository Structure

├── Analysis Reports
├── Power BI Dashboard Assets
│   ├── Images
│   ├── PDFs
│   └── Supporting Files
├── Cyclistic_Bike_Share_Analyzing_Rider_Behavior.ipynb
├── Cyclistic Member vs Casual Rider Analysis Dashboard.pdf
├── Video - Cyclistic Member vs Casual Rider Analysis Dashboard.mp4
└── README.md

Conclusion

This project applies the Google Data Analytics Cyclistic case study framework to analyze rider behavior, identify meaningful differences between annual members and casual riders, and develop data-driven recommendations to support membership growth.

Through data validation, exploratory analysis, visualization, interactive dashboard development, and business-focused reporting, the project demonstrates an end-to-end analytics workflow spanning data preparation, validation, exploration, visualization, dashboard development, and data storytelling.

About

Analyzed Cyclistic bike-share data to uncover behavioral differences between casual riders and annual members using SQL, R, Python, and Power BI. Developed an interactive dashboard, performed data validation and exploratory analysis, and generated data-driven recommendations to support membership growth. For Kaggle notebook check:

Topics

Resources

Stars

Watchers

Forks

Contributors