git-pandas

A wrapper around gitpython to produce pandas dataframes for analysis

Python BSD-3-Clause
190 Stars
25 Forks
2 Issues

Languages

Python 99.5%

Top Contributors

Recent Releases

Project Info

Created
October 27, 2015
Last Updated
July 6, 2025
License
BSD-3-Clause
Default Branch
master

About This Project

Git Pandas wraps GitPython to produce pandas DataFrames for Git repository analysis. It enables data-driven analysis of Git repositories with features for commit history, blame information, and project-level insights.

Features

  • Commit Analysis: Convert commit history into pandas DataFrames
  • Blame Data: Analyze file ownership and contribution patterns
  • Project Aggregation: Analyze multiple repositories together
  • Caching: Redis-backed caching for faster repeated analysis
  • Visualizations: Built-in punchcard and contribution charts

Installation

pip install git-pandas

Quick Start

from gitpandas import Repository

repo = Repository('/path/to/repo')

# Get commit history as DataFrame
commits = repo.commit_history()

# Analyze file blame
blame = repo.blame()

# Generate punchcard data
punchcard = repo.punchcard()

Related Posts