Git Pandas wraps GitPython to produce pandas DataFrames for Git repository analysis. It enables data-driven analysis of Git repositories with features for commit history, blame information, and project-level insights.
Features
- Commit Analysis: Convert commit history into pandas DataFrames
- Blame Data: Analyze file ownership and contribution patterns
- Project Aggregation: Analyze multiple repositories together
- Caching: Redis-backed caching for faster repeated analysis
- Visualizations: Built-in punchcard and contribution charts
Installation
pip install git-pandas
Quick Start
from gitpandas import Repository
repo = Repository('/path/to/repo')
# Get commit history as DataFrame
commits = repo.commit_history()
# Analyze file blame
blame = repo.blame()
# Generate punchcard data
punchcard = repo.punchcard()