Polished and well engineered. Punching above its star count.
StatsPAI is the first Agent-native Python library for causal inference and applied econometrics — unified API, broad cross-method coverage, structured result objects, machine-readable schemas, an MCP server, and R/Stata parity validation.
Outstanding work. A score of 99/100 puts this repo in a very small tier of truly well-engineered open source projects.
What to fix first
The highest-impact improvements for this repo.
- 1Install and run instructionsDocumentationIssue
Add a .env.example listing all required environment variables so contributors know what to set up.
- 2ReproducibilityEngineeringIssue
Add a Dockerfile, .nvmrc, or .python-version to pin the runtime version and make the environment reproducible.
Detailed breakdown
Documentation
97- README100
- README is present.
- README is well structured with multiple sections.
- README includes screenshots or visuals. Great for first impressions.
- README has code examples.
- README links to a live demo or deployed app.
- README includes status badges.
- Install and run instructions90
- README documents how to install the project.
- README documents how to run the project.
- No .env.example found (−10 pts).Add a .env.example listing all required environment variables so contributors know what to set up.
- License100
- Licensed under MIT.
- Contributing guide100
- Contributing guide is detailed and thorough.
- Contributing guide includes setup/install instructions.
- Contributing guide describes code style expectations.
- Contributing guide explains how to run tests.
- Contributing guide describes the PR/review workflow.
- Contributing guide includes code examples.
- Code of conduct present.
Engineering
99- Tests100
- Test files detected (src/statspai/diagnostics/late_test.py).
- Pytest is fully configured in pyproject.toml with testpaths and test files detected.
- CI/CD100
Not applicable?
- CI is configured (.github/workflows/build-wheels.yml).
- CI workflow runs tests.
- CI runs on pull requests, not just on pushes to main.
- CI workflow runs a lint or format check.
- CI runs type checking (tsc, mypy, cargo check, etc.).
- CI reports or uploads test coverage.
- CI caches dependencies for faster runs.
- CI tests across multiple environments or versions.
- Linting and formatting100
- Linter or formatter configured (.flake8).
- Reproducibility90
- Lockfile present (rust/statspai_hdfe/Cargo.lock). Installs are reproducible.
- No Dockerfile or runtime version pin found. Adding one earns +10 pts.Add a Dockerfile, .nvmrc, or .python-version to pin the runtime version and make the environment reproducible.
- Dependabot covers 3 ecosystems (pip, github-actions, cargo). Dependencies stay current.
- Issue and PR templates100
- Issue or PR templates present.
- Security policy present.
Project health
100- Dependency manifest100
- Dependency manifest found (pyproject.toml).
- pyproject.toml has a [project] table with package metadata.
- pyproject.toml includes a description.
- pyproject.toml specifies requires-python, preventing installs on incompatible versions.
- pyproject.toml has a [build-system] table. The package can be built and published.
- Repository metadata100
- Repository has a description.
- Primary language detected: Python.
- pyproject.toml [project] metadata is complete (description, authors, urls).
- Activity100
- Actively maintained (pushed within the last month).
- 243 stars.
- Housekeeping100
- .gitignore present.
Repository health signals
Activity, community, and responsiveness at scan time
Activity
- —Commits (30d / 90d)
- 44Forks
- 26Releaseslatest 2mo ago
Community
- —Community health
- —authors own >50% of commits
- 243Watchers
Responsiveness
- 6hMedian issue response
- <1hMedian PR merge time
- 3Open issues
Repository files43 root entries
- .cov_decomp
- .coverage_campaign
- .examples_campaign
- .githubGood: CI is configured (.github/workflows/build-wheels.yml).Good: Dependabot covers 3 ecosystems (pip, github-actions, cargo). Dependencies stay current.Good: Issue or PR templates present.
- .tier_eg_campaign
- .tierd_campaign
- benchmarks
- docs
- examples
- papers
- plans
- rustGood: Lockfile present (rust/statspai_hdfe/Cargo.lock). Installs are reproducible.
- schemas
- scripts
- specs
- srcGood: Test files detected (src/statspai/diagnostics/late_test.py).
- StatsPAI_full_data_analysis_skill
- test-notebooks
- tests
- tools
- .codespellrc
- .flake8Good: Linter or formatter configured (.flake8).
- .gitignoreGood: .gitignore present.
- .pre-commit-config.yaml
- .zenodo.json
- CHANGELOG_GITHUB.md
- CHANGELOG.md
- CITATION.cff
- CODE_OF_CONDUCT.mdGood: Code of conduct present.
- CONTRIBUTING.mdGood: Contributing guide is detailed and thorough.Good: Contributing guide includes setup/install instructions.Good: Contributing guide describes code style expectations.Good: Contributing guide explains how to run tests.Good: Contributing guide describes the PR/review workflow.Good: Contributing guide includes code examples.
- CONTRIBUTORS.md
- LICENSEGood: Licensed under MIT.
- MANIFEST.in
- MIGRATION.md
- mkdocs.yml
- paper.bib
- paper.md
- pyproject.tomlGood: Dependency manifest found (pyproject.toml).
- README_CN.mdGood: README is present.Good: README is well structured with multiple sections.Good: README includes screenshots or visuals. Great for first impressions.Good: README has code examples.Good: README links to a live demo or deployed app.Good: README includes status badges.Good: README documents how to install the project.Good: README documents how to run the project.
- README.md
- SECURITY.mdGood: Security policy present.
- SUPPORT.md
- test_results_full_suite.md