apache/spark

0

/ 100

Scala43,481Apache-2.05d ago
Grade a repo

Popular and well-maintained. A little polish away from elite status.

Apache Spark - A unified analytics engine for large-scale data processing

Documentation

91

Contributing guide5pt61

Contributing guide is too short for full depth credit (−6 pts). 400+ words earns the full +12 pts.

Add setup instructions, code style notes, and how to run tests.

Install and run instructions9pt90

README documents how to install the project.

README12pt100

README is present.

License6pt100

Licensed under Apache-2.0.

Engineering

73

Issue and PR templates6pt0

No issue or PR templates found (−100 pts).

Add .github/ISSUE_TEMPLATE/ with bug_report.md and feature_request.md to guide contributors. It dramatically improves issue quality.

Tests18pt80

Test files detected (R/pkg/inst/tests).

Reproducibility6pt80

Lockfile present (dev/package-lock.json). Installs are reproducible.

CI/CD14pt83

CI is configured (.github/workflows/build_and_test.yml).

Linting and formatting5pt100

Linter or formatter configured (dev/checkstyle.xml).

Project health

100

Dependency manifest6pt100

Dependency manifest found (pom.xml).

Repository metadata5pt100

Repository has a description.

Activity5pt100

Actively maintained (pushed within the last month).

Housekeeping3pt100

.gitignore present.

Repository files50 root entries
  • .github
    Good: CI is configured (.github/workflows/build_and_test.yml).
  • .mvn
  • assembly
  • bin
  • binder
    Good: Environment pinned via binder/Dockerfile.
  • build
  • common
  • conf
  • connector
  • core
  • data
  • dev
    Good: Linter or formatter configured (dev/checkstyle.xml).
    Good: Lockfile present (dev/package-lock.json). Installs are reproducible.
  • docs
  • examples
  • graphx
  • hadoop-cloud
  • launcher
  • licenses
  • licenses-binary
  • mllib
  • mllib-local
  • project
  • python
  • R
    Good: Test files detected (R/pkg/inst/tests).
  • repl
  • resource-managers
  • sbin
  • sql
  • streaming
  • tools
  • udf
  • ui-test
  • .asf.yaml
  • .gitattributes
  • .gitignore
    Good: .gitignore present.
  • .nojekyll
  • .pre-commit-config.yaml
  • .sbtopts
  • AGENTS.md
  • CLAUDE.md
  • CONTRIBUTING.md
    Issue: Contributing guide is too short for full depth credit (−6 pts). 400+ words earns the full +12 pts.Fix: Add setup instructions, code style notes, and how to run tests.
    Issue: Contributing guide lacks a setup section (−12 pts).Fix: Show new contributors how to get a local dev environment running.
    Issue: Contributing guide lacks a code style section (−8 pts).Fix: Describe your linting/formatting rules and how to run them.
    Issue: Contributing guide lacks a testing section (−8 pts).Fix: Show contributors how to run the test suite (e.g. npm test, pytest, cargo test).
    Good: Contributing guide describes the PR/review workflow.
    Issue: Contributing guide has no code examples (−5 pts).Fix: Add code blocks showing example commands for setup, running tests, and submitting a PR.
  • LICENSE
    Good: Licensed under Apache-2.0.
  • LICENSE-binary
  • NOTICE
  • NOTICE-binary
  • pom.xml
    Good: Dependency manifest found (pom.xml).
  • pyproject.toml
  • README.md
    Good: README is present.
    Good: README is well structured with multiple sections.
    Good: README includes screenshots or visuals. Great for first impressions.
    Good: README has code examples.
    Good: README links to a live demo or deployed app.
    Good: README includes status badges.
    Good: README documents how to install the project.
    Good: README documents how to run the project.
  • scalastyle-config.xml
  • SECURITY.md
    Good: Security policy present.