Skip to content

Feat: Discovery Engine - The Spider #8937

@tobiu

Description

@tobiu

This task focuses on implementing the "Spider" discovery mechanism to expand the contributor index beyond the top repositories.

Requirements

  1. Spider Logic: Update gh_discovery.mjs to traverse the following graph of the initial "seed" candidates.
  2. Depth Control: Implement a depth limit (e.g., 1 or 2 layers) to prevent infinite crawling.
  3. Filtration: Apply a "Pulse Check" (e.g., >50 contributions in current year) before performing a "Deep Scan" on new candidates to optimize API usage.
  4. Deduplication: Ensure users are not scanned twice.

Acceptance Criteria

  • The script discovers and indexes high-contribution users who may not be in the top 50 repositories but are followed by top contributors.
  • The resources/data.json file is populated with these new candidates.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions