New Case Study:See how Anthropic automated 95% of dependency reviews with Socket.Learn More →

github-repo-crawler

Package Overview

Dependencies

Advanced tools

Install Socket

Detect and block malicious and high-risk dependencies

Install

Package was removed

Sorry, it seems this package was removed from the registry

github-repo-crawler

The organisation Crawler uses the [GitHub GraphQL API](https://docs.GitHub.com/en/GraphQL) to extract all of a GitHub organisation's repositories and their dependencies. The current version of the dependencies are found using `DependencyGraphManifest` Gra

1.0.0
unpublished
latest
npm

Version published: 3 years ago

Weekly downloads: 0

Maintainers: 1

Weekly downloads

Created: 3 years ago

Source

Organisation Crawler

The organisation Crawler uses the GitHub GraphQL API to extract all of a GitHub organisation's repositories and their dependencies. The current version of the dependencies are found using DependencyGraphManifest GraphQL object. The latest versions of the dependencies are found using calls to the npm API.

The information is outputted to a response.json file which has a schema .

The current goal is to have the crawler run onn the the frontend

TODO

Improvements

Pagination
Rate Limiter for GitHub
Default Branch

[o] Error handling
- invalid or expired token
Fetch all dependencies, currently cut off limit is 250 for each manifest file
Dedicated GraphQL file
Slack alert if a once a dependency is two majors behind `

Potential

Store results into into a database

Package Managers support

JavaScript
Python

Repository Setup

Before you commit, please configure pre-commit with:

pre-commit install

Now, every time you commit, it will run hooks to fix various styling and linting problem.

Running pre-commit hooks manually

pre-commit run --all-files

Skipping pre-commit hooks

Please avoid doing this at all cost.

git commit -n -m "Your commit message"

The -n allows you to skip git hooks.

Usage

Create a personal access token on GitHub.
Put your token inside the a .env file, and add to .gitignore
run npm install to install the necessary dependencies
run tsc -w or npx tsc -w (if npm is not installed globally on your device) to let TypeScript compile the code and watch for changes. This will create the build folder
run node ./build/index.js in a new terminal tab to execute the crawler

If you want to play around, simply edit the code (tsc will compile it automatically as long as tsc -w is running), finally run node ./build/index.js again.

Output schema

RepoMap: Map<ID, Object> = {
    ID: {
        name: ID= "NAME",
        version: SemVer = VERSION,
        link: string = "LINK",
        internal: bool = IS_INTERNAL
        archived: bool = IS_ARCHIVED
    },
    ...
}
DependencyList: Object[] = [
    {
        id: ID = ID,
        dependencies: [int, SemVer] = [
            [ID, VERSION],
            ...
        ]
    },
    ...
]

FAQs

What is github-repo-crawler?

Is github-repo-crawler popular?

Is github-repo-crawler well maintained?

Package last updated on 18 Jun 2022

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install