
Security News
vlt Launches "reproduce": A New Tool Challenging the Limits of Package Provenance
vlt's new "reproduce" tool verifies npm packages against their source code, outperforming traditional provenance adoption in the JavaScript ecosystem.
github-repo-gpt-scraper
Advanced tools
Welcome to the GitHub Repo GPT Scraper! This powerful tool is designed to help you effortlessly scrape GitHub repositories in order to create an [OpenAI GPT](https://chat.openai.com/create) based on your code! It works with either a public GitHub reposito
Welcome to the GitHub Repo GPT Scraper! This powerful tool is designed to help you effortlessly scrape GitHub repositories in order to create an OpenAI GPT based on your code! It works with either a public GitHub repository URL or a local directory (defaulting to the cwd
if no URL is passed).
Scrape a GitHub Repository:
npx github-repo-gpt-scraper --url=https://github.com/user/repo --out=repo.json
Replace https://github.com/user/repo
with the URL of the repository you wish to scrape.
Scrape the Current Working Directory:
npx github-repo-gpt-scraper --out=repo.json
This will scrape all the files in your current directory, excluding gitignored files per the .gitignore
file in cwd
, and excluding common lockfiles and binary files.
Filter Files with Include and Exclude Options:
Use the --include option to specify a glob pattern for files you want to include. Use the --exclude option to specify a glob pattern for files you want to exclude.
Example:
npx github-repo-gpt-scraper --include="src/**/*.ts" --out=repo.json
Or:
npx github-repo-gpt-scraper --exclude="tests/**" --out=repo.json
Visit https://chat.openai.com/create and click the "Configure" tab.
Under "Knowledge," click "Upload files" and select the JSON file output by the scraper.
Add the following basic instructions to the "Instructions" field:
You are the creator of the codebase documented in the attached file and an expert in all of its code and the dependencies it uses. All of the user's question will relate to this code, so reference it heavily. Give factual, detailed answers and help the user make updates to the code in as efficient a manner possible while explaining more complex points to them along the way.
The simple instructions above cover the essentials and seem to work pretty well, but feel free to experiment with your own!
The tool outputs a JSON file (repo.json
in the above examples) containing the path, URL, and content of each file scraped. I haven't yet experimented with different ways of formatting the file data (or adding supplemental info) and their impact on GPTs, but I'd be eager to hear about anyone's findings if they do so!
Contributions are welcome! Open a PR 😎
This project is licensed under the MIT License.
Happy Scraping and GPTs'ing! 🚀🤖
FAQs
Welcome to the GitHub Repo GPT Scraper! This powerful tool is designed to help you effortlessly scrape GitHub repositories in order to create an [OpenAI GPT](https://chat.openai.com/create) based on your code! It works with either a public GitHub reposito
The npm package github-repo-gpt-scraper receives a total of 4 weekly downloads. As such, github-repo-gpt-scraper popularity was classified as not popular.
We found that github-repo-gpt-scraper demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
vlt's new "reproduce" tool verifies npm packages against their source code, outperforming traditional provenance adoption in the JavaScript ecosystem.
Research
Security News
Socket researchers uncovered a malicious PyPI package exploiting Deezer’s API to enable coordinated music piracy through API abuse and C2 server control.
Research
The Socket Research Team discovered a malicious npm package, '@ton-wallet/create', stealing cryptocurrency wallet keys from developers and users in the TON ecosystem.