Research
Security News
Malicious npm Packages Inject SSH Backdoors via Typosquatted Libraries
Socket’s threat research team has detected six malicious npm packages typosquatting popular libraries to insert SSH backdoors.
fetch-filecache-for-crawling
Advanced tools
Implementation of a `fetch` that extends the implementation from `node-fetch` to add an HTTP cache using a local cache folder for crawling purpose.
Node.js module that exports a fetch
function that extends the implementation
from node-fetch
to add an HTTP cache using a local cache folder.
The code was developed for a particular scenario with specific requirements in mind, and no attempts were made to generalize them. Publication as an npm package is mostly intended to ease reuse by a couple of specific projects.
Typically, the module is intended to be used for crawling purpose and makes the following assumptions, which do not hold true in other cases:
Run npm install fetch-filecache-for-crawling
const fetch = require('fetch-filecache-for-crawling');
// URLs to crawl, some of which may be identical
let urls = [
'https://caniuse.com/data.json',
'https://caniuse.com/data.json'
]
Promise.all(urls.map(url =>
fetch(url, { logToConsole: true })
.then(response => response.json())
.then(json => console.log(Object.keys(json.data).length +
' entries in Can I Use'))
)).catch(err => console.error(err));
On top of usual fetch
options, the following optional parameters can be
passed to fetch
in the options
parameter to change default behavior:
cacheFolder
: the name of the cache folder to use. By default, the code caches all files in a folder named .cache
.resetCache
: set to true
to empty the cache folder when the application starts. Defaults to false
. Note that the cache folder will only be reset once, regardless of whether the parameter is set to true
in subsequent calls to fetch
.avoidNetworkRequests
: set to true
to consider that responses in the cache folder are always valid when they exist. Defaults to false
, which means that the method will send a conditional HTTP request to check whether the response in the cache is still valid.forceRefresh
: set to true
to force another network fetch on a URL that has already been fetched during this crawl. This can be particularly useful when a fetch returns a temporary HTTP error if you want to try again later on. Defaults to false
, meaning that, during the lifetime of an application, the response in the cache is automatically returned when the underlying URL has already been fetched.logToConsole
: set to true
to output progress messages to the console. Defaults to false
. All messages start with the ID of the request to be able to distinguish between them.For instance, you may do:
const fetch = require('fetch-filecache-for-crawling');
fetch('https://www.w3.org/', {
resetCache: true,
cacheFolder: 'mycache',
logToConsole: true
}).then(response => {});
If a config.json
file exists in the current folder, the code will try to parse it as JSON and will look for the above parameters in that file. Configuration parameters provided in the options
parameter take precedence over those defined in config.json
.
The code is available under an MIT license.
FAQs
Implementation of a `fetch` that extends the implementation from `node-fetch` to add an HTTP cache using a local cache folder for crawling purpose.
The npm package fetch-filecache-for-crawling receives a total of 351 weekly downloads. As such, fetch-filecache-for-crawling popularity was classified as not popular.
We found that fetch-filecache-for-crawling demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
Security News
Socket’s threat research team has detected six malicious npm packages typosquatting popular libraries to insert SSH backdoors.
Security News
MITRE's 2024 CWE Top 25 highlights critical software vulnerabilities like XSS, SQL Injection, and CSRF, reflecting shifts due to a refined ranking methodology.
Security News
In this segment of the Risky Business podcast, Feross Aboukhadijeh and Patrick Gray discuss the challenges of tracking malware discovered in open source softare.