Security News
Oracle Drags Its Feet in the JavaScript Trademark Dispute
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
sitemap2docext
Advanced tools
This module downloads all web pages listed in the Sitemap.xml file and compiles them into a single document.
This module downloads all web pages listed in the Sitemap.xml file and compiles them into a single document.
Use node.js v20+
Designed for AI Embedding Generation
Terminal
npm init -y && npm i sitemap2doc
Node index.mjs
import { Sitemap2Doc } from 'sitemap2doc'
const s2d = new Sitemap2Doc()
await s2d.getDocument( {
'projectName': 'test',
'sitemapUrl': 'https://...'
} )
Terminal
node index.mjs
Key | Type | Description | Required | Default |
---|---|---|---|---|
projectName | String | Set project name | true | |
sitemapUrl | String | Set sitemap source | true | |
silent | Boolean | Control terminal output | false | false |
Example
import { Sitemap2Doc } from 'sitemap2doc'
const s2d = new Sitemap2Doc()
await s2d.getDocument( {
'projectName': 'test',
'sitemapUrl': 'https://...'
} )
Get Sitemap https://...
Get Pages 0 1 2 3 4 5 6 7 8 9
Merge 0
Get current config, the default config you can find here: ./src/data/config.mjs
import { Sitemap2Doc } from 'sitemap2doc'
const s2d = new Sitemap2Doc()
let config = s2d.getConfig()
config['download']['chunkSize'] = 4
s2d
.setConfig( { config } )
.getDocument( { ... } )
All module settings are stored in a config file, see ./src/data/config.mjs. This file can be completely overridden by passing an object during initialization.
import { Sitemap2Doc } from 'sitemap2doc'
const s2d = new Sitemap2Doc()
let config = s2d.getConfig()
config['download']['chunkSize'] = 4
s2d
.setConfig( { config } )
.getDocument( { ... } )
The module is available as open source under the terms of the Apache 2.0. License.
FAQs
This module downloads all web pages listed in the Sitemap.xml file and compiles them into a single document.
The npm package sitemap2docext receives a total of 0 weekly downloads. As such, sitemap2docext popularity was classified as not popular.
We found that sitemap2docext demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
Security News
The Linux Foundation is warning open source developers that compliance with global sanctions is mandatory, highlighting legal risks and restrictions on contributions.
Security News
Maven Central now validates Sigstore signatures, making it easier for developers to verify the provenance of Java packages.