Security News
Oracle Drags Its Feet in the JavaScript Trademark Dispute
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
assetgraph-plugin-sitemap
Advanced tools
An Assetgraph plugin that adds support for robots.txt and sitemaps
A plugin of assets and relations relating to sitemaps.
This plugin is especially useful if you intend to crawl your entire website, which might contain landing pages that are not linked to in your site navigation structure.
With this plugin, you can initialize an Assetgraph from an XML sitemap or robots.txt
(with Sitemap:
-directives).
Supported sitemap types:
assetgraph-plugin-sitemap
har a peer dependency on assetgraph
npm install --save-dev assetgraph-plugin-sitemap assetgraph
Then use the plugin by adding the sitemap extensions to an existing Assetgraph instance:
const AssetGraph = require('assetgraph');
const extendWithSitemaps = require('assetgraph-plugin-sitemap');
const graph = new AssetGraph({ root: 'https://example.com' });
extendWithSitemaps(graph);
async function main() {
await graph.loadAssets('robots.txt', 'sitemap.xml');
await graph.populate({
followRelations: {
crossorigin: false
}
});
// You now have a fully populated graph based on your
// robots.txt Sitemap:-directives and your sitemap.xml
}
main();
Robots
- See robots.txtXmlSitemap
- See https://en.wikipedia.org/wiki/Sitemaps#File_formatTextSitemap
- See https://en.wikipedia.org/wiki/Sitemaps#Text_fileRobotsSitemap
- A relation to any sitemap format, starting from the Sitemap:
directive in robots.txt
XmlSitemapUrl
- A relation to a page, starting from the XML sitemap <url>
elementTextSitemapUrl
- A relation to a page, starting from a URL line in TextSitemap
robots.txt
is automatically upgraded from a Text
asset to a Robots
asset, which also discovers the Sitemap:
-directives in robots.txt
and adds RobotsSitemap
relations to the graph.
XML sitemaps are automatically upgraded from an Xml
asset to a XmlSitemap
asset based on the <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
content. <url>
elements in the Xml sitemap automatically add XmlSitemapUrl
relations to the graph.
While TextSitemap
assets now exist, they cannot be automatically inferred based on their contents alone. This means that a Text
asset can only be automatically upgraded to a TextSitemap
asset based on an incoming RobotsSitemap
relation. It is recommended to always add a Sitemap:
-directive to your robots.txt
and initialize your assetgraph from there in order to have TextSitemap
work correctly. When a TextSitemap
is inferred, TextSitemapUrl
relations are added to the graph for each URL line in the text file.
FAQs
An Assetgraph plugin that adds support for robots.txt and sitemaps
The npm package assetgraph-plugin-sitemap receives a total of 811 weekly downloads. As such, assetgraph-plugin-sitemap popularity was classified as not popular.
We found that assetgraph-plugin-sitemap demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Oracle seeks to dismiss fraud claims in the JavaScript trademark dispute, delaying the case and avoiding questions about its right to the name.
Security News
The Linux Foundation is warning open source developers that compliance with global sanctions is mandatory, highlighting legal risks and restrictions on contributions.
Security News
Maven Central now validates Sigstore signatures, making it easier for developers to verify the provenance of Java packages.