New Research: Supply Chain Attack on Axios Pulls Malicious Dependency from npm.Details →
Socket
Book a DemoSign in
Socket

context-url-extractor

Package Overview
Dependencies
Maintainers
1
Versions
2
Alerts
File Explorer

Advanced tools

Socket logo

Install Socket

Detect and block malicious and high-risk dependencies

Install

context-url-extractor

Methods for extracting URLs from HTML or text strings with surrounding context

latest
Source
npmnpm
Version
1.0.1
Version published
Maintainers
1
Created
Source

context-url-extractor

standard-readme compliant

Methods for extracting URLs from HTML or text strings with surrounding context

When data mining content that contains URLs, it's far easier for a machine to categorise them if they are semantic (or friendly) URLs:

Bad URL

Good URL

This package provides a jumping off point for data mining the surrounding context of each URL found in the supplied content.

Table of Contents

Install

npm install --save context-url-extractor

Usage

const extractor = new ContextUrlExtractor({ content });
const res = extractor.extractUrls();

Custom Context Lengths

The default pre and post context string lengths are set to 170 characters, but this can be overridden in the constructor.

const extractor = new ContextUrlExtractor({ content, contextCharsBefore: 80, contextCharsAfter: 80 });

Example Response

[
	{
		"url": "https://example.com/profile.aspx?section=99&trId=9877A4CF44987123AED90&rd=722108935",
		"contextPre": "nd. To log in to your profile please <a href=\"",
		"contextPost": "\">click here</a> and sign in with your email "
	}
]

Maintainers

@njhoran

Contributing

Small note: If editing the README, please conform to the standard-readme specification.

License

MIT © 2019 njhoran

Keywords

URL

FAQs

Package last updated on 22 Aug 2019

Did you know?

Socket

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

Related posts