isbot

Dependencies

Maintainers

Versions

121

Alerts

File Explorer

Advanced tools

npm Scripts

Install Socket

Detect and block malicious and high-risk dependencies

Install

isbot

🤖 detect bots/crawlers/spiders via the user agent.

3.0.21

source

npm

Version published: 3 years ago

Weekly downloads: 402K; decreased by-16.69%

Maintainers: 2

Install size: 17.0 kB

Created: 9 years ago

Weekly downloads

Changelog

Source

3.0.21

Reduce pattern complexity

Readme

Source

isbot 🤖/👨‍🦰

Detect bots/crawlers/spiders using the user agent string.

Usage

Simple detection

const isbot = require('isbot')

// Nodejs HTTP
isbot(request.getHeader('User-Agent'))

// ExpressJS
isbot(req.get('user-agent'))

// User Agent string
isbot('Mozilla/5.0 (iPhone; CPU iPhone OS 6_0 like Mac OS X) AppleWebKit/536.26 (KHTML, like Gecko) Version/6.0 Mobile/10A5376e Safari/8536.25 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)') // true
isbot('Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2228.0 Safari/537.36') // false

Add crawler user agents

Add rules to user agent match RegExp

isbot('Mozilla/5.0') // false
isbot.extend([
    'istat',
    '^mozilla/\\d\\.\\d$'
])
isbot('Mozilla/5.0') // true

Remove matches of known crawlers

Remove rules to user agent match RegExp (see existing rules in list.json file)

isbot('Chrome-Lighthouse') // true
isbot.exclude(['chrome-lighthouse']) // pattern is case insensitive
isbot('Chrome-Lighthouse') // false

Verbose result

Return the respective match for bot user agent rule

isbot.find('Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Firefox/52.0 DejaClick/2.9.7.2') // 'DejaClick'

Definitions

Bot. Autonomous program imitating or replacing some aspect of a human behaviour, performing repetitive tasks much faster than human users could.
Good bot. Automated programs who visit websites in order to collect useful information. Web crawlers, site scrapers, stress testers, preview builders and other programs are welcomed on most websites because they serve purposes of mutual benefits.
Bad bot. Programs which are designed to perform malicious actions, ultimately hurting businesses. Testing credential databases, DDoS attacks, spam bots.

Clarifications

What does "isbot" do?

This package aims to identify "Good bots". Those who voluntarily identify themselves by setting a unique, preferably descriptive, user agent, usually by setting a dedicated request header.

What doesn't "isbot" do?

It does not try to recognise malicious bots or programs disguising themselves as real users.

Why would I want to identify good bots?

Recognising good bots such as web crawlers is useful for multiple purposes. Although it is not recommended to serve different content to web crawlers like Googlebot, you can still elect to

Flag bot pageviews to consider in business analysis
Prefer to serve cached content and relieve service load
Omit third party solutions' code (tags, pixels)

It is not recommended to whitelist requests for any reason based on user agent header only. Instead other methods of identification can be added such as reverse dns lookup.

Data sources

Crawlers user agents:

Non bot user agents:

user-agents npm package
Manual list (source: whatismybrowser.com)

Missing something? Please open an issue

Keywords

FAQs

What is isbot?

Is isbot popular?

Is isbot well maintained?

Last updated on 23 Dec 2020

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

isbot

isbot 🤖/👨‍🦰

Usage

Simple detection

Add crawler user agents

Remove matches of known crawlers

Verbose result

Definitions

Clarifications

What does "isbot" do?

What doesn't "isbot" do?

Why would I want to identify good bots?

Data sources

Crawlers user agents:

Non bot user agents:

Keywords

Related posts

UnitedHealth Group Discloses Protected Health Information Compromised for “Substantial Portion of People in America” in Recent Cyberattack

How Threat Actors are Abusing GitHub’s File Upload Feature to Host Malware