Security News
Fluent Assertions Faces Backlash After Abandoning Open Source Licensing
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
puppeteer-extra-plugin-recaptcha-advanced
Advanced tools
A puppeteer-extra plugin to solve reCAPTCHAs and hCaptchas automatically. This version has support for proxies.
A puppeteer-extra plugin to solve reCAPTCHAs and hCaptchas automatically.
yarn add puppeteer-extra-plugin-recaptcha
# - or -
npm install puppeteer-extra-plugin-recaptcha
If this is your first puppeteer-extra plugin here's everything you need:
yarn add puppeteer puppeteer-extra puppeteer-extra-plugin-recaptcha
# - or -
npm install puppeteer puppeteer-extra puppeteer-extra-plugin-recaptcha
🎁 Note: Until we've automated changelog updates in markdown files please follow the
#announcements
channel in our discord server for the latest updates and changelog info.
Older changelog:
3.1.9
3.1.6
3.1.5
for (const frame of page.mainFrame().childFrames()) {
await frame.solveRecaptchas()
}
3.1.4
Page
and Frame
object (e.g. page.solveRecaptchas()
).The plugin essentially provides a mighty page.solveRecaptchas()
method that does everything needed automagically.
// puppeteer-extra is a drop-in replacement for puppeteer,
// it augments the installed puppeteer with plugin functionality
const puppeteer = require('puppeteer-extra')
// add recaptcha plugin and provide it your 2captcha token (= their apiKey)
// 2captcha is the builtin solution provider but others would work as well.
// Please note: You need to add funds to your 2captcha account for this to work
const RecaptchaPlugin = require('puppeteer-extra-plugin-recaptcha')
puppeteer.use(
RecaptchaPlugin({
provider: {
id: '2captcha',
token: 'XXXXXXX' // REPLACE THIS WITH YOUR OWN 2CAPTCHA API KEY ⚡
},
visualFeedback: true // colorize reCAPTCHAs (violet = detected, green = solved)
})
)
// puppeteer usage as normal
puppeteer.launch({ headless: true }).then(async browser => {
const page = await browser.newPage()
await page.goto('https://www.google.com/recaptcha/api2/demo')
// That's it, a single line of code to solve reCAPTCHAs 🎉
await page.solveRecaptchas()
await Promise.all([
page.waitForNavigation(),
page.click(`#recaptcha-demo-submit`)
])
await page.screenshot({ path: 'response.png', fullPage: true })
await browser.close()
})
// `puppeteer-extra` and the recaptcha plugin are written in TS,
// hence you get perfect type support out of the box :)
import puppeteer from 'puppeteer-extra'
import RecaptchaPlugin from 'puppeteer-extra-plugin-recaptcha'
puppeteer.use(
RecaptchaPlugin({
provider: {
id: '2captcha',
token: 'ENTER_YOUR_2CAPTCHA_API_KEY_HERE'
}
})
)
// Puppeteer usage as normal (headless is "false" just for this demo)
puppeteer.launch({ headless: false }).then(async browser => {
const page = await browser.newPage()
await page.goto('https://www.google.com/recaptcha/api2/demo')
// Even this `Puppeteer.Page` extension is recognized and fully type safe 🎉
await page.solveRecaptchas()
await Promise.all([
page.waitForNavigation(),
page.click(`#recaptcha-demo-submit`)
])
await page.screenshot({ path: 'response.png', fullPage: true })
await browser.close()
})
If you'd like to see debug output just run your script like so:
DEBUG=puppeteer-extra,puppeteer-extra-plugin:* node myscript.js
Tip: The recaptcha plugin works really well together with the stealth plugin.
These days captchas are unfortunately everywhere, with reCAPTCHA having the biggest "market share" in that space (> 80%). The situation got really bad, with privacy minded users (tracking blocker, VPNs) being penalized heavily and having to solve a lot of reCAPTCHA challenges constantly while browsing the web.
The stated reasons for this omnipresent captcha plague vary from site owners having to protect themselves against increasingly malicious actors to some believing that we're essentially forced into free labour to train Google's various machine learning endeavours.
In any case I strongly feel that captchas in their current form have failed. They're a much bigger obstacle and annoyance to humans than to robots, which renders them useless. My anarchist contribution to this discussion is to demonstrate this absurdity, with a plugin for robots with which a single line of code is all it takes to bypass reCAPTCHAs on any site.
Note: Since
v3.3.0
the plugin will solve hCaptchas as well, as they've gained significant marketshare through their Cloudflare partnership.
I thought about having the plugin solve captchas directly (e.g. using the audio challenge and speech-to-text APIs), but external solution providers are so cheap and reliable that there is really no benefit in doing that. ¯\_(ツ)_/¯
Please note: You need a provider configured for this plugin to do it's magic. If you decide to use the built-in 2captcha provider you need to add funds to your 2captcha account.
Currently the only builtin solution provider as it's the cheapest and most reliable, from my experience. If you'd like to throw some free captcha credit my way feel free to signup here (referral link, allows me to write automated tests against their API).
You can easily use your own provider as well, by providing the plugin a function instead of 2captcha credentials (explained in the API docs). PRs for new providers are welcome as well.
page.solveRecaptchas()
the plugin will attempt to find any active reCAPTCHAs & hCaptchas, extract their configuration, pass that on to the specified solutions provider, take the solutions and put them back into the page (triggering any callback that might be required).reCAPTCHAs use a per-site sitekey
. Interestingly enough the response token after solving a challenge is (currently) not tied to a specific session or IP and can be passed on to others (until they expire). This is how the external solutions provider work: They're being given a sitekey
and URL, solve the challenge and respond with a response token.
This plugin automates all these steps in a generic and robust way (detecting captchas, extracting their config and sitekey
) as well as triggering the (optional) response callback the site owner might have specified.
page.solveRecaptchas()
is called.page.solveRecaptchas()
?page.solveRecaptchas()
on a page that has no reCAPTCHAs nothing bad will happen (😄) but the promise will resolve and the rest of your code executes as normal.DEBUG=puppeteer-extra,puppeteer-extra-plugin:* node myscript.js
By default the plugin will never throw, but return any errors silently in the { error }
property of the result object. You can change that behaviour by passing throwOnError: true
to the initializier and use try/catch
blocks to catch errors.
For convenience and because it looks cool the plugin will "colorize" reCAPTCHAs depending on their state (violet = detected and being solved, green = solved). You can turn that feature off by passing visualFeedback: false
to the plugin initializer.
interface PluginOptions {
/** Visualize reCAPTCHAs based on their state */
visualFeedback: boolean // default: true
/** Throw on errors instead of returning them in the error property */
throwOnError: boolean // default: false
/** Only solve captchas and challenges visible in the browser viewport */
solveInViewportOnly: boolean // default: false
/** Solve scored based captchas with no challenge (e.g. reCAPTCHA v3) */
solveScoreBased: boolean // default: false
/** Solve invisible captchas that have no active challenge */
solveInactiveChallenges: boolean // default: false
}
const {
captchas,
filtered,
solutions,
solved,
error
} = await page.solveRecaptchas()
captchas
is an array of captchas found in the pagefiltered
is an array of captchas that have been detected but are ignored due to plugin optionssolutions
is an array of solutions returned from the providersolved
is an array of "solved" (= solution entered) captchas on the pagepage.solveRecaptchas()
is a convenience method that wraps the following steps:
let { captchas, filtered, error } = await page.findRecaptchas()
let { solutions, error } = await page.getRecaptchaSolutions(captchas)
let { solved, error } = await page.enterRecaptchaSolutions(solutions)
By default the plugin will only solve reCAPTCHAs showing up on the immediate page. In case you encounter captchas in frames the plugin extends the Puppeteer.Frame
object with custom methods as well:
// Loop over all potential frames on that page
for (const frame of page.mainFrame().childFrames()) {
// Attempt to solve any potential captchas in those frames
await frame.solveRecaptchas()
}
In addition you might want to disable site isolation, so puppeteer is able to access cross-origin iframes:
puppeteer.launch({
args: [
'--disable-features=IsolateOrigins,site-per-process',
'--flag-switches-begin --disable-site-isolation-trials --flag-switches-end'
]
})
In case you're not using browser.newPage()
but re-use the existing about:blank
tab (which is not recommended for various reasons) you will experience a page.solveRecaptchas is not a function
error, as the plugin hasn't hooked into this page yet. As a workaround you can manually add existing pages to the lifecycle methods of the plugin:
const recaptcha = RecaptchaPlugin()
const pages = await browser.pages()
for (const page in pages) {
// Add plugin methods to existing pages
await recaptcha.onPageCreated(page)
}
page.solveRecaptchas
:await page.waitForSelector('iframe[src*="recaptcha/"]')
await page.solveRecaptchas()
Copyright © 2018 - 2021, berstend̡̲̫̹̠̖͚͓̔̄̓̐̄͛̀͘. Released under the MIT License.
FAQs
A puppeteer-extra plugin to solve reCAPTCHAs and hCaptchas automatically. This version has support for proxies.
We found that puppeteer-extra-plugin-recaptcha-advanced demonstrated a not healthy version release cadence and project activity because the last version was released a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Security News
Fluent Assertions is facing backlash after dropping the Apache license for a commercial model, leaving users blindsided and questioning contributor rights.
Research
Security News
Socket researchers uncover the risks of a malicious Python package targeting Discord developers.
Security News
The UK is proposing a bold ban on ransomware payments by public entities to disrupt cybercrime, protect critical services, and lead global cybersecurity efforts.