Research
Security News
Malicious npm Package Typosquats react-login-page to Deploy Keylogger
Socket researchers unpack a typosquatting package with malicious code that logs keystrokes and exfiltrates sensitive data to a remote server.
googlesearch-python
Advanced tools
Readme
googlesearch is a Python library for searching Google, easily. googlesearch uses requests and BeautifulSoup4 to scrape Google.
To install, run the following command:
python3 -m pip install googlesearch-python
To get results for a search term, simply use the search function in googlesearch. For example, to get results for "Google" in Google, just run the following program:
from googlesearch import search
search("Google")
googlesearch supports a few additional options. By default, googlesearch returns 10 results. This can be changed. To get a 100 results on Google for example, run the following program.
from googlesearch import search
search("Google", num_results=100)
In addition, you can change the language google searches in. For example, to get results in French run the following program:
from googlesearch import search
search("Google", lang="fr")
If you want to turn off the safe search function (this function is on by default), you can do this:
from googlesearch import search
search("Google", safe=None)
To extract more information, such as the description or the result URL, use an advanced search:
from googlesearch import search
search("Google", advanced=True)
# Returns a list of SearchResult
# Properties:
# - title
# - url
# - description
If requesting more than 100 results, googlesearch will send multiple requests to go through the pages. To increase the time between these requests, use sleep_interval
:
from googlesearch import search
search("Google", sleep_interval=5, num_results=200)
If you are using a HTTP Rotating Proxy which requires you to install their CA Certificate, you can simply add ssl_verify=False
in the search()
method to avoid SSL Verification.
from googlesearch import search
proxy = 'http://API:@proxy.host.com:8080/'
j = search("proxy test", num_results=100, lang="en", proxy=proxy, ssl_verify=False)
for i in j:
print(i)
FAQs
A Python library for scraping the Google search engine.
We found that googlesearch-python demonstrated a healthy version release cadence and project activity because the last version was released less than a year ago. It has 1 open source maintainer collaborating on the project.
Did you know?
Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.
Research
Security News
Socket researchers unpack a typosquatting package with malicious code that logs keystrokes and exfiltrates sensitive data to a remote server.
Security News
The JavaScript community has launched the e18e initiative to improve ecosystem performance by cleaning up dependency trees, speeding up critical parts of the ecosystem, and documenting lighter alternatives to established tools.
Product
Socket now supports four distinct alert actions instead of the previous two, and alert triaging allows users to override the actions taken for all individual alerts.