MCPcopy Index your code
hub / github.com/pyppeteer/pyppeteer

github.com/pyppeteer/pyppeteer @2.0.0 sqlite

repository ↗ · DeepWiki ↗ · release 2.0.0 ↗
1,173 symbols 5,070 edges 67 files 326 documented · 28% 1 cross-repo links
README

Attention: This repo is unmaintained and has been outside of minor changes for a long time. Please consider playwright-python as an alternative.

If you would like to overhaul this code to bring it up to date, please contact me

pyppeteer

PyPI PyPI version Documentation CircleCI codecov

Note: this is a continuation of the pyppeteer project

Unofficial Python port of puppeteer JavaScript (headless) chrome/chromium browser automation library.

  • Free software: MIT license (including the work distributed under the Apache 2.0 license)
  • Documentation: https://pyppeteer.github.io/pyppeteer/

Installation

pyppeteer requires Python >= 3.8

Install with pip from PyPI:

pip install pyppeteer

Or install the latest version from this github repo:

pip install -U git+https://github.com/pyppeteer/pyppeteer@dev

Usage

Note: When you run pyppeteer for the first time, it downloads the latest version of Chromium (~150MB) if it is not found on your system. If you don't prefer this behavior, ensure that a suitable Chrome binary is installed. One way to do this is to run pyppeteer-install command before prior to using this library.

Full documentation can be found here. Puppeteer's documentation and its troubleshooting guide are also great resources for pyppeteer users.

Examples

Open web page and take a screenshot:

import asyncio
from pyppeteer import launch

async def main():
    browser = await launch()
    page = await browser.newPage()
    await page.goto('https://example.com')
    await page.screenshot({'path': 'example.png'})
    await browser.close()

asyncio.get_event_loop().run_until_complete(main())

Evaluate javascript on a page:

import asyncio
from pyppeteer import launch

async def main():
    browser = await launch()
    page = await browser.newPage()
    await page.goto('https://example.com')
    await page.screenshot({'path': 'example.png'})

    dimensions = await page.evaluate('''() => {
        return {
            width: document.documentElement.clientWidth,
            height: document.documentElement.clientHeight,
            deviceScaleFactor: window.devicePixelRatio,
        }
    }''')

    print(dimensions)
    # >>> {'width': 800, 'height': 600, 'deviceScaleFactor': 1}
    await browser.close()

asyncio.get_event_loop().run_until_complete(main())

Differences between puppeteer and pyppeteer

pyppeteer strives to replicate the puppeteer API as close as possible, however, fundamental differences between Javascript and Python make this difficult to do precisely. More information on specifics can be found in the documentation.

Keyword arguments for options

puppeteer uses an object for passing options to functions/methods. pyppeteer methods/functions accept both dictionary (python equivalent to JavaScript's objects) and keyword arguments for options.

Dictionary style options (similar to puppeteer):

browser = await launch({'headless': True})

Keyword argument style options (more pythonic, isn't it?):

browser = await launch(headless=True)

Element selector method names

In python, $ is not a valid identifier. The equivalent methods to Puppeteer's $, $$, and $x methods are listed below, along with some shorthand methods for your convenience:

puppeteer pyppeteer pyppeteer shorthand
Page.$() Page.querySelector() Page.J()
Page.$$() Page.querySelectorAll() Page.JJ()
Page.$x() Page.xpath() Page.Jx()

Arguments of Page.evaluate() and Page.querySelectorEval()

puppeteer's version of evaluate() takes a JavaScript function or a string representation of a JavaScript expression. pyppeteer takes string representation of JavaScript expression or function. pyppeteer will try to automatically detect if the string is function or expression, but it will fail sometimes. If an expression is erroneously treated as function and an error is raised, try setting force_expr to True, to force pyppeteer to treat the string as expression.

Examples:

Get a page's textContent:

content = await page.evaluate('document.body.textContent', force_expr=True)

Get an element's textContent:

element = await page.querySelector('h1')
title = await page.evaluate('(element) => element.textContent', element)

Roadmap

See projects

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Core symbols most depended-on inside this repo

evaluate
called by 326
pyppeteer/page.py
get
called by 325
pyppeteer/multimap.py
goto
called by 290
pyppeteer/page.py
send
called by 113
pyppeteer/connection.py
close
called by 65
pyppeteer/page.py
setContent
called by 55
pyppeteer/page.py
set_result
called by 51
tests/base.py
newPage
called by 44
pyppeteer/browser.py

Shape

Method 958
Class 151
Function 64

Languages

Python99%
TypeScript1%

Modules by API surface

tests/test_page.py234 symbols
pyppeteer/page.py96 symbols
pyppeteer/frame_manager.py67 symbols
tests/test_frame.py64 symbols
tests/test_network.py59 symbols
tests/test_launcher.py57 symbols
pyppeteer/network_manager.py57 symbols
tests/test_input.py44 symbols
tests/test_element_handle.py41 symbols
tests/server.py37 symbols
pyppeteer/browser.py31 symbols
pyppeteer/element_handle.py25 symbols

Used by 1 indexed graphs manifest dependencies, hub-wide

For agents

$ claude mcp add pyppeteer \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact