前言 Puppeteer is a JavaScript library which provides a high-level API to control Chrome or Firefox over the DevTools Protocol or WebDriver BiDi. Puppeteer runs in the headless (no visible UI) by default(官网 )
下载依赖
快速入门 1 2 3 4 5 6 7 8 9 const puppeteer = require ("puppeteer" );(async () => { const browser = await puppeteer.launch (); const page = await browser.newPage (); await page.goto ("https://example.com" ); await page.screenshot ({ path : "file.png" }); await browser.close (); })();
创建浏览器对象 不包含参数 1 const browser = await puppeteer.launch ();
包含参数
headless:是否启用GUI
true:缺省值,不启用GUIfalse:启用GUI
userDataDir:指定浏览器数据目录
1 2 3 4 const browser = await puppeteer.launch ({ headless : false , userDataDir : "<dir>" , })
创建一个标签页 1 const page = await browser.newPage ();
标签页访问URL
<url>:指定访问的URL
1 await page.goto ('<url>' );
关闭浏览器对象
截屏
<file>.png:指定截屏文件路径
1 await page.screenshot ({ path : '<file>.png' });
获取页面元素
<selector>:CSS选择器
1 const element = await page.$$eval('<selector>' , el => el.innerText );
等待元素的加载完成 1 await page.waitForSelector ('<selector>' );
完成 参考文献 哔哩哔哩——奇乐编程学院