Categories
WordPress

Scraping JavaScript rendered content from brightside.me – using Crawlomatic and HeadlessBrowserAPI

This is an advanced tutorial showing how to scrape JavaScript rendered content using the Crawlomatic plugin and HeadlessBrowserAPI. šŸ”Ž RESOURCES MENTIONED šŸ‘‡ Crawlomatic ā–ŗ https://1.envato.market/crawlomatic HeadlessBrowserAPI ā–ŗ https://headlessbrowserapi.com/ šŸ¤” Settings I used in the Crawlomatic plugin for correct scraping šŸ‘‡ FOR SINGLE SCRAPING: Scraper Start (Seed) URL: https://brightside.me/inspiration-health/10-daily-habits-that-can-make-deodorant-less-effective-802066/ Content Scraping Method To Use: Puppeteer (HeadlessBrowserAPI)…

This is an advanced tutorial showing how to scrape JavaScript rendered content using the Crawlomatic plugin and HeadlessBrowserAPI.
šŸ”Ž RESOURCES MENTIONED šŸ‘‡
Crawlomatic ā–ŗ https://1.envato.market/crawlomatic
HeadlessBrowserAPI ā–ŗ https://headlessbrowserapi.com/

šŸ¤” Settings I used in the Crawlomatic plugin for correct scraping šŸ‘‡
FOR SINGLE SCRAPING:
Scraper Start (Seed) URL: https://brightside.me/inspiration-health/10-daily-habits-that-can-make-deodorant-less-effective-802066/
Content Scraping Method To Use: Puppeteer (HeadlessBrowserAPI)
Headless Browser Wait Before Rendering Pages (ms): 5000
Strip HTML Elements by Tag Name: svg
Content Query Type: XPath/CSS Selector
Content Query String: //*[@data-test-id=’article-content’]
Strip HTML Elements by XPATH/CSS Selector: //*[@data-adunit-started=’true’]

FOR SERIAL SCRAPING:
Scraper Start (Seed) URL: https://brightside.me/
Do Not Scrape Seed URL: checked
Seed Page Crawling Query Type: XPath/CSS Selector
Seed Page Crawling Query String: //*[@data-test-id=’title-link’]
Content Scraping Method To Use: Puppeteer (HeadlessBrowserAPI)
Headless Browser Wait Before Rendering Pages (ms): 5000
Strip HTML Elements by Tag Name: svg
Content Query Type: XPath/CSS Selector
Content Query String: //*[@data-test-id=’article-content’]
Reverse Crawling Order: checked
Skip Scraping Post If Below Content Query Is Not Found: Checked
Strip HTML Elements by XPATH/CSS Selector: //*[@data-adunit-started=’true’]

šŸ’„ Join this channel to get access to member only videos and PERKS šŸ‘‡
https://www.youtube.com/channel/UCVLIksvzyk-D_oEdHab2Lgg/join

šŸ’„ Join my FREE newsletter to discover my insights (and also to get the YouTube Caption Scraper plugin for FREE) šŸ‘‡
https://coderevolution.mailchimpsites.com/

šŸ’» MY WORDPRESS PLUGINS šŸ‘‡
https://1.envato.market/coderevolutionplugins

ā–¶[SPECIAL OFFER] GET ALL MY PLUGINS AT ONCE! – https://1.envato.market/bundle

šŸ’» MY COURSES šŸ‘‡
https://coderevolution.teachable.com/

šŸ‘šŸ¼ Please help & give the video a like if you enjoyed it!
ā¤ļø Not Yet Subscribed? https://www.youtube.com/channel/UCVLIksvzyk-D_oEdHab2Lgg?sub_confirmation=1 />ā–¶ Check my Community of WordPress Experts by Joining CodeRevolution’s Facebook Group šŸ‘‰šŸ¼ https://www.facebook.com/groups/coderevo/
šŸ”” Hit the notification bell to ensure you get notified!

āœ… CAN I HELP YOU OR YOUR BUSINESS ā“
šŸŒ Become a member of my website today, to enjoy premium tutorials: https://coderevolution.ro/join-the-site/

āœ… EITHER WAY, CAN WE KEEP IN TOUCH ā“
šŸ”— Join the CodeRevolution VIP List here šŸ‘‰šŸ¼ https://coderevolution.mailchimpsites.com/

šŸ—£ļø TALK TO ME AND FOLLOW CODEREVOLUTION šŸ’„ ON SOCIAL MEDIA šŸ‘‡
Instagram ā–ŗ https://www.instagram.com/coderevolution_envato/
Facebook ā–ŗ https://www.facebook.com/CodeRevolution.envato/
Twitter ā–ŗ https://twitter.com/code2revolution
LinkedIn ā–ŗ https://www.linkedin.com/company/18002194
Pinterest ā–ŗ https://pinterest.com/caddy_lagaristu/coderevolution/

šŸ¤” ABOUT CODEREVOLUTION TV šŸ˜ƒ
Hello, Iā€™m Szabi, a 32 years old guy living with my wife and our beautiful 4 year old daughter Maya. I started my journey in WordPress plugin development back in 2017, when I quit my programmer job and became a full time stay at home WordPress plugin developer, entrepreneur, blogger and also daddy. Since then, I implemented over 100 WordPress plugins, earning my full time income from them.

I started this YouTube channel to share tutorials for my plugins with people who are using them, however, since then, the channel has evolved into a daily VLOG, besides of tutorials for my plugins, I am sharing here also my insights about how to be a successful entrepreneur in our current times.

On this YouTube channel, I publish new videos regularly (each time I have to tell you about something new)! If you don’t want to miss my videos, subscribe and hit the bell notification also!

šŸ“š RECOMMENDED RESOURCES šŸ‘‡
See what services I use to power my business online and help me earn as an affiliate
https://coderevolution.ro/recommendations/
Do you want to start autoblogging? Check my recommended resources below šŸ‘‡
https://coderevolution.ro/more-automation/

ā–¶ GEAR I USE IN MY VIDEOS šŸ‘‡
Microphone: Trust GXT 252+ Emita Plus Streaming Microphone:
https://amzn.to/2LB79sv
Web Cam: Logitech Brio 4K Stream Edition: https://amzn.to/2N5CNid

To your success,
Szabi – CodeRevolution.

DISCLAIMER: The information contained on this YouTube Channel and the resources available for download/viewing through this YouTube Channel are for educational and informational purposes only.ā€‹
This description may contain affiliate links. If you purchase a product through one of them, I will receive a commission (at no additional cost to you). I only ever endorse products that I have personally used and benefited from personally. Thank you for your support!

#CRAWLOMATIC #HEADLESSBROWSERAPI #SCRAPING #CRAWLING