Google Sheets import functions definitely needed a revamp so I've made ImportFromWeb and ImportJSON
ImportFromWeb is a powerful scraper in a G Sheets function. It even loads JS rendered pages:
https://nodatanobusiness.com/importfromweb/
I don't have a perfect suggestion to share here, but a few options that could work:
importXML
and importHTML
functions where you can specify which elements from the page you want to import. That'd work with public data, but wouldn't work with stuff behind an account/paywall.My fav is Instant Data Scraper. It's a chrome extension that can automatically parse any tabular data format and export it in CSV format. Link - https://chrome.google.com/webstore/detail/instant-data-scraper/ofaokhiedipichpaobibbnahnkdoiiah
I don't know you, you don't know me, but I love ya~
You saved me hours and hours of work and tears.
Thanks a lot <3
I would give Clay.run a shot! I've enjoyed working with their team and am happy to provide an intro.
Interesting, I didn't realize Clay had a data extraction tool! Can you have it watch for text changes on any site? If so, that would essentially let you turn any site into an API for automation...
Yes they have a chrome extension that easily allows you to scrape data from any website and import it into Clay bases. I haven't seen a way for them to watch for text changes on any site but perhaps we should get their CEO, Kareem, in here to let us know if that's on the roadmap :)
Ahhh gottcha, that'd work for one-time processes, but wouldn't work for automatically parsing sites and checking back to find changes over time.
You should invite them!
Haven't done scraping for a bit but used Octoparse in the past and it worked like a champ.
Some ideas are listed here: https://buildastack.com/product-category/personal-productivity/web-scrapper/ (Disclaimer: Build a Stack is my website)
Personally, I have used Listly and found it does the job well.
Looking for a better way to plan remote meetings across time zones, and keep up with events. What software is doing that best today?
We have 15k newsletter subscribers, and have around ~2k of them in a Slack group. We're starting to encounter issues in terms of community management - specifically, it's hard to pin content like c...
Google lets you subscribe to a calendar using a URL - although when using an Outlook 365 Calendar link, events are copied over once, and then the syncing stops. This seems to be a relatively new is...
Neat, thanks for sharing!