How to Extract Any Web Page Information and Export it to Excel

SMS Text

Some tools are just so awesome that I can’t imagine I haven’t used them by now. My recent discovery is just awesome and I can’t wait to share it with you!

OutWit Hub is a cool FireFox addon that allows you to extract any web page information and export it to our favorite Excel for easier management and organization.

When launched, the tool shows you different kinds of data that can be extracted from the current webpage:

  • all the images on the page,
  • all page links,
  • email addresses,
  • page text,
  • RSS feeds found,
  • page tables, etc.

OutWit

Let me demonstarte its power using just a few examples:

1. Extract page lists:

Let’s try to extract the tool FAQ using two possible methods:

  • Navigate to that page and click the tool icon in the navigation bar;
  • Choose "Lists" and export it to Excel.

OR

  • Let the tool "guess" what to extract: click on "guess" and see all possible data compiled in the form of a handy table.

Outwit - extract lists

2. Extract all page images

  • Navigate to any page containing a lot of images;
  • Click the tool icon and then choose "images";
  • See the detailed table containing:
    • each image thumbnail,
    • image source URL,
    • image dimensions,
    • image alt text;
    • image file names.

Outwit - extract images

3. Scrape Google Results

This one is a bit more complicated but it demonstrates how flexible and customizable the tool can be (kindly shared by Dale Stokdyk)!

First, you will need to create your own scraper, here’s a screenshot which pretty much says that all: just do what is shown there:

Google Scraper:

snippets to create a google scraper with outwit hub

Here is a detailed info on creating your first scraper as well as the post where I found this cool tip.

  • Set Google to show 100 results per page (to have more data to export and analyze);
  • Search for any phrase;
  • Click the tool icon and click "scraper".

Outwit - Scrape Google results

Ann Smarty
Ann Smarty is the blogger and community manager at Internet Marketing Ninjas. Ann's expertise in blogging and tools serve as a base for her writing, tutorials and her guest blogging project, MyBlogGuest.com.
Ann Smarty
Get the latest news from Search Engine Journal!
We value your privacy! See our policy here.
  • http://www.stephanmiller.com Stephan Miller

    Outwit Hub is a great tool. Saves a lot of time.

  • http://www.staygolinks.com/ Barry Welford

    Great find, Ann. That will prove to be very useful, not least in developing blog posts about others you may see.

  • http://www.netmagellan.com Ash Nallawalla

    Great tweet, Ann. The plugin will give the Indian outsourcing industry a new lease on life. 🙂

  • http://www.3r.ie Marketing

    Great idea, let’s try it and check whether it will really save so much time – since we are now in the process of redesigning our website, and with over 600 pages, it would be a reasonable time…

  • http://www.productivity-magazine.info Jen

    another great post & tip Ann, I always look forward to reading your posts.
    🙂

  • http://www.uman.be Jen

    Dare I ask a stupid question?
    Is there a tool that can find keywords that you are ranking on “fairly well”, like second or third page? Meaning without starting off with a keywords list and then running a rank-checking tool like SEO toolbar, but just doing a search for your url and then discovering what keyword/phrase Google has you listed under?
    The logic being that if you are on page two or three then maybe there’s an opportunity to get that keyword/phrase moved up to page 1.

    Or am I completely in left-field here?

  • http://www.moscow.com.ru Moscow

    I think that it could be very helpful for many users. Thanks for the great stuff.

  • http://www.myseolearning.wordpress.com Norma

    Hi Ann,

    Looks like an awesome tool. I have installed in my system. Thank you for the information.

  • http://stewartmedia.biz/myblog/ Jimboot

    Handy little tool Ann. Great post. Our team is playing with it now 🙂

  • http://www.tag44.com Tag44

    Nice tool, am pretty much aware of this tool but not used it anyhow now i think i am going to use it.

  • http://shahzadsyed.blogspot.com Syed

    Great Tools thanks for sharing with us it will really help doing SEO work

  • Keonda

    Just tried the tool.
    Are you aware of a trick to download the links’ anchor text as well?

  • http://www.marketing2oh.com Dale Stokdyk

    Ann, thanks for mentioning and linking to my marketing2oh.com blog post about scraping search engine results with OutWit Hub!

    I hope it’s okay to mention that I also put together a video overview which can be found at YouTube (or the marketing2oh blog).

    Love the tools you’ve helped me discover — keep it up!

    • https://www.searchenginejournal.com Ann Smarty

      Thanks a lot for your find, Dale 🙂

  • http://www.williamaktin.com William Atkin

    I love this tool! Great find Ann. I use this thing all the time because it cuts my work in half.