1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Extracting meta data from a website, Is this possible?

Discussion in 'Web Development' started by CarlyFiorina, Jul 13, 2011.

  1. CarlyFiorina

    CarlyFiorina Banned

    Joined:
    Jun 16, 2011
    Messages:
    31
    Likes Received:
    0
    Trophy Points:
    0
    Hello all,

    Is it possible to extract meta data from your website and get it all in a list, a client (With a huge site) has asked for a list of all the meta data and their corresponding pages but i'm not sure if there is a tool that would extract?

    Thank you in advance!
    Carly Fiorina.
     
    Last edited by a moderator: Jul 13, 2011
  2. pein87

    pein87 New Member

    Joined:
    Aug 6, 2010
    Messages:
    173
    Likes Received:
    47
    Trophy Points:
    0
    Occupation:
    Web Dev
    Location:
    Limbo
    Use a html scrapper. That of write your own that reads only the meta tags and turns them into a multi-dimensional array that can be iterated through and listed as a styled html page or grid.

    some links below will help you either make your own or provide a solution in php.

    http://stackoverflow.com/questions/26947/how-to-implement-a-web-scraper-in-php

    http://www.bradino.com/php/screen-scraping/

    http://www.phpclasses.org/package/1754-PHP-Extract-structured-data-from-remote-HTML-pages.html

    http://simplehtmldom.sourceforge.net/
     
    Last edited: Jul 14, 2011

Share This Page