Extracting meta data from a website, Is this possible?

Discussion in 'Web Development' started by CarlyFiorina, Jul 13, 2011.

  1. CarlyFiorina

    CarlyFiorina Banned

    Joined:
    Jun 16, 2011
    Messages:
    31
    Likes Received:
    0
    Trophy Points:
    0
    Hello all,

    Is it possible to extract meta data from your website and get it all in a list, a client (With a huge site) has asked for a list of all the meta data and their corresponding pages but i'm not sure if there is a tool that would extract?

    Thank you in advance!
    Carly Fiorina.
     
    Last edited by a moderator: Jul 13, 2011
  2. pein87

    pein87 Active Member

    Joined:
    Aug 6, 2010
    Messages:
    173
    Likes Received:
    47
    Trophy Points:
    28
    Occupation:
    Web Dev
    Location:
    Limbo
    Use a html scrapper. That of write your own that reads only the meta tags and turns them into a multi-dimensional array that can be iterated through and listed as a styled html page or grid.

    some links below will help you either make your own or provide a solution in php.

    http://stackoverflow.com/questions/26947/how-to-implement-a-web-scraper-in-php

    http://www.bradino.com/php/screen-scraping/

    http://www.phpclasses.org/package/1754-PHP-Extract-structured-data-from-remote-HTML-pages.html

    http://simplehtmldom.sourceforge.net/
     
    Last edited: Jul 14, 2011

Share This Page

  1. This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Dismiss Notice